在一行中兩次降低Hadoop中

2014-02-11 60 views 0 likes

我希望做的是採取單詞作爲輸入序列一個Hadoop作業和輸出線如下：

小寫字母序列頻率的小寫字母序序列頻率的序列

我想到一個例子是最好的解釋：

假設我輸入的數據是：

the sun 
the sun 
the sun 
The sun 
The sun 
The Sun

我想

the sun 6 the sun 3 
the sun 6 The sun 2 
the sun 6 The Sun 1

結束了，我怎樣才能減少兩個小寫序列頻和原序列頻？

來源

2014-02-11 antares

回答

在地圖功能：輸出鍵： sequence.toLowerCase（）輸出值：序列（按原樣）

在用於每個值的降低功能：

Map<String, Integer> occurrences = new HashMap<String, Integer>(); 
occurrences.put(key, occurrences.get(key) + 1); 
if(!key.equals(value)){ 
occurrences.put(value, occurrences.get(key) + 1); 
}

這只是僞代碼。您將收到NPE，因爲occurrences.get（key/value）將返回null首次。只需爲此添加檢查。因此，您將得到您的出現次數和不同大小寫不同的相同序列的地圖。

來源

2014-02-11 18:46:43 Andrew

謝謝@安德魯。我做了一些非常類似的事情，結果很好。 – antares

相關問題

11. 如何降低行和列
12. 如何降低行SQL
13. 正在降低document.domain
14. 在php中降低圖像質量
15. 在Android中降低或淡入聲音
16. 在OpenCV中降低圖像分辨率
17. 如何在ListActivity（Android）中降低高度？
18. 在Android中降低圖像分辨率
19. 兩次動態下降
20. 降低python降價能力
21. 多次運行imagemagick會降低圖像質量嗎？
22. 降低功耗
23. 降低UITableView
24. 降低幀率
25. 降低C++
26. 如何降低
27. 降低幀率
28. 降低分數
29. setAngularVelocity在onGameResume降低，在onPause
30. 在減速類中運行，並降低mehods