0
A
回答
0
您可以使用排序依據,並限制: -
A =使用PigStorage()作爲(名稱:chararray,count:int)加載'file';
B =按數量排序的A; - 默認情況下,它將上升或下降
C =限制B 1;
D = Foreach C生成名稱;
dump D;
B = order by desc;
C =極限B 1;
D = Foreach C生成名稱;
dump D;
0
下面的例子將幫助你獲得前五名計數
infiles = load '/hdfs/bhavesh/Youtube_POC/Youtube/0222/{0,1,2,3,4}.txt' using PigStorage('\t') as
(videoid:chararray,uploader:chararray,age:int,category:chararray,length:int,views:int,rate:int,rating:int,comments:int,related_id:chararray);
files = FILTER infiles BY category is not null;
grpn_for_catagories = group files by category;
cnt_for_catagories = foreach grpn_for_catagories generate group, COUNT(files.videoid) as counting;
sorted_for_catagories_desc = order cnt_for_catagories by counting desc;
top5_for_catagories = limit sorted_for_catagories_desc 5;
詳細說明,請在
http://ybhavesh.blogspot.in/2015/08/proof-of-concept-or-poc-on-youtube-data.html
希望它可以幫助!!! ...
0
一=使用PigStorage()作爲(名稱:chararray,count:int)加載'文件';
B =按數量排序的A;
C =極限B 1;
D = foreach C生成名稱;
dump D;
相關問題
- 1. MAX(計數)功能的Apache的Pig Latin
- 2. 生成一個子字符串(Apache Pig)的計數
- 3. 用Apache Pig擴展數組
- 4. Apache PIG - GROUP BY
- 5. Apache PIG問題
- 6. apache pig命令
- 7. Apache Pig的基本統計信息
- 8. Pig中的字段值計數
- 9. Apache Pig LOAD錯誤
- 10. Apache Pig 0.8.1 double NaN
- 11. Apache Pig中的UnGroup
- 12. Apache Pig錯誤:java.lang.reflect.InvocationTargetException
- 13. Apache Pig出現字符降序
- 14. Apache Pig:用字符串替換null
- 15. 如何用Apache Pig正確聚合唯一計數?
- 16. Apache PIG - 如何在小數點後削減數字
- 17. 計數值 - 阿帕奇PIG
- 18. Apache Pig中的數學公式
- 19. 從Apache Pig中的數據獲取FileName
- 20. 從apache-pig讀取數據到R
- 21. 加載數據時apache pig錯誤
- 22. Apache Pig和用戶定義函數
- 23. apache PIG中的超前/滯後函數
- 24. Pig Mapreduce來計算連續的字母
- 25. Apache Pig UDF和outputSchema定製
- 26. Apache Pig逃生欄名稱
- 27. Apache Pig棄用錯誤
- 28. Apache Pig的樞軸表
- 29. Apache Pig中的時差?
- 30. Apache Pig和Hadoop的實現
謝謝Bhavesh。 – Naveen
歡迎納文!!! ... – Bhavesh