0
我從'n'列數列中生成了兩列(起點和終點)。現在我想爲這兩列組合計數。我無法得到結果。我得到錯誤的,錯誤1070:無法解析使用計數進口: 下面是我的腳本,如何計算豬列中的兩列組的數量
mydata = load '/Projects/Flightdata/1987/Rawdata' using PigStorage(',') as (year:int, month:int, dom:int, dow:int, deptime:long, crsdeptime:long, arrtime:long, crsarrtime:long, uniqcarcode:chararray, flightnum:long, tailnum:chararray, actelaptime:long, crselaptime:long, airtime:long, arrdeltime:long, depdeltime:long, origcode:chararray, destcode:chararray, dist:long, taxintime:long, taxiouttime:long, flightcancl:int, canclcode:chararray, diverted:int, carrierdel:long, weatherdel:long, nasdel:long, securitydel:long, lateaircraftdel:long);
Step2 = foreach mydata generate origcode, destcode;
grpby = group Step2 by (origcode, destcode) ;
step3 = foreach grpby generate group.origcode as source, group.destcode as destination, Count(step2);
在這裏我要生成地和目的地的每個組合計數。 任何指導都會有所幫助。
嗨,謝謝。我嘗試過相同的情況,但仍然無法工作。 – user3836231
錯誤是,錯誤1070:無法使用導入來解析計數: – user3836231
http://pig.apache.org/docs/r0.12.0/func.html#count – Frederic