0
我承認這個問題的標題是不明確的。如果有人能看我的問題後改寫它,那將是巨大的。如何避免同樣加入了兩個領域?
反正我有一對是詞語的ID的字段。現在我想用他們的文本替換它們。現在我做了兩聯接和foreach像如下:
WordIDs = LOAD wordID.txt AS (wordID1:long, wordID2:long);
WordTexts = LOAD wordText.txt AS (wordID:long, wordText:chararray);
Join1 = JOIN WordIDs BY wordID1, WordTexts BY wordID;
Replaced1 = FOREACH Join1 GENERATE WordTexts::wordText As wordText1, WordIDs::wordID2;
Join2 = JOIN Replaced1 BY wordID2, WordTexts BY wordID;
Replaced2 = FOREACH Join2 GENERATE Replaced1::wordText1 As wordText1, WordTexts::wordText::wordText2;
有沒有用更少的語句的數量這樣做的任何方式(如一個連接,而不是兩個連接)?