我正在嘗試實施Twitter情緒分析。我需要獲取所有積極的推文和消極推文,並將它們存儲在特定的文本文件中。PIG:Twitter情緒分析
sample.json
{"id": 252479809098223616, "created_at": "Wed Apr 12 08:23:20 +0000 2016", "text": "google is a good company", "user_id": 450990391}{"id": 252479809098223616, "created_at": "Wed Apr 12 08:23:20 +0000 2016", "text": "facebook is a bad company","user_id": 450990391}
dictionary.text讓所有的正面和negetive單詞列表
weaksubj 1 bad adj n negative
strongsubj 1 good adj n positive
豬腳本: -
tweets = load 'new.json' using JsonLoader('id:chararray,text:chararray,user_id:chararray,created_at:chararray');
dictionary = load 'dictionary.text' AS (type:chararray,length:chararray,word:chararray,pos:chararray,stemmed:chararray,polarity:chararray);
words = foreach tweets generate FLATTEN(TOKENIZE(text)) AS word,id,text,user_id,created_at;
sentiment = join words by word left outer, dictionary by word;
senti2 = foreach sentiment generate words::id as id,words::created_at as created_at,words::text as text,words::user_id as user_id,dictionary::polarity as polarity;
res = FILTER senti2 BY polarity MATCHES '.*possitive.*';
描述RES: -
res: {id: chararray,created_at: chararray,text: chararray,user_id: chararray,polarity: chararray}
但是,當我傾倒RES我沒有看到任何輸出,但它沒有任何錯誤執行罰款。
我在這裏做了什麼錯誤。
請給我建議。
Mohan.V
感謝您的回覆@Sandesh。 – Bunny
我想什麼ü建議。 但仍然,它的運行成功,但沒有輸出。 – Bunny
我已經去掉空格編輯字典文件。 – Bunny