2017-04-20 98 views
1

crwaling(刮,流)效果非常好如何保存抓取(抓取,流式)結果?

ex。 973:{'text':'RT @ 1111:hihihihihihi'}

但是!無法保存。

我該如何解決?

import tweepy 
import time 
import os 
import json 
import simplejson 

search_term = '5555' 
search_term2= '4444' 
search_term3='3333' 
search_term4='2222' 
search_term5='1111' 

lat = "11.11" 
lon = "11.11" 
radius = "100km" 


API_key = "0" 
API_secret = "0" 
Access_token = "0" 
Access_token_secret = "0" 

location = "%s,%s,%s" % (lat, lon, radius) 

auth = tweepy.OAuthHandler(API_key, API_secret) 
auth.set_access_token(Access_token, Access_token_secret) 

api = tweepy.API(auth) 

c=tweepy.Cursor(api.search, 
       q="{}+OR+{}".format(search_term, search_term2, search_term3, search_term4, search_term5), 
       rpp=1000, 
       geocode=location, 
       include_entities=True) 

data = {} 
i = 1 
for tweet in c.items(): 
    data['text'] = tweet.text 
    print(i, ":", data) 
    time.sleep(0.4) 
    i += 1 

txt文件沒有生成。 - >沒有錯誤信息。

或者,製作txt文件。但是,txt中沒有「tweet text and tweet date」。 - >沒有錯誤信息。

(不一定是一個txt file.Save Excel文件。)

wfile = open(os.getcwd()+"/tqtq.txt", mode='w') 
data = {} 
i = 0  

for tweet in c.items(): 
    data['text'] = tweet.text 
    data['date']= tweet.text 
    wfile.write(data['text','date']+'\n') 
    i += 1 
    time.sleep(0.4) 
wfile.close() 

回答

0

您可以嘗試使用泡菜

import pickle 
pickle.dump(obj, filename) 

加載它回到result = pickle.load(filename)