2017-05-28 23 views
0

模塊即時通訊使用「https://github.com/aivarsk/scrapy-proxies」 設置如指定和個人電腦工作完美,如果是鏈接到PC上現有的txt磁貼。proxylist無法加載Scrapy雲

我在settings.py文件中嘗試了幾種不同的Scrapy Cloud方法。

我在同一個文件夾中添加文件「proxylist.txt」比低於除了該項目的設置我把它上傳到「https://dl.dropboxusercontent.com/s/esdm19mnvz2yguf/proxylist.txt

我的替代名稱: PROXY_LIST =「https://dl.dropboxusercontent.com/s/esdm19mnvz2yguf/proxylist.txt」 或 PROXY_LIST = 'proxylist.txt' 或 PROXY_LIST = '/proxylist.txt' PROXY_LIST = '../proxylist.txt'

,如果我不喜歡它在我的電腦PROXY_LIST = 'proxylist.txt',它的功能就像一個魅力,但不是我一次加載Scrapy Cloud。

我收到錯誤。

Traceback (most recent call last): File "/usr/local/lib/python2.7/site-packages/twisted/internet/defer.py", line 1299, in _inlineCallbacks result = g.send(result) File "/usr/local/lib/python2.7/site-packages/scrapy/crawler.py", line 90, in crawl six.reraise(*exc_info) File "/usr/local/lib/python2.7/site-packages/scrapy/crawler.py", line 72, in crawl self.engine = self._create_engine() File "/usr/local/lib/python2.7/site-packages/scrapy/crawler.py", line 97, in _create_engine return ExecutionEngine(self, lambda _: self.stop()) File "/usr/local/lib/python2.7/site-packages/scrapy/core/engine.py", line 69, in init self.downloader = downloader_cls(crawler) File "/usr/local/lib/python2.7/site-packages/scrapy/core/downloader/init.py", line 88, in init self.middleware = DownloaderMiddlewareManager.from_crawler(crawler) File "/usr/local/lib/python2.7/site-packages/scrapy/middleware.py", line 58, in from_crawler return cls.from_settings(crawler.settings, crawler) File "/usr/local/lib/python2.7/site-packages/scrapy/middleware.py", line 36, in from_settings mw = mwcls.from_crawler(crawler) File "/app/python/lib/python2.7/site-packages/scrapy_proxies/randomproxy.py", line 55, in from_crawler return cls(crawler.settings) File "/app/python/lib/python2.7/site-packages/scrapy_proxies/randomproxy.py", line 35, in init fin = open(self.proxy_list) IOError: [Errno 2] No such file or directory: '../proxylist.txt'

請我需要一些幫助。

回答

0

你很可能不在您的setup.py說明中包含該文件。

提供此功能的機制是MANIFEST.in文件。這是相對比較簡單:MANIFEST.in真的只是一個指定的文件或水珠相對文件路徑的列表,包括:

include README.rst 
include docs/*.txt 
include funniest/data.json 

爲了使在安裝時包的文件夾站點包內被複制這些文件,您需要爲setup()函數提供include_package_data=True

請參閱http://python-packaging.readthedocs.io/en/latest/non-code-files.html

+0

我補充一下。但仍然不會工作。 PROXY_LIST ='資源/ proxylist.txt'在manifest.in'包括proxylist.txt 包括資源/ proxylist.txt 包括base_de_datos /資源/ proxylist.txt' –

+0

但不工作[鏈接與我所有的scrapycould項目與所有文件](https://dl.dropbox.com/s/b84d2sk8zu8mj34/bases_de_datos.zip)這裏是我的文件,我已經嘗試了大約40次,但沒有工作,但我在我的電腦中運行它完美的作品,我想要一些指導 –