2014-02-23 40 views
-1
import mechanize 
br=mechanize.Browser() 
r=br.open("http://www.drugs.com/search-wildcard-phonetic.html") 
br.select_form(nr=0) 
br.form['searchterm']='panadol' 
br.submit() 
print br.response().read() 

error in above code: 
Traceback (most recent call last): 
    File "mech2.py", line 6, in <module> 
    br.submit() 
    File "/usr/lib/python2.7/dist-packages/mechanize/_mechanize.py", line 541, in submit 
    return self.open(self.click(*args, **kwds)) 
    File "/usr/lib/python2.7/dist-packages/mechanize/_mechanize.py", line 203, in open 
    return self._mech_open(url, data, timeout=timeout) 
    File "/usr/lib/python2.7/dist-packages/mechanize/_mechanize.py", line 255, in _mech_open 
    raise response 
mechanize._response.httperror_seek_wrapper: HTTP Error 403: request disallowed by robots.txt 

請幫忙糾正上面的代碼網頁抓取使用代碼機械化錯誤

回答

0

它看起來像有什麼不對您的代碼。你的錯誤信息

mechanize._response.httperror_seek_wrapper:HTTP錯誤403:請求被robots.txt

不允許您似乎違反了robots.txt文件。如果您不想看到此錯誤消息,請停止濫用此網站,並可能與他們聯繫,以獲取可接受的使用他們的數據的方式。

+0

此代碼是否工作?您是否嘗試執行它? – FathimaBeevi