我想在Python用BeautifulSoup報廢https://www.crowdcube.com/investments?sector=technology 3.我不能用刮美麗的湯網頁
Traceback (most recent call last):
File "D:\DataVisualization\lib\urllib\request.py", line 163, in urlopen
return opener.open(url, data, timeout)
File "D:\DataVisualization\lib\urllib\request.py", line 472, in open
response = meth(req, response)
File "D:\DataVisualization\lib\urllib\request.py", line 582, in http_response
'http', request, response, code, msg, hdrs)
File "D:\DataVisualization\lib\urllib\request.py", line 510, in error
return self._call_chain(*args)
File "D:\DataVisualization\lib\urllib\request.py", line 444, in _call_chain
result = func(*args)
File "D:\DataVisualization\lib\urllib\request.py", line 590, in http_error_default
raise HTTPError(req.full_url, code, msg, hdrs, fp)
urllib.error.HTTPError: HTTP Error 403: Forbidden
你能發佈你正在使用的美麗湯代碼? – bejado
從BS4進口BeautifulSoup 進口的urllib,重新 數據= { '標題':[], '描述':[] } 升=( 'https://www.crowdcube.com/investment' ) 樹= BeautifulSoup(1, 'LXML') #title 標題= tree.find_all( 'DIV',{ 'CC-cardOpportunity__body'}) 數據[ '標題'] = tree.find( 'H1' ) #description description = tree.find_all('div',{'class':'cc-cardOpportunity__body'}) data ['description']。append(description [1] .find(' p')。get_text() data – Mart
我不能scrapy這個網站:( – Mart