下載的所有文件在網站

我需要下載在這個環節只有郊區的名字保持在每一個環節下載的所有文件在網站

只是一個參考改變下這個搜索鏈接 https://www.data.vic.gov.au/data/dataset/2014-town-and-community-profile-for-thornbury-suburb

的所有文件中的所有文件： https://www.data.vic.gov.au/data/dataset?q=2014+town+and+community+profile

任何可能性？

謝謝:)

來源

2017-08-07 Bharath

我建議寫一些代碼，將做到這一點 –

您可以下載這樣

import urllib2 
response = urllib2.urlopen('http://www.example.com/file_to_download') 
html = response.read()

文件，以獲取所有的頁面

from bs4 import BeautifulSoup 

import requests 
r = requests.get("http://site-to.crawl") 
data = r.text 
soup = BeautifulSoup(data) 

for link in soup.find_all('a'): 
    print(link.get('href'))

來源

2017-08-07 06:59:23 naren

感謝@naren做。它的工作完美:) – Bharath

你可以做到這一點的聯繫方式有兩種1.by代碼2.by software

1.by code

使用Web Scrapper類庫如BeautifulSoup，JSoup

如果它不起作用，或者您需要執行點擊按鈕，那麼您可以使用Selenium。

2.by軟件

使用互聯網下載管理器的抓取功能這個

來源

2017-08-07 11:09:11

感謝您的回答 – Bharath

下載的所有文件在網站

回答

相關問題