麻煩與我的圖像刮板

我努力使圖像刮刀，並想知道如果任何人都可以用下面的例子幫助：麻煩與我的圖像刮板

page = requests.get('www.example.com/image1') 
tree = html.fromstring(page.text) 

pic = tree.xpath(Copied XPath) 

print pic[0].attrib['src']

現在

在「頁面」我的形象在這個網址案例'www.example.com/image1'。我想知道是否可以循環這個過程中，如果我有像名稱的列表例如，圖像2，圖像3，圖像4等

來源

2014-07-11 user3450524

是的，這是可能的：

list_of_image_names = ['image1', 'image2', 'image3'] 

for image_name in list_of_image_names: 
    page = requests.get('www.example.com/' + image_name) 
    tree = html.fromstring(page.text) 

    pic = tree.xpath(Copied XPath) 

    print pic[0].attrib['src']

來源

2014-07-11 14:38:22 python

謝謝您的回覆:) – user3450524

沒問題。請記住，列表是可以在python中迭代的，因此不需要像@ The2ndSon所做的那樣爲'for'循環寫入範圍內的圖片（len（pictureList））：' - 它比應該更復雜。 – python

假設上面發佈的代碼是可用的，您可以在某種循環中複製相同的功能。這是一個如何工作的例子。

def picLooper(): 
    pictureList = ['image1','image2', 'image3'] # list of image names 
    pictureURL = dict() # dictionary to hold URL for images 
    for picture in range(len(pictureList)): 
     page = requests.get('www.example.com/' + pictureList[picture]) 
     tree = html.fromstring(page.text) 

     pic = tree.xpath(Copied XPath) 
     pictureURL[image] = pic

值得注意的是，這個實現假定您已經知道要獲取的圖像名稱。希望這有助於作爲一個起點！：D

來源

2014-07-11 14:40:45 The2ndSon

我有一個csv中的圖像名稱（超過7000）的列表。這將被定義爲一個函數 – user3450524

好吧，那麼pictureList是事先能夠生成的東西嗎？聽起來你知道了。 – The2ndSon

非常感謝您的幫助，非常感謝。 – user3450524

麻煩與我的圖像刮板

回答

相關問題