0
我在玩imacross
來刮取網站內容,但我一直在試圖從markdown中刪除圖像網址,如下所示。如何使用iMacros颳去圖像的網址
<div class="dpimages-icons-box">
<a href="http://host1.com/1.jpg" class="lightbox" title="9558" rel="dpimages"><img src="//host2.com/9558.jpg" alt="9558" title="9558" width="80" height="54" /></a>
<a href="http://host1.com/2.jpg" class="lightbox" title="9559" rel="dpimages"><img src="//host2.com/9559.jpg" alt="9559" title="9559" width="80" height="67" /></a>
<a href="http://host1.com/3.jpg" class="lightbox" title="9560" rel="dpimages"><img src="//host2.com/9560.jpg" alt="9560" title="9560" width="78" height="80" /></a>
<a href="http://host1.com/4.jpg" class="lightbox" title="9561" rel="dpimages"><img src="//host2.com/9561.jpg" alt="9561" title="9561" width="53" height="80" /></a>
<a href="http://host1.com/5.jpg" class="lightbox" title="9562" rel="dpimages"><img src="//host2.com/9562.jpg" alt="9562" title="9562" width="52" height="80" /></a>
<a href="http://host1.com/6.jpg" class="lightbox" title="9562" rel="dpimages"><img src="//host2.com/9562.jpg" alt="9562" title="9562" width="52" height="80" /></a>
<a href="http://host1.com/7.jpg" class="lightbox" title="9562" rel="dpimages"><img src="//host2.com/9562.jpg" alt="9562" title="9562" width="52" height="80" /></a>
<div class="clearing"></div>
</div>
我怎樣才能提取第一n
圖像的像網址:
http://host1.com/1.jpg
http://host1.com/2.jpg
http://host1.com/3.jpg
http://host1.com/4.jpg
http://host1.com/5.jpg
與imacros
並保存到一個文件.csv
?
請註明您目前嘗試的代碼。 –
我使用 'TAG POS = 1 TYPE = DIV ATTR = CLASS:dpimages-icons-box EXTRACT = HTM SAVEAS TYPE = EXTRACT FOLDER = D:\ Scrape \ FILE = pic.csv' 提取精確的hml標籤,其中需要額外的工作來清除代碼 –
語言和源代碼格式。 –