2017-08-02 68 views
-1

我需要刮取下面的代碼,以檢索「SCRAPE THIS」和「SCRAPE THIS ASWELL」部分。我一直在玩它幾個小時,沒有運氣!有誰知道這可以做到嗎?使用BeautifulSoup進行網頁掃描 - Python

<div class="mod-body add-border"> <div class="mod-inline mod-body-A-F"> <h4>SCRAPE THIS</h4> <div class="mod-body"> <ul class="list"> <li>SCRAPE THIS AS WELL</li> </ul> </div> </div>

+1

哪裏是你的代碼? – gobrewers14

回答

1

試試這個代碼:

from bs4 import BeautifulSoup 
text = """<div class="mod-body add-border"> <div class="mod-inline mod-body-A-F"> <h4>SCRAPE THIS</h4> <div class="mod-body"> <ul class="list"> <li>SCRAPE THIS AS WELL</li> </ul> </div> </div>""" 
x = BeautifulSoup(text, 'lxml') 
print(x.find('h4').get_text()) 
print(x.find('li').get_text())