2013-05-28 67 views
1

我想從網頁中抓取嵌套在div標籤中的一些數據。從div標籤提取數據Python

url = 'http://london2012.fiba.com/pages/eng/fe/12/olym/p/gid/26/grid/A/rid/9087/sid/6233/game.html' 
boxurl = urllib2.urlopen(url).read() 
soup = BeautifulSoup(boxurl) 

linescoreA = soup.find("div", {"class": "scoreA"}) 

print linescoreA 

輸出這樣的:

<div class="scoreA"> 
<div class="period">19</div> 
<div class="period">22</div> 
<div class="period">22</div><div class="period">26</div> 
<div class="final">89</div> 
<div class="clear"></div> 
</div> 

,這是我卡住。我如何從div標籤獲取數據?

回答

2

得到公正的文本數據,使用.stripped_strings

print list(linescoreA.stripped_strings) 
+0

感謝,它似乎應該是eazy – user2333196

1

嘗試

for node in soup.find("div", {"class": "scoreA"}): 
    print ''.join(node.findAll(text=True)) 

和什麼有關

for node in soup.find("div", {"class": "scoreA"}): 
     print node.string 

我很抱歉,我不能嘗試爲您服務。

+0

這種方式適用於標題標籤,而上述不適用。 – user2333196