我正在使用BeautifulSoup，我想要獲得img標籤的alt值

我想得到一支棒球隊今天比賽的對手隊伍。我正在使用BeautifulSoup，我想要獲得img標籤的alt值

所以我編碼這個。

此代碼從網站獲取今日遊戲的信息。

from bs4 import BeautifulSoup 
import datetime 
import urllib.request 

req = urllib.request.Request("http://www.hanwhaeagles.co.kr/html/game/1st_schedule_list1.asp") 
data = urllib.request.urlopen(req).read() 

bs = BeautifulSoup(data, 'html.parser') 

l = bs.find_all('div') 
idx = 0 

for s in l: 
    try: 
     prop = s.get('class') 
     if prop != None and prop[0] == "box" and len(prop) == 2: 
      l = s 
      break 
    except UnicodeEncodeError: 
     print("Error") 
    finally: 
     idx += 1 

print(l)

和'變量l'是今天的遊戲信息。

img標籤的alt值是對方球隊的球隊名稱。

我想打印...幫助我

來源

2017-08-10 황준필

既然你更感興趣的是這裏面box類存在的數據。可以直接提取類和進一步處理它：

from bs4 import BeautifulSoup 
import datetime 
import urllib.request 

req = urllib.request.Request("http://www.hanwhaeagles.co.kr/html/game/1st_schedule_list1.asp") 
data = urllib.request.urlopen(req).read() 
bs = BeautifulSoup(data, 'html.parser') 

for item in bs.select('.box'): 
    team_name = item.find('img')['alt'] 
    print(team_name) 

'NC' 
'NC' 
...

來源

2017-08-10 09:35:58

試過，但.. –

TEAM_NAME = chunck [0] .find（ 'IMG'）[ 'alt'] IndexError：列表索引超出範圍 –

和我打印的查克，它的值是[] –

from bs4 import BeautifulSoup 
import urllib.request 

req = urllib.request.Request("http://www.hanwhaeagles.co.kr/html/game/1st_schedule_list1.asp") 
data = urllib.request.urlopen(req).read() 

bs = BeautifulSoup(data, 'html.parser') 

table = bs.find('table') 

for tr in table.find_all('tr'): 
    for td in tr.find_all('td'): 
     if td.find('img'): 
      if 'alt' in td.find('img').attrs: 
       print(td.find('img')['alt'])

輸出：

NC 
NC 
NC 
KIA 
KIA 
KIA 
두산 
두산 
삼성 
삼성 
넥센 
넥센 
SK 
SK 
NC 
NC 
롯데 
롯데 
KT 
KT 
KIA 
KIA 
SK 
SK 
LG 
LG 
KT

來源

2017-08-10 09:36:21

我做到了。謝謝！ –

接受答案？） –

我正在使用BeautifulSoup，我想要獲得img標籤的alt值

回答

相關問題