新的python和lxml,請耐心等待。現在堅持看起來是unicode問題。我試過.encode,美麗的湯的unicodedammit沒有運氣。已經搜索論壇和網頁,但我缺乏python技能未能將建議的解決方案應用於我的特定代碼。感謝任何幫助,謝謝。lxml unicode輸出問題
代碼:
import requests
import lxml.html
sourceUrl = "http://www.hkex.com.hk/eng/market/sec_tradinfo/stockcode/eisdeqty.htm"
sourceHtml = requests.get(sourceUrl)
htmlTree = lxml.html.fromstring(sourceHtml.text)
for stockCodes in htmlTree.xpath('''/html/body/printfriendly/table/tr/td/table/tr/td/table/tr/table/tr/td'''):
string = stockCodes.text
print string
錯誤:
UnicodeEncodeError: 'ascii' codec can't encode character u'\xa0' in position 0: ordinal not in range(128)
你能提供關於錯誤的更多細節嗎?或者在'print string'之前添加一行'print type(string)'來查看發生了什麼。 – iceout 2013-04-07 14:46:04