1
運行這段代碼解析錯誤:與BeautifulSoup4和Python的3.3
from bs4 import BeautifulSoup
soup = BeautifulSoup (open("my.html"))
print(soup.prettify())
產生以下錯誤:
Traceback (most recent call last):
File "soup.py", line 5, in <module>
print(soup.prettify())
File "C:\Python33\lib\encodings\cp437.py", line 19, in encode
return codecs.charmap_encode(input,self.errors,encoding_map)[0]
UnicodeEncodeError: 'charmap' codec can't encode character '\u25ba' in position
9001: character maps to <undefined>
然後我想:
print(soup.encode('UTF-8').prettify())
但這種失敗是考慮串使用字節對象格式化:
Traceback (most recent call last):
File "soup.py", line 11, in <module>
print(soup.encode('UTF-8').prettify())
AttributeError: 'bytes' object has no attribute 'prettify'
不知道如何去解決這個問題。任何投入將不勝感激。
嘗試從字節串首先解碼:bytes.decode(my.html) – 2013-02-15 06:22:43
我無法使這個工作與美麗的湯(AttributeError:'str'對象沒有屬性...) – Jim 2013-02-15 16:32:38