在Python與相應的UTF-8字符替換HTML實體2.6

&lt;xml ... &gt;

，我想將其轉換爲可讀的格式：

<xml ...>

任何簡單（和快速）的方式來做到這一點在Python？

2009-04-08 Alexandru

我認爲這個問題是這樣的重複：http://stackoverflow.com/questions/57708/convert-xml-html-entities-into-unicode-string-in-python – 2009-04-08 14:37:14

[解碼HTML實體可能的重複在Python字符串？]（http://stackoverflow.com/questions/2087370/decode-html-entities-in-python-string） – csl 2016-09-22 07:07:38

>>> import HTMLParser 
>>> pars = HTMLParser.HTMLParser() 
>>> pars.unescape('&copy; &euro;') 
u'\xa9 \u20ac' 
>>> print _ 
© €

2009-04-08 14:36:29 vartec

有，做它，從後弗雷德指出鏈接的功能here。複製到這裏讓事情變得更簡單。

感謝Fred Larson鏈接到SO上的其他問題。貸記給dF發佈鏈接。

2009-04-08 18:09:06 Benson

回答