我已經安裝在Windows平臺上lxml2.2.2(即時通訊使用Python版本2.6.5)在LXML解析功能。我想這簡單的命令:誤差
from lxml.html import parse
p= parse(‘http://www.google.com’).getroot()
,但我得到以下錯誤:
Traceback (most recent call last):
File 「」, line 1, in p=parse(‘http://www.google.com’).getroot()
File 「C:\Python26\lib\site-packages\lxml-2.2.2-py2.6-win32.egg\lxml\html_init_.py」, line 661, in parse return etree.parse(filenameorurl, parser, baseurl=baseurl, **kw)
File 「lxml.etree.pyx」, line 2698, in lxml.etree.parse (src/lxml/lxml.etree.c:49590)
File 「parser.pxi」, line 1491, in lxml.etree.parseDocument (src/lxml/lxml.etree.c:71205) File 「parser.pxi」, line 1520, in lxml.etree.parseDocumentFromURL (src/lxml/lxml.etree.c:71488)
File 「parser.pxi」, line 1420, in lxml.etree.parseDocFromFile (src/lxml/lxml.etree.c:70583)
File 「parser.pxi」, line 975, in lxml.etree.BaseParser.parseDocFrom
File (src/lxml/lxml.etree.c:67736)
File 「parser.pxi」, line 539, in lxml.etree.ParserContext.handleParseResultDoc (src/lxml/lxml.etree.c:63820)
File 「parser.pxi」, line 625, in lxml.etree.handleParseResult (src/lxml/lxml.etree.c:64741)
File 「parser.pxi」, line 563, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64056)
IOError: Error reading file ‘http://www.google.com’: failed to load external entity 「http://www.google.com」
我不知道下一步該做什麼,因爲我是python的新手。請指導我解決這個錯誤。提前致謝!! :)
非常感謝你澄清!這個偉大的工程: ) – silentNinJa 2010-06-25 09:50:14
非常歡迎! – MattH 2010-06-25 10:36:11
對不起,但它確實(或至少應該)抓取網址。必須有其他原因。也許先嚐試升級到最新的lxml版本。 – Steven 2011-06-30 18:52:23