<h3>
<a href="article.jsp?tp=&arnumber=16">
Granular computing based
<span class="snippet">data</span>
<span class="snippet">mining</span>
in the views of rough set and fuzzy set
</a>
</h3>
使用Python兩個標記之間的數據我想從它應該是在粗糙集和模糊集獲取在Python
的意見粒度基於計算的數據挖掘我嘗試使用lxml的錨標記上的值
parser = etree.HTMLParser()
tree = etree.parse(StringIO.StringIO(html), parser)
xpath1 = "//h3/a/child::text() | //h3/a/span/child::text()"
rawResponse = tree.xpath(xpath1)
print rawResponse
,並得到以下輸出
['\r\n\t\t','\r\n\t\t\t\t\t\t\t\t\tgranular computing based','data','mining','in the view of roughset and fuzzyset\r\n\t\t\t\t\t\t\]
你是否必須使用'lxml'?因爲我大概可以想出一個解決方案,用'BeautifulSoup' – TerryA
我可以使用任何東西 – Jack