數據我使用下面的代碼作爲輸入到DOM文檔濾波器提取使用XPath
<li id="SalesRank">
<b>Amazon Best Sellers Rank:</b>
#20,267 Paid in Kindle Store (
<a href="http://www.amazon.com/gp/bestsellers/digital-text/ref=pd_dp_ts_kstore_1/190-9295683-0277616">See Top 100 Paid in Kindle Store</a>
)
<ul class="zg_hrsr">
<li class="zg_hrsr_item">
<span class="zg_hrsr_rank">#15</span>
<span class="zg_hrsr_ladder">
in
<a href="http://www.amazon.com/gp/bestsellers/digital-text/ref=pd_zg_hrsr_kstore_1_1">Kindle Store</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/154606011">Kindle eBooks</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/157325011">Nonfiction</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/292975011">Lifestyle & Home</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/156699011">Home & Garden</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/156828011">Gardening & Horticulture</a>
>
<b>
<a href="http://rads.stackoverflow.com/amzn/click/156847011">Greenhouses</a>
</b>
</span>
</li>
<li class="zg_hrsr_item">
<span class="zg_hrsr_rank">#26</span>
<span class="zg_hrsr_ladder">
in
<a href="http://www.amazon.com/gp/bestsellers/digital-text/ref=pd_zg_hrsr_kstore_2_1">Kindle Store</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/154606011">Kindle eBooks</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/157325011">Nonfiction</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/292975011">Lifestyle & Home</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/156699011">Home & Garden</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/156828011">Gardening & Horticulture</a>
>
<b>
<a href="http://rads.stackoverflow.com/amzn/click/156849011">House Plants</a>
</b>
</span>
</li>
</ul></li>
我使用以下XPath查詢以提取與數據的textContent ..
$xpath_cat->query('//li[@id="SalesRank"]');
可以檢查輸出,它包括所有li
標籤中包含的數據與id=salrsrank...
,而我只想得到#20,267 paid in kindle store..
因此所需的輸出
#20,267在Kindle商店
付費如何修改我的XPath來獲得所需要的輸出?
在代碼
更新我試圖溶液下文提供和使用的XPath
$xpath_cat->query('//li[@id="SalesRank"]/text()');
但現在,輸出是
([0] => [1 ] =>#20,267在Kindle商店支付([2] =>)
我該如何解決這個問題?
仍然在失敗...... – 2012-02-12 21:04:12