2015-10-12 256 views
2

我想獲得「href」,但不能。如何從標籤獲得「href」

我有這樣的代碼:

<h3 class="list-item-hd"> 
    <a href="/music/news/dwight-yoakam-and-jack-black-to-produce-new-nashville-comedy-20151009">Dwight Yoakam and Jack Black to Produce New Nashville Comedy</a> 
</h3> 

我怎樣才能得到?

/music/news/dwight-yoakam-and-jack-black-to-produce-new-nashville-comedy-20151009 

回答

1

該代碼使用CSS選擇器來獲取所有錨元素和打印他們href屬性:

require 'nokogiri' 

html = <<EOT 
<html> 
    <h3 class="list-item-hd"> 
     <a href="/music/news/dwight-yoakam-and-jack-black-to-produce-new-nashville-comedy-20151009">Dwight Yoakam and Jack Black to Produce New Nashville Comedy</a> 
    </h3> 
</html> 
EOT 

doc = Nokogiri::HTML(html) 
doc.css('a').map { |link| p link['href'] } 
# => "/music/news/dwight-yoakam-and-jack-black-to-produce-new-nashville-comedy-20151009" 

您可以使用CSS或XPath選擇通過或者實現自己的目標。要了解有關CSS和XPath選擇器的更多信息,我強烈建議您查看「Official Nokogiri Documentation for Searching a XML/HTML Document.