2014-05-24 48 views
0

我說用Ruby 4天前搞亂打印出來,我想找到一個標籤(RUBY)如何閱讀HTML標籤內容,並在控制檯

這個內容是我到目前爲止的代碼。有人可以幫忙嗎?

require 'open-uri' 

print "Enter a website domain you like (without the http://): " 
website = gets.chomp 

if website.count(".") >= 2 
    first_letter = website.index(".") + 1 # we want the letter just after the first dot 
    after_first_letter = first_letter + 1 
    last_letter = (website.index(".", after_first_letter) - 1) 
    website_title = website[first_letter..last_letter] 
elsif website.count(".") == 1 
    website_title = website[0..(website.index(".") - 1)] 
else 
    website_title = nil 
end 

unless website_title.nil? 
    file = open("http://#{website}") 
    contents = file.read 
    title = contents.index("<h3>") 

    p "This is the title: #{title}" 
    else 
    puts "We aren't sure what you did. Try again..." 
end 

回答

0

使用nokogiri來解析html。運行gem install nokogiri

require 'nokogiri' 
html = Nokogiri::HTML(open("http://#{website}")) 

html.css('h3').each do |title_node| 
    puts "Title: #{title_node.content}" 
end 
+0

非常感謝! – Veven