2017-09-23 113 views
0

我試圖用http://www.nasdaqomxnordic.com/shares/listed-companies/nordic-large-cap來抓取它的數據。這裏是我的代碼中使用的Net :: HTTP發送GET請求:Ruby Net :: HTTP 400錯誤請求

require 'net/http' 
require 'uri' 

def get_stocks() 
    uri = URI.parse('http://www.nasdaqomxnordic.com/aktier/listed-companies/stockholm') 
    response = Net::HTTP.get_response(uri) 
    puts response 
end 

get_stocks() 

其它網站我測試過的作品好,並用200迴應:OK,但http://www.nasdaqomxnordic.com/shares/listed-companies/nordic-large-cap返回#<Net::HTTPBadRequest:0x00007ffe8f84ec30>,我不明白爲什麼。

對於更詳細的上下文response.body回報:

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> 
<html xmlns="http://www.w3.org/1999/xhtml"> 
<head> 
<META HTTP-EQUIV="CONTENT-TYPE" CONTENT="TEXT/HTML; CHARSET=utf-8"/> 
<title>400 Bad Request</title></head> 
<body> 
    <H2>400 Bad Request</H2> 
    <p>The request could not be understood by the server due to malformed syntax. The client SHOULD NOT repeat the request without modifications.</p> 
    <p>This page can't be displayed.<br/>The incident ID is: 10039581164792379.</p> 
    <p>If you would like assistance, please contact the Support for additional information.<br></p> 
</body> 
</html> 

我能做些什麼,以獲得一個200:OK?

任何幫助非常感謝!提前致謝!

回答

0

我認爲你需要設置請求的User-Agent屬性。 以下代碼有效。

require 'net/http' 
require 'uri' 

def get_stocks() 
    uri = URI.parse("http://www.nasdaqomxnordic.com/shares/listed-companies/nordic-large-cap") 
    http = Net::HTTP.new(uri.host, uri.port) 
    request = Net::HTTP::Get.new(uri.request_uri) 
    user_agent = 'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/59.0.3071.86 Safari/537.36' 
    request.initialize_http_header({"User-Agent" => user_agent}) 

    response = http.request(request) 
    puts response.inspect 
end 

get_stocks() # #<Net::HTTPOK 200 OK readbody=true> 

可以使用response.body

+0

謝謝你得到響應的身體!你爲我節省了很多挫折! – Villevillekulla