2011-05-21 67 views
1

500錯誤在本頁面看看:HTTP在wget的

http://www.ptmytrade.com/product.asp?id=61363

它加載罰款(至少在這裏)。現在我想用wget來抓取它。

$ wget http://www.ptmytrade.com/product.asp?id=61363 --debug 
DEBUG output created by Wget 1.12 on linux-gnu. 

--2011-05-21 18:24:51-- http://www.ptmytrade.com/product.asp?id=61363 
Resolving www.ptmytrade.com... 205.209.150.134 
Caching www.ptmytrade.com => 205.209.150.134 
Connecting to www.ptmytrade.com|205.209.150.134|:80... connected. 
Created socket 3. 
Releasing 0x0890e260 (new refcount 1). 

---request begin--- 
GET /product.asp?id=61363 HTTP/1.0 
User-Agent: Wget/1.12 (linux-gnu) 
Accept: */* 
Host: www.ptmytrade.com 
Connection: Keep-Alive 

---request end--- 
HTTP request sent, awaiting response... 
---response begin--- 
HTTP/1.1 500 Internal Server Error 
Connection: keep-alive 
Date: Sat, 21 May 2011 16:24:56 GMT 
Server: Microsoft-IIS/6.0 
X-Powered-By: ASP.NET 
Content-Length: 471822 
Content-Type: text/html 
Set-Cookie: ASPSESSIONIDSCACCAQA=FOCCMJODFHHMOKNKPAIHJCIL; path=/ 
Cache-control: private 

---response end--- 
500 Internal Server Error 

Stored cookie www.ptmytrade.com -1 (ANY)/<session> <insecure> [expiry none] ASPSESSIONIDSCACCAQA FOCCMJODFHHMOKNKPAIHJCIL 
Registered socket 3 for persistent reuse. 
Disabling further reuse of socket 3. 
Closed fd 3 
2011-05-21 18:24:57 ERROR 500: Internal Server Error. 

OK,所以我檢查了頭使用我的瀏覽器(使用Live HTTP頭附加)抓取的網頁時:

http://www.ptmytrade.com/product.asp?id=61361 

GET /product.asp?id=61361 HTTP/1.1 
Host: www.ptmytrade.com 
User-Agent: Mozilla/5.0 (X11; Linux i686; rv:2.0) Gecko/20100101 Firefox/4.0 
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8 
Accept-Language: en-us,en;q=0.5 
Accept-Encoding: gzip, deflate 
Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7 
Keep-Alive: 115 
Connection: keep-alive 
Cookie: ASPSESSIONIDSCACBBRA=AMPBLLNDGMFLNPNCPEBPNNLB; ASPSESSIONIDSCACCAQA=FJNBMJODLHHJNDHPFBIEEPEM 

HTTP/1.1 500 Internal Server Error 
Date: Sat, 21 May 2011 16:20:46 GMT 
Server: Microsoft-IIS/6.0 
X-Powered-By: ASP.NET 
Content-Length: 471822 
Content-Type: text/html 
Cache-Control: private 
---------------------------------------------------------- 
http://www.ptmytrade.com/images/index_117.jpg 

GET /images/index_117.jpg HTTP/1.1 
Host: www.ptmytrade.com 
User-Agent: Mozilla/5.0 (X11; Linux i686; rv:2.0) Gecko/20100101 Firefox/4.0 
Accept: image/png,image/*;q=0.8,*/*;q=0.5 
Accept-Language: en-us,en;q=0.5 
Accept-Encoding: gzip, deflate 
Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7 
Keep-Alive: 115 
Connection: keep-alive 
Referer: http://www.ptmytrade.com/product.asp?id=61361 
Cookie: ASPSESSIONIDSCACBBRA=AMPBLLNDGMFLNPNCPEBPNNLB; ASPSESSIONIDSCACCAQA=FJNBMJODLHHJNDHPFBIEEPEM 

HTTP/1.1 404 Not Found 
Content-Length: 1635 
Content-Type: text/html 
Server: Microsoft-IIS/6.0 
X-Powered-By: ASP.NET 
Date: Sat, 21 May 2011 16:20:48 GMT 

我不知道是怎麼回事。該頁面顯示得很好,但我得到了頭部中的500錯誤代碼。

這個問題是通過使用curl(它也得到了一個500,但獲取的頁面就好了)來解決,但我很好奇這裏會發生什麼。

回答

1

這是網頁中的一個錯誤。 HTTP狀態確實看起來不正確地設置爲HTTP 500.Firefox/Firebug也證實了這一點。基本上,你正面臨一個帶有「正常」內容的HTTP 500錯誤頁面。

舉報該網站管理員。

+1

爲什麼不該」 t'wget'獲取內容無論如何..? – Campa 2013-11-06 16:56:35

+2

@Campa:因爲HTTP狀態表明有錯誤。如果您對錯誤頁面內容感興趣,請改用curl。對於將來的問題,請按「提問問題」按鈕,而不是劫持別人的問題。 – BalusC 2013-11-06 17:05:45

+0

thx。我會馬上提出問題。 – Campa 2013-11-06 17:14:08