2014-02-06 45 views
0

我想從Bugzilla的(bugzilla.mozilla.org)信息python httplib.HTTPSConnection沒有正確連接(到bugzilla.mozilla.org)?

,當我寫類似下面的代碼,

# 
import httplib 
host = 'bugzilla.mozilla.org' 

h = httplib.HTTPSConnection(host) 
h.putrequest('GET', 'https://bugzilla.mozilla.org/index.cgi') 
h.putheader('Accept', 'application/x-ms-application, image/jpeg, application/xaml+xml, image/gif, image/pjpeg, application/x-ms-xbap, application/x-shockwave-flash, */*') 
h.putheader('User-Agent', "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.1; WOW64; Trident/5.0; SLCC2; .NET CLR 2.0.50727; .NET CLR 3.5.30729; .NET CLR 3.0.30729; Media Center PC 6.0; .NET4.0C; .NET4.0E; InfoPath.3)") 
h.putheader('Host', host) 
h.putheader('Connection', 'Keep-Alive') 
h.endheaders() 

response = h.getresponse() 
print response.read() 

服務器總是返回

<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN"> 
<html><head> 
<title>301 Moved Permanently</title> 
</head><body> 
<h1>Moved Permanently</h1> 
<p>The document has moved <a href="https://bugzilla.mozilla.org/index.cgi">here</a>.</p> 
</body></html> 

但是這個代碼工作正常當其他https服務器時。 有沒有人知道我在哪裏錯了?

回答

1

httplib不跟隨重定向(301 HTTP代碼),你可以使用urrlib2來代替:

from urllib2 import Request, urlopen 

req = Request('https://bugzilla.mozilla.org/index.cgi') 
req.add_header('Accept', 'application/x-ms-application, image/jpeg, application/xaml+xml, image/gif, image/pjpeg, application/x-ms-xbap, application/x-shockwave-flash, */*') 
req.add_header('User-Agent', "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.1; WOW64; Trident/5.0; SLCC2; .NET CLR 2.0.50727; .NET CLR 3.5.30729; .NET CLR 3.0.30729; Media Center PC 6.0; .NET4.0C; .NET4.0E; InfoPath.3)") 
response = urlopen(req) #NOTE: it doesn't check server's ssl certificate 
print(response.headers) 
content = response.read()