2014-03-04 49 views
2

我想要在Jython中使用Htmlunit模塊來取消其中包含JavaScript的網站。問題,而試圖抓取JavaScript網站使用Jython中的htmlunit

import com.gargoylesoftware.htmlunit.WebClient as WebClient 
import com.gargoylesoftware.htmlunit.BrowserVersion as BrowserVersion 

import com.gargoylesoftware.htmlunit.FailingHttpStatusCodeException 
import com.gargoylesoftware.htmlunit.html.HtmlPage 
import com.gargoylesoftware.htmlunit.ScriptException 

def main(): 
    webClient = WebClient(BrowserVersion.CHROME) # creating a new webclient object. 
    webClient.waitForBackgroundJavaScript(50000) 
    webClient.setThrowExceptionOnScriptError(False) 
    webClient.setThrowExceptionOnFailingStatusCode(False) 


    url = "http://www.w3schools.com/js/DEFAULT.asp" 
    page = webClient.getPage(url) # getting the url 
    name = page.getByXPath("/html/body/div[3]/div/div/div[2]/h1/span") 
    print "name:", name[0].asText() 

main() 

錯誤(S)我面對:

Traceback (most recent call last): 
    File "C:\Eclipse\learnJython\src\HTMLunit_basic.py", line 20, in <module> 
    main() 
    File "C:\Eclipse\learnJython\src\HTMLunit_basic.py", line 11, in main 
    webClient.setThrowExceptionOnScriptError(False) 
AttributeError: 'com.gargoylesoftware.htmlunit.WebClient' object has no attribute 'setThrowExceptionOnScriptError' 

問題:即使我導入這些模塊Eclipse是表明這些模塊是未使用

import com.gargoylesoftware.htmlunit.FailingHttpStatusCodeException 
import com.gargoylesoftware.htmlunit.ScriptException 

我進口這些模塊來繞過由JavaScript處理引起的錯誤,我只想關注一些特定的腳本,並且不希望htmlunit加載所有腳本。目前,htmlunit正在加載所有腳本。 facebook插件被我們的公司網絡封鎖。

我是編程新手,請幫我解決這個問題。

回答