2
我想要在Jython中使用Htmlunit模塊來取消其中包含JavaScript的網站。問題,而試圖抓取JavaScript網站使用Jython中的htmlunit
import com.gargoylesoftware.htmlunit.WebClient as WebClient
import com.gargoylesoftware.htmlunit.BrowserVersion as BrowserVersion
import com.gargoylesoftware.htmlunit.FailingHttpStatusCodeException
import com.gargoylesoftware.htmlunit.html.HtmlPage
import com.gargoylesoftware.htmlunit.ScriptException
def main():
webClient = WebClient(BrowserVersion.CHROME) # creating a new webclient object.
webClient.waitForBackgroundJavaScript(50000)
webClient.setThrowExceptionOnScriptError(False)
webClient.setThrowExceptionOnFailingStatusCode(False)
url = "http://www.w3schools.com/js/DEFAULT.asp"
page = webClient.getPage(url) # getting the url
name = page.getByXPath("/html/body/div[3]/div/div/div[2]/h1/span")
print "name:", name[0].asText()
main()
錯誤(S)我面對:
Traceback (most recent call last):
File "C:\Eclipse\learnJython\src\HTMLunit_basic.py", line 20, in <module>
main()
File "C:\Eclipse\learnJython\src\HTMLunit_basic.py", line 11, in main
webClient.setThrowExceptionOnScriptError(False)
AttributeError: 'com.gargoylesoftware.htmlunit.WebClient' object has no attribute 'setThrowExceptionOnScriptError'
問題:即使我導入這些模塊Eclipse是表明這些模塊是未使用
import com.gargoylesoftware.htmlunit.FailingHttpStatusCodeException
import com.gargoylesoftware.htmlunit.ScriptException
我進口這些模塊來繞過由JavaScript處理引起的錯誤,我只想關注一些特定的腳本,並且不希望htmlunit加載所有腳本。目前,htmlunit正在加載所有腳本。 facebook插件被我們的公司網絡封鎖。
我是編程新手,請幫我解決這個問題。