-1
我試圖從網站中的多個頁面中提取一些數據,並使用Javascript生成內容。 所以我使用PyQt4和美麗的湯來循環頁面並提取一些數據字段。使用PyQt4和美麗的湯來瀏覽網頁
import sys
from bs4 import BeautifulSoup
from PyQt4.QtGui import QApplication
from PyQt4.QtCore import QUrl
from PyQt4.QtWebKit import QWebPage
class Client(QWebPage):
def __init__(self, url):
self.app = QApplication(sys.argv)
QWebPage.__init__(self)
self.loadFinished.connect(self.on_page_load)
self.mainFrame().load(QUrl(url))
self.app.exec_()
def on_page_load(self):
self.app.quit()
products_titles = []
urls= ['url1', 'url2', 'url3']
for url in urls:
print "Parsing URL: " + url + '\n'
client_response = Client(url)
source = client_response.mainFrame().toHtml()
soup = BeautifulSoup(source, "html.parser")
print get_product_category(soup)
但是當我運行它擊碎並給出了此錯誤:
QObject::connect: Cannot connect (null)::configurationAdded(QNetworkConfiguration) to QNetworkConfigurationManager::configurationAdded(QNetworkConfiguration)
QObject::connect: Cannot connect (null)::configurationRemoved(QNetworkConfiguration) to QNetworkConfigurationManager::configurationRemoved(QNetworkConfiguration)
QObject::connect: Cannot connect (null)::configurationChanged(QNetworkConfiguration) to QNetworkConfigurationManager::configurationChanged(QNetworkConfiguration)
QObject::connect: Cannot connect (null)::onlineStateChanged(bool) to QNetworkConfigurationManager::onlineStateChanged(bool)
QObject::connect: Cannot connect (null)::configurationUpdateComplete() to QNetworkConfigurationManager::updateCompleted()
[1] 14809 segmentation fault python products.py
我不知道我是我做錯了,請你知道什麼事情幫助。
謝謝,它工作得很好,比我的解決方案更快! – melhirech