python中的selenium webdriver

所以基本上我試圖在python中使用selenium webdriver打開並登錄到叫做marketsworlds的網站並檢索股票的市場價值。我有腳本可以打開並登錄到頁面。我不知道如何捕捉/獲取市場價值並將其打印出來。我以前檢查元素找到類：在之間的打開和關閉括號檢查元素顯示的市場價值，這不斷變化python中的selenium webdriver

<p class="market_value"> 

</p>

。我嘗試將driver.find_element_by_class（「market_value」）設置爲變量並打印該變量。我從「對象0x」處得到一個打印輸出，以及x後面的內容。任何方式來返回它實際顯示的內容？

來源

2015-06-25 dvitt90

份額PLZ網頁的URL –

的網址是[鏈接] https://marketsworld.com – dvitt90

如果你的有要使用Selenium進行導航，比如在JavaScript網站上，它會建議獲取頁面源並使用HTML解析器來提取你想要的數據。

BeautifulSoup是解析器的絕佳選擇。例如：

html = driver.page_source 
soup = BeautifulSoup(html) 

# Get *all* 'p' tags with the specified class attribute. 
p_tags = soup.findAll('p',{'class':'market_value'}) 
for p in p_tags: 
    print p.text

這應該打印到屏幕包含在<p>標籤與類market_value文本。但是，如果不知道確切的頁面源代碼，很難給出具體細節。

但是，如果你確定要嚴格硒使用，你可以找到這些元素：

# Get *all* 'p' tags with the specified class attribute. 
elements = driver.find_elements_by_class_name('market_value') 
for element in elements: 
    print element.text 

# or 

# Get a *single* 'p' tag with the specified class attribute. 
element = driver.find_element_by_class_name('market_value') 
print element.text

來源

2015-06-25 16:06:59 That1Guy

感謝您的幫助。我有點新使用python從網頁中提取信息。我決定用硒作爲開始。如果你有任何建議可以達到同樣的效果，我寧願用一種不同的方式。 – dvitt90

看看[urllib2]（https://docs.python.org/2/library/urllib2.html）。這可能有助於揭示硒何時需要和不需要刮除。也請看一下我在答案中鏈接到的BeautifulSoup文檔。它會希望向您展示如何正確完成HTML解析。 – That1Guy

python中的selenium webdriver

回答

相關問題