我刮雅虎財經網站獲取公司股票數據,我用美麗的湯提取td標籤,但我想刪除span標籤,無法做到這一點。以下是我需要提取文本的html代碼的幾行代碼。如何從td美麗的湯中刪除跨度Python 3.5
[ < td class = "Py(10px) Ta(start)"
data - reactid = "53" > < span data - reactid = "54" > 31 - Jul - 2017 < /span></td > , < td class = "Py(10px)"
data - reactid = "55" > < span data - reactid = "56" > 991.90 < /span></td > , < td class = "Py(10px)"
data - reactid = "57" > < span data - reactid = "58" > 1, 021.70 < /span></td > , < td class = "Py(10px)"
data - reactid = "59" > < span data - reactid = "60" > 986.75 < /span></td > , < td class = "Py(10px)"
data - reactid = "61" > < span data - reactid = "62" > 1, 011.20 < /span></td >
]
我下面的代碼給了我上面的內容。
INFY = url.urlopen("https://in.finance.yahoo.com/quote/INFY.NS/history?p=INFY.NS")
INFYHis = INFY.read()
INFYSoup = soup(INFYHis,'html.parser')
INFYtd=INFYSoup.findAll("td",{"class":"Py(10px)"})
我對python非常陌生,不確定如何獲取刪除或獲取我的分析文本。
那麼你想刪除它或獲取文本? –
是的,我需要得到的文本,並以數據框的形式,以便我可以使用它作爲熊貓datafrome –