這裏是HTML
內容:混淆使用BeautifulSoup讀取html表格內容?
<table cellspacing="1" cellpadding="0" class="data">
<tr class="colhead">
<th colspan="3">Expression</th>
</tr>
<tr class="colhead">
<th>Task</th>
<th>Action</th>
<th>List</th>
</tr>
<tr class="rowLight">
<td width="40%">
Task1
</td>
<td width="20%">
Assigned to
</td>
<td width="40%">
Harry
</td>
</tr>
<tr class="rowDark">
<td width="40%">
Task2
</td>
<td width="20%">
Rejected by
</td>
<td width="40%">
Lopa
</td>
</tr>
<tr class="rowLight">
<td width="40%">
Task5
</td>
<td width="20%">
Accepted By
</td>
<td width="40%">
Mathew
</td>
</tr>
現在我得爲以下值:(如下表只不過是Excel表格,我將建立,一旦達到該值。)
Task Action List
Task1 Assigned to Harry
Task2 Rejected by Lopa
Task5 Accepted By Mathew
一個世俗的人解我所知道的,如下:
from bs4 import BeautifulSoup
soup = BeautifulSoup(source_URL)
alltables = soup.findAll("table", {"border":"2", "width":"100%"})
t = [x for x in soup.findAll('td')]
[x.renderContents().strip('\n') for x in t]
但在我上面HTML
內容,結構不存在,那麼如何處理?請在這裏指導我!
任何人都可以幫助我嗎? –