2017-10-19 35 views
0

文本我想從這個html頁面部分捕字「銅獎」:如何捕捉從HTML頁面

<tr class=""> 
    <td align="left" csk="Nikpai,Rohullah"><a href="/olympics/athletes/ni/rohullah-nikpai-1.html">Rohullah Nikpai</a></td> 
    <td align="right" >25</td> 
    <td align="left" ><a href="/olympics/summer/2012/TKW/mens-featherweight.html">Men's Featherweight</a></td> 
    <td align="right" csk="3">3T </td> 
    <td align="left" class=" Bronze" csk="1"><strong>Bronze</strong></td> 
</tr> 

我嘗試不同的代碼,但我在意圖失敗。許多嘗試之一是以下幾點:

foreach($html4->find('td align="left" strong') as $tag4) { 


           echo $prova = $tag4->innertext . "\n"; 

          } 

其中html4是整個html頁我必須處理。

+2

什麼類是'$ html4'一個instanceof? – delboy1978uk

回答

0

有了下面的代碼,你可以得到的類名「銅獎」

<?php 
$html='<tr class=""> 
    <td align="left" csk="Nikpai,Rohullah"><a href="/olympics/athletes/ni/rohullah-nikpai-1.html">Rohullah Nikpai</a></td> 
    <td align="right" >25</td> 
    <td align="left" ><a href="/olympics/summer/2012/TKW/mens-featherweight.html">Mens Featherweight</a></td> 
    <td align="right" csk="3">3T </td> 
    <td align="left" class=" Bronze" csk="1"><strong>Bronze</strong></td> 
</tr>'; 


$dom = new DOMDocument(); 
@$dom->loadHTML($html); 
foreach($dom->getElementsByTagName('td') as $link) { 
     echo trim($link->getAttribute('class'),' ');    
} 
?> 

或者,如果你喜歡的節點值,而不是類名和CSK attribut總是1:

foreach($dom->getElementsByTagName('td') as $link) { 
    if ($link->getAttribute('csk')=="1"){ 
     echo $link->nodeValue; 
    } 
}