2012-04-25 68 views
1

如何使用php從xml頁面獲取內容。 內容如下:web scraping從xml中獲取內容

<entry> 
    <title>News</title> 
    <link rel="alternate" href="http://www.website.com/detail/2688327:BlogPost:1569917"/> 
    <id>tag:www.website.com,2012-04-25:2688327:BlogPost:1569917</id> 
    <updated>2012-04-25T08:30:00.000Z</updated> 
    <author> 
    <name>Username</name> 
    <uri>http://www.website.com/profile/username</uri> 
    </author> 
     <summary type="html"> 
     Hi this is the latest news 
     </summary> 
</entry> 

<entry> 
    <title>News2</title> 
    <link rel="alternate" href="http://www.website.com/detail/2688327:BlogPost:1569917"/> 
    <id>tag:www.website.com,2012-04-25:2688327:BlogPost:1569917</id> 
    <updated>2012-04-25T08:30:00.000Z</updated> 
    <author> 
    <name>Username2</name> 
    <uri>http://www.website.com/profile/username</uri> 
    </author> 
     <summary type="html"> 
     Hi this is the latest news 
     </summary> 
</entry> 

<entry> 
    <title>News3</title> 
    <link rel="alternate" href="http://www.website.com/detail/2688327:BlogPost:1569917"/> 
    <id>tag:www.website.com,2012-04-25:2688327:BlogPost:1569917</id> 
    <updated>2012-04-25T08:30:00.000Z</updated> 
    <author> 
    <name>Username3</name> 
    <uri>http://www.website.com/profile/username</uri> 
    </author> 
     <summary type="html"> 
     Hi this is the latest news 
     </summary> 
</entry> 

<entry> 
    <title>News4</title> 
    <link rel="alternate" href="http://www.website.com/detail/2688327:BlogPost:1569917"/> 
    <id>tag:www.website.com,2012-04-25:2688327:BlogPost:1569917</id> 
    <updated>2012-04-25T08:30:00.000Z</updated> 
    <author> 
    <name>Username4</name> 
    <uri>http://www.website.com/profile/username</uri> 
    </author> 
     <summary type="html"> 
     Hi this is the latest news 
     </summary> 
</entry> 

我怎樣才能拿到冠軍的數組,博客鏈接<link rel="alternate" href="http://www.website.com/detail/2688327:BlogPost:1569917"/>,筆者細節,如名稱和URI(配置文件鏈接),並總結出使用PHP?

回答

1

退房simplexml的,XPath的 http://php.net/manual/en/book.simplexml.php

$file = 'url or file name'; 
    $xml = simplexml_load_file('$file'); 
    $list= $xml->xpath("/entry"); // root/entry ... 
    print $list[0]->id; 
    #var_dump($list);