2014-04-01 84 views
1

這解析圖像和文字是線程這裏的延續:Trying to Parse Only the Images from an RSS Feed試圖從一個RSS源

這一次,我想從一個RSS feed解析圖像和某些項目。 RSS提要的採樣是這樣的:

<channel> 
<atom:link href="http://mywebsite.com/rss" rel="self" type="application/rss+xml" /> 

<item> 
<title>Article One</title> 
<guid isPermaLink="true">http://mywebsite.com/details/e8c5106</guid> 
<link>http://mywebsite.com/geturl/e8c5106</link> 
<comments>http://mywebsite.com/details/e8c5106#comments</comments>  
<pubDate>Wed, 09 Jan 2013 02:59:45 -0500</pubDate> 
<category>Category 1</category>  
<description> 
     <![CDATA[<div> 
     <img src="http://mywebsite.com/myimages/1521197-main.jpg" width="120" border="0" /> 
     <ul><li>Poster: someone's name;</li> 
     <li>PostDate: Tue, 08 Jan 2013 21:49:35 -0500</li> 
     <li>Rating: 5</li> 
     <li>Summary:Lorem ipsum dolor </li></ul></div><div style="clear:both;">]]> 
     </description> 
</item> 
<item>.. 

下面我有,我嘗試解析圖像和文字下面的代碼:

$xml = simplexml_load_file('http://mywebsite.com/rss?t=2040&dl=1&i=1'); 

$descriptions = $xml->xpath('//item/description'); 
$mytitle= $xml->xpath('//item/title'); 

foreach ($descriptions as $description_node) { 
    // The description may not be valid XML, so use a more forgiving HTML parser mode 
    $description_dom = new DOMDocument(); 
    $description_dom->loadHTML((string)$description_node); 

    // Switch back to SimpleXML for readability 
    $description_sxml = simplexml_import_dom($description_dom); 

    // Find all images, and extract their 'src' param 
    $imgs = $description_sxml->xpath('//img'); 
    foreach($imgs as $image) { 
     echo "<img id=poster class=poster src={$image['src']}> {$mytitle}"; 
     } 
    } 

上面的代碼精美提取圖像....但是,當我嘗試在我的代碼的最後一行時,它不會提取$ mytitle(這將是「Article One」)標記。這應該是從RSS提要中的所有項目中提取的。

任何人都可以幫我找出這一個請。

非常感謝,

赫爾南

+0

XPath是正確的。也許你需要在'$ mytitle'上調用' - > nodeValue'來獲取節點內容。 – helderdarocha

+0

其實,因爲你有很多'item'元素,你將需要使用' - > item(0)'來獲得第一個元素。 – helderdarocha

+0

謝謝Helderdarocha ......不幸的是,我的知識並不先進,我無法理解你的解釋。問題是我必須提取字段內的內容以及字段中的內容。這將在RSS提要中重複多次,這正是我想要的。 – <span class="text-secondary"> <small> <a rel="noopener" target="_blank" href="https://stackoverflow.com/users/1964274/">Hernandito</a></span> <span></span> </small> </span> </p> </div> </div> </div> </div> </div> </article> </div> <div class="answer-title"> <span class="text-logo margin-top-sm">A</span> <h2 class="title h4">回答</h2> </div> <div class="item-description text-md markdown-body margin-bottom-40 voidso"> <article class="board-top-1 padding-top-10"> <div class="post-col vote-info"> <span class="count">1<i class="fa fa-thumbs-up"></i></span> <i class="fa fa-check fa-2x"></i> </div> <div class="post-offset"> <div class="answer fmt"> <p><code class="prettyprint-override">xpath()</code>總是返回一個數組(見<a href="http://www.php.net/manual/en/simplexmlelement.xpath.php" rel="nofollow">http://www.php.net/manual/en/simplexmlelement.xpath.php</a>),哪怕只是一個元素是結果。如果你知道你會期望一個元素,你可以簡單地使用<code class="prettyprint-override">$mytitle[0]</code>。</p> <p>您將不得不迭代每個<code class="prettyprint-override"><item/></code>元素,否則您無法知道哪個說明和哪個標題屬於一起。所以下面應該工作:</p> <p>順便說一句,我還添加了「」給你你<code class="prettyprint-override"><img/></code>元素。我想你想要,因爲這看起來非常像XML/HTML。</p> </div> <div class="post-info"> <div class="post-meta row"> <p class="text-secondary col-lg-6"> <span class="source"> <a rel="noopener" target="_blank" href="https://stackoverflow.com/q/22838573">來源</a> </span> </p> <p class="text-secondary col-lg-6"> <span class="float-right date"> <span>2014-04-03 13:00:59</span> <a rel="noopener" target="_blank" href="https://stackoverflow.com/users/1451599/">dirkk</a></span> </p> <p class="col-12"></p> <p class="col-12"></p></div> </div> <!-- comments --> <div class="comments"> <div itemprop="comment" class="post-comment"> <div class="row"> <div class="col-lg-1"><span class="text-secondary">+0</span></div> <div class="col-lg-11"> <p class="commenttext">謝謝Dirkk ...我認爲我們正在接近... RSSS有多個項目,我想刮。每個項目都有一個嵌入式「標題」和上面我的代碼工作的圖像。因此,在我的foreach中,我想刮掉並回顯相應的圖像以及Feed中每個項目的相應「標題」。您的代碼爲Feed中的所有商品返回了相同的標題。 – <span class="text-secondary"> <small> <a rel="noopener" target="_blank" href="https://stackoverflow.com/users/1964274/">Hernandito</a></span> <span></span> </small> </span> </p> </div> </div> </div> <div itemprop="comment" class="post-comment"> <div class="row"> <div class="col-lg-1"><span class="text-secondary">+0</span></div> <div class="col-lg-11"> <p class="commenttext">@Hernandito我更新了我的答案。您必須稍微調整程序的邏輯,否則描述和標題將始終彼此無關。您應該迭代每個「item」,然後查找所需的元素。 – <span class="text-secondary"> <small> <a rel="noopener" target="_blank" href="https://stackoverflow.com/users/1451599/">dirkk</a></span> <span></span> </small> </span> </p> </div> </div> </div> <div itemprop="comment" class="post-comment"> <div class="row"> <div class="col-lg-1"><span class="text-secondary">+0</span></div> <div class="col-lg-11"> <p class="commenttext">Dirkk ....它就像一個魅力!試圖解決這個問題的2天試用和錯誤。非常感謝你!!! – <span class="text-secondary"> <small> <a rel="noopener" target="_blank" href="https://stackoverflow.com/users/1964274/">Hernandito</a></span> <span></span> </small> </span> </p> </div> </div> </div> </div> </div> </article> <div> <script async src="https://pagead2.googlesyndication.com/pagead/js/adsbygoogle.js"></script> <ins class="adsbygoogle" style="display:block" data-ad-client="ca-pub-6208739752673518" data-ad-slot="1038284119" data-ad-format="auto" data-full-width-responsive="true"></ins> <script> (adsbygoogle = window.adsbygoogle || []).push({}); </script> </div> </div> <div class="clearfix"> </div> <div class="relative-box"> <div class="relative">相關問題</div> <ul class="relative_list"> <li> 1. <a href="http://hk.uwenku.com/question/p-ktaeqlsb-bmw.html" target="_blank" title="測試RSS源"> 測試RSS源 </a> </li> <li> 2. <a href="http://hk.uwenku.com/question/p-urwlghzy-bkd.html" target="_blank" title="從另一個網站實施RSS源"> 從另一個網站實施RSS源 </a> </li> <li> 3. <a href="http://hk.uwenku.com/question/p-bspkecjk-oz.html" target="_blank" title="代碼在一個RSS源"> 代碼在一個RSS源 </a> </li> <li> 4. <a href="http://hk.uwenku.com/question/p-kaqdysbz-vh.html" target="_blank" title="試圖從一個RSS feed鏈接與Syndicationitem"> 試圖從一個RSS feed鏈接與Syndicationitem </a> </li> <li> 5. <a href="http://hk.uwenku.com/question/p-gweaqsht-qr.html" target="_blank" title="Displaing從RSS源圖像的UITableViewCell"> Displaing從RSS源圖像的UITableViewCell </a> </li> <li> 6. <a href="http://hk.uwenku.com/question/p-udmzzcxk-pt.html" target="_blank" title="如何從RSS源提取圖像?"> 如何從RSS源提取圖像? </a> </li> <li> 7. <a href="http://hk.uwenku.com/question/p-gtzipunk-we.html" target="_blank" title="從RSS訂閱源獲取圖片"> 從RSS訂閱源獲取圖片 </a> </li> <li> 8. <a href="http://hk.uwenku.com/question/p-daacrhtn-cn.html" target="_blank" title="從Feedburner獲取XML圖像RSS源"> 從Feedburner獲取XML圖像RSS源 </a> </li> <li> 9. <a href="http://hk.uwenku.com/question/p-tswxrfam-ua.html" target="_blank" title="RSS源中的圖像"> RSS源中的圖像 </a> </li> <li> 10. <a href="http://hk.uwenku.com/question/p-nrzlxxtj-qa.html" target="_blank" title="頭部第一個Android開發 - RSS源"> 頭部第一個Android開發 - RSS源 </a> </li> <div> <script async src="https://pagead2.googlesyndication.com/pagead/js/adsbygoogle.js"></script> <ins class="adsbygoogle" style="display:block; text-align:center;" data-ad-layout="in-article" data-ad-format="fluid" data-ad-client="ca-pub-6208739752673518" data-ad-slot="4606349252"></ins> <script> (adsbygoogle = window.adsbygoogle || []).push({}); </script> </div> <li> 11. <a href="http://hk.uwenku.com/question/p-pmjufkfz-tn.html" target="_blank" title="TBXML解析器解析一個RSS源"> TBXML解析器解析一個RSS源 </a> </li> <li> 12. <a href="http://hk.uwenku.com/question/p-nrhiaxff-eg.html" target="_blank" title="渲染一個控件(用於RSS源)"> 渲染一個控件(用於RSS源) </a> </li> <li> 13. <a href="http://hk.uwenku.com/question/p-cbsesylu-yp.html" target="_blank" title="設置一個WordPress RSS源 - 基礎"> 設置一個WordPress RSS源 - 基礎 </a> </li> <li> 14. <a href="http://hk.uwenku.com/question/p-eobxeavh-bms.html" target="_blank" title="創建一個沒有RSS按鈕的網站的RSS源"> 創建一個沒有RSS按鈕的網站的RSS源 </a> </li> <li> 15. <a href="http://hk.uwenku.com/question/p-wlyytlcw-gx.html" target="_blank" title="創建一個從RSS源更新的新聞博文"> 創建一個從RSS源更新的新聞博文 </a> </li> <li> 16. <a href="http://hk.uwenku.com/question/p-dfyostpb-mc.html" target="_blank" title="從RSS訂閱源創建一個字符串數組"> 從RSS訂閱源創建一個字符串數組 </a> </li> <li> 17. <a href="http://hk.uwenku.com/question/p-crgrbvoc-zk.html" target="_blank" title="從一個RSS源製作NSTableView顯示文章"> 從一個RSS源製作NSTableView顯示文章 </a> </li> <li> 18. <a href="http://hk.uwenku.com/question/p-vbzevuqs-bs.html" target="_blank" title="在PHP中,從基於RDF的RSS源創建一個ATOM Feed"> 在PHP中,從基於RDF的RSS源創建一個ATOM Feed </a> </li> <li> 19. <a href="http://hk.uwenku.com/question/p-fthwlcct-bnc.html" target="_blank" title="讀取多個RSS源"> 讀取多個RSS源 </a> </li> <li> 20. <a href="http://hk.uwenku.com/question/p-pdbptohy-sh.html" target="_blank" title="RSS源圖像WordPress的特色圖片"> RSS源圖像WordPress的特色圖片 </a> </li> <li> 21. <a href="http://hk.uwenku.com/question/p-qzxzxfda-uv.html" target="_blank" title="試圖構建一個gridview數據源"> 試圖構建一個gridview數據源 </a> </li> <li> 22. <a href="http://hk.uwenku.com/question/p-xwfuukwo-boa.html" target="_blank" title="從php-mysql限制RSS源從表"> 從php-mysql限制RSS源從表 </a> </li> <li> 23. <a href="http://hk.uwenku.com/question/p-vpwzvmab-te.html" target="_blank" title="製作試圖從源頭"> 製作試圖從源頭 </a> </li> <li> 24. <a href="http://hk.uwenku.com/question/p-eclgjfbw-hu.html" target="_blank" title="試圖從一個日期"> 試圖從一個日期 </a> </li> <li> 25. <a href="http://hk.uwenku.com/question/p-mfnlhruw-td.html" target="_blank" title="試圖從一個子類"> 試圖從一個子類 </a> </li> <li> 26. <a href="http://hk.uwenku.com/question/p-acxaiwoy-bev.html" target="_blank" title="試圖從一個文件"> 試圖從一個文件 </a> </li> <li> 27. <a href="http://hk.uwenku.com/question/p-vrcbecgm-km.html" target="_blank" title="試圖從另一個表"> 試圖從另一個表 </a> </li> <li> 28. <a href="http://hk.uwenku.com/question/p-gvnwjkku-bdz.html" target="_blank" title="試圖從另一個DIV"> 試圖從另一個DIV </a> </li> <li> 29. <a href="http://hk.uwenku.com/question/p-mfobdfbw-bko.html" target="_blank" title="尋求一個PHP腳本來過濾一個RSS源"> 尋求一個PHP腳本來過濾一個RSS源 </a> </li> <li> 30. <a href="http://hk.uwenku.com/question/p-uhicnevp-bau.html" target="_blank" title="在另一個包含一個RSS源,可能嗎?怎麼樣?"> 在另一個包含一個RSS源,可能嗎?怎麼樣? </a> </li> </ul> </div> <div> <script async src="https://pagead2.googlesyndication.com/pagead/js/adsbygoogle.js"></script> <ins class="adsbygoogle" style="display:block" data-ad-format="autorelaxed" data-ad-client="ca-pub-6208739752673518" data-ad-slot="1575177025"></ins> <script> (adsbygoogle = window.adsbygoogle || []).push({}); </script> </div> <div class="padding-top-10"></div> </div> </div> <script type="text/javascript" src="http://img.uwenku.com/uwenku/script/side.js?t=1644592048261"></script> <script type="text/javascript" src="http://img.uwenku.com/uwenku/plugin/highlight/highlight.pack.js"></script> <link href="http://img.uwenku.com/uwenku/plugin/highlight/styles/docco.css" media="screen" rel="stylesheet" type="text/css" /> <script type="text/javascript"> $('pre').each(function(i, e) { hljs.highlightBlock(e, "<span class='indent'> </span>", false) }); </script> <div class="col-lg-3 col-md-4 col-sm-5"> <div id="rightTop"> <div class="row"> <script async src="https://pagead2.googlesyndication.com/pagead/js/adsbygoogle.js"></script> <ins class="adsbygoogle" style="display:block" data-ad-client="ca-pub-6208739752673518" data-ad-slot="5415218910" data-ad-format="auto" data-full-width-responsive="true"></ins> <script> (adsbygoogle = window.adsbygoogle || []).push({}); </script> </div> <div class="row sidebar panel panel-default"> <div class="panel-heading font-bold"> 最新問題 </div> <div class="m-b-sm m-t-sm clearfix"> <ul class="side_article_list"> <li class="side_article_list_item"> 1. <a href="http://hk.uwenku.com/question/p-hyvmiqdd-bgm.html" target="_blank" title="如何在模型從數據庫更新後保持實體代碼不變?"> 如何在模型從數據庫更新後保持實體代碼不變? </a> </li> <li class="side_article_list_item"> 2. <a href="http://hk.uwenku.com/question/p-ccgjpxxv-bna.html" target="_blank" title="訪問從OpenCV的函數C++ NDK"> 訪問從OpenCV的函數C++ NDK </a> </li> <li class="side_article_list_item"> 3. <a href="http://hk.uwenku.com/question/p-himxyfwg-bmt.html" target="_blank" title="在啓動時將/etc/profile.d/中的腳本作爲根執行?"> 在啓動時將/etc/profile.d/中的腳本作爲根執行? </a> </li> <li class="side_article_list_item"> 4. <a href="http://hk.uwenku.com/question/p-cmvqqsie-bnw.html" target="_blank" title="試圖基於另一個數據透視一列"> 試圖基於另一個數據透視一列 </a> </li> <li class="side_article_list_item"> 5. <a href="http://hk.uwenku.com/question/p-ddluvwly-bnq.html" target="_blank" title="MongoDB Native Node.js問題"> MongoDB Native Node.js問題 </a> </li> <li class="side_article_list_item"> 6. <a href="http://hk.uwenku.com/question/p-vnnbjsyr-bnh.html" target="_blank" title="Tensorflow和CUDA版本"> Tensorflow和CUDA版本 </a> </li> <li class="side_article_list_item"> 7. <a href="http://hk.uwenku.com/question/p-poccepxy-bdn.html" target="_blank" title="是什麼,如果我可以通過安裝搬運工引擎"> 是什麼,如果我可以通過安裝搬運工引擎 </a> </li> <li class="side_article_list_item"> 8. <a href="http://hk.uwenku.com/question/p-dzmvluel-bhe.html" target="_blank" title="優化:值替換在數據幀wiith多個條件"> 優化:值替換在數據幀wiith多個條件 </a> </li> <li class="side_article_list_item"> 9. <a href="http://hk.uwenku.com/question/p-ozzikgly-bgy.html" target="_blank" title="RxJava而不是改造回調"> RxJava而不是改造回調 </a> </li> <li class="side_article_list_item"> 10. <a href="http://hk.uwenku.com/question/p-bogienfk-bhu.html" target="_blank" title="如何將php關聯數組排序爲特定順序?"> 如何將php關聯數組排序爲特定順序? </a> </li> </ul> </div> </div> </div> <p class="article-nav-bar"></p> <div class="row sidebar article-nav"> <div class="row box_white visible-sm visible-md visible-lg margin-zero"> <div class="top"> <h3 class="title"><i class="glyphicon glyphicon-th-list"></i> 相關問題</h3> </div> <div class="article-relative-content"> <ul class="side_article_list"> <li class="side_article_list_item"> 1. <a href="http://hk.uwenku.com/question/p-ktaeqlsb-bmw.html" target="_blank" title="測試RSS源"> 測試RSS源 </a> </li> <li class="side_article_list_item"> 2. <a href="http://hk.uwenku.com/question/p-urwlghzy-bkd.html" target="_blank" title="從另一個網站實施RSS源"> 從另一個網站實施RSS源 </a> </li> <li class="side_article_list_item"> 3. <a href="http://hk.uwenku.com/question/p-bspkecjk-oz.html" target="_blank" title="代碼在一個RSS源"> 代碼在一個RSS源 </a> </li> <li class="side_article_list_item"> 4. <a href="http://hk.uwenku.com/question/p-kaqdysbz-vh.html" target="_blank" title="試圖從一個RSS feed鏈接與Syndicationitem"> 試圖從一個RSS feed鏈接與Syndicationitem </a> </li> <li class="side_article_list_item"> 5. <a href="http://hk.uwenku.com/question/p-gweaqsht-qr.html" target="_blank" title="Displaing從RSS源圖像的UITableViewCell"> Displaing從RSS源圖像的UITableViewCell </a> </li> <li class="side_article_list_item"> 6. <a href="http://hk.uwenku.com/question/p-udmzzcxk-pt.html" target="_blank" title="如何從RSS源提取圖像?"> 如何從RSS源提取圖像? </a> </li> <li class="side_article_list_item"> 7. <a href="http://hk.uwenku.com/question/p-gtzipunk-we.html" target="_blank" title="從RSS訂閱源獲取圖片"> 從RSS訂閱源獲取圖片 </a> </li> <li class="side_article_list_item"> 8. <a href="http://hk.uwenku.com/question/p-daacrhtn-cn.html" target="_blank" title="從Feedburner獲取XML圖像RSS源"> 從Feedburner獲取XML圖像RSS源 </a> </li> <li class="side_article_list_item"> 9. <a href="http://hk.uwenku.com/question/p-tswxrfam-ua.html" target="_blank" title="RSS源中的圖像"> RSS源中的圖像 </a> </li> <li class="side_article_list_item"> 10. <a href="http://hk.uwenku.com/question/p-nrzlxxtj-qa.html" target="_blank" title="頭部第一個Android開發 - RSS源"> 頭部第一個Android開發 - RSS源 </a> </li> </ul> </div> </div> </div> </div> </div> </div> </div><!-- wrap end--> <!-- footer --> <footer id="footer"> <div class="bg-simple lt"> <div class="container"> <div class="row padder-v m-t"> <div class="col-xs-8"> <ul class="list-inline"> <li><a href="http://hk.uwenku.com/contact">聯系我們</a></li> <li>© 2020 HK.UWENKU.COM</li> <li><a target="_blank" href="https://beian.miit.gov.cn/">沪ICP备13005482号-4</a></li> <li><script type="text/javascript" src="https://v1.cnzz.com/z_stat.php?id=1280101193&web_id=1280101193"></script></li> <li><a href="http://www.uwenku.com/" target="_blank" title="优文库">简体中文</a></li> <li><a href="http://hk.uwenku.com/" target="_blank" title="優文庫">繁體中文</a></li> <li><a href="http://ru.uwenku.com/" target="_blank" title="поле вопросов и ответов">Русский</a></li> <li><a href="http://de.uwenku.com/" target="_blank" title="Frage - und - antwort - Park">Deutsch</a></li> <li><a href="http://es.uwenku.com/" target="_blank" title="Preguntas y respuestas">Español</a></li> <li><a href="http://hi.uwenku.com/" target="_blank" title="कार्यक्रम प्रश्न और उत्तर पार्क">हिन्दी</a></li> <li><a href="http://it.uwenku.com/" target="_blank" title="IL Programma di chiedere Park">Italiano</a></li> <li><a href="http://ja.uwenku.com/" target="_blank" title="プログラム問答園区">日本語</a></li> <li><a href="http://ko.uwenku.com/" target="_blank" title="프로그램 문답 단지">한국어</a></li> <li><a href="http://pl.uwenku.com/" target="_blank" title="program o park">Polski</a></li> <li><a href="http://tr.uwenku.com/" target="_blank" title="Program soru ve cevap parkı">Türkçe</a></li> <li><a href="http://vi.uwenku.com/" target="_blank" title="Đáp ứng viên">Tiếng Việt</a></li> <li><a href="http://fr.uwenku.com/" target="_blank" title="Programme interrogation Park">Française</a></li> </ul> </div> </div> </div> </div> </div> </footer> <!-- / footer --> <script> var _hmt = _hmt || []; (function() { var hm = document.createElement("script"); hm.src = "https://hm.baidu.com/hm.js?f78a970f17b19a79fc477a3378096f29"; var s = document.getElementsByTagName("script")[0]; s.parentNode.insertBefore(hm, s); })(); </script> </body> </html>