0
我在R中使用xml2包來提取具有相同類名稱的某些節點。我試圖提取出現在xml中'角色'和'公司'標籤下面的開始和結束日期(都有類名'日期')。但還有其他的日期標籤與我不需要的培訓相關。另外,格式因xml而異。有什麼功能可以幫助我選擇每個角色標籤後面的日期標籤嗎?下面是XML片段:如何使用R在XML中選擇一些具有相同名稱的特定節點
<span class="work-hist-mark" id="57" inprof="n">CAREER HISTORY:</span>
No Company Position Years *
<span class="company" id="58" inprof="y">Nasioncom</span>
<span class="role" id="59_1" inprof="y">Helpdesk</span>
1st level
<span class="date" id="60_1" inprof="y">Jan 1999</span>
-
<span class="date" id="60_2" inprof="y">June 2000</span>
*
<span class="role" id="61_1_1" inprof="y">Komputer Sistem System Engineer</span>
<span class="date" id="61_2_1" inprof="y">June 2000</span>
-
<span class="date" id="61_2_2" inprof="y">Oct 2003</span>
*
<span class="role" id="62_1_1" inprof="y">Servicesoft Network Engineer</span>
<span class="date" id="62_2_1" inprof="y">Oct 2003</span>
-
<span class="date" id="62_2_2" inprof="y">June 2006</span>
*
<span class="company" id="63_1" inprof="y">EDS</span>
<span class="role" id="63_2_1" inprof="y">Infrastructure Associate</span>
<span class="date" id="63_3_1" inprof="y">July</span>
-
<span class="date" id="63_3_2" inprof="y">Nov 2006</span>
*
<span class="company" id="64_1" inprof="y">Atos Origin</span>
<span class="role" id="64_2_1" inprof="y">Technical Specialist</span>
<span class="date" id="64_3_1" inprof="y">Nov 2006</span>
-
<span class="date" id="64_3_2" inprof="y">Nov 2008</span>
*
<span class="company" id="65" inprof="y">Hewlett Packard</span>
<span class="role" id="66_1" inprof="y">Wintel Server Specialist</span>
Level 3
<span class="date" id="67_1" inprof="y">Nov 2008</span>
to
<span class="date" id="67_2" inprof="y">present</span>
TRAINING ATTENDED:
<span class="date" id="68" inprof="y">2001</span>
<span class="sofwr" id="69" inprof="y">HP</span>
&
<span class="sofwr" id="70" inprof="y">Compaq Proliant server</span>
series
<span class="date" id="71_1_1" inprof="y">2003</span>
/
<span class="date" id="71_1_2" inprof="y">05</span>
<span class="role" id="71_2_1" inprof="y">Sophos Antivirus Technical Consultant</span>
<span class="company" id="71_3" inprof="y">Mail Monitor SMTP</span>
<span class="location" id="71_4" inprof="y">Pure</span>
Message for
<span class="sofwr" id="72" inprof="y">Exchange</span>
or
<span class="sofwr" id="73" inprof="y">UNIX</span>
(antivirus + antispam) SAV Integrated (http web scanning) Remote Update (design for mobile user) Sophos in multiple platforms (open source eg:
<span class="sofwr" id="74" inprof="y">UNIX</span>
,
<span class="sofwr" id="75" inprof="y">Linux</span>
,
<span class="sofwr" id="76" inprof="y">Mac9 &10</span>
,
<span class="sofwr" id="77" inprof="y">FreeBSD</span>
)
<span class="company" id="78" inprof="n">Small Business Enterprise</span>
<span class="date" id="79" inprof="y">2005</span>
Watchguard X500/ X2500 Add-on:
<span class="company" id="80" inprof="y">GatewayAV, Weblocker & Spam</span>
screen
<span class="date" id="81" inprof="n">2007</span>
<span class="sofwr" id="82" inprof="y">Microsoft Windows Vista</span>
Install, configuring and managing
<span class="sofwr" id="83" inprof="y">Windows Vista</span>