2016-04-02 128 views
2

我使用ElementTree的與Python解析XML文件來尋找subchildElementTree的find()方法總是返回無

的內容,這是我試圖解析XML文件:

<?xml version='1.0' encoding='UTF-8'?> 
<nvd xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://nvd.nist.gov/feeds/cve/1.2" nvd_xml_version="1.2" pub_date="2016-02-10" xsi:schemaLocation="http://nvd.nist.gov/feeds/cve/1.2 http://nvd.nist.gov/schema/nvdcve_1.2.1.xsd"> 
    <entry type="CVE" name="CVE-1999-0001" seq="1999-0001" published="1999-12-30" modified="2010-12-16" severity="Medium" CVSS_version="2.0" CVSS_score="5.0" CVSS_base_score="5.0" CVSS_impact_subscore="2.9" CVSS_exploit_subscore="10.0" CVSS_vector="(AV:N/AC:L/Au:N/C:N/I:N/A:P)"> 
    <desc> 
     <descript source="cve">ip_input.c in BSD-derived TCP/IP implementations allows remote attackers to cause a denial of service (crash or hang) via crafted packets.</descript> 
    </desc> 
    <loss_types> 
     <avail/> 
    </loss_types> 
    <range> 
     <network/> 
    </range> 
    <refs> 
     <ref source="OSVDB" url="http://www.osvdb.org/5707">5707</ref> 
     <ref source="CONFIRM" url="http://www.openbsd.org/errata23.html#tcpfix">http://www.openbsd.org/errata23.html#tcpfix</ref> 
    </refs> 

這是我的代碼:

import xml.etree.ElementTree as ET 

if __name__ == '__main__': 
    tree = ET.parse('nvdcve-modified.xml') 
    root = tree.getroot() 

    print root.find('entry') 
    print root[0].find('desc') 

輸出爲無兩行

回答

2

您的XML具有默認命名空間d定義在根元素級別:

xmlns="http://nvd.nist.gov/feeds/cve/1.2" 

沒有前綴的後代元素隱式地繼承祖先的默認命名空間。爲了找到空間元素,你可以映射的前綴命名空間URI和使用前綴,像這樣:

ns = {'d': 'http://nvd.nist.gov/feeds/cve/1.2'} 
root.find('d:entry', ns) 

或直接使用空間URI:

root.find('{http://nvd.nist.gov/feeds/cve/1.2}entry') 
+0

感謝這個作品! –