Python：從XML文檔中獲取某些值的有效腳本？

我有下面的XML文檔Python：從XML文檔中獲取某些值的有效腳本？

<data> 
    <point address="com.example.www" time="Jul 30, 2013 10:02:56 PM" protocol="http" type="2" body="404 Not Found" name="Example Site" /> 
    <point address="com.example.test" time="Jul 29, 2013 07:45:03 AM" protocol="https" type="2" body="This is a test" name="Test.example" /> 
    ....... 
</data>

我用下面的Python代碼：

import libxml2 

def ReadValue(pn, dt): 
    return [attr.content for attr in input_file.xpathEval("/data/point[@protocol='%s']/@%s" % (pn, dt))] 

protocol = ["http", "https"] 
data_type = ["body", "type", "time", "name"] 

for i in protocol: 
    for j in data_type: 
     print ReadValue(i, j) 

exit()

我懷疑ReadValue是瓶頸時，我分析了20萬組的標籤。它運行的非常慢，即使我在運行時無法按Ctrl-C腳本。有沒有比使用上面提到的代碼更好的實現？

謝謝

來源

2013-07-31 comepradz

但它可能是另一種低效率。

你正在爲每個協議和每個data_type循環，然後爲每個組合運行一個xpath，每個組合需要花費很高的代價，我懷疑代價高昂的搜索，因爲我沒有索引。

在讀取每個元素並收集數據時，您最好做一遍XML並讀取協議和data_type

來源

2013-07-31 22:54:31 Mark

Python：從XML文檔中獲取某些值的有效腳本？

回答

相關問題