我使用python和ElementTree訪問從EDGAR中抓取的.xml文件列表。我已閱讀並重新閱讀ElementTree/python.org頁面,但我仍不理解如何深入瞭解數據。我怎麼用ElementTree的要達到這樣的第一TextBlock中所列出的.xmls使用python和正則表達式解析xbrl以查找TextBlocks
import import re
from urllib2 import urlopen
import requests
import xml.etree.ElementTree as ET
full_xml =['https://www.sec.gov/Archives/edgar/data/1593001/000121390017010242/ngtf-20170630.xml', 'https://www.sec.gov/Archives/edgar/data/13573/000143774917016692/bwla-20170702.xml', 'https://www.sec.gov/Archives/edgar/data/1652871/000165287117000030/none-20170630.xml', 'https://www.sec.gov/Archives/edgar/data/1434674/000154972717000042/chnd-20170630_cal.xml', 'https://www.sec.gov/Archives/edgar/data/1083922/000130841117000030/arao-20170331.xml']
for xml in full_xml:
file = urllib2.urlopen(xml)
tree = ET.parse(file)
root = tree.getroot()
print root
吉謝蘭感謝您的深入響應。你會碰巧擁有一臺最喜歡的XBRL處理器嗎?或者推薦一個易於訪問的開源軟件? –