1
我想就如何解析這個文件Gene ontology (.obo)解析文件到父/子格式的JSON文件
我正在努力創造D3可視化,並需要建立一個「樹一些幫助/諮詢「文件,以JSON格式 -
{
"name": "flare",
"description": "flare",
"children": [
{
"name": "analytic",
"description": "analytics",
"children": [
{
"name": "cluster",
"description": "cluster",
"children": [
{"name": "Agglomer", "description": "AgglomerativeCluster", "size": 3938},
{"name": "Communit", "description": "CommunityStructure", "size": 3812},
{"name": "Hierarch", "description": "HierarchicalCluster", "size": 6714},
{"name": "MergeEdg", "description": "MergeEdge", "size": 743}
]
}, etc..
這種格式似乎很容易在Python中的字典進行復制,每個條目一個3個字段:名稱,描述和兒童[]。
我的問題實際上是如何提取數據。上面鏈接的文件具有「對象」結構爲:
[Term]
id: GO:0000001
name: mitochondrion inheritance
namespace: biological_process
def: "The distribution of mitochondria, including the mitochondrial genome, into daughter cells after mitosis or meiosis, mediated by interactions between mitochondria and the cytoskeleton." [GOC:mcc, PMID:10873824, PMID:11389764]
synonym: "mitochondrial inheritance" EXACT []
is_a: GO:0048308 ! organelle inheritance
is_a: GO:0048311 ! mitochondrion distribution
我在哪裏需要id,is_a和name字段。我試圖用python來解析這個,但我似乎找不到找到每個對象的方法。
任何想法?