2013-02-12 192 views
4

我一直在努力奮鬥幾個小時,以解決以下問題,但未取得成功。將字典列表轉換爲唯一的字典列表

我有一個數據結構,看起來像這樣:

[ { 'ROOT': [ 
      { 'firstElem': 'gc-3/1/0'}, 
      { 'SecondElem': '5.0.0.1'}, 
      { 'ThirdElem': '127.3.15.1'}, 
      { 'index': 16}, 
      { 'function': 'session'}, 
      { 'hw': '0.0.0.0'}, 
      { 'sw': '1.50.1.3'}, 
      { 'resources': [ { 'cpu-info': [ { 'cpu-peak-load': 1}, 
                { 'cpu-avg-load': 1}]}, 
           { 'memory-total': 1}, 
           { 'memory-used': 2}]}, 
      ]}, 
    { 'ROOT': [ 
      { 'firstElem': 'gc-4/1/0'}, 
      { 'SecondElem': '5.0.0.2'}, 
      { 'ThirdElem': '127.3.4.1'}, 
      { 'index': 5}, 
      { 'function': 'stand'}, 
      { 'hw': '0.0.0.0'}, 
      { 'sw': '1.50.1.3'}, 
      { 'resources': [ { 'cpu-info': [ { 'cpu-peak-load': 1}, 
                { 'cpu-avg-load': 1}]}, 
           { 'memory-total': 1}, 
           { 'memory-used': 2}]}, 
      ]} 
] 

我想遍歷這個數據結構,並使用相同的名稱將所有的字典元素,並創建一個列表來代替。 這是很難解釋,我已經創造了什麼,我想找一個示例結構:

{ 
    "ROOT": [ 
     { 
      "firstElem": "gc-3/1/0", 
      "SecondElem": "5.0.0.1", 
      "ThirdElem": "128.0.2.19", 
      "index": "13", 
      "function": "session", 
      "hw": "1.11.0.0 ", 
      "sw": "1.50.0.228 ", 
      "resources": { 
       "cpu-info": { 
        "cpu-peak-load": "1", 
        "cpu-avg-load": "1", 
       }, 
       "memory-total": "1", 
       "memory-used": "2", 
      }, 
     }, 
     { 
      "firstElem": "gc-4/1/0", 
      "SecondElem": "5.0.0.1", 
      "ThirdElem": "128.0.2.19", 
      "index": "13", 
      "function": "session", 
      "hw": "1.11.0.0 ", 
      "sw": "1.50.0.228 ", 
      "resources": { 
       "cpu-info": { 
        "cpu-peak-load": "8", 
        "cpu-avg-load": "1", 
       }, 
       "memory-total": "1", 
       "memory-used": "2", 
      }, 
     } 
    ], 
} 

我堅持與原始數據結構和不能改變它。任何幫助表示讚賞。 上面提供的結構只是一個例子,因爲數據是動態接收的,我不會知道標籤名稱。所以請不要提供使用特定標籤名稱的解決方案。

回答

2

這裏有一個辦法:

>>> from collections import defaultdict 
>>> def combine(item): 
    # Easy return if not a list: element itself 
    if type(item) != type([]): 
     return item 
    # else call recursion 
    first_ret = [(i.items()[0][0], combine(i.items()[0][1])) for i in item] 

    # Here we group by same keys if any ('ROOT', for instance) 
    count_keys = defaultdict(list) 
    for couple in first_ret: 
     count_keys[couple[0]].append(couple[1]) 
    return dict((k, v if len(v) > 1 else v[0]) for k, v in count_keys.iteritems()) 

我只好組ROOT節點,但它似乎是工作:

>>> pprint(combine(l)) 
{'ROOT': [{'SecondElem': '5.0.0.1', 
      'ThirdElem': '127.3.15.1', 
      'firstElem': 'gc-3/1/0', 
      'function': 'session', 
      'hw': '0.0.0.0', 
      'index': 16, 
      'resources': {'cpu-info': {'cpu-avg-load': 1, 
             'cpu-peak-load': 1}, 
         'memory-total': 1, 
         'memory-used': 2}, 
      'sw': '1.50.1.3'}, 
      {'SecondElem': '5.0.0.2', 
      'ThirdElem': '127.3.4.1', 
      'firstElem': 'gc-4/1/0', 
      'function': 'stand', 
      'hw': '0.0.0.0', 
      'index': 5, 
      'resources': {'cpu-info': {'cpu-avg-load': 1, 
             'cpu-peak-load': 1}, 
         'memory-total': 1, 
         'memory-used': 2}, 
      'sw': '1.50.1.3'}]} 
>>> 
2

讓我們試試這個:

r = {} 

def lst2dct(lst): 
    return (lst if not isinstance(lst, list) else 
     {k: lst2dct(v) for e in lst for k, v in e.items()}) 

for e in source: 
    key, val = e.items()[0] 
    r.setdefault(key, []).append(lst2dct(val)) 
+0

用真實代碼測試,它不是工作,它似乎只採取一個根元素,第二個不知何故被丟棄! – theAlse 2013-02-13 09:28:21

+0

@theAlse:在某處發佈實際數據(例如gist.github.com) – georg 2013-02-13 09:46:37

1

它有點哈克,但你可以嘗試:

data = [ { 'ROOT': [ 
      { 'firstElem': 'gc-3/1/0'}, 
      { 'SecondElem': '5.0.0.1'}, 
      { 'ThirdElem': '127.3.15.1'}, 
      { 'index': 16}, 
      { 'function': 'session'}, 
      { 'hw': '0.0.0.0'}, 
      { 'sw': '1.50.1.3'}, 
      { 'resources': [ { 'cpu-info': [ { 'cpu-peak-load': 1}, 
                { 'cpu-avg-load': 1}]}, 
           { 'memory-total': 1}, 
           { 'memory-used': 2}]}, 
      ]}, 
    { 'ROOT': [ 
      { 'firstElem': 'gc-4/1/0'}, 
      { 'SecondElem': '5.0.0.2'}, 
      { 'ThirdElem': '127.3.4.1'}, 
      { 'index': 5}, 
      { 'function': 'stand'}, 
      { 'hw': '0.0.0.0'}, 
      { 'sw': '1.50.1.3'}, 
      { 'resources': [ { 'cpu-info': [ { 'cpu-peak-load': 1}, 
                { 'cpu-avg-load': 1}]}, 
           { 'memory-total': 1}, 
           { 'memory-used': 2}]}, 
      ]} 
] 

root_list = [ 

     ] 

final_data = { 
      'ROOT' : root_list 
      } 

for dict in data: 
    if dict['ROOT'] not in final_data['ROOT']: 
     final_data['ROOT'].append(dict['ROOT']) 
+0

請參閱已編輯的問題,我不能使用特定的標籤名稱,這僅僅是一個示例。 – theAlse 2013-02-12 17:26:06

0

我只是做了一些在這裏和那裏的東西的解釋,並拿出了這一點:

>>> i = [...] # your data 
>>> rdict = {} 
>>> for di in i: 
for root in di: 
    if root not in rdict: 
     rdict[root] = [di[root]] 
    else: 
     rdict[root] += [di[root]] 

rdict字典是你所需要的類型,應該多升工作派。


這個功能版本:

def common_dict(list_of_dicts): 
    i = list_of_dicts # less typing 
    rdict = {} 
    for di in i: 
     for root in di: 
      if root not in rdict: 
       rdict[root] = [di[root]] 
      else: 
       rdict[root] += [di[root]] 
return rdict