轉換字符串的一部分到變量名蟒蛇

我有一個包含這樣一個文本文件：轉換字符串的一部分到變量名蟒蛇

loadbalancer { 
upstream application1 { 
server 127.0.0.1:8082; 
server 127.0.0.1:8083; 
server 127.0.0.1:8084; 
} 
upstream application2 { 
server 127.0.0.1:8092; 
server 127.0.0.1:8093; 
server 127.0.0.1:8094; 
} 
}

有誰知道，我怎麼能提取變量象下面這樣：

appList=["application1","application2"] 
ServerOfapp1=["127.0.0.1:8082","127.0.0.1:8083","127.0.0.1:8084"] 
ServerOfapp2=["127.0.0.1:8092","127.0.0.1:8093","127.0.0.1:8094"]

。
。
。

等

來源

2016-03-01 Lord ST

後你是怎麼嘗試，到目前爲止... –

我想你可能也需要一個列表字典。 'servers ['application1'] = [「127.0.0.1:8082」，「127.0.0.1:8083」，「127.0.0.1:8084」]'也可以讓你從'appList'的鍵索引哪一組服務器' –

@IronFist，我試圖用正則表達式處理，但我找不到任何有效的形式 –

這是基本的方法：

# each of your objects here 
objText = "xyz xcyz 244.233.233.2:123" 
listOfAll = re.findall(r"/\b(?:(?:25[0-5]|2[0-4][0-9]|[01]?[0-9][0-9]?)\.){3}(?:25[0-5]|2[0-4][0-9]|[01]?[0-9][0-9]?):[0-9]{1,5}/g", objText) 

for eachMatch in listOfAll: 
    print "Here's one!" % eachMatch

顯然，這周圍的邊緣有點粗糙，但它會進行一個全面的正則表達式搜索的任何字符串它給的。也許更好的解決方案是將對象本身傳遞給對象，但現在我不確定你會有什麼樣的原始輸入。不過，我會嘗試改進正則表達式。

來源

2016-03-01 15:41:18 Hektor

如果線條你想始終與上游和服務器啓動這應該工作：然後

app_dic = {} 
with open('file.txt','r') as f: 
    for line in f: 
     if line.startswith('upstream'): 
      app_i = line.split()[1] 
      server_of_app_i = [] 
      for line in f: 
       if not line.startswith('server'): 
        break 
       server_of_app_i.append(line.split()[1][:-1]) 
      app_dic[app_i] = server_of_app_i

app_dic應該是列表的字典：

{'application1': ['127.0.0.1:8082', '127.0.0.1:8083', '127.0.0.1:8084'], 
'application2': ['127.0.0.1:8092', '127.0.0.1:8093', '127.0.0.1:8094']}

編輯

如果輸入文件不包含任何換行符，只要文件不是太大就可以將其寫入列表並迭代ove它：

app_dic = {} 
with open('file.txt','r') as f: 
    txt_iter = iter(f.read().split()) #iterator of list 
for word in txt_iter: 
    if word == 'upstream': 
     app_i = next(txt_iter) 
     server_of_app_i=[] 
     for word in txt_iter: 
      if word == 'server': 
       server_of_app_i.append(next(txt_iter)[:-1]) 
      elif word == '}': 
       break 
     app_dic[app_i] = server_of_app_i

這是更醜陋的，因爲你必須搜索結束花括號打破。如果它變得更復雜，應使用正則表達式。

來源

2016-03-01 15:43:29

你有';'包含在你的最終輸出中！ –

@MT你的代碼適用於每行結尾有「\ n」的配置文件，但是當我爲自己的配置文件嘗試它時，它不起作用... –

@LordST如果全部打開一行，我的例子需要修改一下... –

如果你能夠使用newer regex module通過馬修·巴尼特，您可以採用如下方案，看到一個additional demo on regex101.com：

import regex as re 

rx = re.compile(r""" 
    (?:(?P<application>application\d)\s{\n| # "application" + digit + { + newline 
    (?!\A)\G\n)        # assert that the next match starts here 
    server\s        # match "server" 
    (?P<server>[\d.:]+);     # followed by digits, . and : 
    """, re.VERBOSE) 

string = """ 
loadbalancer { 
upstream application1 { 
server 127.0.0.1:8082; 
server 127.0.0.1:8083; 
server 127.0.0.1:8084; 
} 
upstream application2 { 
server 127.0.0.1:8092; 
server 127.0.0.1:8093; 
server 127.0.0.1:8094; 
} 
} 
""" 

result = {} 
for match in rx.finditer(string): 
    if match.group('application'): 
     current = match.group('application') 
     result[current] = list() 
    if current: 
     result[current].append(match.group('server')) 

print result 
# {'application2': ['127.0.0.1:8092', '127.0.0.1:8093', '127.0.0.1:8094'], 'application1': ['127.0.0.1:8082', '127.0.0.1:8083', '127.0.0.1:8084']}

這使得使用\G改性劑，命名捕獲組和一些編程邏輯。

來源

2016-03-01 16:14:08 Jan

我相信這也可以用re解決：

>>> import re 
>>> from collections import defaultdict 
>>> 
>>> APP = r'\b(?P<APP>application\d+)\b' 
>>> IP = r'server\s+(?P<IP>[\d\.:]+);' 
>>> 
>>> pat = re.compile('|'.join([APP, IP])) 
>>> 
>>> 
>>> scan = pat.scanner(s) 
>>> d = defaultdict(list) 
>>> 
>>> for m in iter(scan.search, None): 
     group = m.lastgroup 
     if group == 'APP': 
      keygroup = m.group(group) 
      continue 
     else: 
      d[keygroup].append(m.group(group)) 


>>> d 
defaultdict(<class 'list'>, {'application1': ['127.0.0.1:8082', '127.0.0.1:8083', '127.0.0.1:8084'], 'application2': ['127.0.0.1:8092', '127.0.0.1:8093', '127.0.0.1:8094']})

或類似的與re.finditer方法和不pat.scanner：

>>> for m in re.finditer(pat, s): 
     group = m.lastgroup 
     if group == 'APP': 
      keygroup = m.group(group) 
      continue 
     else: 
      d[keygroup].append(m.group(group)) 


>>> d 
defaultdict(<class 'list'>, {'application1': ['127.0.0.1:8082', '127.0.0.1:8083', '127.0.0.1:8084'], 'application2': ['127.0.0.1:8092', '127.0.0.1:8093', '127.0.0.1:8094']})

來源

2016-03-03 06:54:33

轉換字符串的一部分到變量名蟒蛇

回答

相關問題