2015-11-30 55 views
0

目前正在爲專有文件格式進行EDI轉換。輸入文件可能有數百到數千條記錄。每條線都是一條記錄。Python專有文件轉換循環

INFILE:(忽略開頭的空白行,它只是存在於視覺表現)

"1","2","3","4","5","6","7","can also contain text, and commas","9","10","11","12","13" 
"1","2","3","4","5","6","7","can also contain text, and commas","9","10","11","12","13" 
"1","2","3","4","5","6","7","can also contain text, and commas","9","10","11","12","13" 
"1","2","3","4","5","6","7","can also contain text, and commas","9","10","11","12","13" 
"1","2","3","4","5","6","7","can also contain text, and commas","9","10","11","12","13" 

我需要一個循環,可以通過走行線,並從一列在原始文件複製數據,並粘貼轉換爲一個新文件,該文件將數據保存在不同的列中。有點像沒有列標題的vlookup。這裏是列號之間的關係。

InColumn-OutColumn 
1-1, 2-2, 3-3, 4-4, 5-5, 6-6, 7-7, 8-8, 12-9, 14-10, 25-11, 68-24 

在此先感謝,我似乎無法繞過這一個。

編輯:這裏要求的是我無法工作的破解代碼。

KEY = [1,2,3,4,5,6,7,8,12,14,25,68] 
Body850 = open(TEMP, 'r+') 

for line in Body850: 
    for x in line.split(','): 
     END += KEY[x] 
    print line 
+0

看起來像csv。 python有一個csv模塊來處理這種類型的文件 – njzk2

+1

請提供一些你嘗試過的代碼,但是不起作用,所以我們可以幫你修復它。 –

+0

好吧,我編輯了原文 – user2890953

回答

1

如前所述,csv模塊可以將帶引號的逗號的照顧。創建一個CSV閱讀器和一個作家,然後唯一的竅門是過濾列。

import csv 

# todo: for testing... 
open('testfile.csv', 'w').write('''"1","2","3","4","5","6","7","can also contain text, and commas","9","10","11","12","13" 
"1","2","3","4","5","6","7","can also contain text, and commas","9","10","11","12","13" 
"1","2","3","4","5","6","7","can also contain text, and commas","9","10","11","12","13" 
"1","2","3","4","5","6","7","can also contain text, and commas","9","10","11","12","13" 
"1","2","3","4","5","6","7","can also contain text, and commas","9","10","11","12","13"''') 

# the columns wanted, in order they should appear in output (zero based) 
# ...example doesn't go to 68 cols, so abreviated 
column_order = (0,1,2,3,4,5,6,7,11) 

with open('testfile.csv') as csvin: 
    reader = csv.reader(csvin) 
    with open('testfile_new.csv', 'w') as csvout: 
     writer = csv.writer(csvout) 
     for row in reader: 
      writer.writerow([row[i] for i in column_order]) 

# todo: for testing... 
print(open('testfile_new.csv').read()) 
0

如果文本中不包含引號,你可以這樣做:

for line in Body850: 
    for x in line.split('","'): 
     if x in KEY: 
      END += KEY[x] 
     else 
      pass # do something here in case x is not in KEY 
    print line