我有類似閱讀在Python 3.0同時兩個文本文件,並提取所需要的字符串
file_1數據的兩個文本文件:
data1 data_1 1
data2 data_2 2
data3 data_2 2
data2 data_4 1
data3 data_3 1 and so on....
等
file_2:
data1
data2
data1
data3
data2
我想得到一個輸出爲
data1:
> data1 data_1 1
> data1 data_3 2
data2:
> data2 data_2 2
> data2 data_4 1
data3:
> data3 data_3 1
等等...
我從目前的代碼會得到什麼:
data1:
> data1 data_1 1
data2:
> data2 data_2 2
data3:
> data3 data_2 2
> data2 data_4 1
> data3 data_3 1
代碼:預期
first_occurance = {}
with open("folder_1/file_1", "r") as file_1:
with open("folder_1/file_2", "r") as file_2:
for line_1,line_2 in zip(file_1, file_2):
only_command = line_1.split()[0]
if only_command in line_2:
if only_command not in first_occurance:
print ("\n " + only_command + " :\n")
print (" > " + line_1.strip())
else:
print (" > " + line_1.strip())
first_occurance[only_command] = only_command
但是,這並不工作,因爲數據是未根據標題格式化,例如對應於data2
的行也顯示在data3
中。針對此問題的任何指導,將是很有益....
你能描述會發生什麼嗎? – user3467349
我編輯了這個問題..希望它現在更清楚了...... – user89
還不完全。那麼你期望'data3'會發生什麼?是否應該打印在data2塊之下? – fnl