通過正則表達式,您可以抓住所有這些元素。
import re
# input text
text="""Larry E Z (29s) Larry E Z: Hi. Thank you for contacting TE, how may I help you? (42s) Cesar Mendez : Hello Larry (57s) Larry E Z: Good Afternoon Cesar (1m 3s) Cesar Mendez : on drawing 2098753 (1m 12s) Cesar Mendez : what does note 9 mena? (1m 18s) Cesar Mendez : mean (2m 4s) Cesar Mendez : "sealing capability must be evaluated on an application basis (2m 6s) Larry E Z: The 9 does not have a specific meaning. (2m 32s) Cesar Mendez : this sentence................."sealing capability must be evaluated on an application basis" (3m 3s) Larry E Z: You are meaning Note 9 (3m 14s) Cesar Mendez : ... yes (3m 17s) Larry E Z: Ok (3m 25s)"""
# regex for Larry
patt1 = re.compile('Larry E Z:(.*?)\(')
larry = patt1.findall(text) # get all Larry's part as list
print(larry)
# regex for Cesar
patt2 = re.compile('Cesar Mendez :(.*?)\(')
cesar = patt2.findall(text) # get all Cesar's
print(cesar)
輸出:
[」您好。感謝您與TE聯繫,我可以如何幫助您? ','下午好Cesar','9沒有特定的含義。 ','你的意思是注意9','確定']
['你好拉里','在繪圖2098753','注9是什麼? ','是指',''密封能力必須在申請基礎上評估','這句話.................'密封能力必須在申請基礎上評估「 ','...是']
這裏是否有換行符?你是否嘗試過簡單地在冒號上分開? –
歡迎來到SO。不幸的是,這不是一個教程服務,請花時間閱讀[問]和它包含的鏈接Python文檔非常好,並且有很多好的在線regex教程,還有一些很好的python flavouredonline正則表達式測試器可以幫助你構建你的模式 - 比如regex101.com – wwii