2015-12-15 58 views
2

我正在嘗試編寫一個程序,它將採用一個HTML文件並輸出每一行。我做錯了什麼,因爲我的代碼輸出每個字母。我怎樣才能將所有的HTML行列入列表?Python - 如何逐行閱讀HTML

這是迄今爲止代碼:

f = open("/home/tony/Downloads/page1/test.html", "r") 
htmltext = f.read() 
f.close() 

for t in htmltext: 
    print t + "\n" 

回答

1

f.read()將試圖讀取並直到EOF滿足產生每個字符。你想要的是f.readlines()方法:

with open("/home/tony/Downloads/page1/test.html", "r") as f: 
    for line in f.readlines(): 
     print(line) # The newline is included in line