0
我有這個代碼塊,它工作,但需要大約8秒鐘執行。我知道這是第二個for
循環,因爲循環內有一個循環。但是,我相信我需要兩個循環,因爲我需要交叉引用tracks
列表。Python for循環效率
有沒有人知道一種方法來讓這個函數更快執行?我似乎看不到另一種寫作方式。
FYI:我正在使用的csv文件有5570行,這是函數採用「while」的另一個原因。
在此先感謝!
def load_library(filename) :
library = open(filename, 'rb')
reader = csv.reader(library, delimiter = '|')
tracks = set([])
albums = set([])
albums1 = set([])
#albums1 is the set of albums which have already been added to the albums list.
for row in reader :
artist, track, album, genre, year = row
track = Track(artist, track)
track.set_album(album)
tracks.add(track)
library = open(filename, 'rb')
reader = csv.reader(library, delimiter = '|')
for row in reader :
artist, track, album, genre, year = row
a = Album(artist, album)
for i in tracks :
if str(i.album) == str(a.title) :
a.add_track(i.title)
if album not in albums1 :
albums.add(a)
albums1.add(album)
return tracks, albums
使用c.Profile後:
cProfile.run( 'load_library()')在9.776秒
Ordered by: standard name
ncalls tottime percall cumtime percall filename:lineno(function)
1 0.002 0.002 9.776 9.776 <string>:1(<module>)
5570 0.001 0.000 0.001 0.000 musiclib.py:18(set_album)
11140 0.007 0.000 0.007 0.000 musiclib.py:23(__init__)
92784 0.028 0.000 0.037 0.000 musiclib.py:31(add_track)
5570 0.004 0.000 0.009 0.000 musiclib.py:6(__init__)
1 9.723 9.723 9.775 9.775 musiclib.py:71(load_library)
2 0.000 0.000 0.000 0.000 {_csv.reader}
16710 0.002 0.000 0.002 0.000 {method 'add' of 'set' objects}
92784 0.009 0.000 0.009 0.000 {method 'append' of 'list' objects}
1 0.000 0.000 0.000 0.000 {method 'disable' of '_lsprof.Profiler' objects}
2 0.000 0.000 0.000 0.000 {open}
224565函數調用
嗯,這是更快地使用'的ReadLine(line_num)'然後使用'for'循環通過每一行去。 – AHuman 2014-10-05 17:44:17
試着用'cProfile'來分析它,看看哪些部分很慢。 – rlms 2014-10-05 17:52:00
@AHuman謝謝,但是我會如何使用'readline(line_num)'來讀取每一行? – 2014-10-05 17:52:10