我試圖在命令行中使用CUDA Profiler;我感興趣的DRAM_Reads和DRAM_Writes - 和我在我的CUDA_PROFILE_LOG文件提供以下計數器:在CUDA Profiler的命令行模式下獲取DRAM_Reads和DRAM_Writes
fb_subp0_read_sectors
fb_subp0_write_sectors
fb0_subp0_read_sectors
fb0_subp0_write_sectors
fb1_subp0_read_sectors
fb1_subp0_write_sectors
但我在cuda_profile文件注意到,有如下錯誤:
NV_Warning: Ignoring the invalid profiler config option: fb0_subp0_read_sectors
NV_Warning: Ignoring the invalid profiler config option: fb0_subp0_write_sectors
NV_Warning: Ignoring the invalid profiler config option: fb1_subp0_read_sectors
NV_Warning: Ignoring the invalid profiler config option: fb1_subp0_write_sectors
值我從fb_subp0_read_sectors獲得,fb_subp0_write_sectors計數器不等於我從NVidia Visual Profiler獲得的數據,這可能是因爲我沒有將正確的計數器傳遞給配置文件。 GPU是特斯拉M2050和CUDA 4.1被使用。如何在命令行中獲得DRAM_Reads和DRAM_Writes?
編輯:在做了一些讀後,我認爲GPU可能有fb0/1 ...或fb ...計數器。但是,即使我有:
fb_subp0_read_sectors
fb_subp0_write_sectors
fb_subp1_read_sectors
fb_subp1_write_sectors
我得到警告:
NV_Warning: Counter 'fb_subp1_read_sectors' is not compatible with other selected counters and it cannot be profiled in this run.
NV_Warning: Counter 'fb_subp1_write_sectors' is not compatible with other selected counters and it cannot be profiled in this run.
感謝, 薩揚
您是否獲得了作爲該計數器輸出的預期訪問次數?我問的原因是,當我使用這個計數器來獲得讀訪問次數時,它返回了我奇怪的結果。即我期待着1920(32字節)的訪問,但它只返回了30次訪問 – warunapww 2014-01-13 23:44:48