2011-04-21 79 views
0

我想使用BCBio GFF分析器解析GFF文件,我得到以下錯誤。任何人都可以幫助我解決這個問題嗎?由BCBio GFF分析器問題

回溯(最近通話最後一個):

File "gff_parse.py", line 6, in <module> 
    for rec in GFF.parse(in_handle): 
    File "build/bdist.linux-x86_64/egg/BCBio/GFF/GFFParser.py", line 709, in parse 
    File "build/bdist.linux-x86_64/egg/BCBio/GFF/GFFParser.py", line 299, in parse_in_parts 
    File "build/bdist.linux-x86_64/egg/BCBio/GFF/GFFParser.py", line 320, in parse_simple 
    File "build/bdist.linux-x86_64/egg/BCBio/GFF/GFFParser.py", line 603, in _gff_process 
    File "build/bdist.linux-x86_64/egg/BCBio/GFF/GFFParser.py", line 634, in _lines_to_out_info 
    File "build/bdist.linux-x86_64/egg/BCBio/GFF/GFFParser.py", line 183, in _gff_line_map 
ValueError: invalid literal for int() with base 10: 'New Start' 

這裏是我的代碼:

from BCBio import GFF  
in_file = "infile.gff"  
in_handle = open(in_file) 
for rec in GFF.parse(in_handle): 
    print rec 
in_handle.close() 

感謝 Tulika

回答

2

你是如何產生的GFF文件?它似乎至少包含一個無效的行。第四列應包含特徵起始座標的整數;錯誤消息指示它包含值'New Start'。

GFF3 specification page有一些有效的GFF的例子,online validator可以幫助調試這樣的格式化問題。