2016-12-29 88 views
0

我在Windows上使用10 x64 pytesseract錯誤打開數據文件,蟒蛇是3.5.2 x64,正方體是4.0,代碼如下:pytesseract在Windows 10:

# -*- coding: utf-8 -*- 

try: 
    import Image 
except ImportError: 
    from PIL import Image 
import pytesseract 


print(pytesseract.image_to_string(Image.open('d:/testimages/name.gif'), lang='chi_sim')) 

錯誤:

Traceback (most recent call last): 
    File "D:/test.py", line 10, in <module> 
    print(pytesseract.image_to_string(Image.open('d:/testimages/name.gif'), lang='chi_sim')) 
    File "C:\Users\dell\AppData\Local\Programs\Python\Python35\lib\site-packages\pytesseract\pytesseract.py", line 165, in image_to_string 
    raise TesseractError(status, errors) 
pytesseract.pytesseract.TesseractError: (1, 'Error opening data file \\Program Files (x86)\\Tesseract-OCR\\tessdata/chi_sim.traineddata') 

C:\Program Files (x86)\Tesseract-OCR\tessdata,像這樣:

enter image description here

爲什麼?

+0

是否對其他語言的工作? – sayan

+0

相關:http://stackoverflow.com/questions/24672531/annoying-python-tesseract-error-error-opening-data-file-tessdata-eng-trainedda – sayan

回答

0

設置TESSDATA_PREFIX環境變量C:\Program Files (x86)\Tesseract-OCR\

0

如果你有一個像tessdata錯誤:「錯誤打開數據文件...」

tessdata_dir_config = '--tessdata-dir "<replace_with_your_tessdata_dir_path>"' 
# Example config: '--tessdata-dir "C:\\Program Files (x86)\\Tesseract-OCR\\tessdata"' 
# It's important to add double quotes around the dir path. 

pytesseract.image_to_string(image, lang='chi_sim', config=tessdata_dir_config) 
+0

tessdata_dir_config ='--tessdata-dir「C:\\ Tesseract-OCR \\ tessdata「' – dubinglin

+0

你是否知道你可以編輯你的答案? – Yunnosch