4
爲什麼返回此錯誤?tesseract(v3.03)輸出爲PDF
[email protected] ~/ocr_test # tesseract -l dan pdf.png out pdf
Tesseract Open Source OCR Engine v3.03 with Leptonica
Error opening data file /usr/local/share/tessdata/osd.traineddata
Please make sure the TESSDATA_PREFIX environment variable is set to the parent directory of your "tessdata" directory.
Failed loading language 'osd'
Tesseract couldn't load any languages!
Warning: Auto orientation and script detection requested, but osd language failed to load
語言列表
[email protected] ~/ocr_test # tesseract --list-langs
List of available languages (3):
eng
dan
dan-frak
輸出爲txt
這工作得很好,並輸出文本out.txt
tesseract -l dan pdf.png out
輸出PDF
這將創建out.pdf
也retuns提到的錯誤,並在PDF中搜索文本沒有意義
tesseract -l dan pdf.png out pdf
存儲庫已移至https://github.com/tesseract-ocr/tessdata – Joe
如何安裝? – happybuddha