2013-07-23 30 views
0

我下載了pdfminer,命令行方法工作完美,但我希望能夠同時轉換多個pdf文檔,所以我想用pdfminer作爲庫,我發現這個os stackoverflow,但我不能得到它的工作..Python中的PDFminer

from pdfminer.pdfinterp import PDFResourceManager, process_pdf 
from pdfminer.converter import TextConverter 
from pdfminer.layout import LAParams 
from cStringIO import StringIO 



def convert_pdf(path): 

    rsrcmgr = PDFResourceManager() 
    retstr = StringIO() 
    codec = 'utf-8' 
    laparams = LAParams() 
    device = TextConverter(rsrcmgr, retstr, codec=codec, laparams=laparams) 

    fp = file(path, 'rb') 
    process_pdf(rsrcmgr, device, fp) 
    fp.close() 
    device.close() 

    str = retstr.getvalue() 
    retstr.close() 
    print str 


convert_pdf("/Users/gorkemyurtseven/Desktop/casino.pdf") 

當我運行它,我得到:

Traceback (most recent call last): 
    File "pdfminer.py", line 1, in <module> 
    from pdfminer.pdfinterp import PDFResourceManager, process_pdf 
    File "/Users/gorkemyurtseven/Desktop/pdfminer.py", line 1, in <module> 
    from pdfminer.pdfinterp import PDFResourceManager, process_pdf 
ImportError: No module named pdfinterp 
+1

'import pdfminer'是否產生一個類似的'ImportError'?可能pdfminer不在你的python路徑上。如果是這樣,你可能需要從pdfminer目錄運行'python setup.py install'。 –

回答

2

看來你在呼喚你的腳本pdfminer作爲模塊,並試圖導入當它瘋狂一個同名的模塊。

另一個原因可能是pdfminer模塊安裝不正確,或者它不是您的python發行版的正確版本。