2013-03-19 76 views
3

有沒有人知道如何解決TreeTagger這個文件讀取錯誤,這是一個常用的自然語言處理工具,用於POS標記,引理和塊句子?TreeTagger安裝成功但無法打開.par文件

[email protected]:~/treetagger$ echo 'Hello world!' | cmd/tree-tagger-english 
     reading parameters ... 

ERROR: Can't open for reading: /home/alvas/treetagger/lib/english.par 
aborted. 

我沒有遇到任何可能的安裝問題上http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/installation-hints.txt的暗示。 我已經按照網頁上的說明和它的正確安裝(http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/#Linux):

[email protected]:~$ mkdir treetagger 
[email protected]:~$ cd treetagger 
[email protected]:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/tree-tagger-linux-3.2.tar.gz 
[email protected]:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/tagger-scripts.tar.gz 
[email protected]:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/install-tagger.sh 
[email protected]:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/dutch-par-linux-3.2-utf8.bin.gz 
[email protected]:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/german-par-linux-3.2-utf8.bin.gz 
[email protected]:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/italian-par-linux-3.2-utf8.bin.gz 
[email protected]:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/spanish-par-linux-3.2-utf8.bin.gz 
[email protected]:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/french-par-linux-3.2-utf8.bin.gz 

[email protected]:~/treetagger$ sh install-tagger.sh 

Linux version of TreeTagger installed. 
Tagging scripts installed. 
German parameter file (Linux, UTF8) installed. 
German chunker parameter file (Linux) installed. 
French parameter file (Linux, UTF8) installed. 
French chunker parameter file (Linux, UTF8) installed. 
Italian parameter file (Linux, UTF8) installed. 
Spanish parameter file (Linux, UTF8) installed. 
Dutch parameter file (Linux, UTF8) installed. 
Path variables modified in tagging scripts. 

You might want to add /home/alvas/treetagger/cmd and /home/alvas/treetagger/bin to the PATH variable so that you do not need to specify the full path to run the tagging scripts. 

但是當我嘗試測試軟件,我得到這些錯誤:

[email protected]:~/treetagger$ echo 'Hello world!' | cmd/tree-tagger-english 
    reading parameters ... 

ERROR: Can't open for reading: /home/alvas/treetagger/lib/english.par 
aborted. 
[email protected]:~/treetagger$ echo 'Das ist ein Test.' | cmd/tagger-chunker-german 

ERROR: Can't open for reading: /home/alvas/treetagger/lib/german-chunker.par 
aborted. 

ERROR: Can't open for reading: /home/alvas/treetagger/lib/german.par 
aborted. 
    reading parameters ... 

ERROR: Can't open for reading: /home/alvas/treetagger/lib/german.par 
aborted. 

回答

4

我想有有兩個問題:首先,腳本的名字應該有「-utf8」,例如cmd/tagger-chunker-german-utf8,因爲您下載了UTF-8數據。其次,標記和分塊需要每個數據文件。參見主頁上有「PC參數文件」和「PC的Chunker參數文件」部分 - 從兩個部分下載文件,然後重新執行install-tagger.sh

0

你寫CMD /樹惡搞英語,但我認爲正確的道路(其中有參數文件)是:

LIB /樹惡搞英語

相關問題