2016-11-05 44 views
4

通過命令行使用CoreNLP時TokensRegex規則顏色註釋器(stanford-corenlp-full-2016-10-31/tokensregex/color.rules.txt)加載成功,但對於java.lang.IllegalArgumentException: Unknown annotator: color Web服務器加載失敗。如何在Stanford CoreNLP服務器上使用自定義的TokensRegex規則註釋器?

設置

# custom.properties 
annotators=tokenize,ssplit,pos,lemma,ner,regexner,color 
customAnnotatorClass.color = edu.stanford.nlp.pipeline.TokensRegexAnnotator 
color.rules = tokensregex/color.rules.txt 

命令行

$ java -cp "*" -Xmx2g edu.stanford.nlp.pipeline.StanfordCoreNLP -props custom.properties -file ./tokensregex/color.input.txt -outputFormat text 
[main] INFO edu.stanford.nlp.pipeline.StanfordCoreNLP - Registering annotator color with class edu.stanford.nlp.pipeline.TokensRegexAnnotator 
... 
[main] INFO edu.stanford.nlp.pipeline.StanfordCoreNLP - Adding annotator color 
[main] INFO edu.stanford.nlp.ling.tokensregex.CoreMapExpressionExtractor - Reading TokensRegex rules from tokensregex/color.rules.txt 
[main] INFO edu.stanford.nlp.ling.tokensregex.CoreMapExpressionExtractor - Read 7 rules 

# color.input.txt.output 
Sentence #1 (9 tokens): 
Both blue and light blue are nice colors. 
[Text=Both CharacterOffsetBegin=0 CharacterOffsetEnd=4 PartOfSpeech=CC Lemma=both NamedEntityTag=O] 
[Text=blue CharacterOffsetBegin=5 CharacterOffsetEnd=9 PartOfSpeech=JJ Lemma=blue NamedEntityTag=COLOR NormalizedNamedEntityTag=#0000FF] 
... 

服務器

  1. java -mx2g -cp "*" edu.stanford.nlp.pipeline.StanfordCoreNLPServer -c custom.properties
  2. wget --post-data 'Both blue and light blue are nice colors.' 'localhost:9000/?properties={"annotators":"tokenize,ssplit,pos,lemma,ner,regexner,color","outputFormat":"json"}' -O -

    HTTP request sent, awaiting response... 500 Internal Server Error 
        2016-11-05 14:41:27 ERROR 500: Internal Server Error. 
    
    java.lang.IllegalArgumentException: Unknown annotator: color 
        at edu.stanford.nlp.pipeline.StanfordCoreNLP.ensurePrerequisiteAnnotators(StanfordCoreNLP.java:304) 
        at edu.stanford.nlp.pipeline.StanfordCoreNLPServer$CoreNLPHandler.getProperties(StanfordCoreNLPServer.java:713) 
        at edu.stanford.nlp.pipeline.StanfordCoreNLPServer$CoreNLPHandler.handle(StanfordCoreNLPServer.java:540) 
        at com.sun.net.httpserver.Filter$Chain.doFilter(Filter.java:79) 
        at sun.net.httpserver.AuthFilter.doFilter(AuthFilter.java:83) 
        at com.sun.net.httpserver.Filter$Chain.doFilter(Filter.java:82) 
        at sun.net.httpserver.ServerImpl$Exchange$LinkHandler.handle(ServerImpl.java:675) 
        at com.sun.net.httpserver.Filter$Chain.doFilter(Filter.java:79) 
        at sun.net.httpserver.ServerImpl$Exchange.run(ServerImpl.java:647) 
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) 
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) 
        at java.lang.Thread.run(Thread.java:745) 
    

解決方案

在請求中包含自定義標註屬性:wget --post-data 'Both blue and light blue are nice colors.' 'localhost:9000/?properties={"color.rules":"tokensregex/color.rules.txt","customAnnotatorClass.color":"edu.stanford.nlp.pipeline.TokensRegexAnnotator","annotators":"tokenize,ssplit,pos,lemma,ner,regexner,color","enforceRequirements":"false","outputFormat":"json"}' -O -

+0

做了所有''ner','regexner'和'color'爲你工作? –

回答

4

添加

"enforceRequirements":"false" 

到你的請求,並應該停止這個錯誤!

+1

謝謝!這給了我一個新的錯誤'java.lang.IllegalArgumentException:沒有註釋器命名爲color'.However,一些搜索後,我發現CoreNLP服務器不加載[傳遞給它的屬性文件](https://github.com/stanfordnlp/CoreNLP /問題/ 165)。我必須在請求中包含顏色註釋器屬性。 –

相關問題