stanford核心NLP：從文本中拆分句子

要小心，NLP解析有很多細節，像BreakIterator這樣的簡單策略可能無法正確處理。例如，你會正確處理一個句子，例如「麪包花費$ 4.99」或「怎麼回事？」問母親。「如果你對一種天真的解決方案沒問題，BreakIterator就可以做得很好。如果你想更有力地處理這些案例，斯坦福大學的NLP庫是一個好主意。 –

有你看過main Stanford NLP page上的文檔？大約一半的時候，它提供了一個你正在尋找的確切東西的例子。這個例子不僅分割句子，而且分詞。

來源

2012-09-10 18:33:37

Properties properties = new Properties(); 
    properties.setProperty("annotators", "tokenize, ssplit, parse"); 
    StanfordCoreNLP pipeline = new StanfordCoreNLP(properties); 
    List<CoreMap> sentences = pipeline.process(SENTENCES) 
    .get(CoreAnnotations.SentencesAnnotation.class);  
    // I just gave a String constant which contains sentences. 
    for (CoreMap sentence : sentences) { 
      System.out.println(sentence.toString()); 
    }

來源

2016-04-05 20:30:08

stanford核心NLP：從文本中拆分句子

回答

相關問題