2013-08-05 34 views
0

我想用斯坦福tagger代替複數,以單數(例如從女孩到女孩)。刪除複數,而使用斯坦福pos tagger

private static final String vbnTag = "VBN"; 
private static final String vbdTag = "VBD"; 
private static final String jjTag = "JJ"; 
private static final String edSuff = "ed"; 
private static final String enSuff = "en"; 
private static final String oneSt = "1"; 
private static final String naWord = "NA"; 

private static final Pattern stopper = Pattern.compile("(?i:and|or|but|,|;|-|--)"); 
private static final Pattern vbnWord = Pattern.compile("(?i:have|has|having|had|is|am|are|was|were|be|being|been|'ve|'s|s|'d|'re|'m|gotten|got|gets|get|getting)"); // cf. list in EnglishPTBTreebankCorrector 

我做對了嗎?

回答

0

我想你可以在Stanford核心NLP中提供的詞性化註釋的幫助下做到這一點。

+0

謝謝RajuPenumatsa我會嘗試它 –