我有以下兩個DTM-S:[R DocumentTermMatrix控制列表不工作,自動忽略未知參數
dtm <- DocumentTermMatrix(t)
dtmImproved <- DocumentTermMatrix(t,
control=list(minWordLength = 4, minDocFreq=5))
當我實現這一點,我看到兩個相等的DTM-S,如果我打開dtmImproved
,有帶有3個符號的詞。爲什麼minWordLength
參數不起作用?謝謝!
> dtm
A document-term matrix (591 documents, 10533 terms)
Non-/sparse entries: 43058/6181945
Sparsity : 99%
Maximal term length: 135
Weighting : term frequency (tf)
> dtmImproved
A document-term matrix (591 documents, 10533 terms)
Non-/sparse entries: 43058/6181945
Sparsity : 99%
Maximal term length: 135
Weighting : term frequency (tf)
此外,當我添加任何東西到「列表(......)」什麼也沒發生,版本'tm'是否使用的是沒有任何警告或某物其他 –