TM封裝我想讀的CSV文件的內容爲dataframesource但是當我嘗試創建一個語料庫它總是說 **argument "x" is missing, with no default**
的代碼是 corpus1 <- Corpus(object=ds,
readerControl=list(reader=readTabular(mapping=m),language="en"))
我想在R中做一些堵塞,但它似乎只能在單個文檔上工作。我的最終目標是顯示文檔中每個術語的頻率的術語文檔矩陣。 下面是一個例子: require(RWeka)
require(tm)
require(Snowball)
worder1<- c("I am taking","these are the samples",
"He speaks differently","This is dis