2017-07-26 35 views
0

使用R(3.2.5)並加載下列軟件包 'SnowballC','tm','NLP','RWeka','RTextTools','wordcloud ','fpc'tm package和dtm/wordclouds引發的新錯誤

carmenCorpus <- Corpus(VectorSource(feedback$Description)) 
carmenCorpus <- tm_map(carmenCorpus, PlainTextDocument) 
carmenCorpus <- tm_map(carmenCorpus, removePunctuation) 
carmenCorpus <- tm_map(carmenCorpus, removeWords, stopwords('english')) 
carmenCorpus <- tm_map(carmenCorpus, stemDocument) 

當我去創建wordcloud時,出現以下錯誤。這是一個新的錯誤,當代碼在幾個月前運行時沒有問題:

wordcloud(carmenCorpus, max.words = 100, random.order = FALSE) 

# Error in simple_triplet_matrix(i, j, v, nrow = length(terms), ncol = length(corpus), : 
# 'i, j' invalid 

請指教這個問題。

回答

0

wordcloud不能只是採取一個語料庫和神奇地驅動一個wordcloud。

你必須這樣做轉換爲TextDocumentMatrix,然後總結詞頻的辛勤工作:

# convert to TDM 
tdm <- TermDocumentMatrix(carmenCorpus, control=list(stemming=True)) 

# calculate word frequencies 
freqs = sort(rowSums(as.matrix(tdm)), decreasing=TRUE) 

# plot wordcloud 
wordcloud(names(freqs), freqs, 
    max.words = 100, 
    random.order = FALSE, 
    # any other params you want to pass into wordcloud 
    )