相关疑难解决方法(0)

Twitter数据分析 - 术语文档矩阵中的错误

试图对twitter数据进行一些分析.下载推文并使用下面的推文文本创建语料库

# Creating a Corpus
wim_corpus = Corpus(VectorSource(wimbledon_text)) 
Run Code Online (Sandbox Code Playgroud)

在尝试创建如下的TermDocumentMatrix时,我收到错误和警告.

tdm = TermDocumentMatrix(wim_corpus, 
                       control = list(removePunctuation = TRUE, 
                                      stopwords =  TRUE, 
                                      removeNumbers = TRUE, tolower = TRUE)) 

Error in simple_triplet_matrix(i = i, j = j, v = as.numeric(v), nrow = length(allTerms),    : 'i, j, v' different lengths


In addition: Warning messages:
1: In parallel::mclapply(x, termFreq, control) :
 all scheduled cores encountered errors in user code
2: In is.na(x) : is.na() applied to non-(list or vector) of type 'NULL'
3: In TermDocumentMatrix.VCorpus(corpus) …
Run Code Online (Sandbox Code Playgroud)

r

8
推荐指数
3
解决办法
2万
查看次数

标签 统计

r ×1