当我将以下代码运行到倒数第二行时,我收到了警告消息:
在mclapply(content(x),FUN,...)中:所有计划的核心在用户代码中遇到错误
当我跑完最后一行时,我得到了
"UseMethod(\"words \")中的错误:\n没有适用于"字"的方法应用于类\"character \"\n"attr(,"class")"try-error"attr(, "条件")
以下链接是一个可重现的示例,我们可以将其复制/粘贴到R并运行.
https://github.com/weijia2013/mclapply-issue/blob/master/codes
我刚开始学习R,我将非常感谢你的帮助.
library(devtools)
install_github("twitteR", username="geoffjentry")
library(twitteR)
setup_twitter_oauth("API Key", "API Secret")
rdmTweets <- userTimeline('rdatamining', n=200)
(nDocs <- length(rdmTweets))
rdmTweets[11:15]
for (i in 11:15) {cat(paste("[[", i, "]] ", sep="")) + writeLines(strwrap(rdmTweets[[i]]$getText(), width=73))}
df <- do.call("rbind", lapply(rdmTweets, as.data.frame))
dim(df)
library(tm)
library(SnowballC)
library(RWeka)
library(rJava)
library(RWekajars)
myCorpus <- Corpus(VectorSource(df$text))
myCorpus <- tm_map(myCorpus, tolower)
myCorpus <- tm_map(myCorpus, removePunctuation)
myCorpus <- tm_map(myCorpus, removeNumbers)
removeURL <- function(x) gsub("http[[:alnum:]]*", "", x)
myCorpus <- tm_map(myCorpus, removeURL)
myStopwords <- c(stopwords("english"), "available", "via")
myStopwords …Run Code Online (Sandbox Code Playgroud)