在R 2.1.0之后的某个地方,我开始注意到启动时显示的R版本号旁边的短语.这被证明是一个发布昵称,一个添加到R.Version变量的未记录的组件,可以访问R.Version$nickname.
R-announce的档案显示Peter Dalgaard多次宣布新版本的昵称(例如,这里),但我没有找到任何其他信息.尽管是查询对象的列表元素R.Version(),但它不包含在该函数的帮助文件中.
有关于此功能的任何文档吗?理想情况下,我想要一个用于确定版本昵称的系统声明,但任何权威参考都会很棒.
我被要求重新创建以下风格的情节.(请忽略这是否是一种良好的可视化类型的问题,并将此视为在数字表中添加彩色元素.)
大部分都很简单,但我还没有找到一个让中心空洞的好方法.为了节省时间,我可能会采用添加隐形虚拟数据的方法(如果没有其他人这样做,我会发布该方法,但它似乎不如修改主题的方法更优化).是否有基于主题的解决方案或非ggplot2 R解决方案?
library(ggplot2)
# make sample dataframe
Category <- c("Electronics", "Appliances", "Books", "Music", "Clothing",
"Cars", "Food/Beverages", "Personal Hygiene",
"Personal Health/OTC", "Hair Care")
Percent <- c(81, 77, 70, 69, 69, 68, 62, 62, 61, 60)
internetImportance<-data.frame(Category,Percent)
# append number to category name
internetImportance$Category <-
paste0(internetImportance$Category," - ",internetImportance$Percent,"%")
# set factor so it will plot in descending order
internetImportance$Category <-
factor(internetImportance$Category,
levels=rev(internetImportance$Category))
# plot
ggplot(internetImportance, aes(x = Category, y = Percent,
fill = Category)) +
geom_bar(width = 0.9, stat="identity") …Run Code Online (Sandbox Code Playgroud) 当我ctree()从party包中绘制决策树结果时,字体太大而且框太大了.它们与其他节点重叠.
有没有办法自定义输出,plot()以便框和字体更小?
我想为我的多维结果实现平行坐标.有没有人在matlab或R中实现它的实现?此外,有没有关于用于生成平行坐标的最佳工具的建议?
我有一个带有布尔变量事件的数据框d,指示某个事件是否在给定日期发生.我想创建一个新变量,指示距离最近的事件有多少观察(天).
d=structure(list(date = structure(c(-365, -364, -363, -362, -361,
-360, -359, -358, -357, -356, -355, -354, -353, -352, -351, -350,
-349, -348, -347, -346), class = "Date"), event = c(TRUE,
FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE,
FALSE, FALSE, FALSE, FALSE, TRUE, FALSE, FALSE, FALSE, FALSE,
FALSE)), .Names = c("date", "event"), row.names = c(NA, 20L
), class = "data.frame")
Run Code Online (Sandbox Code Playgroud)
有没有这样做的功能?
当使用带有黑色背景的ggplot2主题时,是否可以控制颜色指南以外的指南的图例颜色,以便不用黑色绘制内容?如果是这样,怎么样?
library(ggplot2) # needs to be 0.9.3 for this theme
data(iris) # included with ggplot2
theme_black<- function (base_size = 16, base_family = ""){
theme_minimal() %+replace%
theme(
line = element_line(colour = "white", size = 0.5, linetype = 1,
lineend = "butt"),
rect = element_rect(fill = "white",
colour = "white", size = 0.5, linetype = 1),
text = element_text(family = base_family,
face = "plain", colour = "white", size = base_size,
angle = 0, lineheight = 0.9, hjust = 0, vjust = 0), …Run Code Online (Sandbox Code Playgroud) 我目前正在处理MMST包中的葡萄酒数据.我已将整个数据集拆分为训练和测试,并构建如下代码的树:
library("rpart")
library("gbm")
library("randomForest")
library("MMST")
data(wine)
aux <- c(1:178)
train_indis <- sample(aux, 142, replace = FALSE)
test_indis <- setdiff(aux, train_indis)
train <- wine[train_indis,]
test <- wine[test_indis,] #### divide the dataset into trainning and testing
model.control <- rpart.control(minsplit = 5, xval = 10, cp = 0)
fit_wine <- rpart(class ~ MalicAcid + Ash + AlcAsh + Mg + Phenols + Proa + Color + Hue + OD + Proline, data = train, method = "class", control = model.control)
windows()
plot(fit_wine,branch = …Run Code Online (Sandbox Code Playgroud) 我有一个结构相同的csv文件目录.我正在尝试将所有这些加载到单个data.frame中.目前我使用lapply()与read.csv()获得data.frames的名单,我期待一个优雅的方式来此列表转换为避免外在的循环一个data.frame.
我的结果lapply(list.of.file.names,read.csv)可以近似为这个结构:
list.of.dfs <- list(data.frame(A=sample(seq(from = 1, to = 10), size = 5),
B=sample(seq(from = 1, to = 10), size = 5)),
data.frame(A=sample(seq(from = 1, to = 10), size = 5),
B=sample(seq(from = 1, to = 10), size = 5)),
data.frame(A=sample(seq(from = 1, to = 10), size = 5),
B=sample(seq(from = 1, to = 10), size = 5))
)
Run Code Online (Sandbox Code Playgroud)
适用于任意长度列表的以下行的优雅版本是什么:
one.data.frame <- rbind(list.of.dfs[[1]],list.of.dfs[[2]],list.of.dfs[[3]])
Run Code Online (Sandbox Code Playgroud)
我可以使用for循环执行此操作,但是有基于矢量的解决方案吗?