我正在研究主成分分析 (PCA)。我发现ggfortify效果很好,但想做一些手动调整。
然后尝试绘制 PCA 结果如下:
evec <- read.table(textConnection("
PC1 PC2 PC3
-0.5708394 -0.6158420 -0.5430295
-0.6210178 -0.1087985 0.7762086
-0.5371026 0.7803214 -0.3203424"
), header = TRUE, row.names = c("M1", "M2", "M3"))
res.ct <- read.table(textConnection("
PC1 PC2 PC3
-1.762697 -1.3404825 -0.3098503
-2.349978 -0.0531175 0.6890453
-1.074205 1.5606429 -0.6406848
2.887080 -0.7272039 -0.3687029
2.299799 0.5601610 0.6301927"
), header = TRUE, row.names = c("A", "B", "C", "D", "E"))
require(ggplot2)
require(dplyr)
gpobj <-
res.ct %>%
ggplot(mapping = aes(x=PC1, y=PC2)) +
geom_point(color="grey30") +
annotate(geom="text", x=res.ct$PC1*1.07, y=res.ct$PC2*1.07,
label=rownames(res.ct))
for (i in 1:nrow(evec))
{
PCx <- evec[i,1]
PCy <- evec[i,2]
axisname <- rownames(evec)[[i]]
gpobj <- gpobj +
geom_segment(
data = evec[i,],
aes(
x = 0, y = 0,
xend = PC1, yend = PC2
# xend = PCx, yend = PCy #not work as intended
),
arrow = arrow(length = unit(4, "mm")),
color = "red"
) +
annotate(
geom = "text",
x = PCx * 1.15, y = PCy * 1.15,
label = axisname,
color = "red"
)
}
gpobj
Run Code Online (Sandbox Code Playgroud)
该代码运行良好,但是当我尝试使用注释行xend = PCx, yend = PCy而不是 时xend = PC1, yend = PC2,它不能按我的预期正常工作,它没有显示所有箭头。
xend = PC1, yend = PC2 效果很好:

xend = PCx, yend = PCy 才不是:

问题:geom_segment()当起点和终点由环境变量指定而不是由来自 的变量名称引用时,为什么不保留之前的箭头data =?
在您使用的代码中,当PCx/PCy在美学映射内部指定时aes(...)(而不是将它们硬编码为外部的固定美学值aes(...),如对annotate图层所做的那样),仅在绘制/打印 ggplot 对象时才评估实际值gpobj。
PCx这意味着/的值PCy是在for 循环之外计算的。此时,它们对应于它们所采用的最后一个值,i = 3这就是为什么只有一个箭头段(实际上是三个箭头彼此重叠)可见。搬到xend = PCx, yend = PCy外面aes(...)应该会达到您想要的外观。
不过,我确实想知道为什么你首先选择使用 for 循环。像下面这样的东西难道不会达到同样的目的吗?
# convert row names to explicit columns
res.ct <- tibble::rownames_to_column(res.ct)
evec <- tibble::rownames_to_column(evec)
# plot
res.ct %>%
ggplot(mapping = aes(x=PC1, y=PC2)) +
geom_point(color="grey30") +
geom_text(aes(x = PC1 * 1.07, y = PC2 * 1.07,
label = rowname)) +
geom_segment(data = evec,
aes(x = 0, y = 0, xend = PC1, yend = PC2, group = rowname),
arrow = arrow(length = unit(4, "mm")),
color = "red") +
geom_text(data = evec,
aes(x = PC1 * 1.15, y = PC2 * 1.15, label = rowname),
colour = "red")
Run Code Online (Sandbox Code Playgroud)