Aas*_*hta 1 r dataframe dplyr data-cleaning
输入数据帧
DF 1(示例 - nrow = 10)
Col A | Col B | Col C
a 1 2
a 3 4
b 5 6
c 9 10
Run Code Online (Sandbox Code Playgroud)
DF 2(示例 - nrow = 20)
Col A | Col B | Col E
a 1 22
a 31 41
a 3 63
b 5 6
b 11 13
c 9 20
Run Code Online (Sandbox Code Playgroud)
我想创建第三个数据集,其中包含在数据帧 2 中为 Col A 和 Col B 条目找到的每个附加行。
输出文件(nrow = 20-10 = 10)
Col A | Col B | Col E
a 31 41
b 11 13
Run Code Online (Sandbox Code Playgroud)
library(dplyr)
anti_join(df2, df1, by = c("ColA", "ColB"))
# ColA ColB ColE
# 1 a 31 41
# 2 b 11 13
Run Code Online (Sandbox Code Playgroud)
数据:
df1 <- structure(list(ColA = c("a", "a", "b", "c"), ColB = c(1L, 3L,
5L, 9L), ColC = c(2L, 4L, 6L, 10L)), class = "data.frame", row.names = c(NA,
-4L))
df2 <- structure(list(ColA = c("a", "a", "a", "b", "b", "c"), ColB = c(1L,
31L, 3L, 5L, 11L, 9L), ColE = c(22L, 41L, 63L, 6L, 13L, 20L)), class = "data.frame", row.names = c(NA,
-6L))
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
50 次 |
| 最近记录: |