如何在R中使用dplyr :: inner_join多个tbls或data.frames

Zhi*_*Jia 14 r inner-join dplyr

在R,哪能inner_jointblsdata.frame小号有效?

例如:

devtools::install_github("rstudio/EDAWR")
library(EDAWR)
library(dplyr)
data(songs)
data(artists)
test <- songs
colnames(test) <- c("song2", "name")
inner_join(songs, artists,by="name") %>% inner_join(test,by="name")
Run Code Online (Sandbox Code Playgroud)

有数百个testdata.frames,我想加入.

jba*_*ums 22

您可以在列表中收集数据框并使用Reduce:

L <- list(songs, artists, test)
Reduce(inner_join, L)

#   name  plays                song               song2
# 1 John guitar Across the Universe Across the Universe
# 2 John guitar       Come Together Across the Universe
# 3 John guitar Across the Universe       Come Together
# 4 John guitar       Come Together       Come Together
# 5 Paul   bass      Hello, Goodbye      Hello, Goodbye
Run Code Online (Sandbox Code Playgroud)

您可以使用L <- mget(ls())(使用可选的patternarg ls)将所有内容放入列表中.


正如@akrun在评论中提到的,plyr另一种选择是:

library(plyr)
join_all(L, type='inner')
Run Code Online (Sandbox Code Playgroud)

  • @jazzurro你可以`Reduce(函数(x,y)inner_join(x,y,by = c('foo'='bar')),L)`,但我认为这需要`by`列对于元素1是`foo`,对于所有后续元素,它是`bar`. (2认同)