Dav*_*vis 3 split r strsplit splitstackshape
我正在处理一组非常原始的数据,需要对其进行整形才能使用它.我试图根据分隔符拆分选定的列'|'
d <- data.frame(id = c(022,565,893,415),
name = c('c|e','m|q','w','w|s|e'),
score = c('e','k|e','e|k|e', 'e|o'))
Run Code Online (Sandbox Code Playgroud)
是否可以将数据帧拆分为一个,以便最终看起来像这样.
df <- data.frame(id = c(22,22,565,565,565,565,893,893,893,415,415,415,415,415,415),
name = c('c','e','m','m','q','q','w','w','w','w','w','s','s','e','e'),
score = c('e','e','k','e','k','e','e','k','e','e','o','e','o','e','o'))
Run Code Online (Sandbox Code Playgroud)
到目前为止,我已经尝试了各种不同的字符串拆分功能,但没有太多运气:(
有人可以帮忙吗?
这是一个简单的基础R方法,分两步:
1)拆分列:
x <- lapply(d[-1], strsplit, "|", fixed = TRUE)
Run Code Online (Sandbox Code Playgroud)
2)扩展和组合:
d2 <- setNames(do.call(rbind, Map(expand.grid, d$id, x$name, x$score)), names(d))
Run Code Online (Sandbox Code Playgroud)
结果是:
# id name score
#1 22 c e
#2 22 e e
#3 565 m k
#4 565 q k
#5 565 m e
#6 565 q e
#7 893 w e
#8 893 w k
#9 893 w e
#10 415 w e
#11 415 s e
#12 415 e e
#13 415 w o
#14 415 s o
#15 415 e o
Run Code Online (Sandbox Code Playgroud)