Har*_*pta 6 r dataframe data.table
我有一个这样的数据集
temp <- structure(list(col_1 = c("", "P9603", "", "", "11040",
"80053"), col_2 = c("84484", "80061", "", "80061", "A0428", "85025"
), col_3 = c("V2632", "82310", "", "", "", "86357"), col_4 = c("J1170",
"84305", "62311", "80061", "", ""), col_5 = c("", "86708", "J0690",
"", "", "")), .Names = c("col_1", "col_2", "col_3", "col_4",
"col_5"), class = c("data.table", "data.frame"))
col_1 col_2 col_3 col_4 col_5
1: 84484 V2632 J1170
2: P9603 80061 82310 84305 86708
3: 62311 J0690
4: 80061 80061
5: 11040 A0428
6: 80053 85025 86357
Run Code Online (Sandbox Code Playgroud)
是否有可能像这样移动列
col_1 col_2 col_3 col_4 col_5
1: 84484 V2632 J1170 #LEFT SHIFT 1
2: P9603 80061 82310 84305 86708 #NO CHANGE
3: 62311 J0690 #LEFT SHIFT 3
4: 80061 80061 #LEFT SHIFT 1 FOR FIRST ITEM,
#LEFT SHIFT 2 FOR 2ND ITEM
5: 11040 A0428 #NO CHANGE
6: 80053 85025 86357 #NO CHANGE
Run Code Online (Sandbox Code Playgroud)
如果左侧的值为空,我将向左移动列
这是一个使用 的选项data.table。按行顺序分组, data.table( )unlist的子集,通过逻辑向量( ),转换为去掉 grp 列后用原来的列名设置名称.SDorderun==''list
setnames(temp[, {un <- unlist(.SD); as.list(un[order(un=='')])},
.(grp = 1:nrow(temp))][, grp := NULL], names(temp))[]
# col_1 col_2 col_3 col_4 col_5
#1: 84484 V2632 J1170
#2: P9603 80061 82310 84305 86708
#3: 62311 J0690
#4: 80061 80061
#5: 11040 A0428
#6: 80053 85025 86357
Run Code Online (Sandbox Code Playgroud)
或者另一种选择是melt在创建序列列后采用长格式,然后dcast采用宽格式
dcast(melt(temp[, n := seq_len(.N)], id.var = 'n')[order(n, value == ''),
.(value, variable = names(temp)[1:5]), n], n ~ variable)[, n := NULL][]
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
1186 次 |
| 最近记录: |