我正在处理一系列TRUE和FALSE序列,这些序列按组进行组织并延伸一段时间(例如几天).下面,我花时间重新创建一个示例并将其data.frame
转换为data.table
使用该data.table
包.
> dput(df)
structure(list(day = c(1L, 2L, 3L, 4L, 5L, 6L, 7L, 1L, 2L, 3L,
4L, 5L, 6L, 7L), group = structure(c(1L, 1L, 1L, 1L, 1L, 1L,
1L, 2L, 2L, 2L, 2L, 2L, 2L, 2L), .Label = c("a", "b"), class = "factor"),
condition = structure(c(1L, 1L, 2L, 2L, 2L, 1L, 1L, 1L, 2L,
2L, 1L, 1L, 2L, 2L), .Label = c("FALSE", "TRUE"), class = "factor")), .Names = c("day",
"group", "condition"), row.names = c(NA, -14L), …
Run Code Online (Sandbox Code Playgroud) 我有两列的数据框。我希望“ id”列是唯一的,并且非重复id的值应设置为相同的值,对于重复id的值应为NA。
library(data.table)
DT <- data.table(id = c(1,2,3,3,4,5,5), value = c(17,13,8,NA,9,NA,11))
DT
id value
1: 1 17
2: 2 13
3: 3 8
4: 3 NA
5: 4 9
6: 5 NA
7: 5 11
Run Code Online (Sandbox Code Playgroud)
预期产量
id value
1: 1 17
2: 2 13
3: 3 NA
4: 4 9
5: 5 NA
Run Code Online (Sandbox Code Playgroud) 我有一系列的观察结果描述了动物是否以及何时被发现在特定区域.以下样本表确定每天看到某种动物(status == 1
)与否(status == 0
).
id date status
1 1 2014-06-20 1
2 1 2014-06-21 1
3 1 2014-06-22 1
4 1 2014-06-23 1
5 1 2014-06-24 0
6 2 2014-06-20 1
7 2 2014-06-21 1
8 2 2014-06-22 0
9 2 2014-06-23 1
10 2 2014-06-24 1
11 3 2014-06-20 1
12 3 2014-06-21 1
13 3 2014-06-22 0
14 3 2014-06-23 1
15 3 2014-06-24 0
16 4 2014-06-20 1
17 4 2014-06-21 0 …
Run Code Online (Sandbox Code Playgroud)