按组增加

ton*_*nyk 6 r dplyr

我正在尝试为每个组增加一列.所以如果有一个值,那么我们根据它之前的值递增它,否则我们就离开它.

所以例如它会从df变为dfb.

df <- data.frame(group = c("A", "A", "B", "B", "B", "C", "C", "C", "D", "D"), 
                 num = c(1, NA, NA, 8, NA, 5, NA, NA, 10, NA))
dfb <- data.frame(group = c("A", "A", "B", "B", "B", "C", "C", "C", "D", "D"),
                 num = c(1, 2, NA, 8, 9, 5, 6, 7, 10, 11))
> df

   group num
1      A   1
2      A  NA
3      B  NA
4      B   8
5      B  NA
6      C   5
7      C  NA
8      C  NA
9      D  10
10     D  NA

> dfb
   group num
1      A   1
2      A   2
3      B  NA
4      B   8
5      B   9
6      C   5
7      C   6
8      C   7
9      D  10
10     D  11
Run Code Online (Sandbox Code Playgroud)

我最好的尝试是这个,但它不起作用

dfc <- df %>%
   mutate(num = ifelse(is.na(num),lag(num) + 1, num))
Run Code Online (Sandbox Code Playgroud)

删除了我之前的问题,因为我之前的问题定义不明确.谢谢您的帮助!

akr*_*run 6

我们可以做的

df %>% 
 group_by(grp1= cumsum(!is.na(num)), group) %>%
 mutate(num = if(n() > 1) num[1L] + row_number()-1 else num) %>% 
 ungroup() %>%
 select(-grp1)
# A tibble: 10 × 2
#    group   num
#   <fctr> <dbl>
#1       A     1
#2       A     2
#3       B    NA
#4       B     8
#5       B     9
#6       C     5
#7       C     6
#8       C     7
#9       D    10
#10      D    11
Run Code Online (Sandbox Code Playgroud)

或者 data.table

library(data.table)
setDT(df)[, num := if(.N >1) num[1L] + seq_len(.N)-1
            else num,.(grp1=cumsum(!is.na(num)), group)]
Run Code Online (Sandbox Code Playgroud)