根据r中的多个条件创建新列

jon*_*det 0 if-statement r case

我有一个数据框,我想根据数据框中的其他现有列(“th”和“br”)创建一个新列(“path”)。如果“th”中的值为“NA”,我想根据列“ce”和“br”创建新列。

可以使用以下代码创建可重现的数据样本:

df <- structure(list(
  th = c(3, 1, NA, 2, 2, 0, 3, 3, 0, 2, 3, 2, 1, NA, 3, 4, 3, 3, 1, 3), 
  br = c(1, 2, 4, 1, 2, 2, 1, 2, 2, 5, 4, 1, 1, 2, 1, 5, 2, 1, 1, 1), 
  ce = c(2, 3, 2, 0, 1, 0, 2, 1, 1, 1, 1, 0, 0, 1, 2, 0, 0, 1, 1, 2)), 
  row.names = c(NA, 20L), class = "data.frame")
Run Code Online (Sandbox Code Playgroud)

我用 if 和 else 语句尝试了以下代码:

df <- df %>% if(!is.na(th)) {
  mutate(path = case_when(
    (th %in% c(0:2) & br %in% c(0:3) ~ "Low"), 
    (th %in% c(3:5) & br %in% c(0:3) ~ "Intermediate"),
    (th %in% c(3:5) & br %in% c(4:6) ~ "High")))
} else {
  mutate(path = case_when(
    (ce %in% c(0:1) & br %in% c(0:3) ~ "Low"), 
    (ce %in% c(1:3) & br %in% c(3) ~ "Intermediate"),
    (ce %in% c(2:3) & br %in% c(4:6) ~ "High")))
}
Run Code Online (Sandbox Code Playgroud)

这会导致错误“Error in if (.) !is.na(th) else { : the condition has length > 1”

代码有什么问题吗?对于如何改进代码或替代解决方案有什么建议吗?

小智 5

尝试这个:

df %>%
  mutate(path = ifelse(!is.na(th),
                       case_when(
                         (th %in% c(0:2) & br %in% c(0:3) ~ "Low"), 
                         (th %in% c(3:5) & br %in% c(0:3) ~ "Intermediate"),
                         (th %in% c(3:5) & br %in% c(4:6) ~ "High")),
                       case_when(
                         (ce %in% c(0:1) & br %in% c(0:3) ~ "Low"), 
                         (ce %in% c(1:3) & br %in% c(3) ~ "Intermediate"),
                         (ce %in% c(2:3) & br %in% c(4:6) ~ "High"))))
Run Code Online (Sandbox Code Playgroud)