我觉得应该有一种有效的方法来dplyr使用case_whenand 来改变新列contains,但是无法让它工作.
我理解使用case_when内部mutate是"有点实验性"(如本文所述),但对任何建议都会感激不尽.
不起作用:
library(tidyverse)
set.seed(1234)
x <- c("Black", "Blue", "Green", "Red")
df <- data.frame(a = 1:20,
b = sample(x,20, replace=TRUE))
df <- df %>%
mutate(group = case_when(.$b(contains("Bl")) ~ "Group1",
case_when(.$b(contains("re", ignore.case=TRUE)) ~ "Group2")
)
Run Code Online (Sandbox Code Playgroud)
akr*_*run 15
我们可以用 grep
df %>%
mutate(group = case_when(grepl("Bl", b) ~ "Group1",
grepl("re", b, ignore.case = TRUE) ~"Group2"))
# a b group
#1 1 Black Group1
#2 2 Green Group2
#3 3 Green Group2
#4 4 Green Group2
#5 5 Red Group2
#6 6 Green Group2
#7 7 Black Group1
#8 8 Black Group1
#9 9 Green Group2
#10 10 Green Group2
#11 1 Green Group2
#12 2 Green Group2
#13 3 Blue Group1
#14 4 Red Group2
#15 5 Blue Group1
#16 6 Red Group2
#17 7 Blue Group1
#18 8 Blue Group1
#19 9 Black Group1
#20 10 Black Group1
Run Code Online (Sandbox Code Playgroud)
想要添加一些使用函数的示例,str_detect该paste0函数也可以使连接公共组变得轻而易举。假设您正在与gapminder 或其他国家/地区的df 合作。
interest <- c("Austria", "Belgium", "Bulgaria", "Croatia", "Cyprus",
"Czech Republic", "Denmark", "Estonia", "Finland",
"France", "Germany", "Greece", "Hungary", "Ireland",
"Italy", "Latvia", "Lithuania", "Luxembourg","Malta",
"The Netherlands", "Poland","Portugal", "Romania",
"Slovakia", "Slovenia","Spain", "Sweden","United Kingdom")
EU <- paste0(countrycode::countryname(
sourcevar = interest, destination = "iso2c"),
sep = "|", collapse = "")
df%<>%mutate(Region=case_when(
str_detect(Country, "AT|BE|BG|HR|CY|CZ|DK|EE|FI|FR|DE|GR|HU|IE|
IT|LV|LT|LU|MT|NL|PL|PT|RO|SK|SI|ES|SE|GB|UK|G8")~ "EU",
TRUE ~ "Not EU")) ```
You'll need to load `library(magittr)` to get `%<>%` the compound pipe to work, it's basically an abbreviation of `df<-funs(df)`
Run Code Online (Sandbox Code Playgroud)