这是数据集:
company <- c("Coca-Cola Inc.", "DF, CocaCola",
"COCA-COLA", "PepsiCo Inc.", "Beverages Distribution")
brand <- c("Coca-Cola Zero","N/A", "Coca-Cola", "Pepsi", "soft drink")
vol <- c("2456","1653", "19", "2766", "167")
data <-data.frame(company, brand, vol)
data
Run Code Online (Sandbox Code Playgroud)
结果如下:
company brand vol
1 Coca-Cola Inc. Coca-Cola Zero 2456
2 DF, CocaCola N/A 1653
3 COCA-COLA CocaCola 19
4 PepsiCo Inc. Pepsi 2766
5 Beverages Distribution soft drink 167
Run Code Online (Sandbox Code Playgroud)
比方说,这是按品牌进口量.
任务是对数据帧进行SUBSET,以仅查看与可口可乐相关的观察结果,而不是任何其他品牌.
我们需要将COMPANY和BRAND变量与标准列表(键)进行部分匹配:
company_key <- c("coca-", "cocacola", "coca cola", "beverages distribution")
brand_key <- c("coca-", "cocacola", "coca …Run Code Online (Sandbox Code Playgroud)