包含频率和百分比的双向列联表

MYa*_*208 4 r contingency dplyr tidyverse janitor

我得到了以下双向列联表,其中包含细胞百分比和频率(括号中)。

gender       blue blue-gray       brown      dark      hazel    yellow
 female 33.33% (3) 0.00% (0) 55.56%  (5) 0.00% (0) 11.11% (1) 0.00% (0)
   male 34.62% (9) 3.85% (1) 46.15% (12) 3.85% (1)  3.85% (1) 7.69% (2)
Run Code Online (Sandbox Code Playgroud)

我使用的代码R

library(dplyr)

library(janitor)

starwars %>%
  filter(species == "Human") %>% 
  tabyl(gender, eye_color) %>%
  adorn_percentages("row") %>%
  adorn_pct_formatting(digits = 2) %>%
  adorn_ns()
Run Code Online (Sandbox Code Playgroud)

但是,我想要获得相同类型的表格,其中包含单元格频率和百分比(在括号中)。请提供任何帮助。

akr*_*run 6

我们可以将position参数adorn_nsrear(默认)更改为front

library(tidyverse)
starwars %>%
  filter(species == "Human") %>% 
   tabyl(gender, eye_color) %>%
   adorn_percentages("row") %>%
   adorn_pct_formatting(digits = 2) %>%
   adorn_ns(position = "front")
# gender       blue blue-gray       brown      dark      hazel    yellow
# female 3 (33.33%) 0 (0.00%)  5 (55.56%) 0 (0.00%) 1 (11.11%) 0 (0.00%)
#   male 9 (34.62%) 1 (3.85%) 12 (46.15%) 1 (3.85%) 1  (3.85%) 2 (7.69%)
Run Code Online (Sandbox Code Playgroud)

或者,如果对象已创建,则另一个选项是后处理,mutate_at通过捕获两个块中的字符来更改除第一列之外的所有列的格式,通过反转反向引用来反转位置,同时添加()百分比

library(tidyverse)
starwars %>%
  filter(species == "Human") %>% 
  tabyl(gender, eye_color) %>%
  adorn_percentages("row") %>%
  adorn_pct_formatting(digits = 2) %>%
  adorn_ns() %>% 
  mutate_at(-1, list(~ str_replace(., "^([0-9.%]+)\\s+\\((\\d+)\\)", "\\2 (\\1)")))
# gender       blue blue-gray       brown      dark      hazel    yellow
#1 female 3 (33.33%) 0 (0.00%)  5 (55.56%) 0 (0.00%) 1 (11.11%) 0 (0.00%)
#2   male 9 (34.62%) 1 (3.85%) 12 (46.15%) 1 (3.85%)  1 (3.85%) 2 (7.69%)
Run Code Online (Sandbox Code Playgroud)