带有names_pattern和成对列的pivot_longer

ebo*_*ath 3 pivot-table r data-manipulation dataframe tidyr

我试图弄清楚如何在下面的示例中使用pivot_longerfrom 。tidyr这就是原始表dat_plot的结构:

   year organizational_based action_based ideological_based share_org_based share_ideo_based share_act_based
  <dbl>                <dbl>        <dbl>             <dbl>           <dbl>            <dbl>           <dbl>
1  1956                    1            0                 0               2               95              95
2  2000                    0            0                 0              92               87              91
Run Code Online (Sandbox Code Playgroud)

也在这里:

dat_plot <- structure(list(year = c(1956, 2000), organizational_based = c(1, 
0), action_based = c(0, 0), ideological_based = c(0, 0), share_org_based = c(2, 
92), share_ideo_based = c(95, 87), share_act_based = c(95, 91
)), row.names = c(NA, -2L), class = c("tbl_df", "tbl", "data.frame"
))
Run Code Online (Sandbox Code Playgroud)

我想通过以下方式将其转换为长格式:

  year          based based_value      share share_value
1 1956 organizational           1  org_based           2
2 1956         action           0 ideo_based          95
3 1956    ideological           0  act_based          95
4 2000 organizational           0  org_based          92
5 2000         action           0 ideo_based          87
6 2000    ideological           0  act_based          91
Run Code Online (Sandbox Code Playgroud)

或者,与dput

solution <- structure(list(year = c(1956, 1956, 1956, 2000, 2000, 2000), 
    based = c("organizational", "action", "ideological", "organizational", 
    "action", "ideological"), based_value = c(1, 0, 0, 0, 0, 
    0), share = c("org_based", "ideo_based", "act_based", "org_based", 
    "ideo_based", "act_based"), share_value = c(2, 95, 95, 92, 
    87, 91)), class = "data.frame", row.names = c(NA, -6L))
Run Code Online (Sandbox Code Playgroud)

我想我必须与 合作names_pattern,我尝试过的是这样的,但如果你尝试你会发现,这不是我想要的:

pivot_longer(data=dat_plot, cols=c("share_org_based", "share_ideo_based", "share_act_based",
                    "organizational_based", "action_based", "ideological_based"),
             names_pattern = c("(share_[A-Za-z]+)([A-Za-z]+_based)"),
             names_to = c("share", ".value"),
             values_to = "value")
Run Code Online (Sandbox Code Playgroud)

我很感激任何有关如何names_pattern工作或我缺少什么的线索。

Maë*_*aël 6

您可以使用两个pivot_longer

dat_plot %>% 
  pivot_longer(cols = starts_with("share"), names_to = "share", names_prefix = "share_", values_to = "share_value") %>%
  pivot_longer(cols = ends_with("based"), names_to = "based", names_pattern = "(.*)_based", values_to = "based_value") %>% 
  filter(substr(share, 1, 3) == substr(based, 1, 3))
Run Code Online (Sandbox Code Playgroud)

输出

dat_plot %>% 
  pivot_longer(cols = starts_with("share"), names_to = "share", names_prefix = "share_", values_to = "share_value") %>%
  pivot_longer(cols = ends_with("based"), names_to = "based", names_pattern = "(.*)_based", values_to = "based_value") %>% 
  filter(substr(share, 1, 3) == substr(based, 1, 3))
Run Code Online (Sandbox Code Playgroud)