将正值分成多行、多列

Question

将正值分成多行、多列

Maë*_*aël 4 r data-manipulation dataframe dplyr

假设我有一个这样的数据集：

\n

dat <- tibble(id = 1:4, \n              col1 = c(0, 1, 1, 0),\n              col2 = c(1, 0, 1, 0),\n              col3 = c(1, 1, 0, 1))\n\n> dat\n# A tibble: 4 \xc3\x97 4\n     id  col1  col2  col3\n  <int> <dbl> <dbl> <dbl>\n1     1     0     1     1\n2     2     1     0     1\n3     3     1     1     0\n4     4     0     0     1\n

Run Code Online (Sandbox Code Playgroud)\n

我想对于每个唯一的 id，将多个 1 分成多行，即预期输出是：

\n

# A tibble: 7 \xc3\x97 4\n     id  col1  col2  col3\n  <dbl> <dbl> <dbl> <dbl>\n1     1     0     1     0\n2     1     0     0     1\n3     2     1     0     0\n4     2     0     0     1\n5     3     1     0     0\n6     3     0     1     0\n7     4     0     0     1\n

Run Code Online (Sandbox Code Playgroud)\n

对于第一个 id (id = 1)，col2 和 col3 都是 1，所以我想为它们每个单独的行。它有点像行的 one-hot 编码。

\n

Answer 1

Hoe*_*elR 7

在 Ritchie Sacramento 和 RobertoT 的帮助下

\n

library(tidyverse)\n\ndat <- tibble(id = 1:4, \n              col1 = c(0, 1, 1, 0),\n              col2 = c(1, 0, 1, 0),\n              col3 = c(1, 1, 0, 1))\n\ndat %>%  \n  pivot_longer(-id) %>% \n  filter(value != 0) %>% \n  mutate(rows = 1:nrow(.)) %>% \n  pivot_wider(values_fill = 0, \n              names_sort = TRUE) %>% \n  select(-rows)\n\n# A tibble: 7 \xc3\x97 4\n     id  col1  col2  col3\n  <int> <dbl> <dbl> <dbl>\n1     1     0     1     0\n2     1     0     0     1\n3     2     1     0     0\n4     2     0     0     1\n5     3     1     0     0\n6     3     0     1     0\n7     4     0     0     1\n

Run Code Online (Sandbox Code Playgroud)\n

我建议不要编辑“id”列以避免丢失 id 密钥。您可以只创建一个时间列：`mutate(rows = 1:nrow(.))`，然后删除它：`select(-rows)` (2认同)

归档时间：	3 年，6 月前
查看次数：	195 次
最近记录：	3 年，6 月前