将列粘贴在一起,不使 NA 成为字符

sal*_*all 3 r na

我有一个相当大的数据集,其中有多个缺失值和重复值。我的第一个目标是创建一个新列 ( Name),它由三个现有列组成,例如FirstNameMiddleInitialLastName

我努力了:

owners4$Name <- paste(owners4$FirstName, owners4$MiddleInitial, owners4$LastName)
Run Code Online (Sandbox Code Playgroud)

但这会导致NAs 被粘贴为字符而不仅仅是 NA。之后,我将删除NA新列中包含 的每一行。

有人确定我可以实现这一目标吗?

zx8*_*754 5

使用 na.omit 粘贴列,请参见示例:

# reproducible example
owners4 <- data.frame(FirstName = c("Aa", "Bb", NA),
                      MiddleInitial = c("T", "U", NA),
                      LastName = c(NA, "Yyy", NA))

owners4$Name <- apply(owners4[, c("FirstName", "MiddleInitial", "LastName")], 1,
                      function(i){ paste(na.omit(i), collapse = " ") })

owners4
#   FirstName MiddleInitial LastName     Name
# 1        Aa             T     <NA>     Aa T
# 2        Bb             U      Yyy Bb U Yyy
# 3      <NA>          <NA>     <NA>         
Run Code Online (Sandbox Code Playgroud)

现在过滤掉名称为空的行

result <- owners4[ owners4$Name != "", ]
result
#   FirstName MiddleInitial LastName     Name
# 1        Aa             T     <NA>     Aa T
# 2        Bb             U      Yyy Bb U Yyy
Run Code Online (Sandbox Code Playgroud)