如何从R中的数据框中删除带有inf的行

library(dplyr)

# sample data
df <- data_frame(a = c(1, 2, 3, NA), b = c(5, Inf, 8, 8), c = c(9, 10, Inf, 11), d = c('a', 'b', 'c', 'd'))

# across all columns:
df %>% 
  filter_all(all_vars(!is.infinite(.)))

# note that is.finite() does not work with NA or strings:
df %>% 
  filter_all(all_vars(is.finite(.)))

# checking only numeric columns:
df %>% 
  filter_if(~is.numeric(.), all_vars(!is.infinite(.)))

# checking only select columns, in this case a through c:
df %>% 
  filter_at(vars(a:c), all_vars(!is.infinite(.)))

Run Code Online (Sandbox Code Playgroud)

Answer 3

akr*_*run 10

对象的is.finite作用,vector而不是data.frame对象的作品.因此,我们可以遍历data.frame使用lapply并获得"有限"值.

lapply(df, function(x) x[is.finite(x)])

Run Code Online (Sandbox Code Playgroud)

如果数量Inf,-Inf为每列值是不同的,上面的代码将具有list与具有不相等的元件length.因此,最好将其留作list.如果我们想要一个data.frame,它应该有相同的长度.

如果我们要删除包含任何NA或Inf/-Inf值的行

df[Reduce(`&`, lapply(df, function(x) !is.na(x)  & is.finite(x))),]

Run Code Online (Sandbox Code Playgroud)

或@nicola的紧凑选项

df[Reduce(`&`, lapply(df, is.finite)),]

Run Code Online (Sandbox Code Playgroud)

如果我们准备使用包,那么紧凑的选项就是 NaRV.omit

library(IDPmisc)
NaRV.omit(df)

Run Code Online (Sandbox Code Playgroud)

数据

set.seed(24)
df <- as.data.frame(matrix(sample(c(1:5, NA, -Inf, Inf), 
                      20*5, replace=TRUE), ncol=5))

Run Code Online (Sandbox Code Playgroud)

Answer 4

use*_*230 5

我花了一段时间才为dplyr 1.0.0解决这个问题，所以我想我会使用新版本的 @sbha 解决方案c_across，因为filter_all,filter_if已被弃用。

library(dplyr)
df <- tibble(a = c(1, 2, 3, NA), b = c(5, Inf, 8, 8), c = c(9, 10, Inf, 11), d = c('a', 'b', 'c', 'd'))
#       a     b     c d    
#   <dbl> <dbl> <dbl> <chr>
# 1     1     5     9 a    
# 2     2   Inf    10 b    
# 3     3     8   Inf c    
# 4    NA     8    11 d 

df %>% 
  rowwise %>% 
  filter(!all(is.infinite(c_across(where(is.numeric)))))
# # A tibble: 4 x 4
# # Rowwise: 
#       a     b     c d    
#   <dbl> <dbl> <dbl> <chr>
# 1     1     5     9 a    
# 2     2   Inf    10 b    
# 3     3     8   Inf c    
# 4    NA     8    11 d 

df %>% 
  rowwise %>% 
  filter(!any(is.infinite(c_across(where(is.numeric)))))
# # A tibble: 2 x 4
# # Rowwise: 
#       a     b     c d    
#   <dbl> <dbl> <dbl> <chr>
# 1     1     5     9 a    
# 2    NA     8    11 d 

df %>% 
  rowwise %>% 
  filter(!any(is.infinite(c_across(a:c))))

# # A tibble: 2 x 4
# # Rowwise: 
#       a     b     c d    
#   <dbl> <dbl> <dbl> <chr>
# 1     1     5     9 a    
# 2    NA     8    11 d

Run Code Online (Sandbox Code Playgroud)

说实话，我认为@sbha 的答案更简单！

归档时间：	9 年，10 月前
查看次数：	39474 次
最近记录：	6 年，11 月前