Lou*_*lou 11 r vector neighbours difference
我的数据:
data <- c(1,5,11,15,24,31,32,65)
Run Code Online (Sandbox Code Playgroud)
有两个邻居:31和32.我希望删除它们并仅保留平均值(例如31.5),这样数据将是:
data <- c(1,5,11,15,24,31.5,65)
Run Code Online (Sandbox Code Playgroud)
这似乎很简单,但我希望自动完成,有时候会有包含更多邻居的向量.例如 :
data_2 <- c(1,5,11,15,24,31,32,65,99,100,101,140)
Run Code Online (Sandbox Code Playgroud)
这是另一个创建id via的想法cumsum(c(TRUE, diff(a) > 1)),其中1显示了间隙阈值,即
#our group variable
grp <- cumsum(c(TRUE, diff(a) > 1))
#keep only groups with length 1 (i.e. with no neighbor)
i1 <- a[!!!ave(a, grp, FUN = function(i) length(i) > 1)]
#Find the mean of the groups with more than 1 rows,
i2 <- unname(tapply(a, grp, function(i)mean(i[length(i) > 1])))
#Concatenate the above 2 (eliminating NAs from i2) to get final result
c(i1, i2[!is.na(i2)])
#[1] 1.0 5.0 11.0 15.0 24.0 65.0 31.5
Run Code Online (Sandbox Code Playgroud)
您也可以将其包装在一个函数中.我把间隙留作参数让你可以调整,
get_vec <- function(x, gap) {
grp <- cumsum(c(TRUE, diff(x) > gap))
i1 <- x[!!!ave(x, grp, FUN = function(i) length(i) > 1)]
i2 <- unname(tapply(x, grp, function(i) mean(i[length(i) > 1])))
return(c(i1, i2[!is.na(i2)]))
}
get_vec(a, 1)
#[1] 1.0 5.0 11.0 15.0 24.0 65.0 31.5
get_vec(a_2, 1)
#[1] 1.0 5.0 11.0 15.0 24.0 65.0 140.0 31.5 100.0
Run Code Online (Sandbox Code Playgroud)
数据:
a <- c(1,5,11,15,24,31,32,65)
a_2 <- c(1, 5, 11, 15, 24, 31, 32, 65, 99, 100, 101, 140)
Run Code Online (Sandbox Code Playgroud)