向量内的平均邻居

Lou*_*lou 11 r vector neighbours difference

我的数据:

data <- c(1,5,11,15,24,31,32,65)
Run Code Online (Sandbox Code Playgroud)

有两个邻居:31和32.我希望删除它们并仅保留平均值(例如31.5),这样数据将是:

data <- c(1,5,11,15,24,31.5,65)
Run Code Online (Sandbox Code Playgroud)

这似乎很简单,但我希望自动完成,有时候会有包含更多邻居的向量.例如 :

data_2 <- c(1,5,11,15,24,31,32,65,99,100,101,140)
Run Code Online (Sandbox Code Playgroud)

Sot*_*tos 7

这是另一个创建id via的想法cumsum(c(TRUE, diff(a) > 1)),其中1显示了间隙阈值,即

#our group variable
grp <- cumsum(c(TRUE, diff(a) > 1))

#keep only groups with length 1 (i.e. with no neighbor)
i1 <- a[!!!ave(a, grp, FUN = function(i) length(i) > 1)] 

#Find the mean of the groups with more than 1 rows,
i2 <- unname(tapply(a, grp, function(i)mean(i[length(i) > 1])))

#Concatenate the above 2 (eliminating NAs from i2) to get final result
c(i1, i2[!is.na(i2)])
#[1]  1.0  5.0 11.0 15.0 24.0 65.0 31.5
Run Code Online (Sandbox Code Playgroud)

您也可以将其包装在一个函数中.我把间隙留作参数让你可以调整,

get_vec <- function(x, gap) {
    grp <- cumsum(c(TRUE, diff(x) > gap))
    i1 <- x[!!!ave(x, grp, FUN = function(i) length(i) > 1)]
    i2 <- unname(tapply(x, grp, function(i) mean(i[length(i) > 1])))
    return(c(i1, i2[!is.na(i2)]))
}

get_vec(a, 1)
#[1]  1.0  5.0 11.0 15.0 24.0 65.0 31.5

get_vec(a_2, 1)
#[1]   1.0   5.0  11.0  15.0  24.0  65.0 140.0  31.5 100.0
Run Code Online (Sandbox Code Playgroud)

数据:

a <- c(1,5,11,15,24,31,32,65)
a_2 <- c(1, 5, 11, 15, 24, 31, 32, 65, 99, 100, 101, 140)
Run Code Online (Sandbox Code Playgroud)