将数据帧转换为类"dist"的对象,而不实际计算R中的距离

Eli*_*eth 9 r distance

我有一个带距离的数据框

df<-data.frame(site.x=c("A","A","A","B","B","C"),   
site.y=c("B","C","D","C","D","D"),Distance=c(67,57,64,60,67,60))
Run Code Online (Sandbox Code Playgroud)

我需要将其转换为类"dist"的对象,但我不需要计算距离,因此我可以使用dist()函数.有什么建议?

Rei*_*son 13

没有什么可以阻止你自己创建dist对象.它只是一个带有属性的距离矢量,用于设置标签,大小等.

使用你的df,这是怎么回事

dij2 <- with(df, Distance)
nams <- with(df, unique(c(as.character(site.x), as.character(site.y))))
attributes(dij2) <- with(df, list(Size = length(nams),
                                  Labels = nams,
                                  Diag = FALSE,
                                  Upper = FALSE,
                                  method = "user"))
class(dij2) <- "dist"
Run Code Online (Sandbox Code Playgroud)

或者您可以structure()直接执行此操作:

dij3 <- with(df, structure(Distance,
                           Size = length(nams),
                           Labels = nams,
                           Diag = FALSE,
                           Upper = FALSE,
                           method = "user",
                           class = "dist"))
Run Code Online (Sandbox Code Playgroud)

这些给出:

> df
  site.x site.y Distance
1      A      B       67
2      A      C       57
3      A      D       64
4      B      C       60
5      B      D       67
6      C      D       60
> dij2
   A  B  C
B 67      
C 57 60   
D 64 67 60
> dij3
   A  B  C
B 67      
C 57 60   
D 64 67 60
Run Code Online (Sandbox Code Playgroud)

注意:上面没有检查数据的顺序是否正确.确保您df按照正确的顺序输入数据,就像在示例中一样; 即排序site.x,然后site.y运行我展示代码之前.


joh*_*nes 3

不久前我也遇到了类似的问题,并这样解决:

n <- max(table(df$site.x)) + 1  # +1,  so we have diagonal of 
res <- lapply(with(df, split(Distance, df$site.x)), function(x) c(rep(NA, n - length(x)), x))
res <- do.call("rbind", res)
res <- rbind(res, rep(NA, n))
res <- as.dist(t(res))
Run Code Online (Sandbox Code Playgroud)