在j参数中data.table,是否有语法允许我在同一j语句中引用先前创建的变量?我正在考虑像Lisp的let*构造.
library(data.table)
set.seed(22)
DT <- data.table(a = rep(1:5, each = 10),
b = sample(c(0,1), 50, rep = TRUE))
DT[ ,
list(attempts = .N,
successes = sum(b),
rate = successes / attempts),
by = a]
Run Code Online (Sandbox Code Playgroud)
这导致了
# Error in `[.data.table`(DT, , list(attempts = .N, successes = sum(b), :
# object 'successes' not found
Run Code Online (Sandbox Code Playgroud)
我理解为什么,但有不同的方法来实现这一点j吗?
这样就可以了:
DT[ , {
list(attempts = attempts <- .N,
successes = successes <- sum(b),
rate = successes/attempts)
}, by = a]
# a attempts successes rate
# 1: 1 10 5 0.5
# 2: 2 10 6 0.6
# 3: 3 10 3 0.3
# 4: 4 10 5 0.5
# 5: 5 10 5 0.5
Run Code Online (Sandbox Code Playgroud)
FWIW,这个密切相关的data.table功能请求可以使你的问题中使用+/-语法.从链接页面引用:
摘要:
:=(和`:=`(...))的迭代RHS ,以及多个:=内部j = {...}语法详细说明
例如
DT[, `:=`( m1 = mean(a), m2 = sd(a), s = m1/m2 ), by = group]其中s可以使用先前的lhs名称(使用'迭代'一词尝试传达它).
试试这个:
DT[,
{successes = sum(b);
attempts = .N;
list(attempts = attempts,
successes = successes,
rate = successes / attempts)
},
by = a]
Run Code Online (Sandbox Code Playgroud)
或者
DT[,
list(attempts = .N,
successes = sum(b)),
by = a][, rate := successes / attempts]
Run Code Online (Sandbox Code Playgroud)