我想了解H2o R-package中h2o.predict()函数的值(结果)的含义。我意识到在某些情况下,当predict列是 时1,p1列的值低于列p0。我对p0和p1列的解释是指每个事件的概率,所以我预计predict=1的概率p1应该高于相反事件的概率 ( p0),但它并不总是发生,正如我在以下示例中所示:使用前列腺数据集。
这是可执行示例:
library(h2o)
h2o.init(max_mem_size = "12g", nthreads = -1)
prostate.hex <- h2o.importFile("https://h2o-public-test-data.s3.amazonaws.com/smalldata/prostate/prostate.csv")
prostate.hex$CAPSULE <- as.factor(prostate.hex$CAPSULE)
prostate.hex$RACE <- as.factor(prostate.hex$RACE)
prostate.hex$DCAPS <- as.factor(prostate.hex$DCAPS)
prostate.hex$DPROS <- as.factor(prostate.hex$DPROS)
prostate.hex.split = h2o.splitFrame(data = prostate.hex,
ratios = c(0.70, 0.20, 0.10), seed = 1234)
train.hex <- prostate.hex.split[[1]]
validate.hex <- prostate.hex.split[[2]]
test.hex <- prostate.hex.split[[3]]
fit <- h2o.glm(y = "CAPSULE", x = …Run Code Online (Sandbox Code Playgroud)