在sklearn中计算f统计量

Question

在sklearn中计算f统计量

Vic*_*i B 4 python linear-regression scikit-learn

我一直在热切地搜索，但找不到答案。

如何使用 sklearn 计算 f 统计量？考虑到以下公式，我真的必须手动计算它吗：

（其中是观测值的数量，是变量的数量）。

而且...如果我手动计算，如何获得相关的 p 值？

Answer 1

jro*_*ros 6

我希望这有帮助！要查找 f 统计量，请尝试：

import sklearn
from sklearn.linear_model import LinearRegression

X, y = df[['x1','x2']], df[['y']]

model=LinearRegression().fit(X, y)

Rsq = model.score

fstat = (Rsq/(1-Rsq))*((N-K-1)/K) #you should find N and K yourself

Run Code Online (Sandbox Code Playgroud)

要查找 p 值，您可以使用 python 包 symbulate

import symbulate as sm

dfN = 5 -1 #degrees of freedom in the numerator of F-statistic
dfD = 2 -1 #degrees of freedom in the denominator of F-statistic

pVal = 1-sm.F(dfN,dfD).cdf(fstat)

Run Code Online (Sandbox Code Playgroud)

归档时间：	6 年前
查看次数：	3119 次
最近记录：	2 年，3 月前