Aka*_*all 18 python statistics scipy
我正在寻找Python中的测试:
> survivors <- matrix(c(1781,1443,135,47), ncol=2)
> colnames(survivors) <- c('survived','died')
> rownames(survivors) <- c('no seat belt','seat belt')
> survivors
survived died
no seat belt 1781 135
seat belt 1443 47
> prop.test(survivors)
2-sample test for equality of proportions with continuity correction
data: survivors
X-squared = 24.3328, df = 1, p-value = 8.105e-07
alternative hypothesis: two.sided
95 percent confidence interval:
-0.05400606 -0.02382527
sample estimates:
prop 1 prop 2
0.9295407 0.9684564
Run Code Online (Sandbox Code Playgroud)
我最感兴趣的是p-value计算.
这里的例子就是这里的例子
Aka*_*all 20
我想我明白了:
In [11]: from scipy import stats
In [12]: import numpy as np
In [13]: survivors = np.array([[1781,135], [1443, 47]])
In [14]: stats.chi2_contingency(survivors)
Out[14]:
(24.332761232771361, # x-squared
8.1048817984512269e-07, # p-value
1,
array([[ 1813.61832061, 102.38167939],
[ 1410.38167939, 79.61832061]]))
Run Code Online (Sandbox Code Playgroud)
添加@Akavall 的答案:如果您没有明确指定“失败”计数(示例中的死亡人数),R 可以prop.test让您仅指定试验总数,例如prop.test(c(1781, 1443), c(1781+135, 1443+47))会给您与列联表相同的结果建造的。
Scipychi2_contingency明确要求提供失败计数和完整的列联表。如果您没有明确地获得失败计数,而只是想检查两个样本的成功占总数的比例是否相等,您可以使用 scipy 的函数来破解
survivors = np.array([[1781, total1 - 1781], [1443, total2 - 47]])
chi2_contingency(survivors)
# Result:
(24.332761232771361, 8.1048817984512269e-07, 1,
array([[ 1813.61832061, 102.38167939],
[ 1410.38167939, 79.61832061]]))
Run Code Online (Sandbox Code Playgroud)
我花了一些时间才弄清楚这一点。希望它能帮助某人。
| 归档时间: |
|
| 查看次数: |
10689 次 |
| 最近记录: |