StatsModels未对齐错误

Rup*_*ert 5 python statsmodels

当我尝试在Statsmodels中运行多变量线性回归时出现错误。当我仅硬编码XData变量中的一个X列时,一切工作正常。

有人可以给我一些建议,因为我在这里缺少什么吗?我将不胜感激。

错误:

ValueError:形状(747,2)和(747,2)不对齐:2(dim 1)!= 747(dim 0)

码:

import pandas as pd
import statsmodels.api as sm
import itertools

data = pd.read_csv("deaconFoodData.csv")

for i in range(2,10,1):
    xCombinations = itertools.combinations(["Food Exp","HH Size","HH Inc","Highest Ed Head","Age Head","Shopping Time","Kid <6","Kid 6-18","Eating Healthy"], i)
    print(str(i) + " variables")
    for combination in xCombinations:

        comb = list(combination)

        print(comb)
        xData = data[["Food Exp", "HH Size"]] # data[comb]
        yData = data["Shopping LH"]
        yData = sm.add_constant(yData, prepend=False)
        print(yData)
        # Fit and summarize OLS model
        mod = sm.OLS(xData, yData)
        res = mod.fit()
        print(res.rsquared)
Run Code Online (Sandbox Code Playgroud)

GitHub链接:https : //github.com/deacons2016/DeaconFood