import statsmodels.formula.api as sm
import numpy as np
import pandas
url = "http://vincentarelbundock.github.com/Rdatasets/csv/HistData/Guerry.csv"
df = pandas.read_csv(url)
df = df[['Lottery', 'Literacy', 'Wealth', 'Region']].dropna()
print df.head()
mod = sm.ols(formula='Lottery ~ Literacy + Wealth + Region', data=df)
res = mod.fit()
print res.summary()
打印表後返回此錯誤。statsmodels示例似乎不起作用
---------------------------------------------------------------------------
TypeError Traceback (most recent call last)
<ipython-input-4-f69caff21ed0> in <module>()
6 df = df[['Lottery', 'Literacy', 'Wealth', 'Region']].dropna()
7 print df.head()
----> 8 mod = sm.ols(formula='Lottery ~ Literacy + Wealth + Region', data=df)
9 res = mod.fit()
10 print res.summary()
TypeError: from_formula() takes at least 3 arguments (2 given)
這似乎不是可接受的行爲。我究竟做錯了什麼?
我認爲這是因爲公式和數據不是關鍵字參數。嘗試從通話中刪除他們到ols。 – user333700
適合我。你可以輸入'import statsmodels.api as sm;打印sm.version.version'和'輸入patsy;將patsy .__ version__'打印成提示並讓我知道你得到了什麼? – TomAugspurger
import smsmodels.api as sm;打印sm.version.version 0.5.0.dev-1bbd4ca – tipanverella