rit*_*uri 9 python pandas quandl
我被困在以下几行
import quandl,math
import pandas as pd
import numpy as np
from sklearn import preprocessing ,cross_validation , svm
from sklearn.linear_model import LinearRegression
df = quandl.get('WIKI/GOOGL')
df = df[['Adj. Open','Adj. High','Adj. Low','Adj. Close','Adj. Volume']]
df['HL_PCT'] = (df["Adj. High"] - df['Adj. Close'])/df['Adj. Close'] * 100
df['PCT_CHANGE'] = (df["Adj. Close"] - df['Adj. Open'])/df['Adj. Open'] * 100
df = df[['Adj. Close','HL_PCT','PCT_CHANGE','Adj. Open']]
forecast_col = 'Adj. Close'
df.fillna(-99999,inplace = True)
forecast_out = int(math.ceil(.1*len(df)))
df['label'] = df[forecast_col].shift(-forecast_out)
print df.head()
Run Code Online (Sandbox Code Playgroud)
我无法理解df [forecast_col] .shift(-forecast_out)的含义
请解释一下该命令,该怎么办?
小智 16
pandas.Dataframe的移位函数使用可选的时间频率将索引移动所需的周期数.有关换档功能的更多信息,请参阅此链接.
以下是要移位的列值的小示例:
import pandas as pd
import numpy as np
df = pd.DataFrame({"date": ["2000-01-03", "2000-01-03", "2000-03-05", "2000-01-03", "2000-03-05",
"2000-03-05", "2000-07-03", "2000-01-03", "2000-07-03", "2000-07-03"],
"variable": ["A", "A", "A", "B", "B", "B", "C", "C", "C", "D"],
"no": [1, 2.2, 3.5, 1.5, 1.5, 1.2, 1.3, 1.1, 2, 3],
"value": [0.469112, -0.282863, -1.509059, -1.135632, 1.212112, -0.173215,
0.119209, -1.044236, -0.861849, None]})
Run Code Online (Sandbox Code Playgroud)
下面是移位前的列值
df['value']
Run Code Online (Sandbox Code Playgroud)
产量
0 0.469112
1 -0.282863
2 -1.509059
3 -1.135632
4 1.212112
5 -0.173215
6 0.119209
7 -1.044236
8 -0.861849
9 NaN
Run Code Online (Sandbox Code Playgroud)
使用移位函数值根据给定的周期移动
例如,使用带有正整数的移位将行值向下移动:
df['value'].shift(1)
Run Code Online (Sandbox Code Playgroud)
产量
0 NaN
1 0.469112
2 -0.282863
3 -1.509059
4 -1.135632
5 1.212112
6 -0.173215
7 0.119209
8 -1.044236
9 -0.861849
Name: value, dtype: float64
Run Code Online (Sandbox Code Playgroud)
使用带负整数的shift将行值向上移动:
df['value'].shift(-1)
Run Code Online (Sandbox Code Playgroud)
产量
0 -0.282863
1 -1.509059
2 -1.135632
3 1.212112
4 -0.173215
5 0.119209
6 -1.044236
7 -0.861849
8 NaN
9 NaN
Name: value, dtype: float64
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
13703 次 |
| 最近记录: |