如果数据如下:
Store,Dept,Date,Weekly_Sales,IsHoliday
1,1,2010-02-05,24924.5,FALSE
1,1,2010-02-12,46039.49,TRUE
1,1,2010-02-19,41595.55,FALSE
1,1,2010-02-26,19403.54,FALSE
1,1,2010-03-05,21827.9,FALSE
1,1,2010-03-12,21043.39,FALSE
1,1,2010-03-19,22136.64,FALSE
1,1,2010-03-26,26229.21,FALSE
1,1,2010-04-02,57258.43,FALSE
Run Code Online (Sandbox Code Playgroud)
我想复制IsHoliday等于TRUE的行,我可以这样做:
is_hol = df['IsHoliday'] == True
df_try = df[is_hol]
df=df.append(df_try*10)
Run Code Online (Sandbox Code Playgroud)
但有没有更好的方法来做到这一点因为我需要将假日行复制5次,如果使用上述方法我必须追加5次.
我想在同一列"Date"上加入到dataframe,代码如下:
import pandas as pd
from datetime import datetime
df_train_csv = pd.read_csv('./train.csv',parse_dates=['Date'],index_col='Date')
start = datetime(2010, 2, 5)
end = datetime(2012, 10, 26)
df_train_fly = pd.date_range(start, end, freq="W-FRI")
df_train_fly = pd.DataFrame(pd.Series(df_train_fly), columns=['Date'])
merged = df_train_csv.join(df_train_fly.set_index(['Date']), on = ['Date'], how = 'right', lsuffix='_x')
Run Code Online (Sandbox Code Playgroud)
它抱怨数据帧df_train_csv没有名为"Date"的列.我想在数据框中将"Date"设置为索引,我想知道将date作为索引加入dataframe的最佳方法是什么?
那是样本数据
Date,Weekly_Sales
2010-02-05,24924.5
2010-02-12,46039.49
2010-02-19,41595.55
2010-02-26,19403.54
2010-03-05,21827.9
2010-03-12,21043.39
2010-03-19,22136.64
2010-03-26,26229.21
2010-04-02,57258.43
2010-04-09,42960.91
2010-04-16,17596.96
2010-04-23,16145.35
2010-04-30,16555.11
2010-05-07,17413.94
2010-05-14,18926.74
2010-05-21,14773.04
2010-05-28,15580.43
2010-06-04,17558.09
2010-06-11,16637.62
2010-06-18,16216.27
2010-06-25,16328.72
2010-07-02,16333.14
2010-07-09,17688.76
2010-07-16,17150.84
2010-07-23,15360.45
2010-07-30,15381.82
2010-08-06,17508.41
2010-08-13,15536.4
2010-08-20,15740.13
2010-08-27,15793.87
2010-09-03,16241.78
2010-09-10,18194.74
2010-09-17,19354.23
2010-09-24,18122.52
2010-10-01,20094.19 …Run Code Online (Sandbox Code Playgroud) 我是pandas用于数据分析的新手,我刚刚安装了具有所需依赖项的pandas(NumPy,python-dateutil,pytz,numexpr,bottleneck和matplotlib).但是当我开始尝试最基本的代码时:
import pandas as pd
pd.set_option('display.mpl_style', 'default') # Make the graphs a bit prettier
figsize(15, 5)
Run Code Online (Sandbox Code Playgroud)
它抱怨NameError:名称'figsize'未定义.
我不确定我是否还需要其他依赖项.谁能对此有所了解?
假设我有2个Winform项目A,B.
项目A(目标.NET 2.0)必须在x86(它是一个外部库)上运行,并且由于遗留原因,项目B(目标.NET 4.0)必须在任何CPU上运行.现在我需要从项目B调用A的方法,但它会抛出一个错误:
"An unhandled exception of type 'System.BadImageFormatException' occurred in System.Windows.Forms.dll"
Additional information: Could not load file or assembly 'CSharpDemo, Version=1.0.5414.18636, Culture=neutral, PublicKeyToken=null' or one of its dependencies. An attempt was made to load a program with an incorrect format."
Run Code Online (Sandbox Code Playgroud)
但是,我发现如果我在.NET 4.5上定位项目B,那么这个问题就不会发生了.但是我仍然希望B项目定位到4.0,如何实现这一目标?