小编wuh*_*uha的帖子

Python Pandas在数据帧中复制行

如果数据如下:

Store,Dept,Date,Weekly_Sales,IsHoliday
1,1,2010-02-05,24924.5,FALSE
1,1,2010-02-12,46039.49,TRUE
1,1,2010-02-19,41595.55,FALSE
1,1,2010-02-26,19403.54,FALSE
1,1,2010-03-05,21827.9,FALSE
1,1,2010-03-12,21043.39,FALSE
1,1,2010-03-19,22136.64,FALSE
1,1,2010-03-26,26229.21,FALSE
1,1,2010-04-02,57258.43,FALSE
Run Code Online (Sandbox Code Playgroud)

我想复制IsHoliday等于TRUE的行,我可以这样做:

is_hol = df['IsHoliday'] == True
df_try = df[is_hol]
df=df.append(df_try*10)
Run Code Online (Sandbox Code Playgroud)

但有没有更好的方法来做到这一点因为我需要将假日行复制5次,如果使用上述方法我必须追加5次.

python dataframe pandas

54
推荐指数
6
解决办法
8万
查看次数

Python Pandas在索引上连接数据帧

我想在同一列"Date"上加入到dataframe,代码如下:

import pandas as pd
from datetime import datetime
df_train_csv = pd.read_csv('./train.csv',parse_dates=['Date'],index_col='Date')

start = datetime(2010, 2, 5)
end = datetime(2012, 10, 26)

df_train_fly = pd.date_range(start, end, freq="W-FRI")
df_train_fly = pd.DataFrame(pd.Series(df_train_fly), columns=['Date'])

merged = df_train_csv.join(df_train_fly.set_index(['Date']), on = ['Date'], how = 'right', lsuffix='_x')
Run Code Online (Sandbox Code Playgroud)

它抱怨数据帧df_train_csv没有名为"Date"的列.我想在数据框中将"Date"设置为索引,我想知道将date作为索引加入dataframe的最佳方法是什么?

更新:

那是样本数据

Date,Weekly_Sales
2010-02-05,24924.5
2010-02-12,46039.49
2010-02-19,41595.55
2010-02-26,19403.54
2010-03-05,21827.9
2010-03-12,21043.39
2010-03-19,22136.64
2010-03-26,26229.21
2010-04-02,57258.43
2010-04-09,42960.91
2010-04-16,17596.96
2010-04-23,16145.35
2010-04-30,16555.11
2010-05-07,17413.94
2010-05-14,18926.74
2010-05-21,14773.04
2010-05-28,15580.43
2010-06-04,17558.09
2010-06-11,16637.62
2010-06-18,16216.27
2010-06-25,16328.72
2010-07-02,16333.14
2010-07-09,17688.76
2010-07-16,17150.84
2010-07-23,15360.45
2010-07-30,15381.82
2010-08-06,17508.41
2010-08-13,15536.4
2010-08-20,15740.13
2010-08-27,15793.87
2010-09-03,16241.78
2010-09-10,18194.74
2010-09-17,19354.23
2010-09-24,18122.52
2010-10-01,20094.19 …
Run Code Online (Sandbox Code Playgroud)

python indexing data-analysis pandas

20
推荐指数
1
解决办法
4万
查看次数

Python Pandas figsize没有定义

我是pandas用于数据分析的新手,我刚刚安装了具有所需依赖项的pandas(NumPy,python-dateutil,pytz,numexpr,bottleneck和matplotlib).但是当我开始尝试最基本的代码时:

import pandas as pd
pd.set_option('display.mpl_style', 'default') # Make the graphs a bit prettier
figsize(15, 5)
Run Code Online (Sandbox Code Playgroud)

它抱怨NameError:名称'figsize'未定义.

我不确定我是否还需要其他依赖项.谁能对此有所了解?

python data-analysis pandas

7
推荐指数
2
解决办法
1万
查看次数

使用X86和Any CPU的C#项目

假设我有2个Winform项目A,B.

项目A(目标.NET 2.0)必须在x86(它是一个外部库)上运行,并且由于遗留原因,项目B(目标.NET 4.0)必须在任何CPU上运行.现在我需要从项目B调用A的方法,但它会抛出一个错误:

"An unhandled exception of type 'System.BadImageFormatException' occurred in System.Windows.Forms.dll"

Additional information: Could not load file or assembly 'CSharpDemo, Version=1.0.5414.18636, Culture=neutral, PublicKeyToken=null' or one of its dependencies. An attempt was made to load a program with an incorrect format."
Run Code Online (Sandbox Code Playgroud)

但是,我发现如果我在.NET 4.5上定位项目B,那么这个问题就不会发生了.但是我仍然希望B项目定位到4.0,如何实现这一目标?

.net c# visual-studio

5
推荐指数
2
解决办法
4351
查看次数

标签 统计

pandas ×3

python ×3

data-analysis ×2

.net ×1

c# ×1

dataframe ×1

indexing ×1

visual-studio ×1