给定此数据帧,如何仅选择那些"Col2"等于的行NaN?
In [56]: df = pd.DataFrame([range(3), [0, np.NaN, 0], [0, 0, np.NaN], range(3), range(3)], columns=["Col1", "Col2", "Col3"])
In [57]: df
Out[57]:
0 1 2
0 0 1 2
1 0 NaN 0
2 0 0 NaN
3 0 1 2
4 0 1 2
Run Code Online (Sandbox Code Playgroud)
结果应该是这个:
Out[57]:
0 1 2
1 0 NaN 0
Run Code Online (Sandbox Code Playgroud) 我有DataFrame:
time_diff avg_trips
0 0.450000 1.0
1 0.483333 1.0
2 0.500000 1.0
3 0.516667 1.0
4 0.533333 2.0
Run Code Online (Sandbox Code Playgroud)
我希望获得该列的第一个四分位数,第三个四分位数和中位数time_diff.为了获得中位数,我使用np.median(df["time_diff"].values).
我该如何计算四分位数?
我有一个DataFrame df:
df = pd.DataFrame(columns=["App","Feature1", "Feature2","Feature3",
"Feature4","Feature5",
"Feature6","Feature7","Feature8"],
data=[["SHA",0,0,1,1,1,0,1,0],
["LHA",1,0,1,1,0,1,1,0],
["DRA",0,0,0,0,0,0,1,0],
["FRA",1,0,1,1,1,0,1,1],
["BRU",0,0,1,0,1,0,0,0],
["PAR",0,1,1,1,1,0,1,0],
["AER",0,0,1,1,0,1,1,0],
["SHE",0,0,0,1,0,0,1,0]])
Run Code Online (Sandbox Code Playgroud)
我想创建一个堆积条形图,以便每个堆栈对应于AppY轴将包含1值的计数和X轴Feature.
它应该类似于这个条形图,唯一的区别是现在我想看到堆栈条和带颜色的图例:
df_c = df.iloc[:, 1:].eq(1).sum().rename_axis('Feature').reset_index(name='Cou??nt')
df_c = df_c.sort_values('Count')
plt.figure(figsize=(12,8))
ax = sns.barplot(x="Feature", y="Count", data=df_c, palette=sns.color_palette("GnBu", 10))
plt.xticks(rotation='vertical')
ax.grid(b=True, which='major', color='#d3d3d3', linewidth=1.0)
ax.grid(b=True, which='minor', color='#d3d3d3', linewidth=0.5)
plt.show()
Run Code Online (Sandbox Code Playgroud) df =
Col1 Col2 Col3
1 nan 4
2 5 4
3 3 nan
Run Code Online (Sandbox Code Playgroud)
给定数据帧df,我想获得一个df2不包含nan在列中的新数据帧Col2.这是预期的结果:df2 =
Col1 Col2 Col3
2 5 4
3 3 nan
Run Code Online (Sandbox Code Playgroud)
我知道,这是可能的使用pandas.isnull和dropna,但如何只指定特定的列到过滤应适用?
我有这些数据:
ID TIME
1 2
1 4
1 2
2 3
Run Code Online (Sandbox Code Playgroud)
我想将数据分组ID并计算每组的平均时间和大小.
ID MEAN_TIME COUNT
1 2.67 3
2 3.00 1
Run Code Online (Sandbox Code Playgroud)
如果我运行此代码,则会收到错误"ValueError:无法插入ID,已存在":
result = df.groupby(['ID']).agg({'TIME': 'mean', 'ID': 'count'}).reset_index()
Run Code Online (Sandbox Code Playgroud) 我想提取列的所有唯一值组合Col1,Col2和Col3.假设有以下数据帧df:
df =
Col1 Col2 Col3
12 AB 13
11 AB 13
12 AB 13
12 AC 14
Run Code Online (Sandbox Code Playgroud)
答案是:
unique =
Col1 Col2 Col3
12 AB 13
11 AB 13
12 AC 14
Run Code Online (Sandbox Code Playgroud)
我知道如何获得特定列的唯一值,即df.Col1.unique()不确定唯一的组合.
TypeError: Cannot do inplace boolean setting on mixed-types with a non np.nan value当我尝试用特定的字符串值替换多列中的数值时,我收到错误.
df =
TYPE VD_1 VD_2 VD_3
AAA 1234 22122 2345
AAA 1234 2345 22122
Run Code Online (Sandbox Code Playgroud)
我是这样做的:
df[df.isin([22122])] = "English"
Run Code Online (Sandbox Code Playgroud)
要么
df[df==22122] = "English"
Run Code Online (Sandbox Code Playgroud) 我使用matplotlib.pyplot和seaborn库创建了一个条形图.如何按照递增的顺序对条形图进行排序Speed?我想看到左边最低速度的条形图和右边最高速度的条形图.
df =
Id Speed
1 30
1 35
1 31
2 20
2 25
3 80
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
%matplotlib inline
result = df.groupby(["Id"])['Speed'].aggregate(np.median).reset_index()
norm = plt.Normalize(df["Speed"].values.min(), df["Speed"].values.max())
colors = plt.cm.Reds(norm(df["Speed"]))
plt.figure(figsize=(12,8))
sns.barplot(x="Id", y="Speed", data=gr_vel_1, palette=colors)
plt.ylabel('Speed', fontsize=12)
plt.xlabel('Id', fontsize=12)
plt.xticks(rotation='vertical')
plt.show()
Run Code Online (Sandbox Code Playgroud) 我正在使用interact.js并且我想启用在预定义的 div 内放置可拖动对象.dropzone,以便.dropzone禁止放置在外面。我介绍了restriction: ".dropzone", drag: document.getElementById('dropzone'),,但是它不能正常工作。
我创建这个 JFIDDLE 是为了更清楚(我更新了链接,因为旧链接不正确)。
JavaScript:
// target elements with the "draggable" class
interact('.draggable').draggable({
inertia: true,
restrict: {
restriction: ".dropzone",
drag: document.getElementById('dropzone'),
endOnly: true,
elementRect: {
top: 0,
left: 0,
bottom: 1,
right: 1
}
},
autoScroll: true,
onmove: function(event) {
var target = event.target;
var x = (parseFloat(target.getAttribute('data-x')) || 0) + event.dx;
var y = (parseFloat(target.getAttribute('data-y')) || 0) + event.dy;
target.style.transform = 'translate(' + x …Run Code Online (Sandbox Code Playgroud) 如何使用Kafka REST Proxy删除Kafka主题?我尝试了以下命令,但它返回错误消息:
curl -X DELETE XXX.XX.XXX.XX:9092/topics/test_topic
Run Code Online (Sandbox Code Playgroud)
如果不可能,那么如何更新删除消息并更新主题方案?
pandas ×8
python ×8
matplotlib ×2
numpy ×2
seaborn ×2
apache-kafka ×1
css ×1
html ×1
http-proxy ×1
interact.js ×1
javascript ×1
jms-topic ×1
python-2.7 ×1
rest ×1