我想Scatterplot Matrix用labels和建立自己colors.我喜欢这个例子:Scatterplot Matrix.
我有一个问题.我不明白,究竟如何从加载数据numpy matrix到seaborn dataframe.
data_resc = np.random.rand(150,2)
sns.set()
df = DataFrame(data_resc)
sns.pairplot(df, hue="species", size=2.5)
sns.plt.show()
Run Code Online (Sandbox Code Playgroud)
此代码段有错误:
Traceback (most recent call last):
File "pandas\index.pyx", line 154, in pandas.index.IndexEngine.get_loc (pandas\index.c:3687)
File "pandas\hashtable.pyx", line 381, in pandas.hashtable.Int64HashTable.get_item (pandas\hashtable.c:7192)
TypeError: an integer is required
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "file.py", line 69, in <module>
main()
File "file.py", line 64, in …Run Code Online (Sandbox Code Playgroud) 我seaborn在sublime文本编辑器中遇到问题。
import pandas as pd
import seaborn as sns
data = pd.read_csv('train.csv')
sns.factorplot('Sex', data=data)
Run Code Online (Sandbox Code Playgroud)
这是错误:
Error
File "C:\Anaconda\lib\site-packages\numpy\core\_methods.py", line 73, in _mean
ret = ret / rcount
TypeError: unsupported operand type(s) for /: 'str' and 'long'
Run Code Online (Sandbox Code Playgroud) 给定以下数据框:
import pandas as pd
df = pd.DataFrame(
{'A':['A','B','C','D'],
'C':[12355.00,12555.67,640.00,7000]
})
df
A C
0 A 12355.00
1 B 12555.67
2 C 640.00
3 D 7000.00
Run Code Online (Sandbox Code Playgroud)
我想像这样将价值转换成千美元的美元:
A C
0 A $12.3K
1 B $12.5K
2 C $0.6K
3 D $7.0K
Run Code Online (Sandbox Code Playgroud)
我需要做的第二件事是以某种方式将它们放入Seaborn热图,该图仅接受浮点数和整数。有关热图方面的更多信息,请参见此处。
我假设一旦将浮点数转换为货币,它们将采用对象格式,但我希望可以解决这一问题。
提前致谢!
我正在努力将几张图保存在同一PDF上。数据源是一个数据框。我考虑过将它们另存为png,但这一点都不实际。
pdf_pages = PdfPages('kde.pdf')
for i in range(1,114):
for j in range(0,113):
x=t1.iloc[:,i]
y=t1.iloc[:,j]
fig=sns.jointplot(x=x, y=y, kind="kde",dropna=True);
pdf_pages.savefig(fig)
Run Code Online (Sandbox Code Playgroud) 当我使用中位数估算器绘制一个seaborn barplot时,我得到以下错误.似乎适用于其他估算师(例如总和).我错过了什么?
j = sns.barplot(x='category', y='sales', data=perf, estimator=median)
for item in j.get_xticklabels():
item.set_rotation(90)
plt.show()
NameError: name 'median' is not defined
Run Code Online (Sandbox Code Playgroud) 故事
熊猫,numpy,seaborn,matplotlib已正确安装在系统上。我从http://seaborn.pydata.org/examples/many_pairwise_correlations.html中获取了此代码示例
片段
from string import letters
import numpy as np
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
sns.set(style="white")
# Generate a large random dataset
rs = np.random.RandomState(33)
d = pd.DataFrame(data=rs.normal(size=(100, 26)),
columns=list(letters[:26]))
# Compute the correlation matrix
corr = d.corr()
# Generate a mask for the upper triangle
mask = np.zeros_like(corr, dtype=np.bool)
mask[np.triu_indices_from(mask)] = True
# Set up the matplotlib figure
f, ax = plt.subplots(figsize=(11, 9))
# Generate a custom diverging colormap …Run Code Online (Sandbox Code Playgroud) 在使用Python的Jupyter笔记本中,我正在从数据帧的两列中绘制一个六边形联合图。该图已正确绘制,但我无法调整图片大小。
这是代码:
fig, ax = plt.subplots()
fig.set_size_inches(11.7, 8.27)
sns.jointplot(x=train['max1'], y=train['intangle'], kind="hex", color="#4CB391",ax=ax)
plt.show()
Run Code Online (Sandbox Code Playgroud)
肠道我得到inner()得到了多个参数'ax'的值
我是python的新手,所以请多多包涵。我无法使用seaborn的lmplot可视化excel / csv文件。这段代码:
%matplotlib inline
import matplotlib.pyplot as plt
import pandas as pd
import seaborn as sns
df=pd.read_csv("C:/Users/me/Documents/Jupyter Notebooks/Seaborn/Test.csv")
sns.set_style('whitegrid')
sns.lmplot(x=df["TestX"],y=df["TestY"], data=df)
Run Code Online (Sandbox Code Playgroud)
给我这个错误:
---------------------------------------------------------------------------
KeyError Traceback (most recent call last)
<ipython-input-41-00ad8b882663> in <module>()
----> 1 sns.lmplot(x=df["TestX"],y=df["TestY"], data=df)
~\Anaconda3\lib\site-packages\seaborn\regression.py in lmplot(x, y, data, hue, col, row, palette, col_wrap, size, aspect, markers, sharex, sharey, hue_order, col_order, row_order, legend, legend_out, x_estimator, x_bins, x_ci, scatter, fit_reg, ci, n_boot, units, order, logistic, lowess, robust, logx, x_partial, y_partial, truncate, x_jitter, y_jitter, scatter_kws, line_kws)
550 need_cols = …Run Code Online (Sandbox Code Playgroud) 我正在尝试使用绘制python中的分布图seaborn。但是我收到以下无法解决的错误。
码:
sns.distplot(df['nn'])
错误:
cannot convert float NaN to integer
是否有任何有效的方法可以通过绘制直方图和分布图来解决此问题?
帕累托是Excel和Tableu中非常受欢迎的diagarm.在excel中我们可以轻松绘制Pareto图,但我发现在Python中绘制图表没有简单的方法.
我有一个像这样的pandas数据帧:
import numpy as np
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
df = pd.DataFrame({'country': [177.0, 7.0, 4.0, 2.0, 2.0, 1.0, 1.0, 1.0]})
df.index = ['USA', 'Canada', 'Russia', 'UK', 'Belgium', 'Mexico', 'Germany', 'Denmark']
print(df)
country
USA 177.0
Canada 7.0
Russia 4.0
UK 2.0
Belgium 2.0
Mexico 1.0
Germany 1.0
Denmark 1.0
Run Code Online (Sandbox Code Playgroud)
如何绘制帕累托图?使用大熊猫,seaborn,matplotlib等?
到目前为止,我已经能够制作降序条形图.但它仍然将累积的总和线图放在它们之上.
我的尝试:
df.sort_values(by='country',ascending=False).plot.bar()
seaborn ×10
python ×8
pandas ×6
matplotlib ×5
python-3.x ×2
csv ×1
dataframe ×1
excel ×1
numpy ×1
pareto-chart ×1
pdf ×1
plotly ×1
python-2.7 ×1
tkinter ×1