Edu*_*scu 6 python python-2.7 pandas python-ggplot
我尝试使用ggplot for python我有以下数据:
power_data = [[ 4.13877565e+04, 2.34652000e-01],
[ 4.13877565e+04, 2.36125000e-01],
[ 4.13877565e+04, 2.34772000e-01],
...
[ 4.13882896e+04, 2.29006000e-01],
[ 4.13882896e+04, 2.29019000e-01],
[ 4.13882896e+04, 2.28404000e-01]]
Run Code Online (Sandbox Code Playgroud)
我想用ggplot代表它:
print ggplot(aes(x='TIME', y='Watts'), data=power_data) + \
geom_point(color='lightblue') + \
geom_line(alpha=0.25) + \
stat_smooth(span=.05, color='black') + \
ggtitle("Power comnsuption over 13 hours") + \
xlab("Time") + \
ylab("Watts")
Run Code Online (Sandbox Code Playgroud)
但得到错误:
File "C:\PYTHON27\lib\site-packages\ggplot\ggplot.py", line 59, in __init__
for ae, name in self.aesthetics.iteritems():
AttributeError: 'list' object has no attribute 'iteritems'
>>>
Run Code Online (Sandbox Code Playgroud)
我不知道aes(x='TIME', y='Watts')应该做什么.
如何格式化power_data列表以便我可以将它与ggplot一起使用,我想x在电源y轴上的时间轴和第二列表示第一列?
如果我尝试这个meat例子它没有显示任何它只显示
>>> print (ggplot(aes(x='date', y='beef'), data=meat) + \
... geom_line())
<ggplot: (20096197)>
>>>
Run Code Online (Sandbox Code Playgroud)
我该怎么做才能进一步展示图形?
我错过了3个重要步骤:
1)首先,数据需要采用以下格式:
[{'TIME': 41387.756495162001, 'Watts': 0.234652},
{'TIME': 41387.756500821, 'Watts': 0.236125},
{'TIME': 41387.756506480997, 'Watts': 0.23477200000000001},
{'TIME': 41387.756512141001, 'Watts': 0.23453099999999999},
...
{'TIME': 41387.756574386003, 'Watts': 0.23558699999999999},
{'TIME': 41387.756580046, 'Watts': 0.23508899999999999},
{'TIME': 41387.756585706004, 'Watts': 0.235041},
{'TIME': 41387.756591365003, 'Watts': 0.23541200000000001},
{'TIME': 41387.756597013002, 'Watts': 0.23461699999999999},
{'TIME': 41387.756602672998, 'Watts': 0.23483899999999999}]
Run Code Online (Sandbox Code Playgroud)
2)然后,数据需要被饰以DataFrame与
powd = DataFrame(data2)
Run Code Online (Sandbox Code Playgroud)
3)没有plt.show(1)情节将不会显示
以下是解决上述问题的代码:
from pandas import DataFrame
data2 = []
for i in range(0,len(power_data)):
data2.append({'TIME': power_data[i][0], 'Watts': power_data[i][1]})
powd = DataFrame(data2)
print powd
# the above can be changed with this line:
# powd = DataFrame(power_data, columns=['TIME', 'Watts'])
# see sugestion in comments
print ggplot(aes(x='TIME', y='Watts'), data=powd) + \
geom_point(color='lightblue') + \
geom_line(alpha=0.25) + \
stat_smooth(span=.05, color='black') + \
ggtitle("Power comnsuption over 13 hours") + \
xlab("Time") + \
ylab("Watts")
Run Code Online (Sandbox Code Playgroud)
或者在没有评论中for提供的一次通过中:
powd = DataFrame(power_data, columns=['TIME', 'Watts'])
print ggplot(aes(x='TIME', y='Watts'), data=powd) + \
geom_point(color='lightblue') + \
geom_line(alpha=0.25) + \
stat_smooth(span=.05, color='black') + \
ggtitle("Power comnsuption over 13 hours") + \
xlab("Time") + \
ylab("Watts")
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
3978 次 |
| 最近记录: |