我正在做一些聚类,结果是这样的字符串列表:
['5-3-2', '5-3-2', '4-3-2-1', ...]
Run Code Online (Sandbox Code Playgroud)
我想根据字符串的频率绘制条形图。是否有捷径可寻?我想我可以识别列表中的唯一元素并计算它们,但也许有更舒适的解决方案?
编辑:更多信息
import matplotlib.pyplot as plt
import numpy as np
import math as math
import Utils as ut
from sklearn.cluster import KMeans
from itertools import cycle
...
result = np.array(result)
keys, counts = np.unique(result, return_counts=True)
print('Keys: ', keys)
print('Counts: ', counts)
print(result)
plt.bar(keys,counts)
plt.show
Run Code Online (Sandbox Code Playgroud)
输出:
Keys: ['3-1-4-2' '3-2-3-2' '3-3-2-2' '4-2-2-2' '4-2-3-1' '4-4-2']
Counts: [ 21 154 23 1 48 4]
Run Code Online (Sandbox Code Playgroud)
编辑 2:绘图显示在调试模式下,断点打开plt.show,当我跨过它时它消失了。所以它在运行模式下不可见。有什么建议?
np.unique 可以返回列表中唯一元素的计数。
keys, counts = np.unique(x, return_counts=True)
Run Code Online (Sandbox Code Playgroud)
然后您可以将它们绘制为条形图。
import matplotlib.pyplot as plt
import numpy as np
x = ['5-3-2', '5-3-2', '4-3-2', "2-3-2", '4-3-2', '4-3-2', "1-2-4"]
keys, counts = np.unique(x, return_counts=True)
plt.bar(keys, counts)
plt.show()
Run Code Online (Sandbox Code Playgroud)