Blaze with Scikit学习K-Means

sac*_*ena 10 python scikit-learn blaze

我试图使Blaze数据对象适合scikit kmeans函数.

from blaze import *
from sklearn.cluster import KMeans
data_numeric = Data('data.csv')
data_cluster = KMeans(n_clusters=5)
data_cluster.fit(data_numeric)
Run Code Online (Sandbox Code Playgroud)

数据样本:

A  B  C
1  32 34
5  57 92
89 67 21
Run Code Online (Sandbox Code Playgroud)

它的投掷错误:

在此输入图像描述

我已经能够用Pandas Dataframe做到这一点.有没有办法将blaze对象提供给这个函数?

abe*_*ger 5

我认为您需要在适合之前将您的pandas数据帧转换为numpy数组.

from blaze import *
import numpy

from sklearn.cluster import KMeans
data_numeric = numpy.array(data('data.csv'))
data_cluster = KMeans(n_clusters=5)
data_cluster.fit(data_numeric)
Run Code Online (Sandbox Code Playgroud)