在scikit中使用火车测试时获取指数

Question

在scikit中使用火车测试时获取指数

the*_*ala 12 python-2.7 scikit-learn

为了将我的数据分成火车和测试数据,我正在使用

sklearn.cross_validation.train_test_split 功能.

当我将数据和标签作为列表列表提供给此函数时,它会在两个单独的列表中返回列车和测试数据.

我想从原始数据列表中获取列车的索引和测试数据元素.

任何人都可以帮我解决这个问题吗？

提前致谢

Answer 1

Chr*_*sch 23

您可以提供索引向量作为附加参数.使用sklearn中的示例:

import numpy as np
from sklearn.cross_validation import train_test_split
X, y,indices = (0.1*np.arange(10)).reshape((5, 2)),range(10,15),range(5)
X_train, X_test, y_train, y_test,indices_train,indices_test = train_test_split(X, y,indices, test_size=0.33, random_state=42)
indices_train,indices_test
#([2, 0, 3], [1, 4])

Run Code Online (Sandbox Code Playgroud)

归档时间：	10 年前
查看次数：	9009 次
最近记录：	10 年前