我试图将数据帧转换为嵌套字典但到目前为止没有成功.
数据帧: clean_data['Model', 'Problem', 'Size']
这是我的数据的样子:
Model Problem Size
lenovo a6020 screen broken 1
lenovo a6020a40 battery 60
bluetooth 60
buttons 60
lenovo k4 wi-fi 3
bluetooth 3
Run Code Online (Sandbox Code Playgroud)
我想要的输出:
{
"name": "Brand",
"children": [
{
"name": "Lenovo",
"children": [
{
"name": "lenovo a6020",
"children": {
"name": "screen broken",
"size": 1
}
},
{
"name": "lenovo a6020a40",
"children": [
{
"name": "battery",
"size": 60
},
{
"name": "bluetooth",
"size": 60
},
{
"name": "buttons",
"size": 60
}
]
},
{
"name": …Run Code Online (Sandbox Code Playgroud) 我想为分类问题创建合成数据。我使用make_classification的方法sklearn.datasets。我希望数据在特定范围内,比方说[80, 155],但它正在生成负数。
我尝试了很多scale和class_sep参数的组合,但没有得到想要的输出。
import pandas as pd
from sklearn.datasets import make_classification
weight = [0.2, 0.37, 0.21, 0.04, 0.11, 0.05, 0.02]
X, y = make_classification(n_samples=100, n_features=3,
n_informative=3, n_redundant=0, n_repeated=0,
n_classes=7, n_clusters_per_class=1, weights=weight,
class_sep=1,shuffle=True, random_state=41, scale= 1)
pd.DataFrame(X).describe()
Run Code Online (Sandbox Code Playgroud)

输出应该在特定范围内,但它挑选出标准偏差约为 1.33 的随机值。