我正在使用 SMOTE 重新采样我的数据(多类)。
sm = SMOTE(random_state=1)
X_res, Y_res = sm.fit_resample(X_train, Y_train)
Run Code Online (Sandbox Code Playgroud)
但是,我收到此属性错误。任何人都可以帮忙吗?
我有一个多列的表。对于其中一列AC,我需要将其替换为Result AC,如下表所示
该函数的伪代码如下:
For each unique address:
If
# of unique(AC) < Value in column B AND
len(Top 1 (unique(AC))) != len(Top 2 (unique(AC))):
return mode(air_conditioning)
Else
For each unique(AC), calculate sum(area)
If Top 1 (sum(area)) > Top 2 (SUM(area)):
return air_conditioning which has Top 1 (SUM(building_area))
Else
For each unique(AC), SUM(value)
return air_conditioning with Top 1 (SUM(value))
Run Code Online (Sandbox Code Playgroud)