Roh*_*oor 4 python regression machine-learning matplotlib
我正在实现简单的多项式回归来预测给定视频大小的视频时间,这是我自己的数据集。现在由于某种原因,我的情节得到了多个痕迹。
# Importing the libraries
import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
# Importing the dataset
dataset = pd.read_csv('estSize.csv')
X = dataset.iloc[:, 0].values.reshape(-1,1)
y = dataset.iloc[:, 1].values.reshape(-1,1)
from sklearn.linear_model import LinearRegression
# Fitting Polynomial Regression to the dataset
from sklearn.preprocessing import PolynomialFeatures
poly_reg = PolynomialFeatures(degree = 2)
X_poly = poly_reg.fit_transform(X)
poly_reg.fit(X_poly, y)
lin_reg_2 = LinearRegression()
lin_reg_2.fit(X_poly, y)
# Visualising the Polynomial Regression results
plt.scatter(X, y, color = 'red')
plt.plot(X, lin_reg_2.predict(poly_reg.fit_transform(X)), color = 'blue')
plt.show()
Run Code Online (Sandbox Code Playgroud)
您的数据需要根据预测变量进行排序。
行后
dataset = pd.read_csv('estSize.csv')
Run Code Online (Sandbox Code Playgroud)
添加这一行:
dataset = dataset.sort_values(by=['col1'])
Run Code Online (Sandbox Code Playgroud)
col1文件大小值的列标题在哪里。