Pyspark 中这个操作的等价物是什么?
import pandas as pd
import numpy as np
df = pd.DataFrame({'Type':list('ABBC'), 'Set':list('ZZXY')})
df['color'] = np.where(df['Set']=='Z', 'green', 'red')
print(df)
Run Code Online (Sandbox Code Playgroud)
输出
Set Type color
0 Z A green
1 Z B green
2 X B red
3 Y C red
Run Code Online (Sandbox Code Playgroud) 我必须将数据框保存到Pickle文件中,但是会返回错误
df.saveAsPickleFile(path)
Run Code Online (Sandbox Code Playgroud)
AttributeError:“ Dataframe”对象没有属性“ saveAsPickleFile”