我有一个SparkR DataFrame,我想(最常见)value获取每个唯一模式name。我怎样才能做到这一点?似乎没有内置mode功能。SparkR或PySpark解决方案都可以。
#Create DF
df <- data.frame(name = c("Thomas", "Thomas", "Thomas", "Bill", "Bill", "Bill"),
value = c(5, 5, 4, 3, 3, 7))
DF <- createDataFrame(df)
name | value
-----------------
Thomas | 5
Thomas | 5
Thomas | 4
Bill | 3
Bill | 3
Bill | 9
#What I want to get
name | mode(value)
-----------------
Thomas | 5
Bill | 3
Run Code Online (Sandbox Code Playgroud)