我有一个RDD[String]包含以下数据:
数据格式: ('Movie Name','Actress Name')
('Night of the Demons (2009) (uncredited)', '"Steff", Stefanie Oxmann Mcgaha')
('The Bad Lieutenant: Port of Call - New Orleans (2009) (uncredited)', '"Steff", Stefanie Oxmann Mcgaha')
('"Please Like Me" (2013) {All You Can Eat (#1.4)}', '$haniqua')
('"Please Like Me" (2013) {French Toast (#1.2)}', '$haniqua')
('"Please Like Me" (2013) {Horrible Sandwiches (#1.6)}', '$haniqua')
Run Code Online (Sandbox Code Playgroud)
我希望将其转换RDD[String,String]为第一个元素,这' '将是我在RDD中的第一个字符串,其中的第二个元素' '将是我在RDD中的第二个字符串.
我试过这个:
val rdd1 = sc.textFile("/home/user1/Documents/TestingScala/actress"
val splitRdd = rdd1.map( line => line.split(",") ) …Run Code Online (Sandbox Code Playgroud)