小编use*_*508的帖子

Scala RDD [String]到RDD [String,String]

我有一个RDD[String]包含以下数据:

数据格式: ('Movie Name','Actress Name')

('Night of the Demons (2009)  (uncredited)', '"Steff", Stefanie Oxmann Mcgaha')
('The Bad Lieutenant: Port of Call - New Orleans (2009)  (uncredited)', '"Steff", Stefanie Oxmann Mcgaha') 
('"Please Like Me" (2013) {All You Can Eat (#1.4)}', '$haniqua') 
('"Please Like Me" (2013) {French Toast (#1.2)}', '$haniqua') 
('"Please Like Me" (2013) {Horrible Sandwiches (#1.6)}', '$haniqua')
Run Code Online (Sandbox Code Playgroud)

我希望将其转换RDD[String,String]为第一个元素,这' '将是我在RDD中的第一个字符串,其中的第二个元素' '将是我在RDD中的第二个字符串.

我试过这个:

val rdd1 = sc.textFile("/home/user1/Documents/TestingScala/actress"
val splitRdd = rdd1.map( line => line.split(",") ) …
Run Code Online (Sandbox Code Playgroud)

dictionary scala scala-collections apache-spark rdd

1
推荐指数
1
解决办法
1563
查看次数

标签 统计

apache-spark ×1

dictionary ×1

rdd ×1

scala ×1

scala-collections ×1