当我尝试打印RDD的内容时,会打印下面显示的内容,如何打印内容?谢谢!
scala> lines
res15: org.apache.spark.rdd.RDD[Array[String]] = MapPartitionsRDD[3] at filter at <console>:23
scala> lines.take(5).foreach(println)
[Ljava.lang.String;@6d3db5d1
[Ljava.lang.String;@6e6be45e
[Ljava.lang.String;@6d5e0ff4
[Ljava.lang.String;@3a699444
[Ljava.lang.String;@69851a51
Run Code Online (Sandbox Code Playgroud) 我有一个有2个节点的火花簇,master(172.17.0.229)和slave(172.17.0.228).我已经编辑spark-env.sh,添加SPARK_MASTER_IP=127.17.0.229和奴隶,补充说172.17.0.228.
我正在使用start-master.sh和从节点启动我的主节点start-slaves.sh.
我可以看到webUI的主节点没有worker,但是worker节点的日志如下:
Spark Command: /usr/lib/jvm/java-7-oracle/jre/bin/java -cp /usr/local/src/spark-1.5.2-bin-hadoop2.6/sbin/../conf/:/usr/local/src/spark-1.5.2-bin-hadoop$
========================================
Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties
15/12/18 14:17:25 INFO Worker: Registered signal handlers for [TERM, HUP, INT]
15/12/18 14:17:26 WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
15/12/18 14:17:26 INFO SecurityManager: Changing view acls to: ujjwal
15/12/18 14:17:26 INFO SecurityManager: Changing modify acls to: ujjwal
15/12/18 14:17:26 INFO SecurityManager: SecurityManager: …Run Code Online (Sandbox Code Playgroud)