我每次都会收到这个错误......我用的是苏打水...我的conf-file:
***"spark.driver.memory 65g
spark.python.worker.memory 65g
spark.master local[*]"***
Run Code Online (Sandbox Code Playgroud)
数据量约为5 Gb.没有关于此错误的其他信息......有人知道它为什么会发生吗?谢谢!
***"ERROR:py4j.java_gateway:Error while sending or receiving.
Traceback (most recent call last):
File "/data/analytics/Spark1.6.1/python/lib/py4j-0.9-src.zip/py4j/java_gateway.py", line 746, in send_command
raise Py4JError("Answer from Java side is empty")
Py4JError: Answer from Java side is empty
ERROR:py4j.java_gateway:An error occurred while trying to connect to the Java server
Traceback (most recent call last):
File "/data/analytics/Spark1.6.1/python/lib/py4j-0.9-src.zip/py4j/java_gateway.py", line 690, in start
self.socket.connect((self.address, self.port))
File "/usr/local/anaconda/lib/python2.7/socket.py", line 228, in meth
return getattr(self._sock,name)(*args)
error: [Errno 111] Connection refused
ERROR:py4j.java_gateway:An error occurred while …Run Code Online (Sandbox Code Playgroud) 我使用 Sparklung Water,并且正在从镶木地板文件中读取数据。
我的spark-default.conf的部分:
`spark.serializer org.apache.spark.serializer.KryoSerializer spark.kryoserializer.buffer.max 1g
spark.driver.memory 40g
spark.executor.memory 40g
spark.driver.maxResultSize 0
spark.python.worker.memory 30g
spark.executor.extraJavaOptions -XX:+PrintGCDetails -XX:+PrintGCTimeStamps -XX:+PrintTenuringDistribution
spark.storage.safetyFraction 0.9
spark.storage.memoryFraction 0.0
Run Code Online (Sandbox Code Playgroud)
`
15/11/26 11:44:46 WARN MemoryStore: Not enough space to cache rdd_7_472 in memory! (computed 3.2 MB so far)
15/11/26 11:44:46 WARN ParquetRecordReader: Can not initialize counter due to context is not a instance of TaskInputOutputContext, but is org.apache.hadoop.mapreduce.task.TaskAttemptContextImpl
15/11/26 11:44:46 WARN ParquetRecordReader: Can not initialize counter due to context is not a instance of TaskInputOutputContext, but …Run Code Online (Sandbox Code Playgroud) apache-spark ×2