小编Ant*_*lli的帖子

必须用Hive构建Spark(spark 1.5.0)

下载spark 1.5.0预建并通过pyspark运行这个简单的代码

from pyspark.sql import Row
l = [('Alice', 1)]
sqlContext.createDataFrame(l).collect
Run Code Online (Sandbox Code Playgroud)

产量误差:

15/09/30 06:48:48 INFO Datastore: The class "org.apache.hadoop.hive.metastore.model.MResourceUri" is tagged as "embedded-only" so do
es not have its own datastore table.
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "c:\bigdata\spark-1.5\spark-1.5.0\python\pyspark\sql\context.py", line 408, in createDataFrame
    jdf = self._ssql_ctx.applySchemaToPythonRDD(jrdd.rdd(), schema.json())
  File "c:\bigdata\spark-1.5\spark-1.5.0\python\pyspark\sql\context.py", line 660, in _ssql_ctx
    "build/sbt assembly", e)
Exception: ("You must build Spark with Hive. Export 'SPARK_HIVE=true' and run build/sbt assembly", Py4JJavaError(u'An error occurred
 while calling …
Run Code Online (Sandbox Code Playgroud)

python hive maven apache-spark spark-dataframe

7
推荐指数
1
解决办法
2608
查看次数

标签 统计

apache-spark ×1

hive ×1

maven ×1

python ×1

spark-dataframe ×1