IntelliJ Idea 14.1 中 Spark 1.4.1 的 winutils 二进制文件

Gis*_*gen 0 scala intellij-idea maven apache-spark

我试图让 Spark 1.4.1 与 IntelliJ Idea 14.1 中的 Scala 2.11.7 一起工作,但我不断收到此错误:

ERROR Shell: Failed to locate the winutils binary in the hadoop binary path
java.io.IOException: Could not locate executable null\bin\winutils.exe in the `Hadoop binaries.`
Run Code Online (Sandbox Code Playgroud)

有人知道我需要下载哪些二进制文件吗?

我来自 spark 的 pom.xml 是这个(开始):

    <project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
         xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
  <modelVersion>4.0.0</modelVersion>
  <parent>
    <groupId>org.apache</groupId>
    <artifactId>apache</artifactId>
    <version>14</version>
  </parent>
  <groupId>org.apache.spark</groupId>
  <artifactId>spark-parent_2.10</artifactId>
  <version>1.4.1</version>
  <packaging>pom</packaging>
  <name>Spark Project Parent POM</name>
  <url>http://spark.apache.org/</url>
  <licenses>
    <license>
      <name>Apache 2.0 License</name>
      <url>http://www.apache.org/licenses/LICENSE-2.0.html</url>
      <distribution>repo</distribution>
    </license>
  </licenses>
  <scm>
    <connection>scm:git:git@github.com:apache/spark.git</connection>
    <developerConnection>scm:git:https://git-wip-us.apache.org/repos/asf/spark.git</developerConnection>
    <url>scm:git:git@github.com:apache/spark.git</url>
    <tag>HEAD</tag>
  </scm>
  <developers>
    <developer>
      <id>matei</id>
      <name>Matei Zaharia</name>
      <email>matei.zaharia@gmail.com</email>
      <url>http://www.cs.berkeley.edu/~matei</url>
      <organization>Apache Software Foundation</organization>
      <organizationUrl>http://spark.apache.org</organizationUrl>
    </developer>
  </developers>
  <issueManagement>
    <system>JIRA</system>
    <url>https://issues.apache.org/jira/browse/SPARK</url>
  </issueManagement>

  <prerequisites>
    <maven>3.0.4</maven>
  </prerequisites>
Run Code Online (Sandbox Code Playgroud)

降级到 Scala 2.10.5 是否更好?

Tob*_*obi 5

我有完全相同的设置。看一下

它可以从 HortonWorks 服务器下载。将 放在winutils.exe某个文件夹中,并在您的 Scala 程序中按以下方式引用它:

System.setProperty("hadoop.home.dir", "D:\\YOUR_PATH_TO_WINUTILS")
Run Code Online (Sandbox Code Playgroud)