将HDFS格式的本地磁盘替换为s3获取错误(org.apache.hadoop.service.AbstractService)

Vik*_*dia 7 hadoop amazon-s3 hdfs

我们正在尝试设置Cloudera 5.5,其中HDFS将在s3上工作,因为我们已经在Core-site.xml中配置了必需属性

<property>
    <name>fs.s3a.access.key</name>
    <value>################</value>
</property>
<property>
    <name>fs.s3a.secret.key</name>
    <value>###############</value>
</property>
<property>
    <name>fs.default.name</name>
    <value>s3a://bucket_Name</value>
</property>
<property>
    <name>fs.defaultFS</name>
    <value>s3a://bucket_Name</value>
</property>
Run Code Online (Sandbox Code Playgroud)

设置完成后,我们可以从命令中浏览s3存储桶的文件

hadoop fs -ls /
Run Code Online (Sandbox Code Playgroud)

它只显示s3上可用的文件.

但是,当我们启动纱线服务时,JobHistory服务器无法启动以下错误,并且在启动猪作业时,我们遇到相同的错误

PriviledgedActionException as:mapred (auth:SIMPLE) cause:org.apache.hadoop.fs.UnsupportedFileSystemException: No AbstractFileSystem for scheme: s3a
ERROR   org.apache.hadoop.mapreduce.v2.jobhistory.JobHistoryUtils   
Unable to create default file context [s3a://kyvosps]
org.apache.hadoop.fs.UnsupportedFileSystemException: No AbstractFileSystem for scheme: s3a
    at org.apache.hadoop.fs.AbstractFileSystem.createFileSystem(AbstractFileSystem.java:154)
    at org.apache.hadoop.fs.AbstractFileSystem.get(AbstractFileSystem.java:242)
    at org.apache.hadoop.fs.FileContext$2.run(FileContext.java:337)
    at org.apache.hadoop.fs.FileContext$2.run(FileContext.java:334)
    at java.security.AccessController.doPrivileged(Native Method)
    at javax.security.auth.Subject.doAs(Subject.java:415)
Run Code Online (Sandbox Code Playgroud)

在Internet上进行serching时,我们发现我们需要在core-site.xml中设置以下属性

<property>
  <name>fs.s3a.impl</name>
  <value>org.apache.hadoop.fs.s3a.S3AFileSystem</value>
  <description>The implementation class of the S3A Filesystem</description>
</property>
<property>
    <name>fs.AbstractFileSystem.s3a.impl</name>
    <value>org.apache.hadoop.fs.s3a.S3AFileSystem</value>
    <description>The FileSystem for  S3A Filesystem</description>
</property>
Run Code Online (Sandbox Code Playgroud)

设置上述属性后,我们遇到以下错误

org.apache.hadoop.service.AbstractService   
Service org.apache.hadoop.mapreduce.v2.hs.HistoryFileManager failed in state INITED; cause: java.lang.RuntimeException: java.lang.NoSuchMethodException: org.apache.hadoop.fs.s3a.S3AFileSystem.<init>(java.net.URI, org.apache.hadoop.conf.Configuration)
java.lang.RuntimeException: java.lang.NoSuchMethodException: org.apache.hadoop.fs.s3a.S3AFileSystem.<init>(java.net.URI, org.apache.hadoop.conf.Configuration)
    at org.apache.hadoop.fs.AbstractFileSystem.newInstance(AbstractFileSystem.java:131)
    at org.apache.hadoop.fs.AbstractFileSystem.createFileSystem(AbstractFileSystem.java:157)
    at org.apache.hadoop.fs.AbstractFileSystem.get(AbstractFileSystem.java:242)
    at org.apache.hadoop.fs.FileContext$2.run(FileContext.java:337)
    at org.apache.hadoop.fs.FileContext$2.run(FileContext.java:334)
    at java.security.AccessController.doPrivileged(Native Method)
    at javax.security.auth.Subject.doAs(Subject.java:415)
    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1671)
    at org.apache.hadoop.fs.FileContext.getAbstractFileSystem(FileContext.java:334)
    at org.apache.hadoop.fs.FileContext.getFileContext(FileContext.java:451)
    at org.apache.hadoop.fs.FileContext.getFileContext(FileContext.java:473)
    at org.apache.hadoop.mapreduce.v2.jobhistory.JobHistoryUtils.getDefaultFileContext(JobHistoryUtils.java:247)
Run Code Online (Sandbox Code Playgroud)

这需要的罐子到位,但仍然得到错误任何帮助将是伟大的.提前致谢

更新

我试图删除属性fs.AbstractFileSystem.s3a.impl,但它给了我相同的第一个例外,我之前得到的是

org.apache.hadoop.security.UserGroupInformation 
PriviledgedActionException as:mapred (auth:SIMPLE) cause:org.apache.hadoop.fs.UnsupportedFileSystemException: No AbstractFileSystem for scheme: s3a
ERROR   org.apache.hadoop.mapreduce.v2.jobhistory.JobHistoryUtils   
Unable to create default file context [s3a://bucket_name]
org.apache.hadoop.fs.UnsupportedFileSystemException: No AbstractFileSystem for scheme: s3a
    at org.apache.hadoop.fs.AbstractFileSystem.createFileSystem(AbstractFileSystem.java:154)
    at org.apache.hadoop.fs.AbstractFileSystem.get(AbstractFileSystem.java:242)
    at org.apache.hadoop.fs.FileContext$2.run(FileContext.java:337)
    at org.apache.hadoop.fs.FileContext$2.run(FileContext.java:334)
    at java.security.AccessController.doPrivileged(Native Method)
    at javax.security.auth.Subject.doAs(Subject.java:415)
    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1671)
    at org.apache.hadoop.fs.FileContext.getAbstractFileSystem(FileContext.java:334)
    at org.apache.hadoop.fs.FileContext.getFileContext(FileContext.java:451)
    at org.apache.hadoop.fs.FileContext.getFileContext(FileContext.java:473)
Run Code Online (Sandbox Code Playgroud)

Man*_*lur 6

问题不在于罐子的位置.

问题在于设置:

<property>
    <name>fs.AbstractFileSystem.s3a.impl</name>
    <value>org.apache.hadoop.fs.s3a.S3AFileSystem</value>
    <description>The FileSystem for  S3A Filesystem</description>
</property>
Run Code Online (Sandbox Code Playgroud)

不需要此设置.由于这个设置,它在S3AFileSystem类中搜索以下构造函数,并且没有这样的构造函数:

S3AFileSystem(URI theUri, Configuration conf);
Run Code Online (Sandbox Code Playgroud)

以下异常清楚地告诉它无法找到S3AFileSystemwith URIConfiguration参数的构造函数.

java.lang.RuntimeException: java.lang.NoSuchMethodException: org.apache.hadoop.fs.s3a.S3AFileSystem.<init>(java.net.URI, org.apache.hadoop.conf.Configuration)
Run Code Online (Sandbox Code Playgroud)

要解决此问题,请fs.AbstractFileSystem.s3a.impl从中删除设置core-site.xml.刚fs.s3a.impl安装就core-site.xml可以解决您的问题.

编辑: org.apache.hadoop.fs.s3a.S3AFileSystem只是实现FileSystem.

因此,您不能设置fs.AbstractFileSystem.s3a.implto的值org.apache.hadoop.fs.s3a.S3AFileSystem,因为org.apache.hadoop.fs.s3a.S3AFileSystem没有实现AbstractFileSystem.

我正在使用Hadoop 2.7.0,并且在此版本s3A中未公开AbstractFileSystem.

有JIRA票证:https://issues.apache.org/jira/browse/HADOOP-11262实现相同,并且Hadoop 2.8.0中提供了修复程序.

假设你的jar暴露s3AAbstractFileSystem,你需要设置以下内容fs.AbstractFileSystem.s3a.impl:

<property>
    <name>fs.AbstractFileSystem.s3a.impl</name>
    <value>org.apache.hadoop.fs.s3a.S3A</value>
</property>
Run Code Online (Sandbox Code Playgroud)

这将解决您的问题.