问题: 我们正在尝试在集群的特定主机上运行少量命令.我们选择了SSH Action.我们一直面临这个SSH问题.这可能是什么真正的问题?请指出我的解决方案.
日志:
AUTH_FAILED:无法执行操作[ssh -o PasswordAuthentication = no -o KbdInteractiveDevices = no -o StrictHostKeyChecking = no -o ConnectTimeout = 20 USER@1.2.3.4 mkdir -p oozie-oozi/0000000-131008185935754-oozie-oozi-W/action1 - ssh /] | ErrorStream:警告:永久性地将主机1.2.3.4(RSA)添加到已知主机列表中.权限被拒绝(publickey,gssapi-keyex,gssapi-with-mic,密码).
org.apache.oozie.action.ActionExecutorException:AUTH_FAILED:无法执行操作[ssh -o PasswordAuthentication = no -o KbdInteractiveDevices = no -o StrictHostKeyChecking = no -o ConnectTimeout = 20 user@1.2.3.4 mkdir -p oozie-oozi/0000000-131008185935754-oozie-oozi-W/action1 - ssh /] | ErrorStream:警告:永久性地将1.2.3.4,192.168.34.208(RSA)添加到已知主机列表中.权限被拒绝(publickey,gssapi-keyex,gssapi-with-mic,密码).
at org.apache.oozie.action.ssh.SshActionExecutor.execute(SshActionExecutor.java:589)
at org.apache.oozie.action.ssh.SshActionExecutor.start(SshActionExecutor.java:204)
at org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:211)
at org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:59)
at org.apache.oozie.command.XCommand.call(XCommand.java:277)
at org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.java:326)
at org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.java:255)
at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:175)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
Run Code Online (Sandbox Code Playgroud)
引起:java.io.IOException:无法执行操作[ssh -o PasswordAuthentication = no -o KbdInteractiveDevices = no -o StrictHostKeyChecking = no -o ConnectTimeout = 20 user@1.2.3.4 mkdir -p oozie-oozi/0000000- 131008185935754-oozie-oozi-W/action1 - ssh /] | ErrorStream:警告:永久性地将"1.2.3.4,1.2.3.4"(RSA)添加到已知主机列表中.权限被拒绝(publickey,gssapi-keyex,gssapi-with-mic,密码).
at org.apache.oozie.action.ssh.SshActionExecutor.executeCommand(SshActionExecutor.java:340)
at org.apache.oozie.action.ssh.SshActionExecutor.setupRemote(SshActionExecutor.java:373)
at org.apache.oozie.action.ssh.SshActionExecutor$1.call(SshActionExecutor.java:206)
at org.apache.oozie.action.ssh.SshActionExecutor$1.call(SshActionExecutor.java:204)
at org.apache.oozie.action.ssh.SshActionExecutor.execute(SshActionExecutor.java:547)
... 10 more
Run Code Online (Sandbox Code Playgroud)
2013-10-09 12:48:25,982 WARN org.apache.oozie.command.wf.ActionStartXCommand:USER [user] GROUP [ - ] TOKEN [] APP [Test] JOB [0000000-131008185935754-oozie-oozi-W]行动[0000000-131008185935754-oozie-oozi-W @ action1]暂停工作流程工作ID = 0000000-131008185935754-oozie-oozi-W 2013-10-09 12:48:27,204 WARN org.apache.oozie.command.coord.CoordActionUpdateXCommand :USER [user] GROUP [ - ] TOKEN [] APP [Test] JOB [0000000-131008185935754-oozie-oozi-W]动作[0000000-131008185935754-oozie-oozi-W @ action1] E1100:命令前置条件未保持之前执行,[,coord action为null],错误代码:E1100 2013-10-09 12:59:57,477 INFO org.apache.oozie.command.wf.KillXCommand:USER [user] GROUP [ - ] TOKEN [] APP [测试]工作[0000000-131008185935754-oozie-oozi-W]行动[ - ] STARTED WorkflowKillXCommand for jobId = 0000000-131008185935754-oozie-oozi-W 2013-10-09 12:59:57,685 WARN org.apache.oozie.command.coord.CoordActionUpdateXCommand:USER [user] GROUP [ - ] TOKEN [] APP [Test] JOB [0000000-131008185935754-oozie-oozi-W] ACTION [ - ] E1100:命令前提条件执行前未保持,[,coord action为null],错误代码:E1100 2013-10-09 12:59:57,686 INFO org.apache.oozie.command.wf.KillXCommand:USER [user] GROUP [ - ] TOKEN [ ] APP [测试]工作[0000000-131008185935754-oozie-oozi-W]行动[ - ] ENDED WorkflowKillXCommand for jobId = 0000000-131008185935754-oozie-oozi-W 2013-10-09 13:41:32,654 WARN org.apache. oozie.command.wf.KillXCommand:USER [user] GROUP [ - ] TOKEN [] APP [Test] JOB [0000000-131008185935754-oozie-oozi-W] ACTION [ - ] E0725:工作流实例无法被杀死,0000000- 131008185935754-oozie-oozi-W,错误代码:E0725 2013-10-09 13:41:45,199 WARN org.apache.oozie.command.wf.KillXCommand:USER [user] GROUP [ - ] TOKEN [] APP [Test] JOB [0000000-131008185935754-oozie-oozi-W] ACTION [ - ] E0725:工作流程实例无法被杀死,0000000-131008185935754-oozie-oozi-W,错误代码:E0725 2013-10-09 13:42:04,869 WARN org.apache.oozie.command.wf.ResumeXCommand:USER [user] GROUP [ - ] TOKEN [] APP [Test] JOB [0000000-131008185935754-oozie- oozi-W] ACTION [ - ] E1100:执行前命令前置条件不成立,[工作流程状态为KILLED未暂停],错误代码:E1100 2013-10-09 13:45:55,857 WARN org.apache.oozie.command .wf.KillXCommand:USER [user] GROUP [ - ] TOKEN [] APP [Test] JOB [0000000-131008185935754-oozie-oozi-W] ACTION [ - ] E0725:工作流实例无法被杀死,0000000-131008185935754-oozie -oozi-W,错误代码:E07250000000-131008185935754-oozie-oozi-W,错误代码:E0725 2013-10-09 13:42:04,869警告org.apache.oozie.command.wf.ResumeXCommand:USER [user] GROUP [ - ] TOKEN [] APP [ [测试] JOB [0000000-131008185935754-oozie-oozi-W]操作[ - ] E1100:执行前命令前提条件未成立,[工作流程状态为KILLED未暂停],错误代码:E1100 2013-10-09 13:45 :53,557 WARN org.apache.oozie.command.wf.KillXCommand:USER [user] GROUP [ - ] TOKEN [] APP [Test] JOB [0000000-131008185935754-oozie-oozi-W]操作[ - ] E0725:工作流实例无法杀死,0000000-131008185935754-oozie-oozi-W,错误代码:E07250000000-131008185935754-oozie-oozi-W,错误代码:E0725 2013-10-09 13:42:04,869警告org.apache.oozie.command.wf.ResumeXCommand:USER [user] GROUP [ - ] TOKEN [] APP [ [测试] JOB [0000000-131008185935754-oozie-oozi-W]操作[ - ] E1100:执行前命令前提条件未成立,[工作流程状态为KILLED未暂停],错误代码:E1100 2013-10-09 13:45 :53,557 WARN org.apache.oozie.command.wf.KillXCommand:USER [user] GROUP [ - ] TOKEN [] APP [Test] JOB [0000000-131008185935754-oozie-oozi-W]操作[ - ] E0725:工作流实例无法杀死,0000000-131008185935754-oozie-oozi-W,错误代码:E0725命令前提条件在执行前不成立,[工作流程的状态为KILLED不是SUSPENDED],错误代码:E1100 2013-10-09 13:45:55,857 WARN org.apache.oozie.command.wf.KillXCommand:USER [user] GROUP [ - ] TOKEN [] APP [测试] JOB [0000000-131008185935754-oozie-oozi-W]操作[ - ] E0725:工作流实例无法被杀死,0000000-131008185935754-oozie-oozi-W,错误代码:E0725命令前提条件在执行前不成立,[工作流程的状态为KILLED不是SUSPENDED],错误代码:E1100 2013-10-09 13:45:55,857 WARN org.apache.oozie.command.wf.KillXCommand:USER [user] GROUP [ - ] TOKEN [] APP [测试] JOB [0000000-131008185935754-oozie-oozi-W]操作[ - ] E0725:工作流实例无法被杀死,0000000-131008185935754-oozie-oozi-W,错误代码:E0725
接近试过:
谢谢;
卡萨.
我刚刚遇到了类似的问题.我有一个案例,我可以作为用户运行:
ssh -o PasswordAuthentication=no -o KbdInteractiveDevices=no -o StrictHostKeyChecking=no -o ConnectTimeout=20 USER@1.2.3.4 mkdir -p oozie-oozi/0000000-131008185935754-oozie-oozi-W/action1--ssh/
Run Code Online (Sandbox Code Playgroud)
在命令行上手动工作,但是当通过Oozie作为USER启动它失败了.
在我的情况下,失败的原因是我在oozie服务器上的USER和远程机器上的USER之间设置了无密码ssh.需要做的是在oozie服务器上的oozie和远程机器上的USER之间设置无密码ssh.换句话说,在oozie服务器上su到oozie并手动运行上面的命令.如果失败,它将在Oozie中失败.如果它工作,那么它应该在Oozie中工作(假设所有其他正确,如dir权限等)
看一下您的oozie服务器运行的用户:
ps -ef | grep oozie
Run Code Online (Sandbox Code Playgroud)
无论用户在远程机器上需要无密码ssh到USER.
| 归档时间: |
|
| 查看次数: |
7923 次 |
| 最近记录: |