在运行 中的 wordcount 示例时Hadoop,我遇到以下错误。
saying "JAR does not exist or is not a normal file:
/usr/local/hadoop/share/hadoop/mapreduce/hadoop-mapreduceexamples-2.2.0.jar"
Run Code Online (Sandbox Code Playgroud)
我的输入命令是:
hadoop jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduceexamples-2.2.0.jar wordcount input output
Run Code Online (Sandbox Code Playgroud) 这是两个表: 输入:
Employees table:
+-------------+----------+
| employee_id | name |
+-------------+----------+
| 2 | Crew |
| 4 | Haven |
| 5 | Kristian |
+-------------+----------+
Salaries table:
+-------------+--------+
| employee_id | salary |
+-------------+--------+
| 5 | 76071 |
| 1 | 22517 |
| 4 | 63539 |
+-------------+--------+
Run Code Online (Sandbox Code Playgroud)
我想要一个像你如何执行连接但使用联合的输出。
联合的输出应如下所示:
employee_id | name | salary
2 crew. null
4. haven. 63539
5. Kristian 76071
1. null. 22517
Run Code Online (Sandbox Code Playgroud)
并发布工会我想执行选择,通过选择没有姓名或没有薪水的员工的employee_id
目前我正在处理的查询:
select * from
(select employee_id, name, null …Run Code Online (Sandbox Code Playgroud)