Spark数据帧1-:
+------+-------+---------+----+---+-------+
|city |product|date |sale|exp|wastage|
+------+-------+---------+----+---+-------+
|city 1|prod 1 |9/29/2017|358 |975|193 |
|city 1|prod 2 |8/25/2017|50 |687|201 |
|city 1|prod 3 |9/9/2017 |236 |431|169 |
|city 2|prod 1 |9/28/2017|358 |975|193 |
|city 2|prod 2 |8/24/2017|50 |687|201 |
|city 3|prod 3 |9/8/2017 |236 |431|169 |
+------+-------+---------+----+---+-------+
Run Code Online (Sandbox Code Playgroud)
Spark数据框2-:
+------+-------+---------+----+---+-------+
|city |product|date |sale|exp|wastage|
+------+-------+---------+----+---+-------+
|city 1|prod 1 |9/29/2017|358 |975|193 |
|city 1|prod 2 |8/25/2017|50 |687|201 |
|city 1|prod 3 |9/9/2017 |230 |430|160 |
|city 1|prod 4 |9/27/2017|350 |90 |190 |
|city 2|prod 2 …Run Code Online (Sandbox Code Playgroud) Spark SQL文档指定join()支持以下联接类型:
必须是以下之一:内部,交叉,外部,完整,完整_外部,左,左_外部,右,右_外部,左_半和left_anti。
有什么区别outer和full_outer?我怀疑不是,我怀疑它们只是彼此的同义词,但想弄清楚。