我一直在寻找使用简化数据的方法,以便在hadoop中进一步映射.我有类的对象A作为输入数据和类的对象B作为输出数据.问题是,虽然映射不仅B生成s而且生成新As.
这就是我想要实现的目标:
1.1 input: a list of As
1.2 map result: for each A a list of new As and a list of Bs is generated
1.3 reduce: filtered Bs are saved as output, filtered As are added to the map jobs
2.1 input: a list of As produced by the first map/reduce
2.2 map result: for each A a list of new As and a list of Bs is generated
2.3 …Run Code Online (Sandbox Code Playgroud)