将PIG输出存储为Ctrl a分隔输出以导入hive?

ade*_*sin 1 hadoop apache-pig

如何将PIG输出存储为Ctrl-a分隔输出以存储到配置单元?

Bec*_*tor 6

要获得预期结果,您可以按照下面提到的过程
使用以下命令存储您的关系

STORE <Relation> INTO '<file_path>' USING PigStorage('\u0001');
Run Code Online (Sandbox Code Playgroud)

参考生成的文件公开hive表

hive>CREATE EXTERNAL TABLE TEMP(
c1 INT,
c2 INT,
c3 INT,
c4 INT
.....
)
ROW FORMAT
DELIMITED FIELDS TERMINATED BY '\001'
LINES TERMINATED BY '\n'
STORED AS TEXTFILE
LOCATION '<file_path>';
Run Code Online (Sandbox Code Playgroud)

如果输出文件存在于linux本地目录中,则创建表

hive>CREATE TABLE TEMP(
c1 INT,
c2 INT,
c3 INT,
c4 INT
.....
)
ROW FORMAT
DELIMITED FIELDS TERMINATED BY '\001'
LINES TERMINATED BY '\n'
STORED AS TEXTFILE;
Run Code Online (Sandbox Code Playgroud)

并将数据加载到表中

hive> load data local inpath '<file_path>' into table temp;
Run Code Online (Sandbox Code Playgroud)