我需要将文件夹从本地文件系统复制到HDFS.我找不到任何将文件夹(包括其所有子文件夹)移动到HDFS的示例
$ hadoop fs -copyFromLocal /home/ubuntu/Source-Folder-To-Copy HDFS-URI
Ash*_*ith 79
你可以尝试:
hadoop fs -put /path/in/linux /hdfs/path
Run Code Online (Sandbox Code Playgroud)
甚至
hadoop fs -copyFromLocal /path/in/linux /hdfs/path
Run Code Online (Sandbox Code Playgroud)
在默认情况下都put与copyFromLocal将上传目录递归到HDFS.
Kum*_*mar 28
如果从本地复制文件夹,则会将包含其所有子文件夹的文件夹复制到HDFS.
要将文件夹从本地复制到hdfs,您可以使用
hadoop fs -put localpath
Run Code Online (Sandbox Code Playgroud)
要么
hadoop fs -copyFromLocal localpath
Run Code Online (Sandbox Code Playgroud)
要么
hadoop fs -put localpath hdfspath
Run Code Online (Sandbox Code Playgroud)
要么
hadoop fs -copyFromLocal localpath hdfspath
Run Code Online (Sandbox Code Playgroud)
注意:
如果未指定hdfs path,则文件夹副本将复制到具有该文件夹相同名称的hdfs.
从hdfs复制到本地
hadoop fs -get hdfspath localpath
Run Code Online (Sandbox Code Playgroud)
mrs*_*vas 21
hdfs dfs -put <localsrc> <dest>
在将文件放入HDFS之前检查源和目标
[cloudera@quickstart ~]$ ll files/
total 132
-rwxrwxr-x 1 cloudera cloudera 5387 Nov 14 06:33 cloudera-manager
-rwxrwxr-x 1 cloudera cloudera 9964 Nov 14 06:33 cm_api.py
-rw-rw-r-- 1 cloudera cloudera 664 Nov 14 06:33 derby.log
-rw-rw-r-- 1 cloudera cloudera 53655 Nov 14 06:33 enterprise-deployment.json
-rw-rw-r-- 1 cloudera cloudera 50515 Nov 14 06:33 express-deployment.json
[cloudera@quickstart ~]$ hdfs dfs -ls
Found 1 items
drwxr-xr-x - cloudera cloudera 0 2017-11-14 00:45 .sparkStaging
Run Code Online (Sandbox Code Playgroud)
使用-put或-copyFromLocal命令复制文件HDFS
[cloudera@quickstart ~]$ hdfs dfs -put files/ files
Run Code Online (Sandbox Code Playgroud)
验证HDFS中的结果
[cloudera@quickstart ~]$ hdfs dfs -ls
Found 2 items
drwxr-xr-x - cloudera cloudera 0 2017-11-14 00:45 .sparkStaging
drwxr-xr-x - cloudera cloudera 0 2017-11-14 06:34 files
[cloudera@quickstart ~]$ hdfs dfs -ls files
Found 5 items
-rw-r--r-- 1 cloudera cloudera 5387 2017-11-14 06:34 files/cloudera-manager
-rw-r--r-- 1 cloudera cloudera 9964 2017-11-14 06:34 files/cm_api.py
-rw-r--r-- 1 cloudera cloudera 664 2017-11-14 06:34 files/derby.log
-rw-r--r-- 1 cloudera cloudera 53655 2017-11-14 06:34 files/enterprise-deployment.json
-rw-r--r-- 1 cloudera cloudera 50515 2017-11-14 06:34 files/express-deployment.json
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
169376 次 |
| 最近记录: |