如何定义 STFP Operator 在 Airflow 上的操作?

Raj*_*Raj 3 python operators operation directed-acyclic-graphs airflow

class SFTPOperation(object):
    PUT = 'put'
    GET = 'get'  

operation=SFTPOperation.GET,
NameError: name 'SFTPOperation' is not defined
Run Code Online (Sandbox Code Playgroud)

我在这里定义了操作符,但我在互联网上找不到与操作相关的任何内容

class sftpplugin(AirflowPlugin):
    name = "sftp_plugin"
    operators = [SFTPOperator]
Run Code Online (Sandbox Code Playgroud)

任何帮助将不胜感激!

谢谢,

Mig*_*ejo 6

通过注意到SFTP操作员使用 ssh_hook 打开 sftp 传输通道,您应该需要提供ssh_hookssh_conn_id用于文件传输。首先,让我们看一个提供参数的示例ssh_conn_id

from airflow.providers.sftp.operators import sftp_operator
from airflow import DAG
import datetime

dag = DAG(
'test_dag',
start_date = datetime.datetime(2020,1,8,0,0,0),
schedule_interval = '@daily'
)

put_operation = SFTPOperator(
            task_id="operation",
            ssh_conn_id="ssh_default",
            local_filepath="route_to_local_file",
            remote_filepath="remote_route_to_copy",
            operation="put",
            dag=dag
            )
get_operation = SFTPOperator(....,
            operation = "get",
            dag = dag
            )

put_operation >> get_operation
Run Code Online (Sandbox Code Playgroud)

请注意,应根据您的任务的需要安排 dag,这里的示例考虑从中午开始的每日计划。现在,如果您提供 SSHhook,则需要对上述代码进行以下更改

from airflow.contrib.hooks.ssh_hook import SSHHook
...

put_operation = SFTPOperator(
            task_id="operation",
            ssh_hook=SSHHook("Name_of_variable_defined"),
            ...
            dag=dag
            )
....
Run Code Online (Sandbox Code Playgroud)

"Name_of_variable_defined"Airflow界面的Admin -> Connections中创建的位置。