clu*_*s92 2 python postgresql airflow
我正在使用 Airflow 的 python 运算符来调用 python 函数。错误发生在 try/ except 块中。
def python_callable_new():
print("Inside python callable ...")
import psycopg2
try:
print("attempting database connection from python method.. ")
conn = psycopg2.connect('postgres_defined_connection')
print("success. ")
except Exception as error:
print("failed: ")
print (error)
return 'End of callable. '
with dag:
start_task = DummyOperator( task_id= "start" )
stop_task = DummyOperator( task_id= "stop" )
do_python_task = PythonOperator(
task_id = 'do-py-operation',
python_callable= python_callable_new,
)
extract_source_data = PostgresOperator(
task_id='extract-cb-source-data',
postgres_conn_id='postgres_defined_connection',
sql='./sql_scripts/extract_csv_data.sql'
)
# csv_to_postgres
start_task >> do_python_task >> extract_source_data >> stop_task
Run Code Online (Sandbox Code Playgroud)
基本上,我的问题是
(仅供参考 - 我将 postgres_define_connection 存储在使用 sqlalchemy 引擎和 PostgresHook 的单独的connections.py 中)
psycopg2.connect需要连接参数。如果将连接参数格式化为以空格分隔的键/值对,则可以向它们传递单个字符串。这就是为什么它给你错误消息缺少“=”。
请参阅psycopg 文档以获取更多信息。
要连接到 Airflow 中的 Postgres 数据库,只要您已创建连接,就可以利用PostgresHook 。
from airflow.hooks.postgres_hook import PostgresHook
def execute_query_with_conn_obj(query):
hook = PostgresHook(postgres_conn_id='my_connection')
conn = hook.get_conn()
cur = conn.cursor()
cur.execute(query)
def execute_query_with_hook(query):
hook = PostgresHook(postgres_conn_id='my_connection')
hook.run(sql=query)
Run Code Online (Sandbox Code Playgroud)
您也可以使用纯 Python 代码来完成此操作。
def execute_query_with_psycopg(query):
conn_args = dict(
host='myhost',
user='admin',
password='password',
dbname='my_schema',
port=5432)
conn = psycopg2.connect(**conn_args)
cur = conn.cursor()
cur.execute(query)
def execute_query_with_psycopg_string(query):
conn = psycopg2.connect("dbname=test user=postgres password=secret")
cur = conn.cursor()
cur.execute(query)
Run Code Online (Sandbox Code Playgroud)