Iva*_*vin 6 sqlalchemy vertica python-3.x pandas
我尝试使用 sqlalchemy 通过 Pandas 附加我的 vertica(SQL 类型)表
import pandas as pd
import sqlalchemy as sa
Run Code Online (Sandbox Code Playgroud)
为vertica创建引擎:
def get_engine(base):
engine = sa.create_engine("{sys}+{dri}://{user}:" + \
"{password}@{host}:{port}/{database}".format(**login[base]))
return engine
engine = get_engine('vertica')
Run Code Online (Sandbox Code Playgroud)
为了清楚起见,一个简单的查询:
table = '***'
sql =\
'''
select *
from public.{table}
'''.format(table=table)
connection = engine.connect()
data = pd.read_sql(sql, connection)
connection.close()
Run Code Online (Sandbox Code Playgroud)
数据不为空:
print(len(data))
569955
Run Code Online (Sandbox Code Playgroud)
并尝试写入同一张表:
fields = list(data.columns)
connection = engine.connect()
data.to_sql(table, connection, schema='public', index=False, if_exists='append', chunksize=30000,
dtype={fields[0]:sa.types.Integer,
fields[1]:sa.types.VARCHAR,
fields[2]:sa.types.Integer,
fields[3]:sa.types.Integer,
fields[4]:sa.types.Integer,
fields[5]:sa.types.VARCHAR,
fields[6]:sa.types.VARCHAR,
fields[7]:sa.types.VARCHAR,
fields[8]:sa.types.VARCHAR,
fields[9]:sa.types.VARCHAR,
fields[10]:sa.types.VARCHAR,
fields[11]:sa.types.VARCHAR,
fields[12]:sa.types.DateTime
})
connection.close()
Run Code Online (Sandbox Code Playgroud)
并得到这个错误:
...
\Anaconda3\lib\site-packages\sqlalchemy\engine\default.py in do_executemany(self, cursor, statement, parameters, context)
465
466 def do_executemany(self, cursor, statement, parameters, context=None):
--> 467 cursor.executemany(statement, parameters)
468
469 def do_execute(self, cursor, statement, parameters, context=None):
\Anaconda3\lib\site-packages\vertica_python\vertica\cursor.py in executemany(self, operation, seq_of_parameters)
153 else:
154 raise NotImplementedError(
--> 155 "executemany is implemented for simple INSERT statements only")
156
157 def fetchone(self):
NotImplementedError: executemany is implemented for simple INSERT statements only
Run Code Online (Sandbox Code Playgroud)
当我尝试使用 sqlalchemy 将数据写入 vertica 时,出现了同样的错误。就我而言,问题在于列名称。它似乎无法写入包含特殊字符的列名。我可以通过从 pandas 中的列名称中删除所有“_”、“%”和空格字符来修复错误,然后使用 df.to_sql() 将其写入 vertica。
| 归档时间: |
|
| 查看次数: |
1081 次 |
| 最近记录: |