eda*_*lls 5 python mysql iterator cursor
我试图使用Python和MySQLdb接口实现以下功能:
对我来说,迭代每一行,即时处理然后在运行中将每个新行插入到新表中似乎是明智的.
这有效:
import MySQLdb
import MySQLdb.cursors
conn=MySQLdb.connect(
host="somehost",user="someuser",
passwd="somepassword",db="somedb")
cursor1 = conn.cursor(MySQLdb.cursors.Cursor)
query1 = "SELECT * FROM table1"
cursor1.execute(query1)
cursor2 = conn.cursor(MySQLdb.cursors.Cursor)
for row in cursor1:
values = some_function(row)
query2 = "INSERT INTO table2 VALUES (%s, %s, %s)"
cursor2.execute(query2, values)
cursor2.close()
cursor1.close()
conn.commit()
conn.close()
Run Code Online (Sandbox Code Playgroud)
但这很慢且占用内存,因为它使用客户端游标进行SELECT
查询.如果我改为使用服务器端游标进行SELECT
查询:
cursor1 = conn.cursor(MySQLdb.cursors.SSCursor)
Run Code Online (Sandbox Code Playgroud)
然后我收到2014年的错误:
Exception _mysql_exceptions.ProgrammingError: (2014, "Commands out of sync; you can't run this command now") in <bound method SSCursor.__del__ of <MySQLdb.cursors.SSCursor object at 0x925d6ec>> ignored
Run Code Online (Sandbox Code Playgroud)
所以它似乎不喜欢在迭代服务器端游标时启动另一个游标.这似乎让我陷入了一个非常缓慢的客户端迭代器.
有什么建议?
您需要与数据库的单独连接,因为第一个连接被困在流式传输结果集上,因此您无法运行插入查询。
尝试这个:
import MySQLdb
import MySQLdb.cursors
conn=MySQLdb.connect(
host="somehost",user="someuser",
passwd="somepassword",db="somedb")
cursor1 = conn.cursor(MySQLdb.cursors.SSCursor)
query1 = "SELECT * FROM table1"
cursor1.execute(query1)
insertConn=MySQLdb.connect(
host="somehost",user="someuser",
passwd="somepassword",db="somedb")
cursor2 = inserConn.cursor(MySQLdb.cursors.Cursor)
for row in cursor1:
values = some_function(row)
query2 = "INSERT INTO table2 VALUES (%s, %s, %s)"
cursor2.execute(query2, values)
cursor2.close()
cursor1.close()
conn.commit()
conn.close()
insertConn.commit()
insertConn.close()
Run Code Online (Sandbox Code Playgroud)
归档时间: |
|
查看次数: |
2368 次 |
最近记录: |