小编eda*_*lls的帖子

使用MySQLdb的嵌套查询

我试图使用Python和MySQLdb接口实现以下功能:

读取具有几百万行的表的内容.
处理并修改每一行的输出.
将修改后的行放入另一个表中.

对我来说,迭代每一行,即时处理然后在运行中将每个新行插入到新表中似乎是明智的.

这有效:

import MySQLdb
import MySQLdb.cursors

conn=MySQLdb.connect(
    host="somehost",user="someuser",
    passwd="somepassword",db="somedb")

cursor1 = conn.cursor(MySQLdb.cursors.Cursor)
query1 = "SELECT * FROM table1"
cursor1.execute(query1)

cursor2 = conn.cursor(MySQLdb.cursors.Cursor)

for row in cursor1:
    values = some_function(row)
    query2 = "INSERT INTO table2 VALUES (%s, %s, %s)"
    cursor2.execute(query2, values)

cursor2.close()
cursor1.close()
conn.commit()
conn.close()

Run Code Online (Sandbox Code Playgroud)

但这很慢且占用内存,因为它使用客户端游标进行SELECT查询.如果我改为使用服务器端游标进行SELECT查询:

cursor1 = conn.cursor(MySQLdb.cursors.SSCursor)

Run Code Online (Sandbox Code Playgroud)

然后我收到2014年的错误:

Exception _mysql_exceptions.ProgrammingError: (2014, "Commands out of sync; you can't run this command now") in <bound method SSCursor.__del__ of <MySQLdb.cursors.SSCursor object at 0x925d6ec>> ignored …

Run Code Online (Sandbox Code Playgroud)

python mysql iterator cursor

eda*_*lls

2013 04-08

5
推荐指数

1
解决办法

2368
查看次数

带有子查询的MySQL UPDATE查询永远占用

我有一个MySQL UPDATE查询,需要很长时间才能完成.我错过了一种更简单的方法来实现相同的结果吗？

"UPDATE table2, table1
SET table2.id_occurrences = (SELECT SUM(IF(id = table2.id, 1, 0)) FROM table1)
WHERE table2.id = table1.id;"

Run Code Online (Sandbox Code Playgroud)

table2包含所有可能的值id,每个值只有一个记录.
table1包含一些值id,但有一些值的多个记录.
我需要更新的记录中table2显示的对应值的出现次数id中table1.上面的查询完成了这项工作,但当table1包含500条记录和table230,000条记录时,大约需要3分钟.我有更大的表来处理所以这太长了:)

提前致谢.

mysql sql performance

eda*_*lls

lucky-day

3
推荐指数

1
解决办法

2225
查看次数