让 SQLAlchemy 在 ORM 级联中执行“ON DUPLICATE KEY UPDATE”(在 MySQL 中)

use*_*813 1 python mysql orm sqlalchemy

我正在从 python reddit 频道交叉发布这个,我还没有设法得到回应。我很惊讶,因为这似乎是一个非常标准的场景。

我的问题是在 SQLAlchemy 的 ORM 接口(在 mysql 上)的级联插入中,是否有一种简单的方法可以处理子对象的主键冲突。举一个具体的例子,我有一个句子表和一个“字典”表,当我创建一个句子记录时,我想在字典中插入所有相关的单词。当然,经常有一些词已经在字典中,所以我得到一个主键错误(这个词是关键)。如果我使用原始 SQL,我会执行 ON DUPLICATE KEY UPDATE 来获取现有记录的 id 并将其与句子相关联。我希望能够直接在 ORM 中执行此操作,因为级联非常方便(上面的示例实际上是简化的……涉及很多表)。由于 ORM 执行级联插入,我不

这似乎是一个非常标准的问题,但我见过的大多数响应都不使用 ORM。我看过很多相关的问题(关于 ON DUPLICATE KEY UPDATE),答案似乎是一个自定义的@compiles,如:

https://github.com/bedwards/sqlalchemy_mysql_ext/blob/master/duplicate.py

我不明白的是(因为我还在学习 SQLAlchemy)是否可以在 ORM 中使用它?如果没有,还有其他解决方案吗?

我已经尝试了 session.merge() 而不是 session.add() 但这没有用。

任何帮助,将不胜感激。

编辑:

感谢下面 iuridiniz 的帮助,我现在可以提供一些演示问题的具体代码。假设我们有一个与用户一对多关系的组表,并且每个用户都与(电子邮件)地址具有一对一关系,并且后者必须是唯一的。我们现在可以创建两个具有相同地址的用户,如果我们直接对用户进行合并,它的行为是正确的(不会因为电子邮件地址重复而引发错误)。但是,如果我们创建两个具有相同地址的用户,然后将它们添加到一个组并尝试在该组上进行合并,则会引发唯一键错误(即在尝试添加地址之前它不会检查地址是否存在):

from sqlalchemy import create_engine, Column, types
from sqlalchemy.ext.declarative import declarative_base
from sqlalchemy.orm import sessionmaker, scoped_session
from sqlalchemy.orm import Session

from sqlalchemy import ForeignKey
from sqlalchemy.orm import relationship, backref


engine = create_engine('sqlite:///:memory:', echo=False)
Base = declarative_base()

session = scoped_session(sessionmaker(bind=engine))

class Group(Base):
    __tablename__ = "groups"
    gid = Column(types.Integer, primary_key=True)
    name = Column(types.String(255))

    users = relationship("User", backref="group")

    def __repr__(self):
        ret =  "Group(name=%r)" % self.name
        for user in self.users:
            ret += str(user)

class User(Base):
    __tablename__ = "users"
    login = Column(types.String(50), primary_key=True)
    name = Column(types.String(255))
    group_id = Column(types.Integer, ForeignKey('groups.gid'))
    address = Column(types.String(200), 
                            ForeignKey('addresses.email_address'))
    email = relationship("Address")

    def __repr__(self):
        return "User(login=%r, name=%r)\n%s" % (self.login, self.name,
                str(self.email))
Run Code Online (Sandbox Code Playgroud)

关于解决这个问题的优雅方式的任何想法?

很抱歉使用“答案”最初发布此编辑时堆栈溢出礼仪不好。

iur*_*niz 6

AFAIK SQLAlchemy ORM 层无法执行ON DUPLICATE KEY UPDATE. 幸运的是session.merge()功能,可以为你做这一点,但只有当此键是一个主键(你的情况)。

session.merge(o)通过发出 a 检查是否存在具有相同主键值的行SELECT,如果为真,则发出 aUPDATE而不是INSERT

看这个例子:

from sqlalchemy import create_engine, Column, types
from sqlalchemy.ext.declarative import declarative_base
from sqlalchemy.orm import sessionmaker, scoped_session

engine = create_engine('sqlite:///:memory:', echo=False)
Base = declarative_base()
session = scoped_session(sessionmaker(bind=engine))

class User(Base):
    __tablename__ = "user"
    login = Column(types.String(50), primary_key=True)
    name = Column(types.String(255))

    def __repr__(self):
        return "User(login=%r, name=%r)" % (self.login, self.name)

Base.metadata.create_all(engine)

if __name__ == '__main__':
    # create two users    
    u1 = User(login='iuridiniz', name="Iuri Diniz")
    u2 = User(login='someuser', name="Some User")
    session.merge(u1) # could be session.add(u1)
    session.merge(u2) # could be session.add(u2)
    session.commit()

    # print all users
    print("First two users")
    for u in session.query(User):
        print(u)

    # create more two users, one with the same login
    u3 = User(login='iuridiniz', name="Iuri Gomes Diniz")
    u4 = User(login='anotheruser', name="Another User")
    session.merge(u3) # session.add(u3) will raise a sqlalchemy.exc.IntegrityError
    session.merge(u4) # could be session.add(u4)
    session.commit()

    print("More two users")
    for u in session.query(User):
        print(u)
Run Code Online (Sandbox Code Playgroud)

输出:

First two users
User(login=u'iuridiniz', name=u'Iuri Diniz')
User(login=u'someuser', name=u'Some User')
More two users
User(login=u'iuridiniz', name=u'Iuri Gomes Diniz')
User(login=u'someuser', name=u'Some User')
User(login=u'anotheruser', name=u'Another User')
Run Code Online (Sandbox Code Playgroud)

更改engine = create_engine('sqlite:///:memory:', echo=False)engine = create_engine('sqlite:///:memory:', echo=True)以查看执行的查询:

[INFO Engine] SELECT CAST('test plain returns' AS VARCHAR(60)) AS anon_1
[INFO Engine] ()
[INFO Engine] SELECT CAST('test unicode returns' AS VARCHAR(60)) AS anon_1
[INFO Engine] ()
[INFO Engine] PRAGMA table_info("user")
[INFO Engine] ()
[INFO Engine] 
CREATE TABLE user (
    login VARCHAR(50) NOT NULL, 
    name VARCHAR(255), 
    PRIMARY KEY (login)
)


[INFO Engine] ()
[INFO Engine] COMMIT
[INFO Engine] BEGIN (implicit)
[INFO Engine] SELECT user.login AS user_login, user.name AS user_name 
FROM user 
WHERE user.login = ?
[INFO Engine] ('iuridiniz',)
[INFO Engine] INSERT INTO user (login, name) VALUES (?, ?)
[INFO Engine] ('iuridiniz', 'Iuri Diniz')
[INFO Engine] SELECT user.login AS user_login, user.name AS user_name 
FROM user 
WHERE user.login = ?
[INFO Engine] ('someuser',)
[INFO Engine] INSERT INTO user (login, name) VALUES (?, ?)
[INFO Engine] ('someuser', 'Some User')
[INFO Engine] COMMIT
[INFO Engine] BEGIN (implicit)
[INFO Engine] SELECT user.login AS user_login, user.name AS user_name 
FROM user 
WHERE user.login = ?
[INFO Engine] ('iuridiniz',)
[INFO Engine] UPDATE user SET name=? WHERE user.login = ?
[INFO Engine] ('Iuri Gomes Diniz', 'iuridiniz')
[INFO Engine] SELECT user.login AS user_login, user.name AS user_name 
FROM user 
WHERE user.login = ?
[INFO Engine] ('anotheruser',)
[INFO Engine] INSERT INTO user (login, name) VALUES (?, ?)
[INFO Engine] ('anotheruser', 'Another User')
[INFO Engine] COMMIT
Run Code Online (Sandbox Code Playgroud)