Sqlalchemy 在多个表上使用带有外连接的 func

Mad*_*war 3 python sql sqlalchemy pyramid

我在 sqlalchemy 中有以下表格:-

class Post(Base):
    __tablename__ = 'posts'
    id = Column(Integer, primary_key=True)
    compare_url =Column(String(200))
    url = Column(String(200))
    postedby = Column(Integer)
    category = Column(String(50))
    title  = Column(String(500),nullable=False)
    author  = Column(String(500),default="Unspecified")
    content = Column(Text(),default="could not fetch this content you will have to read it externally")
    summary = Column(Text())
    time = Column(TIMESTAMP(),default=now())
    post_type=Column(Text())
    Reads = relationship("Read", backref="Post")
    Reposts = relationship("RePost", backref="Post")
    Votes = relationship("Vote", backref="Post")



class Read(Base):
    __tablename__ = 'reads'
    id = Column(Integer, primary_key=True)
    post_read = Column(Integer, ForeignKey('posts.id'))
    #post = relationship("Post", backref=backref('Reads', order_by=id))
    time = Column(TIMESTAMP(),default=now())
    user_id = Column(String(50))


class Vote(Base):
    __tablename__ = 'votes'
    id = Column(Integer, primary_key=True)
    post_read = Column(Integer, ForeignKey('posts.id'))
    time = Column(TIMESTAMP(),default=now())
    user_id = Column(String(50))
    user_vote = Column(Boolean(),nullable=False)
Run Code Online (Sandbox Code Playgroud)

我有这个查询

posts = session.query(Post, func.count(Read.id).label('total'),func.sum(Vote.user_vote).label('votes'),User.username).outerjoin(Post.Reads).outerjoin(Post.Votes)
Run Code Online (Sandbox Code Playgroud)

我试图获得投票数和帖子被阅读的次数。投票值可以是 -1 或 1

问题是我在每个帖子上的阅读次数和投票数获得相同的值

例如,当我的阅读表有

id  post_read   time             user_id
1   7       2012-09-19 09:32:06  1
Run Code Online (Sandbox Code Playgroud)

和投票表有

id  post_read   time                 user_id    user_vote
1   7 [->]         2012-09-19 09:42:27  1   1
2   7 [->]         2012-09-19 09:42:27  2   1
Run Code Online (Sandbox Code Playgroud)

但我仍然得到了投票的价值并读为两个。

van*_*van 6

它看起来就好像你可以通过简单地更换解决这个特定的问题func.count(Read.id).label('total')func.count(func.distinct(Read.id)).label('total')。事实上,这将解决读取次数的问题。

但是,如果您突然为您的帖子增加了另一个读者(最终有 2 个读者和 2 个投票者),那么您的所有选票也将被计算两次。

对此的最佳解决方案就是不要在同一查询中聚合不同的项目。您可以使用子查询来解决这个问题:

subq_read = (session.query(
                Post.id, 
                func.count(Read.id).label("total_read")
            ).
            outerjoin(Post.Reads).
            group_by(Read.post_read)
            ).subquery()

subq_vote = (session.query(
                Post.id, 
                func.sum(Vote.user_vote).label("total_votes")
            ).
            outerjoin(Post.Votes).
            group_by(Vote.post_read)
            ).subquery()

posts = (session.query(
            Post, 
            subq_read.c.total_read,
            subq_vote.c.total_votes,
        ).
        outerjoin(subq_read, subq_read.c.id == Post.id).
        outerjoin(subq_vote, subq_vote.c.id == Post.id)
        .group_by(Post)
        )
Run Code Online (Sandbox Code Playgroud)

注意:您User.username的查询中有 a ,但我在查询中没有看到任何join子句。您可能也想检查一下。