Django QuerySet update_or_create创建重复条目

Sau*_*mar 4 django django-queryset

最近,我在update_or_create方法中遇到问题。首先让我给出完整的解释。

模型:

class TransactionPageVisits(models.Model):
    transactionid = models.ForeignKey(
        Transaction,
        on_delete=models.CASCADE,
        db_column='transactionid',
    )
    sessionid = models.CharField(max_length=40, db_index=True)
    ip_address = models.CharField(max_length=39, editable=False)
    user_agent = models.TextField(null=True, editable=False)
    page = models.CharField(max_length=100, null=True, db_index=True)
    method = models.CharField(max_length=20, null=True)
    url = models.TextField(null=False, editable=False)
    created_dtm = models.DateTimeField(auto_now_add=True)

    class Meta(object):
        ordering = ('created_dtm',)
Run Code Online (Sandbox Code Playgroud)

功能:

def _tracking(self, request, response, **kwargs):
    txn_details = kwargs.get('txn_details')
    data = {
        'sessionid': request.session.session_key,
        'ip_address': get_ip_address(request),
        'user_agent': get_user_agent(request),
        'method': request.method,
        'url': request.build_absolute_uri(),
        'transactionid': txn_details.txn_object,
        'page': kwargs.get('page')
    }

    # Keep updating/creating tracking data to model
    obj, created = TransactionPageVisits.objects.update_or_create(**data)
Run Code Online (Sandbox Code Playgroud)

笔记:

我知道我没有将任何默认参数传递给update_or_create(),因为在编写代码时它不是必需的(仅当每个数据的所有列都具有唯一性时,才希望创建新行)。_tracking()也在中间件中,并且将在每个请求和响应中调用。

一切进展顺利,直到今天我得到以下例外:

File "trackit.py", line 65, in _tracking
    obj, created = TransactionPageVisits.objects.update_or_create(**data)
  File "/usr/local/lib/python2.7/dist-packages/Django-1.10.4-py2.7.egg/django/db/models/manager.py", line 85, in manager_method
    return getattr(self.get_queryset(), name)(*args, **kwargs)
  File "/usr/local/lib/python2.7/dist-packages/Django-1.10.4-py2.7.egg/django/db/models/query.py", line 488, in update_or_create
    obj = self.get(**lookup)
  File "/usr/local/lib/python2.7/dist-packages/Django-1.10.4-py2.7.egg/django/db/models/query.py", line 389, in get
    (self.model._meta.object_name, num)
MultipleObjectsReturned: get() returned more than one TransactionPageVisits -- it returned 2!
Run Code Online (Sandbox Code Playgroud)

我注意到在表中创建了两个条目,它们的值完全相同(created_dtm除外,因为它具有auto_add_now = True):

| id    | sessionid                        | ip_address     | user_agent                                                                     | page | method | url                                                                                                    | created_dtm                | transactionid |
| 32858 | nrq2vwxbtsjp8yoibotpsur0zit5jhoq | xx.xxx.xxx.xxx | Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:58.0) Gecko/20100101 Firefox/58.0 |      | GET    | https://www.example.com/example_url/?jobid=5a9f2acb4cedfd00011c7d5d&transactionid=XXXXXXXXXXXX | 2018-03-06 23:57:00.061280 | XXXXXXXXXXXX  |
| 32859 | nrq2vwxbtsjp8yoibotpsur0zit5jhoq | xx.xxx.xxx.xxx | Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:58.0) Gecko/20100101 Firefox/58.0 |      | GET    | https://www.example.com/example_url/?jobid=5a9f2acb4cedfd00011c7d5d&transactionid=XXXXXXXXXXXX | 2018-03-06 23:57:00.062121 | XXXXXXXXXXXX  |
Run Code Online (Sandbox Code Playgroud)

为什么首先在表中创建重复条目?

Pau*_*ida 6

update_or_create文档中所述容易发生竞争:

如上文在get_or_create()中所述,此方法易于出现竞争条件,如果在数据库级别未强制唯一性,则可能导致同时插入多行。

您可以unique_together在模型中使用,如另一个答案所建议。我从未对此进行过测试,但是显然Django抓住了IntegrityError这些竞争条件引起的问题

  • 是的,我认为您是对的,有一个[commit](https://github.com/django/django/commit/d44afd889275473c97474cca19467d1509e0fcc1)已修复。 (2认同)