Django模型中域名的正则表达式匹配

Cod*_*alk 3 python regex django model

我有一张桌子,看起来像:

class Tld(models.Model):
    domainNm = models.CharField(validators=[ RegexValidator('^[0-9]^[a-z]','yourdomain.com only','Invalid Entry')], max_length=40)
    dtCreated = models.DateField()
Run Code Online (Sandbox Code Playgroud)

对于domainNm - 我想验证任何看起来像这样的条目:

  • domain.com
  • 1domain.com
  • domain1.com

它必须遵循这样的方式:<domainname>.[com|biz|net]等等,并且是字母数字.

如何在django模型的模型级别上执行此操作?

谢谢

Adr*_*ián 11

如果要验证HTTP URL,请忘记正则表达式并使用内置验证器.

如果您只想要没有任何协议的域,请尝试:

def full_domain_validator(hostname):
    """
    Fully validates a domain name as compilant with the standard rules:
        - Composed of series of labels concatenated with dots, as are all domain names.
        - Each label must be between 1 and 63 characters long.
        - The entire hostname (including the delimiting dots) has a maximum of 255 characters.
        - Only characters 'a' through 'z' (in a case-insensitive manner), the digits '0' through '9'.
        - Labels can't start or end with a hyphen.
    """
    HOSTNAME_LABEL_PATTERN = re.compile("(?!-)[A-Z\d-]+(?<!-)$", re.IGNORECASE)
    if not hostname:
        return
    if len(hostname) > 255:
        raise ValidationError(_("The domain name cannot be composed of more than 255 characters."))
    if hostname[-1:] == ".":
        hostname = hostname[:-1]  # strip exactly one dot from the right, if present
    for label in hostname.split("."):
        if len(label) > 63:
            raise ValidationError(
                _("The label '%(label)s' is too long (maximum is 63 characters).") % {'label': label})
        if not HOSTNAME_LABEL_PATTERN.match(label):
            raise ValidationError(_("Unallowed characters in label '%(label)s'.") % {'label': label})
Run Code Online (Sandbox Code Playgroud)

用法:

from django.core.validators import URLValidator

field = models.CharField(_('host name'), max_length=255, validators=[URLValidator])
Run Code Online (Sandbox Code Playgroud)

要么

field = models.CharField(_('host name'), max_length=255, validators=[full_domain_validator])
Run Code Online (Sandbox Code Playgroud)


Adi*_*bar 7

To recap the clarifications above: You want to match only domains with a single alphanumeric label and a TLD of up to 4 characters, eg. "domain.com" or "someotherdomain.info" or "345xyz.pdq1" but not "subdomain.domain.com", "http://domain.com", "www.domain.com", or "345xyz.abcde". This regex will do it:

^[a-z0-9]+\.[a-z0-9]{1,4}$
Run Code Online (Sandbox Code Playgroud)