Python的urllib.quote()和urllib.unquote()的等效Javascript函数

Cam*_*ron 31 javascript python url encoding

是否有Python的任何等效Javascript函数urllib.quote()urllib.unquote()

我遇到的最接近的是escape(),encodeURI()encodeURIComponent()(和它们相应的非编码函数),但就我所知,它们不会对同一组特殊字符进行编码/解码.

谢谢,
卡梅隆

mjh*_*jhm 71

作为记录:

JavaScript               |  Python
----------------------------------- 
encodeURI(str)           |  urllib.quote(str, safe='~@#$&()*!+=:;,.?/\'');
-----------------------------------
encodeURIComponent(str)  |  urllib.quote(str, safe='~()*!.\'')
Run Code Online (Sandbox Code Playgroud)

  • 对于更多的pendantic记录,`encodeURIComponent`使用UTF-8,而`urllib.quote(u'é')`例如抛出异常.等效的Python编码应该是`urllib.quote(unicode(str).encode('utf-8'),safe = ...` (21认同)

Cam*_*ron 6

好的,我想我将使用混合自定义函数集:

编码:使用encodeURIComponent(),然后将斜杠放回.
解码:解码找到的任何%十六进制值.

这是我最终使用的更完整的变体(它也正确处理Unicode):

function quoteUrl(url, safe) {
    if (typeof(safe) !== 'string') {
        safe = '/';    // Don't escape slashes by default
    }

    url = encodeURIComponent(url);

    // Unescape characters that were in the safe list
    toUnencode = [  ];
    for (var i = safe.length - 1; i >= 0; --i) {
        var encoded = encodeURIComponent(safe[i]);
        if (encoded !== safe.charAt(i)) {    // Ignore safe char if it wasn't escaped
            toUnencode.push(encoded);
        }
    }

    url = url.replace(new RegExp(toUnencode.join('|'), 'ig'), decodeURIComponent);

    return url;
}


var unquoteUrl = decodeURIComponent;    // Make alias to have symmetric function names
Run Code Online (Sandbox Code Playgroud)

请注意,如果在编码时不需要"安全"字符('/'默认情况下在Python中),那么您可以直接使用内置函数encodeURIComponent()decodeURIComponent()函数.

此外,如果字符串中有Unicode字符(即代码点> = 128的字符),那么为了保持与JavaScript的兼容性encodeURIComponent(),Python quote_url()必须是:

def quote_url(url, safe):
    """URL-encodes a string (either str (i.e. ASCII) or unicode);
    uses de-facto UTF-8 encoding to handle Unicode codepoints in given string.
    """
    return urllib.quote(unicode(url).encode('utf-8'), safe)
Run Code Online (Sandbox Code Playgroud)

而且unquote_url()将是:

def unquote_url(url):
    """Decodes a URL that was encoded using quote_url.
    Returns a unicode instance.
    """
    return urllib.unquote(url).decode('utf-8')
Run Code Online (Sandbox Code Playgroud)


Mil*_*ric 6

请求库是一个比较受欢迎的,如果你不介意额外的依赖

from requests.utils import quote
quote(str)
Run Code Online (Sandbox Code Playgroud)