Cam*_*ron 31 javascript python url encoding
是否有Python的任何等效Javascript函数urllib.quote()和urllib.unquote()?
我遇到的最接近的是escape(),encodeURI()和encodeURIComponent()(和它们相应的非编码函数),但就我所知,它们不会对同一组特殊字符进行编码/解码.
谢谢,
卡梅隆
mjh*_*jhm 71
作为记录:
JavaScript | Python
-----------------------------------
encodeURI(str) | urllib.quote(str, safe='~@#$&()*!+=:;,.?/\'');
-----------------------------------
encodeURIComponent(str) | urllib.quote(str, safe='~()*!.\'')
Run Code Online (Sandbox Code Playgroud)
好的,我想我将使用混合自定义函数集:
编码:使用encodeURIComponent(),然后将斜杠放回.
解码:解码找到的任何%十六进制值.
这是我最终使用的更完整的变体(它也正确处理Unicode):
function quoteUrl(url, safe) {
if (typeof(safe) !== 'string') {
safe = '/'; // Don't escape slashes by default
}
url = encodeURIComponent(url);
// Unescape characters that were in the safe list
toUnencode = [ ];
for (var i = safe.length - 1; i >= 0; --i) {
var encoded = encodeURIComponent(safe[i]);
if (encoded !== safe.charAt(i)) { // Ignore safe char if it wasn't escaped
toUnencode.push(encoded);
}
}
url = url.replace(new RegExp(toUnencode.join('|'), 'ig'), decodeURIComponent);
return url;
}
var unquoteUrl = decodeURIComponent; // Make alias to have symmetric function names
Run Code Online (Sandbox Code Playgroud)
请注意,如果在编码时不需要"安全"字符('/'默认情况下在Python中),那么您可以直接使用内置函数encodeURIComponent()和decodeURIComponent()函数.
此外,如果字符串中有Unicode字符(即代码点> = 128的字符),那么为了保持与JavaScript的兼容性encodeURIComponent(),Python quote_url()必须是:
def quote_url(url, safe):
"""URL-encodes a string (either str (i.e. ASCII) or unicode);
uses de-facto UTF-8 encoding to handle Unicode codepoints in given string.
"""
return urllib.quote(unicode(url).encode('utf-8'), safe)
Run Code Online (Sandbox Code Playgroud)
而且unquote_url()将是:
def unquote_url(url):
"""Decodes a URL that was encoded using quote_url.
Returns a unicode instance.
"""
return urllib.unquote(url).decode('utf-8')
Run Code Online (Sandbox Code Playgroud)
该请求库是一个比较受欢迎的,如果你不介意额外的依赖
from requests.utils import quote
quote(str)
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
22696 次 |
| 最近记录: |