utf-8中的汉字字符

Dav*_*542 3 python unicode utf-8

>>> s='????'
>>> s
'\xe6\x9c\xaa\xe4\xbd\x9c\xe8\xa9\x95\xe7\xb4\x9a'
>>> s = unicode(s)
UnicodeDecodeError: 'ascii' codec can't decode byte 0xe6 in position 0: ordinal not in range(128)
Run Code Online (Sandbox Code Playgroud)

我怎样才能????进入uniciode?

Tim*_*ker 6

从一开始就使用Unicode字符串:

>>> s = u'????'
Run Code Online (Sandbox Code Playgroud)

或者从当前编码(看起来是UTF-8)解码字符串.然后你得到一个Unicode字符串.

>>> s = '????'.decode("utf-8")
Run Code Online (Sandbox Code Playgroud)