使用UTF-8编码字符打印字符串,例如:"\ u00c5\u009b \"

Question

使用UTF-8编码字符打印字符串,例如:"\ u00c5\u009b \"

dan*_*iel 2 python python-3.x python-unicode

我想打印像这样编码的字符串:"Cze\u00c5\u009b\u00c4\u0087"但我不知道怎么做.示例字符串应打印为:"Cześć".

我试过的是:

str = "Cze\u00c5\u009b\u00c4\u0087"
print(str) 
#gives: CzeÅÄ

str_bytes = str.encode("unicode_escape")
print(str_bytes) 
#gives: b'Cze\\xc5\\x9b\\xc4\\x87'

str = str_bytes.decode("utf8")
print(str) 
#gives: Cze\xc5\x9b\xc4\x87

Run Code Online (Sandbox Code Playgroud)

哪里

print(b"Cze\xc5\x9b\xc4\x87".decode("utf8"))

Run Code Online (Sandbox Code Playgroud)

给出"Cześć",但我不知道如何将"Cze\xc5\x9b\xc4\x87"字符串转换为b"Cze\xc5\x9b\xc4\x87"字节.

我也知道在使用"unicode_escape"参数编码基础字符串后,问题是字节表示中的额外反斜杠,但我不知道如何摆脱它们 - str_bytes.replace(b'\\\\', b'\\')不起作用.

Answer 1

Mat*_*ero 5

用途raw_unicode_escape:

text = 'Cze\u00c5\u009b\u00c4\u0087'
text_bytes = text.encode('raw_unicode_escape')
print(text_bytes.decode('utf8')) # outputs Cze??

Run Code Online (Sandbox Code Playgroud)

归档时间：	7 年，7 月前
查看次数：	244 次
最近记录：	7 年，7 月前