标签: unicode

"Unicode错误"unicodeescape"编解码器无法解码字节...无法在Python 3中打开文本文件

我在Windows 7机器上使用python 3.1.俄语是默认的系统语言,utf-8是默认编码.

看看上一个问题的答案,我尝试使用"编解码器"模块给我一点运气.以下是一些例子:

>>> g = codecs.open("C:\Users\Eric\Desktop\beeline.txt", "r", encoding="utf-8")
SyntaxError: (unicode error) 'unicodeescape' codec can't decode bytes in position 2-4: truncated \UXXXXXXXX escape (<pyshell#39>, line 1)
>>> g = codecs.open("C:\Users\Eric\Desktop\Site.txt", "r", encoding="utf-8")
SyntaxError: (unicode error) 'unicodeescape' codec can't decode bytes in position 2-4: truncated \UXXXXXXXX escape (<pyshell#40>, line 1)
>>> g = codecs.open("C:\Python31\Notes.txt", "r", encoding="utf-8")
SyntaxError: (unicode error) 'unicodeescape' codec can't decode bytes in position 11-12: malformed \N character escape (<pyshell#41>, line 1)
>>> g = codecs.open("C:\Users\Eric\Desktop\Site.txt", …

Run Code Online (Sandbox Code Playgroud)

python unicode python-3.x

Eri*_*ric

2017 05-23

223
推荐指数

6
解决办法

48万
查看次数

C++中_tmain()和main()有什么区别？

如果我使用以下main()方法运行我的C++应用程序,一切正常:

int main(int argc, char *argv[]) 
{
   cout << "There are " << argc << " arguments:" << endl;

   // Loop through each argument and print its number and value
   for (int i=0; i<argc; i++)
      cout << i << " " << argv[i] << endl;

   return 0;
}

Run Code Online (Sandbox Code Playgroud)

我得到了我的期望,我的论点被打印出来了.

但是,如果我使用_tmain:

int _tmain(int argc, char *argv[]) 
{
   cout << "There are " << argc << " arguments:" << endl;

   // Loop through each argument and print its number and value
   for …

Run Code Online (Sandbox Code Playgroud)

c++ unicode arguments

jos*_*ley

lucky-day

222
推荐指数

4
解决办法

14万
查看次数

将Unicode文本写入文本文件？

我正在从Google文档中提取数据,处理数据并将其写入文件(最终我将粘贴到Wordpress页面).

它有一些非ASCII符号.如何将这些安全地转换为可以在HTML源中使用的符号？

目前我正在将所有内容转换为Unicode,在Python字符串中将它们连接在一起,然后执行:

import codecs
f = codecs.open('out.txt', mode="w", encoding="iso-8859-1")
f.write(all_html.encode("iso-8859-1", "replace"))

Run Code Online (Sandbox Code Playgroud)

最后一行有编码错误:

UnicodeDecodeError:'ascii'编解码器无法解码位置12286中的字节0xa0:序数不在范围内(128)

部分解决方案:

这个Python运行时没有错误:

row = [unicode(x.strip()) if x is not None else u'' for x in row]
all_html = row[0] + "<br/>" + row[1]
f = open('out.txt', 'w')
f.write(all_html.encode("utf-8"))

Run Code Online (Sandbox Code Playgroud)

但是如果我打开实际的文本文件,我会看到许多符号,如:

Qur‚Äôan

Run Code Online (Sandbox Code Playgroud)

也许我需要写一些文本文件以外的东西？

python unicode character-encoding python-2.x

sim*_*mon

2019 06-20

219
推荐指数

5
解决办法

26万
查看次数

UnicodeDecodeError,无效的连续字节

为什么以下项目失败？为什么它会成功使用"latin-1"编解码器？

o = "a test of \xe9 char" #I want this to remain a string as this is what I am receiving
v = o.decode("utf-8")

Run Code Online (Sandbox Code Playgroud)

结果是:

 Traceback (most recent call last):  
 File "<stdin>", line 1, in <module>  
 File "C:\Python27\lib\encodings\utf_8.py",
 line 16, in decode
     return codecs.utf_8_decode(input, errors, True) UnicodeDecodeError:
 'utf8' codec can't decode byte 0xe9 in position 10: invalid continuation byte

Run Code Online (Sandbox Code Playgroud)

python unicode decode

Rui*_*iDC

2011 04-05

217
推荐指数

10
解决办法

51万
查看次数