uWSGI 无法在从 Python 的标准输出日志重定向的日志文件中写入 unicode 数据

err*_*ata 5 python unicode logging uwsgi pyramid

我正在使用 uWSGI (2.0.11.2) 和 Python (3.4.3) 在 Ubuntu 14.04 上为我的 Pyramid (1.5.7) 应用程序提供服务。我注意到我的 uWSGI 日志中出现了与 unicode 解码相关的错误:

#
# one of the situations when exception is raised is
# when SQLAlchemy (which has set INFO logging level)
# tries to write an SQL statement containing unicode charater
# into log file
#
2016-02-26 16:01:38,734 INFO  [sqlalchemy.engine.base.Engine][b'uWSGIWorker5Core0'] BEGIN (implicit)
2016-02-26 16:01:38,735 INFO  [sqlalchemy.engine.base.Engine][b'uWSGIWorker5Core0'] SELECT * FROM staging WHERE company_name = %(company_name_1)s AND time = %(time_1)s AND ship_name = %(ship_name_1)s
# exact place (missing line) where SQLAlchemy is trying to print out
# query parameters, which in this case include unicode character
--- Logging error ---
Traceback (most recent call last):
  File "/usr/lib/python3.4/logging/__init__.py", line 980, in emit
    stream.write(msg)
UnicodeEncodeError: 'ascii' codec can't encode character '\xfa' in position 132: ordinal not in range(128)
Call stack:
  File "/home/mk/.virtualenvs/api/lib/python3.4/site-packages/sqltap/wsgi.py", line 42, in __call__
    return self.app(environ, start_response)
  File "/home/mk/.virtualenvs/api/lib/python3.4/site-packages/pyramid/router.py", line 242, in __call__
    response = self.invoke_subrequest(request, use_tweens=True)
  #
  # the stack continues...
  # full stack here -> https://bpaste.net/show/8e12af790372
  #
  File "/home/mk/.virtualenvs/api/lib/python3.4/site-packages/sqlalchemy/engine/base.py", line 1010, in _execute_clauseelement
    compiled_sql, distilled_params
  File "/home/mk/.virtualenvs/api/lib/python3.4/site-packages/sqlalchemy/engine/base.py", line 1100, in _execute_context
    sql_util._repr_params(parameters, batches=10)
Unable to print the message and arguments - possible formatting error.
Use the traceback above to help find the error.
Run Code Online (Sandbox Code Playgroud)

我还注意到,将相同的行写入 Pyramid 生成的日志文件(不涉及 uWSGI)工作得非常好,没有任何错误,并且正确插入了 unicode 字符。

我正在使用以下命令运行 uWSGI:

/usr/local/bin/uwsgi --emperor /etc/uwsgi/vassals
Run Code Online (Sandbox Code Playgroud)

vassals文件夹中,我从我的 Pyramid 应用程序中符号链接了 uWSGI 配置,它看起来像这样:

[uwsgi]

host = %h
username = mk
project_name = api
project_root = /shared/projects/python/%(project_name)

env = PYTHONIOENCODING=UTF-8

; this env var is generated based on host name
env = APP_INI_FILE=develop.ini

; folders config
home_folder = /home/%(username)
virtualenv_folder = %(home_folder)/.virtualenvs/%(project_name)
logs_folder = %(home_folder)/logs/%(project_name)
chdir = %(project_root)
socket = %(project_root)/%(project_name).sock
pidfile = %(project_root)/%(project_name).pid
virtualenv = %(virtualenv_folder)
daemonize = %(logs_folder)/uwsgi.log

; core stuff
master = true
vacuum = true
processes = 5
enable-threads = true

; socket conf
chmod-socket = 666  # invoking the One
chown-socket = %(username)
uid = %(username)
gid = %(username)

; log conf
log-reopen = true
logfile-chown = %(username)
logfile-chmod = 644

; app conf
module = wsgi:application
harakiri = 120
max-requests = 500
post-buffering = 1
paste = config:%p
paste-logger = $p
Run Code Online (Sandbox Code Playgroud)

定义所有日志记录的 Pyramid 配置文件如下所示:

###
# app configuration
# http://docs.pylonsproject.org/projects/pyramid/en/1.5-branch/narr/environment.html
###

[DEFAULT]
home_dir = /home/mk

[app:main]
use = egg:api

pyramid.reload_templates = false
pyramid.debug_authorization = false
pyramid.debug_notfound = false
pyramid.debug_routematch = false
pyramid.default_locale_name = en

sqlalchemy.url = postgresql://XXX:YYY@12.13.14.15:5432/ze_database?client_encoding=utf8

[server:main]
use = egg:waitress#main
host = 0.0.0.0
port = 6543

###
# logging configuration
# http://docs.pylonsproject.org/projects/pyramid/en/1.5-branch/narr/logging.html
###

[loggers]
keys = root, sqlalchemy

[handlers]
keys = console, debuglog

[formatters]
keys = generic, short

[logger_root]
level = DEBUG
handlers = console, debuglog

[logger_sqlalchemy]
level = INFO
handlers =
qualname = sqlalchemy.engine
# "level = INFO" logs SQL queries.
# "level = DEBUG" logs SQL queries and results.
# "level = WARN" logs neither.  (Recommended for production systems.)

[handler_console]
class = StreamHandler
args = (sys.stderr,)
level = DEBUG
formatter = generic

[handler_debuglog]
class = handlers.RotatingFileHandler
args = ('%(home_dir)s/logs/api/pyramid_debug.log', 'a', 1024000000, 10)
level = DEBUG
formatter = generic

[formatter_generic]
format = %(asctime)s %(levelname)-5.5s [%(name)s][%(threadName)s] %(message)s

[formatter_short]
format = %(asctime)s %(message)s
Run Code Online (Sandbox Code Playgroud)

最后,我的 Pyramidwsgi.py文件非常简单:

import os
from pyramid.paster import get_app, setup_logging

here = os.path.dirname(os.path.abspath(__file__))
conf = os.path.join(here, os.environ.get('APP_INI_FILE'))  # APP_INI_FILE variable is set in uwsgi.ini
setup_logging(conf)

application = get_app(conf, 'main')
Run Code Online (Sandbox Code Playgroud)

本质上,我将我的应用程序日志重定向到stderr(或者stdout,就我所注意到的而言,它完全相同),并且同时将其写入单独的文件 ( pyramid_debug.log) 中。stderr在我的情况下是 uWSGI 守护进程的日志文件,这就是错误发生的地方。

虽然LC_ALL和相关变量在系统上设置为en_EN.UTF-8,但我也尝试使用各种与本地化相关的环境变量并在我的 Pyramid 的 wsgi 应用程序中明确设置它们,但运气不佳 - 例如,仅PYTHONIOENCODING=UTF-8在 uWSGI 配置中设置变量解决了问题在我的本地机器上,但在我部署后不在服务器上。

我的明显问题是 - 在这种情况下如何正确处理 uWSGI 在日志文件中写入 unicode 字符?

per*_*oud 1

编辑File "/usr/lib/python3.4/logging/__init__.py", line 980和更改

\n\n
stream.write(msg)\n
Run Code Online (Sandbox Code Playgroud)\n\n

\n\n
stream.write(msg.encode(\'utf-8\'))\n
Run Code Online (Sandbox Code Playgroud)\n\n

最有可能的是,流类型的更改方式不应该杀死您的 UTF-8 编码功能,但实际上是由于 Pythonics 造成的。(似乎无论世界如何大声喊叫“UTF-8”,Python 都会忽略这个问题上的所有人。)

\n\n

例如,如果您正在处理文件,则您尝试的所有环境变量都将被忽略:

\n\n
# test.py\nimport sys\nsys.stdout.write(u\'\\u00f6\\n\')\n
Run Code Online (Sandbox Code Playgroud)\n\n

测试:

\n\n
max% python test.py\n\xc3\xb6\nmax% python test.py > f\nTraceback (most recent call last):\n  File "test.py", line 2, in <module>\n    sys.stdout.write(u\'\\u00f6\\n\')\nUnicodeEncodeError: \'ascii\' codec can\'t encode character u\'\\xf6\' in position 0: ordinal not in range(128)\n
Run Code Online (Sandbox Code Playgroud)\n