将MIMEText编码为引用的printables

the*_*ega 12 python encoding quoted-printable

Python支持一个相当的功能MIME图书馆email.mime.

我想要实现的是获取包含普通UTF-8文本的MIME部分,将其编码为带引号的printables而不是base64.虽然库中提供了所有功能,但我没有设法使用它:

例:

import email.mime.text, email.encoders
m=email.mime.text.MIMEText(u'This is the text containing ünicöde', _charset='utf-8')
m.as_string()
# => Leads to a base64-encoded message, as base64 is the default.

email.encoders.encode_quopri(m)
m.as_string()
# => Leads to a strange message
Run Code Online (Sandbox Code Playgroud)

最后一个命令导致一个奇怪的消息:

Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: base64
Content-Transfer-Encoding: quoted-printable

GhpcyBpcyB0aGUgdGV4dCBjb250YWluaW5nIMO8bmljw7ZkZQ=3D=3D
Run Code Online (Sandbox Code Playgroud)

这显然不是编码为引用的printables,双头transfer-encoding最后是奇怪的(如果不是非法).

如何在mime-message中将我的文本编码为带引号的printables?

the*_*ega 9

好吧,我有一个非常hacky的解决方案,但至少它导致了某个方向:MIMEText假设base64而我不知道如何改变它.出于这个原因,我使用MIMENonMultipart:

import email.mime, email.mime.nonmultipart, email.charset
m=email.mime.nonmultipart.MIMENonMultipart('text', 'plain', charset='utf-8')

#Construct a new charset which uses Quoted Printables (base64 is default)
cs=email.charset.Charset('utf-8')
cs.body_encoding = email.charset.QP

#Now set the content using the new charset
m.set_payload(u'This is the text containing ünicöde', charset=cs)
Run Code Online (Sandbox Code Playgroud)

现在消息似乎编码正确:

Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: quoted-printable

This is the text containing =C3=BCnic=C3=B6de
Run Code Online (Sandbox Code Playgroud)

甚至可以构建一个隐藏复杂性的新类:

class MIMEUTF8QPText(email.mime.nonmultipart.MIMENonMultipart):
  def __init__(self, payload):
    email.mime.nonmultipart.MIMENonMultipart.__init__(self, 'text', 'plain',
                                                      charset='utf-8')

    utf8qp=email.charset.Charset('utf-8')
    utf8qp.body_encoding=email.charset.QP

    self.set_payload(payload, charset=utf8qp) 
Run Code Online (Sandbox Code Playgroud)

并像这样使用它:

m = MIMEUTF8QPText(u'This is the text containing ünicöde')
m.as_string()
Run Code Online (Sandbox Code Playgroud)


Ill*_*mov 8

在 Python 3 中你不需要 hack:

\n
import email\n\n# Construct a new charset which uses Quoted Printables (base64 is default)\ncs = email.charset.Charset(\'utf-8\')\ncs.body_encoding = email.charset.QP\n\nm = email.mime.text.MIMEText(u\'This is the text containing \xc3\xbcnic\xc3\xb6de\', \'plain\', _charset=cs)\n\nprint(m.as_string())\n
Run Code Online (Sandbox Code Playgroud)\n

  • 公平地说,Python 2 中需要 hack。你的答案仅适用于 Python 3。所以基本上你可以说原来的问题可以通过切换到 Python 3 来解决。 (2认同)

mmo*_*oya 5

改编自问题1525919并在python 2.7上测试:

from email.Message import Message
from email.Charset import Charset, QP

text = "\xc3\xa1 = \xc3\xa9"
msg = Message()

charset = Charset('utf-8')
charset.header_encoding = QP
charset.body_encoding = QP

msg.set_charset(charset)
msg.set_payload(msg._charset.body_encode(text))

print msg.as_string()
Run Code Online (Sandbox Code Playgroud)

会给你:

MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable

=C3=A1 =3D =C3=A9
Run Code Online (Sandbox Code Playgroud)

另请参阅Python提交者的此响应.