tim*_*ger 3 python python-3.x python-docx
我编写了一个使用 python-docx 来搜索 Word 文档(通过搜索运行)的参考编号和技术关键词的脚本,然后创建一个总结搜索结果的表格,该表格附加到 Word 文档的末尾。
有些文档有 100 多页,所以我想通过在搜索结果表中创建内部超链接来让用户更轻松,这样它会将您带到文档中检测到搜索结果的位置。
一旦找到参考运行,我不知道如何将其标记为书签或如何在结果表中创建指向该书签的超链接。
我能够使用本页中的代码创建外部网址的书签 通过使用 python-docx 在 MSWord 中添加超链接
我也尝试过创建书签,我找到了此页面: https: //github.com/python-openxml/python-docx/issues/109
标题与创建书签有关,但代码似乎在word中生成数字。
我觉得这两个解决方案可以放在一起,但我对 xml/word 文档没有足够的了解,无法做到这一点。
更新: 我发现一些代码可以将书签添加到Word文档中,现在需要的是一种使用Word文档中的链接链接到此的方法 https://github.com/python-openxml/python-docx/issues /403
*from docx import Document
def add_bookmark(paragraph, bookmark_text, bookmark_name):
run = paragraph.add_run()
tag = run._r # for reference the following also works: tag = document.element.xpath('//w:r')[-1]
start = docx.oxml.shared.OxmlElement('w:bookmarkStart')
start.set(docx.oxml.ns.qn('w:id'), '0')
start.set(docx.oxml.ns.qn('w:name'), bookmark_name)
tag.append(start)
text = docx.oxml.OxmlElement('w:r')
text.text = bookmark_text
tag.append(text)
end = docx.oxml.shared.OxmlElement('w:bookmarkEnd')
end.set(docx.oxml.ns.qn('w:id'), '0')
end.set(docx.oxml.ns.qn('w:name'), bookmark_name)
tag.append(end)
doc = Document("test_input_1.docx")
# add a bookmakr to every paragraph
for paranum, paragraph in enumerate(doc.paragraphs):
add_bookmark(paragraph=paragraph, bookmark_text=f"temp{paranum}", bookmark_name=f"temp{paranum+1}")
doc.save("output.docx")*
Run Code Online (Sandbox Code Playgroud)
已解决:我从这篇文章中得到它,将超链接添加到书签
这是关键行
hyperlink.set(docx.oxml.shared.qn('w:anchor'), link_to,)
Run Code Online (Sandbox Code Playgroud)
作为奖励,我添加了向链接添加工具提示的功能:
享受
答案如下:
from docx import Document
import docx
from docx.enum.dml import MSO_THEME_COLOR_INDEX
def add_bookmark(paragraph, bookmark_text, bookmark_name):
run = paragraph.add_run()
tag = run._r
start = docx.oxml.shared.OxmlElement('w:bookmarkStart')
start.set(docx.oxml.ns.qn('w:id'), '0')
start.set(docx.oxml.ns.qn('w:name'), bookmark_name)
tag.append(start)
text = docx.oxml.OxmlElement('w:r')
text.text = bookmark_text
tag.append(text)
end = docx.oxml.shared.OxmlElement('w:bookmarkEnd')
end.set(docx.oxml.ns.qn('w:id'), '0')
end.set(docx.oxml.ns.qn('w:name'), bookmark_name)
tag.append(end)
def add_link(paragraph, link_to, text, tool_tip=None):
# create hyperlink node
hyperlink = docx.oxml.shared.OxmlElement('w:hyperlink')
# set attribute for link to bookmark
hyperlink.set(docx.oxml.shared.qn('w:anchor'), link_to,)
if tool_tip is not None:
# set attribute for link to bookmark
hyperlink.set(docx.oxml.shared.qn('w:tooltip'), tool_tip,)
new_run = docx.oxml.shared.OxmlElement('w:r')
rPr = docx.oxml.shared.OxmlElement('w:rPr')
new_run.append(rPr)
new_run.text = text
hyperlink.append(new_run)
r = paragraph.add_run()
r._r.append(hyperlink)
r.font.name = "Calibri"
r.font.color.theme_color = MSO_THEME_COLOR_INDEX.HYPERLINK
r.font.underline = True
# test the functions
if __name__ == "__main__":
# input test document
doc = Document(r"test_input_1.docx")
# add a bookmark to every paragraph
for paranum, paragraph in enumerate(doc.paragraphs):
add_bookmark(paragraph=paragraph,
bookmark_text=f"{paranum}", bookmark_name=f"temp{paranum+1}")
# add page to the end to put your link
doc.add_page_break()
paragraph = doc.add_paragraph("This is where the internal link will live")
# add a link to the first paragraph
add_link(paragraph=paragraph, link_to="temp0",
text="this is a link to ", tool_tip="your message here")
doc.save(r"output.docx")
Run Code Online (Sandbox Code Playgroud)