Ben*_*ale 0 python xml beautifulsoup
我遇到了 BeautifulSoup(更具体地说是 xml 解析器)的问题,其中似乎将“名称”作为标签属性重载了一些底层函数。
鉴于以下代码:
#!/usr/bin/env python3
from bs4 import BeautifulSoup
siteconfig="""
<?xml version="1.0" encoding="utf-8"?>
<sites version="180201">
<site name="au" location="oceana">
<addresslist="IPv4">
<address>192.168.1.10/32</address>
<address>192.168.2.10/32</address>
</addresslist>
<addresslist="IPv6">
<address>fc00:07bc:5ae6:75d0::26/128</address>
<address>fc00:07bc:5ae6:75d1::26/128</address>
</addresslist>
</site>
<site name="us" location="americas">
<addresslist="IPv4">
<address>192.168.4.13/32</address>
<address>192.168.5.13/32</address>
</addresslist>
<addresslist="IPv6">
<address>fc00:07bc:5ae6:75d0::45/128</address>
<address>fc00:07bc:5ae6:75d1::45/128</address>
</addresslist>
</site>
</sites>
"""
soup = BeautifulSoup(siteconfig,"xml")
print(soup.find("site", name="us"))
Run Code Online (Sandbox Code Playgroud)
我收到以下错误:
Traceback (most recent call last):
File "./siteConfig.py", line 33, in <module>
print(soup.find("site", name="us"))
TypeError: find() got multiple values for argument 'name'
Run Code Online (Sandbox Code Playgroud)
但是,如果我将最后一行更改为:
print(soup.find("site", location="americas"))
Run Code Online (Sandbox Code Playgroud)
我得到以下输出:
<site location="americas" name="us">
<addresslist>="IPv4">
<address>192.168.4.13/32</address>
<address>192.168.5.13/32</address>
</addresslist>
<addresslist>="IPv6">
<address>fc00:07bc:5ae6:75d0::45/128</address>
<address>fc00:07bc:5ae6:75d1::45/128</address>
</addresslist>
</site>
Run Code Online (Sandbox Code Playgroud)
为什么会这样?
name是第一个参数的名称find(在您的情况下,“站点”):
find(name=None, attrs={}, recursive=True, text=None, **kwargs) bs4.BeautifulSoup 实例的方法
使用属性字典soup.find("site", {"name":"us"})来避免冲突。