BeautifulSoup 类型错误

Ben*_*ale 0 python xml beautifulsoup

我遇到了 BeautifulSoup(更具体地说是 xml 解析器)的问题,其中似乎将“名称”作为标签属性重载了一些底层函数。

鉴于以下代码:

#!/usr/bin/env python3

from bs4 import BeautifulSoup

siteconfig="""
<?xml version="1.0" encoding="utf-8"?>
<sites version="180201">
  <site name="au" location="oceana">
    <addresslist="IPv4">
      <address>192.168.1.10/32</address>
      <address>192.168.2.10/32</address>
    </addresslist>
    <addresslist="IPv6">
      <address>fc00:07bc:5ae6:75d0::26/128</address>
      <address>fc00:07bc:5ae6:75d1::26/128</address>
    </addresslist>
  </site>
  <site name="us" location="americas">
    <addresslist="IPv4">
      <address>192.168.4.13/32</address>
      <address>192.168.5.13/32</address>
    </addresslist>
    <addresslist="IPv6">
      <address>fc00:07bc:5ae6:75d0::45/128</address>
      <address>fc00:07bc:5ae6:75d1::45/128</address>
    </addresslist>
  </site>
</sites>
"""
soup = BeautifulSoup(siteconfig,"xml")
print(soup.find("site", name="us"))
Run Code Online (Sandbox Code Playgroud)

我收到以下错误:

Traceback (most recent call last):
  File "./siteConfig.py", line 33, in <module>
    print(soup.find("site", name="us"))
TypeError: find() got multiple values for argument 'name'
Run Code Online (Sandbox Code Playgroud)

但是,如果我将最后一行更改为:

print(soup.find("site", location="americas"))
Run Code Online (Sandbox Code Playgroud)

我得到以下输出:

<site location="americas" name="us">
  <addresslist>="IPv4"&gt;
    <address>192.168.4.13/32</address>
    <address>192.168.5.13/32</address>
  </addresslist>
  <addresslist>="IPv6"&gt;
    <address>fc00:07bc:5ae6:75d0::45/128</address>
    <address>fc00:07bc:5ae6:75d1::45/128</address>
  </addresslist>
</site>
Run Code Online (Sandbox Code Playgroud)

为什么会这样?

DYZ*_*DYZ 5

name是第一个参数的名称find(在您的情况下,“站点”):

find(name=None, attrs={}, recursive=True, text=None, **kwargs) bs4.BeautifulSoup 实例的方法

使用属性字典soup.find("site", {"name":"us"})来避免冲突。