小编cor*_*ore的帖子

使用BeautifulSoup获取跨度之间的文本

我正在尝试使用Python中的BeautifulSoup抓取各种站点。说我有以下html摘录:

<div class="member_biography">
<h3>Biography</h3>
<span class="sub_heading">District:</span> AnyState - At Large<br/>
<span class="sub_heading">Political Highlights:</span> AnyTown City Council, 19XX-XX<br/>
<span class="sub_heading">Born:</span> June X, 19XX; AnyTown, Calif.<br/>
<span class="sub_heading">Residence:</span> Some Town<br/>
<span class="sub_heading">Religion:</span> Episcopalian<br/>
<span class="sub_heading">Family:</span> Wife, Some Name; two children<br/>
<span class="sub_heading">Education:</span> Some State College, A.A. 19XX; Some Other State College, B.A. 19XX<br/>
<span class="sub_heading">Elected:</span> 19XX<br/>
</div>
Run Code Online (Sandbox Code Playgroud)

我需要结果采用以下格式:

District:              AnyState - At Large
Political Highlights:  AnyTown City Council, 19XX-XX
Born:                  June X, 19XX; AnyTown, Calif.
Residence:             Some Town
Religion:              Episcopalian …
Run Code Online (Sandbox Code Playgroud)

python lxml beautifulsoup

1
推荐指数
1
解决办法
174
查看次数

标签 统计

beautifulsoup ×1

lxml ×1

python ×1