H.C*_*hoi 2 python xml parsing beautifulsoup hyperlink
我有这样的xml:
<link>
www.link1.com
</link>
<link>
www.link2.com
</link>
Run Code Online (Sandbox Code Playgroud)
我试过这段代码:
from BeautifulSoup import BeautifulStoneSoup
soup = BeautifulStoneSoup(results2) #Beautiful Soup
linklist = soup.findAll('link')
print soup
Run Code Online (Sandbox Code Playgroud)
使用此代码,输出是
[<link>www.link1.com</link>,<link>www.link2.com</link>]
Run Code Online (Sandbox Code Playgroud)
但我想要这样的输出
[www.link1.com, www.link2.com]
Run Code Online (Sandbox Code Playgroud)
你有没有尝试过:
linklist = [el.string for el in soup.findAll('link')]
Run Code Online (Sandbox Code Playgroud)