meh*_*dxx 1 python python-requests
import requests
import xml.etree.ElementTree as ET
import re
gen_news_list=[]
r_milligenel = requests.get('http://www.milliyet.com.tr/D/rss/rss/Rss_4.xml')
root_milligenel = ET.fromstring(r_milligenel.text)
for entry in root_milligenel:
for channel in entry:
for item in channel:
title = re.search(".*title.*",item.tag)
if title:
gen_news_list.append(item.text)
link = re.search(".*link.*",item.tag)
if link:
gen_news_list.append(item.text)
r = requests.get(item.text)
print(r.text)
Run Code Online (Sandbox Code Playgroud)
我有一个名为gen_news_list的列表,我正在尝试将标题,摘要,链接等附加到此列表中.但是当我尝试请求链接时出现错误:
Traceback (most recent call last):
File "/home/deniz/Masaüstü/Çal??malar/Python/Bot/xmlcek.py", line 23, in <module>
r = requests.get(item.text)
File "/usr/lib/python3/dist-packages/requests/api.py", line 55, in get
return request('get', url, **kwargs)
File "/usr/lib/python3/dist-packages/requests/api.py", line 44, in request
return session.request(method=method, url=url, **kwargs)
File "/usr/lib/python3/dist-packages/requests/sessions.py", line 456, in request
resp = self.send(prep, **send_kwargs)
File "/usr/lib/python3/dist-packages/requests/sessions.py", line 553, in send
adapter = self.get_adapter(url=request.url)
File "/usr/lib/python3/dist-packages/requests/sessions.py", line 608, in get_adapter
raise InvalidSchema("No connection adapters were found for '%s'" % url)
requests.exceptions.InvalidSchema: No connection adapters were found for '
http://www.milliyet.com.tr/tbmm-baskani-cicek-programlarini/siyaset/detay/2037301/default.htm
Run Code Online (Sandbox Code Playgroud)
第一个链接成功运行.但第二个出错了.我无法添加内容以列出此错误的原因.这是我的循环问题吗?代码有什么问题?
如果添加行print(repr(item.text))问题的行之前r = requests.get(item.text)你看到开始第二次item.text具有\n在开始和这是不允许的URL结尾.
'\nhttp://www.milliyet.com.tr/tbmm-baskani-cicek-programlarini/siyaset/detay/2037301/default.htm\n'
Run Code Online (Sandbox Code Playgroud)
我使用repr它因为它\n在输出中将字符串显示为字符串.
你的问题的解决方法是调用strip上item.text删除这些换行符:
r = requests.get(item.text.strip())
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
15642 次 |
| 最近记录: |