我需要解析一个网站,但我得到一个错误403 Forbidden.这是一个代码:
url = 'http://worldagnetwork.com/'
result = requests.get(url)
print(result.content.decode())
Run Code Online (Sandbox Code Playgroud)
它的输出:
<html>
<head><title>403 Forbidden</title></head>
<body bgcolor="white">
<center><h1>403 Forbidden</h1></center>
<hr><center>nginx</center>
</body>
</html>
Run Code Online (Sandbox Code Playgroud)
请说出问题所在.
我正在尝试登录网站。当我查看 print(g.text) 时,我没有取回我期望的网页,而是一个 cloudflare 页面,上面写着“在访问之前检查您的浏览器”
import requests
import time
s = requests.Session()
s.get('https://www.off---white.com/en/GB/')
headers = {'Referer': 'https://www.off---white.com/en/GB/login'}
payload = {
'utf8':'?',
'authenticity_token':'',
'spree_user[email]': 'EMAIL@gmail.com',
'spree_user[password]': 'PASSWORD',
'spree_user[remember_me]': '0',
'commit': 'Login'
}
r = s.post('https://www.off---white.com/en/GB/login', data=payload, headers=headers)
print(r.status_code)
g = s.get('https://www.off---white.com/en/GB/account')
print(g.status_code)
print(g.text)
Run Code Online (Sandbox Code Playgroud)
为什么在我设置会话后会发生这种情况?