the*_*row 1 python urllib2 socks python-requests
我正在尝试nzxj65x32vh2fkhk.onion使用请求访问以下域.
我已经运行了,我正确配置了会话的对象代理.
import requests
session = requests.session()
session.proxies = {'http': 'socks5://localhost:9050',
'https': 'socks5://localhost:9050'}
print(session.get('http://httpbin.org/ip').text) # prints {"origin": "67.205.146.164" }
print(requests.get('http://httpbin.org/ip').text) # prints {"origin": "5.102.254.76" }
Run Code Online (Sandbox Code Playgroud)
但是当我尝试使用.onion域访问URL时,我收到以下错误:
session.get('http://nzxj65x32vh2fkhk.onion/all')
ConnectionError: SOCKSHTTPConnectionPool(host='nzxj65x32vh2fkhk.onion', port=80): Max retries exceeded with url: /all (Caused by NewConnectionError('<requests.packages.urllib3.contrib.socks.SOCKSConnection object at 0x7f5e8c2dbbd0>: Failed to establish a new connection: [Errno -2] Name or service not known',))
Run Code Online (Sandbox Code Playgroud)
我也尝试localhost用127.0.0.1其中一个答案中的建议替换.不幸的是,结果是一样的.
使用urllib2执行相同的请求就可以了.
import socks, socket, urllib2
def create_connection(address, timeout=None, source_address=None):
sock = socks.socksocket()
sock.connect(address)
return sock
socks.setdefaultproxy(socks.PROXY_TYPE_SOCKS5, '127.0.0.1', 9050)
socket.socket = socks.socksocket
socket.create_connection = create_connection
print(urllib2.urlopen('http://nzxj65x32vh2fkhk.onion/all').read()) # Prints the URL's contents
Run Code Online (Sandbox Code Playgroud)
cURL还可以正确检索页面的内容.
我正在使用Python 2.7.13,请求2.13.0和PySocks 1.6.7.Tor使用以下命令通过docker容器运行:
sudo docker run -it -p 8118:8118 -p 9050:9050 -d dperson/torproxy
Run Code Online (Sandbox Code Playgroud)
我在这做错了什么?如何使请求识别.onion URL,我需要做什么?
解决方案是使用该socks5h协议,以便在本地DNS解析过程失败的情况下启用远程DNS解析.请参阅https://github.com/kennethreitz/requests/blob/e3f89bf23c53b98593e4248054661472aacac820/requests/packages/urllib3/contrib/socks.py#L158
以下代码按预期工作:
import requests
session = requests.session()
session.proxies = {'http': 'socks5h://localhost:9050',
'https': 'socks5h://localhost:9050'}
print(session.get('http://httpbin.org/ip').text) # prints {"origin": "67.205.146.164" }
print(requests.get('http://httpbin.org/ip').text) # prints {"origin": "5.102.254.76" }
print(session.get('http://nzxj65x32vh2fkhk.onion/all').text) # Prints the contents of the page
Run Code Online (Sandbox Code Playgroud)