使用硒或要求填写表格

Sát*_*iro 7 python selenium web-scraping python-3.x python-requests

我试图进入此站点以检索我的银行帐户,首先我尝试使用硒,但仅填写了用户名(可能是因为它具有2种形式):

from selenium import webdriver

driver = webdriver.Firefox()
driver.get("https://www.santandertotta.pt/pt_PT/Particulares.html")
user = driver.find_element_by_name("usr")
user.send_keys("user")
pas = driver.find_element_by_name("claveConsultiva")
pas.send_keys("password")
login = driver.find_element_by_id("login_button").click()
Run Code Online (Sandbox Code Playgroud)

然后,我进入了rambo模式:)试图弄清楚为什么我不能填充密码空间,以及使用请求的表单的隐藏值是什么,这是代码:

url = "https://www.particulares.santandertotta.pt/pagina/indice/0,,276_1_2,00.html"     
user_agent = {"user-agent" : "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/..."}
session = requests.session()
r = session.get(url)
soup = BeautifulSoup(r.text, "html.parser")    
data = {t['name']:t.get('value') for t in soup.find_all('input', attrs={'type': 'hidden'})}
print(data)    
Run Code Online (Sandbox Code Playgroud)

但是刚收到一个空洞的字典。通过登录和抓取进入网站的最佳方法是什么?

And*_*son 6

您无法访问密码字段,因为它不在主页上。要处理密码字段,您必须单击登录按钮才能进入登录页面。您还需要切换到包含身份验证表单的 iframe

from selenium.webdriver.support.ui import WebDriverWait as wait
from selenium.webdriver.support import expected_conditions as EC

driver.get("https://www.santandertotta.pt/pt_PT/Particulares.html")
driver.find_element_by_xpath("//input[@title='Login de Particulares']").click()
wait(driver, 10).until(EC.frame_to_be_available_and_switch_to_it("ws"))
user = driver.find_element_by_name("identificacionUsuario")
user.send_keys("user")
pas = driver.find_element_by_name("claveConsultiva")
pas.send_keys("password")
pas.submit()
Run Code Online (Sandbox Code Playgroud)


Deb*_*anB 4

首先访问 url 后,您必须单击带有Loginhttps://www.santandertotta.pt/pt_PT/Particulares.html文本的元素,然后仅显示NomePassword字段,但要访问这些字段,您必须切换到带有as ws诱导WebDriverWait的框架。接下来要找到Nome元素,您必须再次引发WebDriverWait,如下所示:id

from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC

driver=webdriver.Firefox(executable_path=r'C:\Utility\BrowserDrivers\geckodriver.exe')
driver.get("https://www.santandertotta.pt/pt_PT/Particulares.html")
driver.find_element_by_xpath("//input[@class='ttAH_button03']").click()
WebDriverWait(driver, 20).until(EC.frame_to_be_available_and_switch_to_it((By.ID, "ws")))
WebDriverWait(driver, 20).until(EC.element_to_be_clickable((By.XPATH, "//input[@class='inputlong' and @id='identificacionUsuario']"))).send_keys("your_name")
driver.find_element_by_xpath("//input[@id='claveConsultiva' and @name='claveConsultiva']").send_keys("your_password")
driver.find_element_by_link_text("Entrar no NetBanco Particulares").click()
Run Code Online (Sandbox Code Playgroud)

在这里你可以找到关于iframe下处理#document的方法的相关讨论