G.T*_*.T. 2 r web-scraping rvest
这是我的代码:
library(rvest)
#login
url <- "https://secure.usnews.com/member/login?ref=https%3A%2F%2Fpremium.usnews.com%2Fbest-graduate-schools%2Ftop-medical-schools%2Fresearch-rankings"
session <- html_session(url)
form <- html_form(read_html(url))[[1]]
filled_form <- set_values(form,
username = "notmyrealemail",
password = "notmyrealpassword")
submit_form(session, filled_form)
Run Code Online (Sandbox Code Playgroud)
这是我得到的输出后submit_form:
<session> https://premium.usnews.com/best-graduate-schools/top-medical-schools/research-rankings
Status: 200
Type: text/html; charset=utf-8
Size: 286846
Run Code Online (Sandbox Code Playgroud)
我认为这意味着它有效吗?如果是这样,我如何read_html在登录后出现的页面上?
Nvm,通过使用它来工作 url <- jump_to(session, "https://premium.usnews.com/best-graduate-schools/top-medical-schools/research-rankings")
| 归档时间: |
|
| 查看次数: |
3496 次 |
| 最近记录: |