Web在R中抓取:从`href`标签中提取名称

Lau*_*ura 5 r web-scraping

这是我的代码:

library(rvest)
library(XML)
library(xml2)
url_imb <- 'https://www.imdb.com/search/title/?count=100&release_date=2016,2016&title_type=feature'
web_page<-read_html(url_imb)
Run Code Online (Sandbox Code Playgroud)

我想提取所有与adv_li_dr_0标签相关的Directors名称。

这就是我所做的:CSS SELECTOR:

directors_0<-html_text(html_nodes(web_page,"p a"))
Run Code Online (Sandbox Code Playgroud)

XPATH选择器:

directors_0<-html_attr(html_nodes(web_page,xpath='//p[@class=""]//a'),"href")
Run Code Online (Sandbox Code Playgroud)

当然是不完整的。但是你能帮我吗?如何提取与标签中有关的元素href