min*_*tuz 3 html c# html-parsing html-agility-pack
我正在尝试从YouTube的搜索结果页面中检索所有视频ID.
每个结果都有这个代码:
<a href="/watch?v=aYIC-ebAD3o" class="ux-thumb-wrap result-item-thumb">
<span class="video-thumb ux-thumb-128 ">
<span class="clip">
<img onload="tn_load(5)" alt="Thumbnail" src="//i2.ytimg.com/vi/aYIC-ebAD3o/default.jpg" >
</span>
</span>
<span class="video-time">4:16</span>
<span dir="ltr" class="yt-uix-button-group addto-container short video-actions" data-video-ids="aYIC-ebAD3o" data-feature="thumbnail">
<button type="button" class="start master-sprite yt-uix-button yt-uix-button-short yt-uix-tooltip" onclick=";return false;" title="" data-button-action="yt.www.addtomenu.add" role="button" aria-pressed="false">
<img class="yt-uix-button-icon yt-uix-button-icon-addto" src="//s.ytimg.com/yt/img/pixel-vfl3z5WfW.gif" alt="">
<span class="yt-uix-button-content">
<span class="addto-label">Add to</span>
</span>
</button>
<button type="button" class="end yt-uix-button yt-uix-button-short yt-uix-tooltip yt-uix-button-empty" onclick=";return false;" title="" data-button-menu-id="shared-addto-menu" data-button-action="yt.www.addtomenu.load" role="button" aria-pressed="false">
<img class="yt-uix-button-arrow" src="//s.ytimg.com/yt/img/pixel-vfl3z5WfW.gif" alt="">
</button>
</span>
<span class="video-in-quicklist">Added to queue </span>
</a>
<div class="result-item-main-content">
Run Code Online (Sandbox Code Playgroud)
我正在尝试解析"data-video-ids"类数据.什么是使用HTML Agility Pack执行此操作的最佳方法?
我试过这个:
foreach(HtmlNode node in doc.DocumentNode.
SelectNodes("//span[@class='data-video-ids']"))
{
string text = node.InnerText;
lblTest2.Text += text + Environment.NewLine;
}
Run Code Online (Sandbox Code Playgroud)
有任何想法吗?
如果您使用YouTube的API之一,我认为您在longrun中会更好.
当没有API时,我只会使用Web请求和HtmlAgilityPack作为最后的手段.造成这种情况的主要原因是,如果YouTube更改了网页,则会破坏您的代码.开放API通常适合向后兼容,因此您的应用程序在大多数情况下应无限期地工作.
以下是Youtube API的代码示例:
YouTubeQuery query = new YouTubeQuery(YouTubeQuery.DefaultVideoUri);
//order results by the number of views (most viewed first)
query.OrderBy = "viewCount";
// search for puppies and include restricted content in the search results
// query.SafeSearch could also be set to YouTubeQuery.SafeSearchValues.Moderate
query.Query = "puppy";
query.SafeSearch = YouTubeQuery.SafeSearchValues.None;
Feed<Video> videoFeed = request.Get<Video>(query);
printVideoFeed(videoFeed);
Run Code Online (Sandbox Code Playgroud)
看起来很简单吧?
| 归档时间: |
|
| 查看次数: |
2438 次 |
| 最近记录: |