C#HtmlAgilityPack解析<ul>

use*_*096 3 html c# parsing html-agility-pack

我想解析以下HTML.

我现在拥有的是

var node = document.DocumentNode.SelectSingleNode("//div[@class='wrapper']");
Run Code Online (Sandbox Code Playgroud)

html是

<div class="wrapper">
    <ul>
                <li data="334040566050326217">
                    <span>test1</span>
                </li>
                <li data="334040566050326447">
                    <span>test2</span>
                </li>
    </ul>
Run Code Online (Sandbox Code Playgroud)

我需要从标签中获取数字li data和值span.任何帮助赞赏.

Ich*_*lay 6

这样的事情可能适合您的需求.

//Assumes your document is loaded into a variable named 'document'

List<string> dataAttribute = new List<string>(); //This will contain the long # in the data attribute
List<string> spanText = new List<string>();      //This will contain the text between the <span> tags
HtmlNodeCollection nodeCollection = document.DocumentNode.SelectNodes("//div[@class='wrapper']//li");

foreach (HtmlNode node in nodeCollection)
{
    dataAttribute.Add(node.GetAttributeValue("data", "null"));
    spanText.Add(node.SelectSingleNode("span").InnerText);
}
Run Code Online (Sandbox Code Playgroud)