相关疑难解决方法(0)

寻找C#HTML解析器

可能重复:
在C#中解析html的最佳方法是什么？

我想提取HTML文档的结构 - 所以标签比内容更重要.理想情况下,它也能够在一定程度上合理地处理格式错误的HTML.

有人知道一个可靠而有效的解析器吗？

.net html c# parsing

ben*_*ual

2017 05-23

112
推荐指数

0
解决办法

6万
查看次数

Jsoup喜欢C++/C的解析器？

有没有像C++/C一样的开源Jsoup/jQuery解析器/选择器引擎？

c c++ jquery jsoup

new*_*bie

lucky-day

12
推荐指数

1
解决办法

3364
查看次数

使用正则表达式在多个HTML标记之间获取文本

使用正则表达式,我希望能够在多个DIV标记之间获取文本.例如,以下内容:

<div>first html tag</div>
<div>another tag</div>

Run Code Online (Sandbox Code Playgroud)

输出:

first html tag
another tag

Run Code Online (Sandbox Code Playgroud)

我使用的正则表达式模式只匹配我的最后一个div标签并错过了第一个.码:

    static void Main(string[] args)
    {
        string input = "<div>This is a test</div><div class=\"something\">This is ANOTHER test</div>";
        string pattern = "(<div.*>)(.*)(<\\/div>)";

        MatchCollection matches = Regex.Matches(input, pattern);
        Console.WriteLine("Matches found: {0}", matches.Count);

        if (matches.Count > 0)
            foreach (Match m in matches)
                Console.WriteLine("Inner DIV: {0}", m.Groups[2]);

        Console.ReadLine();
    }

Run Code Online (Sandbox Code Playgroud)

输出:

匹配发现:1

内部DIV:这是另一个测试

html c# regex

ben*_*ben

lucky-day

8
推荐指数

2
解决办法

6万
查看次数