从字符串中删除HTML

Mic*_*rdy 5 c# windows-phone-7

我试图从我的RSS源清除HTML编码.我无法弄清楚如何设置以下来取出HTML编码.

var rssFeed = XElement.Parse(e.Result);

var currentFeed = this.DataContext as app.ViewModels.FeedViewModel;
var items = from item in rssFeed.Descendants("item")                            
            select new ATP_Tennis_App.ViewModels.FeedItemViewModel()
            {

                Title = item.Element("title").Value,
                DatePublished = DateTime.Parse(item.Element("pubDate").Value),
                Url = item.Element("link").Value,
                Description = item.Element("description").Value
            };

foreach (var item in items)
    currentFeed.Items.Add(item);
Run Code Online (Sandbox Code Playgroud)

Ped*_*mas 19

只需使用以下代码:

var withHtml = "<p>hello <b>there</b></p>";
var withoutHtml = Regex.Replace(withHtml, "<.+?>", string.Empty);
Run Code Online (Sandbox Code Playgroud)

这将清除html只留下文本,所以"你好那里"

所以,你可以复制并使用这个功能:

string RemoveHtmlTags(string html) {
    return Regex.Replace(html, "<.+?>", string.Empty);
}
Run Code Online (Sandbox Code Playgroud)

您的代码看起来像这样:

var rssFeed = XElement.Parse(e.Result);
var currentFeed = this.DataContext as app.ViewModels.FeedViewModel;
var items = from item in rssFeed.Descendants("item")                            
            select new ATP_Tennis_App.ViewModels.FeedItemViewModel()
            {

                Title = RemoveHtmlTags(item.Element("title").Value),
                DatePublished = DateTime.Parse(item.Element("pubDate").Value),
                Url = item.Element("link").Value,
                Description = RemoveHtml(item.Element("description").Value)
            };
Run Code Online (Sandbox Code Playgroud)