修剪html文本c＃的一部分，但不修剪html標籤

public static string StripHTML(string HTMLText) 
{ 
    string ret = HTMLText.Replace("<br>", "\n").Replace("<br />", "\n"); 
    Regex reg = new Regex("<[^>]+>", RegexOptions.IgnoreCase); 
    return HttpUtility.HtmlDecode(reg.Replace(ret, "")); 
}

如果您喜歡的東西下面的代碼測試此代碼..

string longHtmlText = "<html>This is a &quot;<b>long &amp; bolded</b> <a href=\"http://en.wikipedia.org/wiki/HTML\">HTML</a> text</html>&quot;"; 
string excerpt = StripHTML(longHtmlText); 
excerpt = excerpt.Substring(0, 30) + "(..)";

..the結果將是..

這是一個「長&加粗的HTML（..）

..應該回答你的問題。

請記住，正如Albireo注意到的，Regex不是HTML解析...但如果您需要快速HTML剝離和修剪（對於簡單的HTML文本），無需外部組件，此代碼可能已足夠。

來源

2011-08-22 15:56:08 Fulvio

修剪html文本c＃的一部分，但不修剪html標籤

回答

相關問題