regex to trap img tag, both versions
I开发者_StackOverflow need to remove image tags from text, so both versions of the tag:
<img src="" ... ></img>
<img src="" ... />
In C# you can use following code:
string html = @"
<h1>
<img src="" ... >
</img>
<img></img>-bad
<img/>-bad
<img src="" ... />
</h1>";
string result = Regex.Replace(html, @"<img\s[^>]*>(?:\s*?</img>)?", "", RegexOptions.IgnoreCase);
But be aware that modifying HTML with Regex isn't good approach, its better to use Html Agility Pack or other HTML parsers.
This is not something you do with regular expressions. Use the xpath-query //img
instead.
精彩评论