Back to General and Gameplay Programming

[.net] regex removing HTML tags with Match

General and Gameplay Programming Programming

Started by Dragon_Strike December 18, 2008 08:04 PM

3 comments, last by Koori 15 years, 4 months ago

Dragon_Strike

264

Author

December 18, 2008 08:04 PM

i would like to create a regex that matches everything in a string except characters within "<" ">"; tags. i know this would be quite simple with regex.Replace("<[^>]*>") but i need it to be with regex.Match... is there anyway to do this? something like Match(/*negate*/"<[^>]*>") EDIT: i would also like to point out that tehre isnt always whitespace between the tags and the text

alex_myrpg

351

December 19, 2008 10:50 AM

I can't be sure without knowing what you're trying to accomplish in the end - but I would recommend avoiding regex (which is really overkill here) and using text parsing functions instead. You should be able to write an enumerator function (using yield return, etc.) to do the job in less than 5 lines. You'll also have something much more readable if you can avoid regexes.

About Me | Blog | Website

Oluseyi

2,123

December 19, 2008 01:09 PM

Quote:Original post by Dragon_Strike
i know this would be quite simple with regex.Replace("<[^>]*>")

but i need it to be with regex.Match...

is there anyway to do this?

No. You can't remove text with match. All you can do is determine if your input text matches the specified pattern, and optionally capture subgroups. You'll need additional processing to yield the equivalent of replace - match all the desired groups and then concatenate them.

Spodi

642

December 19, 2008 06:50 PM

Like Fingon said, can you elaborate a bit on what you want? I'm a bit unclear.

NetGore - Open source multiplayer RPG engine

Koori

122

December 23, 2008 05:48 PM

If I understand correctly You want to remove HTML tags from text.
I usually do it with

stringWithHtml = Regex.Replace(stringWithHtml, @"<(.|\n)*?>", string.Empty);

Matching is for finding parts of text. Replacing is for manipulation of matching fragments.

[.net] regex removing HTML tags with Match

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

[.net] regex removing HTML tags with Match

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Reticulating splines