开发者

Newbie to RegEx

I have this sample string :

≪! [If Gte Mso 9]>≪Xml>  ≪Br /> ≪O:Office Document Set开发者_StackOverflow社区tings>  ≪Br /> ≪O:Allow Png/>  ≪Br /> ≪/O:Off...

And I would like to target specifically anything that begins in an "≪" and ends in a ">", and replace it with no-space "".

Been using Rubular, but I'm having a tricky time learning how to set this one up.

Any idaes?


result = subject.gsub(/≪[^>]*>/, '')

should do the trick.

[^>]* means: Match any number of characters except >.


The patter is simple as that:

≪[^>]*>


Just a helpful hint, I use Rubular to help with regex writing a debugging.


It sure looks like you're trying to parse XML with regular expressions, which is a very difficult and fragile way to extract the data you need from that document.

You might be better off parsing it and selecting the information you need using XPath or DOM.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜