RegExp get string inside string

2023-01-12 08:53 问答作者：

Let presume we have something like this:

<div1>
    <h1>text1</h1>
    <h1>text2</h1>
</div1>
<div2>
    <h1>text3</h1>
</div2>

Using RegExp we need to get text1 and text2 but not text3.

How to do this?

Thanks in advance.

EDIT: This is just an example. The text I'm parsing could be just plain text. The main thing I want to accomplish is list all strings from a specific section of a document. I gave this HTML code for example as it perfectly resembles the thing I need to get.

(?siU)<h1>(.*)</h1> would parse all three strings, but how to get only first two?

EDIT2: Here is another rather dumb example. :)

Section1

This is a "very" nice sentence.
It has "just" a few words.

Section2

This is "only" an example.

The End

I need quoted words from first but not from second section.

Yet again, 开发者_运维问答(?siU)"(.*)" returns quoted words from whole text, and I need only those between words Section1 and Section2.

This is for the "Rainmeter" application, which apparently uses Perl regex syntax.

I'm sorry, but I can't explain it better. :)

For the general case of the two examples provided -- for use in Rainmeter regex -- you can use:

(?siU)<h1>(.*)</h1>(?=.+<div2>) for the first sample and

(?siU)"(.*)"(?=.+Section2) for the second.

Note that Rainmeter seems to escape things for you, but you might need to change " to \", above.

These both use Positive Lookahead but beware: both solutions will fail in the case of nested tags/structures or if there are mutiple Section1's and Section2's. Regex is not the best tool for this kind of parsing.

But maybe this is good enough for your current needs?

Use a DOM library and getElementsByTagName('div') and you'll get a nodeList back. You can reference the first item with ->item(0) and then getElementsByTagName('h1') using the div as a context node, grab the text with ->nodeValue property.

继续阅读：perl regex

RegExp get string inside string

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？