regexp to remove entire paragraph based on it's content?

2023-02-11 21:59 问答作者：

hey guys, I'm a regexp noob, Is it possible with preg_replace to re开发者_JS百科move a the an entire paragraph tag?

<p><div class="vidwrapper"> lot of content with oder divs etc. </div><p>

The paragraph should only be removed if it is following div has a class of .vidwrapper.

Is that even possible? Any idea how this regexp would look like? Thank you for your help.

If it's a fixed occurrence, then following might work:

preg_replace('#<p>[^<]*<div[^>]+class="vidwrapper"[^>]*>.*?</p>#is', "")

For matching nested html you would normally need a recursing regex, hencewhy something like phpQuery or QueryPath is then often simpler:

$html = pq($html)->find("p div.vidwrapper")->parent()->remove()->html();

It's a bad idea to do this using a regex, unless you know that there will be no paragraph (or anything that might superficially be interpreted as a paragraph) inside of the vidwrapper.

If you don't, writing a regex for something like this will be very hard:

<p><div class="vidwrapper"> Hello there. <p>Wee.</p> Yoink. </div></p>

<p><div class="vidwrapper"> Hello there. <!-- <p>Wee.</p> --> Yoink. </div></p>

An easier (and more robust) way would probably be to parse the HTML with an HTML parser, and do a search on the DOM tree instead.

regexp to remove entire paragraph based on it's content?

See also:

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

王昌瑞《潜梦追凶》剧组庆生新锐演员未来可期？

Is it allowed to ask users to enter credit card details for own payment method?

Escaping "<" in Perl-generated XML

imessage会显示已读吗？

微信重新建群怎么建？