Regular expression to match anything but 2 consecutive curly braces

2023-02-01 04:27 问答作者：

What would be the regular expression to match anything but 2 consecutive curly braces ({) ?

An example string:

{{some text}} string I want {{another set {{and inner}} }}

I want to get only string i want.

Using stack to do the stuff had crossed my mind, but I wanted to know if this can be done开发者_开发百科 using regex.

I'm using PHP's PCRE

Thanks in advance

Use a lookahead assertion (?!{{|}}) to verify that you don't have a nested set of braces inside of your outer set.

{{((?!{{|}}).)*}}

Test program

<?php
$string = '{{lot {{of}} characters}}';

for (;;)
{
    var_dump($string);
    $replacement = preg_replace('/{{((?!{{|}}).)*}}/', '', $string);

    if ($string == $replacement)
        break;

    $string = $replacement;
}

Output

string(25) "{{lot {{of}} characters}}"
string(19) "{{lot  characters}}"
string(0) ""

It appears to handle various edge cases reasonably, as well:

# Unbalanced braces.
string(23) "{{lot {{of}} characters"
string(17) "{{lot  characters"

string(23) "lot {{of}} characters}}"
string(17) "lot  characters}}"

# Multiple sets of braces.
string(25) "{{lot }}of{{ characters}}"
string(2) "of"

# Lone curlies.
string(41) "{{lot {{of {single curly} }} characters}}"
string(19) "{{lot  characters}}"
string(0) ""

If you need to do something more complicated with the contents, such as processing the contents or the variables, then you can use a recursive regexp, making use of the (?R) operator.

$data = "{{abcde{{fg{{hi}}jk}}lm}}";
$regexp = "#\{\{((?:[^(\{\{)(\}\})]+|(?R))+)\}\}#";
$count = 0;

function revMatch($matches) {
  global $regexp, $count;

  if (is_array($matches)) {
    // Match detected, process for nested components
    $subData = preg_replace_callback($regexp, 'revMatch', $matches[1]);
  } else {
    // No match, leave text alone
    $subData = $matches;
  }

  // This numbers each match, to demonstrate call order
  return "(" . $count++ . ":<" . $subData . ">)";
}

echo preg_replace_callback($regexp, 'revMatch', $data);

This converts: {{abcde{{fg{{hi}}jk}}lm}} to (2:<abcde(1:<fg(0:<hi>)jk>)lm>)

A bit of explanation on the regexp: #\{\{((?:[^(\{\{)(\}\})]+|(?R))+)\}\}#

The double braces at the front and back match any target component, the contents of the braces are to be one or more of the two defined options:

a string with no double braces [^(\{\{)(\}\})]+
the whole regexp repeated. The (?:) bracket is a non-capturing group.

NB. The #s are the pattern delimiters, I thought extra slashes would decrease readability further.

继续阅读：pcre php regex

Regular expression to match anything but 2 consecutive curly braces

Test program

Output

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？

Test program

Output

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集 河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？