开发者

Regular Expression to check multi-dimensional array syntax

I am trying to write a regular expression to match ruby's list and hash syntaxes, e.g.:

[:a, "b", c, 3]

{:a => [
    1,2,3
]}

[1, {
    a => "t", :b => "w",
    :c => :o
}, 3]

The issue, of course, is with the nested/recursive nature of the things. I have a sneaking suspicion that such nested structures cannot actually be expressed as a regular expression as that 'language' is not regular. I expect the solution would have to involve subroutines and re开发者_如何学Pythoncursion, however I'm struggling to get my head around it. Can anyone confirm/deny my suspicions or offer a solution?

Any help appreciated.

Edit: as a note, I'm using PHP's preg_* methods mainly

Edit: as another note, I've created a routine, <ruby_value> to match keys and scalar values.

Edit: I should specify that I'm more interested in this "out of interest". I have already wrote a mini-parser for these things in PHP however I am interested to see if a not-necessarily-pure-regex solution exists.

E.g. equal nested brackets:

/^(?<paren_expr>
    \( (?: (?&paren_expr) | ) \)
)$/x

Which is a valid PHP regex and will match "(())", "()" and "((((((()))))))" but not "(" or "(()" etc.


Correct, it's not regular so you can't match it with 1 regular non-recursive expression.

You can, however, make a loop which replaces every match till there are no more matches available.

So...

[[[ foo ]]]

[[PLACEHOLDER_001]]

[PLACEHOLDER_002]

PLACEHOLDER_003

That way you can make it work without a problem. Can't say it's such a pretty solution though. A stack based solution would be better.


You are correct that nested structures are not a regular language, and thus cannot be expressed via pure regular expressions.

PCRE has the ability to specify recursive regexes, although I'm not sure whether PHP's implementation of them includes that support.

Really though, what you want to do is write a state machine yourself (with memory of nesting).

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜