What does this regular expression mean?

2023-02-03 23:59 问答作者：

I have 开发者_开发百科got a question on regular expression. Though simple, I got some contracting answers from my professor. I just wanted to clarify it here.

(a+bc)* - What are the 4 smallest distinct pattern which this regex can give ?

I was expecting it to be epsilon (empty string), abc, aabc, aaabc.

But , his explanation was (a+bc) results in either a or bc. So his answer was epsilon (empty string), a, bc && aa(because of the star)

Which one is correct ? Is there any link which explains these kind of regex. I checked out wikipedia but they dont have these kind of things. Could you point me to some resource which actaully deals with the above kind ? Thanks in advance !

It sounds like your professor confused + for |.

For (a+bc)*, the answer would be ε, abc, aabc, aaabc as you said, while for (a|bc)*, the answer would be ε, a, aa, bc as he said.

You're correct, your professor is wrong (assuming there wasn't a misunderstanding between the two of you).

Note that there isn't one single regex language (there is a common definition for a Regular Language, but they aren't the same thing), though many share common features, including those used in your example. It's conceivable that someone could have regexes where '+' means alternation, but typically '+' is "one or more of the preceding" and '|' is for alternation.

As for a regular expression resource, check Regular-Expressions.info. It lists the features of various regular expression implementations. Each implementation often has their own page (such as perlre), which may have more or better information.

I think for regexes '+' and '|' means same thing in reg expression. Only the context make the difference especially the Kleene star.

(a)* +(bc)* means -- ε, a, aa, bc

but both (a+bc)* and (a|bc) means same thing as - ε, a, aa, abc etc (converting to NFA will clear the doubt. Here in NFA you have 2 alternativies either a or bc but the *means you can go back using an ε and choose whichever path you want.)

eg from wiki page of RE Examples:

a|b* denotes {ε, "a", "b", "bb", "bbb", ...} (a|b)* denotes the set of all strings with no symbols other than "a" and "b", including the empty string: {ε, "a", "b", "aa", "ab", "ba", "bb", "aaa", ...} ab*(c|ε) denotes the set of strings starting with "a", then zero or more "b"s and finally optionally a "c": {"a", "ac", "ab", "abc", "abb", "abbc", ...}

继续阅读：regex

What does this regular expression mean?

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？