Why does my JavaCC parser not parse tokens smaller than 2 characters?

2023-01-17 17:57 问答作者：

I'm working on a JavaCC parser that should parse BBcodes.

My Javacc source code: patebin.com (Junit test: here)

The source code kind off works, but it does not want to accept t开发者_如何学Pythonokens with a single character, only multi character strings are recognized.

It does parse this string:

"test[b]bold[/b]nothing[b]bold[/b]after"

But not:

"t[b]bold[/b]nothing[b]bold[/b]after"

I’m kind of lost here, any tips welcome here.

I figured it out. Downloaded JavaCC and compiled everything. With single character input, the output is:

String: t
Length: 1
Call:   parse
  Call:   body
  Return: body
Return: parse
Exception in thread "main" ParseException: Encountered " <LETTER> "t "" at line
1, column 1.
Was expecting one of:
    <EOF>
    "[b]" ...
    "[i]" ...
    "[u]" ...
    "[s]" ...
    "[url]" ...
    "[url=" ...
    "[img]" ...
    "[quote]" ...
    "[code]" ...
    "[color=" ...
    "[br]" ...
    <EOL> ...
    <TEXT> ...
    <TAGCHAR> ...

I noticed that it found a <LETTER> token but didn't recognize it as <TEXT>.

That's where the problem lies. You've declared everything as tokens and based on the ordering of the token definitions, the string "t" is a <LETTER> first, not <TEXT>. Move the <LETTER> token after <TEXT> and it should work now. You'll want to apply the same changes for <DIGIT>s and other such tokens.

继续阅读：bbcode javacc parsing

Why does my JavaCC parser not parse tokens smaller than 2 characters?

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？