How to have unstructured sections in a file parsed using Antlr

2023-03-19 07:38 问答作者：

I am creating a translator from my language into many (all?) other object oriented languages. As part of the language I want to support being able to insert target language code sections into the file. This is actually rather similar to how Antlr supports actions in rules.

So I would like to be able to have the sections begin and end with curlies like this:

{ ...target lang code... }

The issue is that it is quite possible { ... } can show up in the target language code so I need to be able match pairs of curlies.

What I want to be able to do is something like this fragment that I've pulled into its own grammar:

grammar target_lang_block;

options
{
    output = AST;
}

entry   
    :   target_lang_block;

target_lang_block
    :   '{' target_lang_code* '}'
    ;


target_lang_code
    :   target_lang_block
    |   NO_CURLIES 
    ;       

WS
    :  (' ' | '开发者_运维知识库\r' | '\t' | '\n')+ {$channel = HIDDEN;}
    ;

NO_CURLIES  
    :   ~('{'|'}')+
    ;

This grammar works by itself (at least to the extent I have tested it).

However, when I put these rules into the larger language, NO_CURLIES seems to eat everything and cause MismatchedTokenExceptions.

I'm not sure how to deal with this situation, but it seems that what I want is to be able to turn NO_CURILES on and off based on if I'm in target_lang_block, but it does not seem that is possible.

Is it possible? Is there another way?

Thanks

Handle the target_lang_block inside the lexer instead:

Target_lang_block
  :  '{' (~('{' | '}') | Target_lang_block)* '}'
  ;

And remove NO_CURLIES, of course.

继续阅读：antlr

How to have unstructured sections in a file parsed using Antlr

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？