Add a language in the Stanford parser

2023-04-09 02:41 问答作者：

I would like to use the Stanford parser in another language not already implemented.

I looked on the website but found nothing that could help开发者_开发问答 me with that.

I guess what I have to do is "just" create a new languagePCFG.ser but to do that?

Also, if anyone knows if French and Spanish are supposed to be released?

Several things are needed:

You need a treebank (set of hand-parsed trees) from which the probabilities used in the parser are calculated
You need language-specific files (like xLanguagePack, xTreebankParserParams, which specify things about the language, treebank encoding, and parsing options
You then train the parser on the treebank to produce the grammar file (see makeSerialized.csh in the distribution)
You might need a language-specific tokenizer to divide text into tokens
If you want Stanford Dependencies output, then there is also a rule-based layer that defines the dependencies

Starting in 2011, we did start distributing a French model with the Stanford Parser. And starting in 2015, we have begun distributing a Spanish model.

继续阅读：parsing stanford-nlp

Add a language in the Stanford parser

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？