Implementing Lucene without an Analyzer for the used language of content?
Does it make sense?
For my client it's to开发者_如何学编程o expensive to develop the Analyzer for Croatian language, I did not find any existing ones...so my question is...do I tell them to drop the idea of Lucene for Croatian content?
Thanks!
Sematext's Morphological Analyzer claims to support Croatian.
Robert Muir, Chris Male and others built a Lucene Morphological Analyzer based on Hunspell. The code is here. Croatian is one of the supported languages in the list. There may be licensing issues as hunspell is GPL, I think, but it is well worth checking.
精彩评论