Solr Tokenizer Injection
As an example I have a text field that might contain the following string:
"d7199^^==^^81^^==^^A sentence or two!!"
I want to tokenize this data but have each token contain the first part of the string. So, I'd like the tokens to look like this for the example above:
"d7199^^==^^81^^==^^a"
"d7199^^==^开发者_如何转开发^81^^==^^sentence"
"d7199^^==^^81^^==^^or"
"d7199^^==^^81^^==^^two"
How would I go about doing this?
You can implement your own custom Tokenizer and add it to the Solr classpath. Then use it in your Solr schema.xml and solrconfig.xml
精彩评论