Ok, I have the following code to train the NER Identifier from OpenNLP FileReader fileReader = new FileReader(\"train.txt\");
My application requires that I point the head of a phrase (noum or verb). I have this kind of info in my Portuguese corpus:
I need to generate a vector of u开发者_JAVA百科nigrams, i.e. a vector of all the unique words which appear in a specific text field that I have stored as part of a broader JSON object in MongoDB.
What are the statistical engines that yield bette开发者_JAVA技巧r results than the OpenNLP suite of tools, if any? What I\'m looking for is an engine that picks keywords from texts and provides stemmi
Here is an online programming contest we are planning to have. What could be possible approaches to solving the same?
I\'m trying to train a parser for a new model using the openNLP tutorial http://sourceforge.net/apps/mediawiki/opennlp/index.php?title=Parser#Training . The only problem is that i开发者_JAVA百科s requ
I have tried using OpenNLP Tools 1.5 from SourceForge for getting the Named Entites from a text. I did find a JWNL file in the OpenNLP download. Does that imply that OpenNLP in turn is using WordNet f
Is there a way to get the subject of a sentence using OpenNLP? I\'m trying to identify the most important part of a users sentence.Generally, users will be submitting sentences to our \"engine\" and
OpenNLP(in Java) is unable to identify dates of the format \"Januar开发者_JAVA技巧y 10th, 2010\" or \"January 10, 2010\". I replaced all \',\'s in the text with an empty string \"\" before using OpenN
currently I am using the OpenNLP tools to PoS-tag german sentences, with the maxent model listed on their download-site: