I\'m using a python library called Guess Language: http://pypi.python.org/pypi/guess-language/0.1 \"justwords\" is a string with unicode text. I stick it in the package, but it always returns English
I\'m designing architecture of a text parser. Example sentence: Content here, content here. Whole sentence is a... sentence, that\'s obvious. The, quick etc are words; , and. are punctuation marks. B
I\'m writing a text tag parser and I\'m currently using this recursive method to create tags of n words. Is there a way that it can be done non-recursively or at least be optimized? Assume that $this-
I am hand tagging twitter messages as Positive, Negative, Neutral. I am try to appreciate is there some logic one can use to identify of the training set what pro开发者_JS百科portion of message should
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Hellow Stack Overflow people. I\'d like some suggestions regarding the following problem. I am using Java.
How does something lik开发者_开发技巧e Statistically Improbable Phrases work? According to amazon:
Closed. This question needs to be more focused. It is not currently accepting answers. Want to improve this question? Update the question so it focuses on one problem only by editing this
I am currently parsing a bunch of mails and want to get wor开发者_如何学运维ds and other interesting tokens out of mails (even with spelling errors or combination of characters and letters, like \"zeb
I\'m using JAWS to access WordNet. Given a word, is there any way to detect if it is a proper noun? It looks like the synsets have pretty coarse lexical categories.