I am trying to process various texts by regex and NLTK of python -which is at http://www.nltk.org/book-. I am trying to create a random text generator and I am having a slight problem. Firstly, here i
I am trying to process various texts by regex and NLTK of python -which is at http://www.nltk.org/book-. I am trying to create a random text generator and I am having a hard time with a problem. First
How to config nltk data directory开发者_如何学JAVA from code?Just change items of nltk.data.path, it\'s a simple list.From the code, http://www.nltk.org/_modules/nltk/data.html:
I have: from __future__ import division import nltk, re, pprint f = open(\'/home/a/Desktop/Projects/FinnegansWake/JamesJoyce-FinnegansWake.txt\')
Can you show me a simple example using http://www开发者_Go百科.nltk.org/code to determine if a string about a happy or upset mood?NLTK cannot out of the box, but if you are looking for some related re
I am looking for a way given an English text count verb phrases in it in past, present and future tenses. For now I am usi开发者_StackOverflow社区ng NLTK, do a POS (Part-Of-Speech) tagging, and then c
I have written python scripts that use scrapy,nltk and simplejson in my project but i need to run them from java as my mentor wants to deploy them on a server and i have very less time to do this.I to
How do I detect what language a text is written in using NLTK? The examples I\'ve seen use nltk.detect开发者_Go百科, but when I\'ve installed it on my mac, I cannot find this package.Have you come ac
I\'m trying to use TF-IDF to sort documents into categories.I\'ve calculated the tf_idf for some documents, but now when I try to calculate the Cosine Similarity between two of these documents I get a
I\'ve got about 300k documents stored in a Postgres database that are tagged with topic categories (there are about 150 categories in total).I have another 150k documents that don\'t yet have categori