I have a list of english words (approx 10000) and I\'d like to sort them by their usage as they occur in literature, newspaper, blogs etc. Can I sort them in Py开发者_如何学Pythonthon or other languag
I have text stored in 开发者_开发问答a python string. What I Want To identify key words in that text.
I\'m a Python noob, so bear with me. I\'m trying to work with the NLTK library, and in particular the \'generate\' function. It looks like from the documentation this function simply prints its resul
import nltk from nltk.book import * from nltk.corpus import brown corpus_text = brown.words() word_freq = FreqDist(corpus_text)
I\'m using NLTK to search for n-grams in a corpus but it\'s taking a very long time in some cases. I\'ve noticed calculating n-grams isn\'t an uncommon feature in other packages (apparently Haystack h
I need to train a naive Bayes classifier on two corpuses consisting of approx. 15,000 tokens each. I\'m using a basic bag of words feature extractor with binary labeling and I\'m wondering if NLTK is
I\'m trying t开发者_开发知识库o get NLTK working with IronPython 2.7.1. The installation works so far, but I tried some sample code and the expat module is missing. Any hint how to get that up and run
I just read a fascinating article about how MALLET could be used for topic modelling, but I couldn\'t find anything online comparing MALLET to NLTK, which I\'ve already had some experience with.
which similarity function in nltk.corpus.wordnet is Appropriate for find similarity of two words? path_similarity()?
This is probably a very trivial question. I am trying to use the stanford pos tagger through nltk given here The problem is that my nltk lib doesnt contain the stanford module. So I copied the same in