nltk
Sort words by their usage
I have a list of english words (approx 10000) and I\'d like to sort them by their usage as they occur in literature, newspaper, blogs etc. Can I sort them in Py开发者_如何学Pythonthon or other languag[详细]
2023-04-13 03:19 分类:问答Identifying important words and phrases in text
I have text stored in 开发者_开发问答a python string. What I Want To identify key words in that text.[详细]
2023-04-11 00:25 分类:问答NLTK "generate" function: How to get back returned text?
I\'m a Python noob, so bear with me. I\'m trying to work with the NLTK library, and in particular the \'generate\' function. It looks like from the documentation this function simply prints its resul[详细]
2023-04-10 08:16 分类:问答Need to create a histogram in Python for a corpus
import nltk from nltk.book import * from nltk.corpus import brown corpus_text = brown.words() word_freq = FreqDist(corpus_text)[详细]
2023-04-09 00:05 分类:问答Fast n-gram calculation
I\'m using NLTK to search for n-grams in a corpus but it\'s taking a very long time in some cases. I\'ve noticed calculating n-grams isn\'t an uncommon feature in other packages (apparently Haystack h[详细]
2023-04-08 22:48 分类:问答Is NLTK's naive Bayes Classifier suitable for commercial applications?
I need to train a naive Bayes classifier on two corpuses consisting of approx. 15,000 tokens each. I\'m using a basic bag of words feature extractor with binary labeling and I\'m wondering if NLTK is[详细]
2023-04-08 12:41 分类:问答expat for IronPython 2.7.1 beta 2
I\'m trying t开发者_开发知识库o get NLTK working with IronPython 2.7.1. The installation works so far, but I tried some sample code and the expat module is missing. Any hint how to get that up and run[详细]
2023-04-06 10:51 分类:问答Topic Modelling in MALLET vs NLTK
I just read a fascinating article about how MALLET could be used for topic modelling, but I couldn\'t find anything online comparing MALLET to NLTK, which I\'ve already had some experience with.[详细]
2023-04-06 03:35 分类:问答which similarity function of nltk.corpus.wordnet is Appropriate for find similarity of two words?
which similarity function in nltk.corpus.wordnet is Appropriate for find similarity of two words? path_similarity()?[详细]
2023-04-04 07:07 分类:问答trouble importing stanford pos tagger into nltk
This is probably a very trivial question. I am trying to use the stanford pos tagger through nltk given here The problem is that my nltk lib doesnt contain the stanford module. So I copied the same in[详细]
2023-04-03 05:54 分类:问答