NLP
Large scale naïve Bayes classifier with top-k output
I need a library for naïve Bayes large scale, with millions of training examples and +100k b开发者_JAVA百科inary features. It must be an online version (updatable after training). I also need top-k o[详细]
2023-03-19 10:20 分类:问答Most efficient way to generate a list of Unigrams from a text field in MongoDB
I need to generate a vector of u开发者_JAVA百科nigrams, i.e. a vector of all the unique words which appear in a specific text field that I have stored as part of a broader JSON object in MongoDB.[详细]
2023-03-19 09:07 分类:问答algorithm to find out if two sentence parts can be joined together
I\'m looking to build a开发者_JS百科n algorithm that can join together sentence parts. So, for example, it would know that[详细]
2023-03-19 03:45 分类:问答Pure statistical, or Natural Language Processing engine?
What are the statistical engines that yield bette开发者_JAVA技巧r results than the OpenNLP suite of tools, if any? What I\'m looking for is an engine that picks keywords from texts and provides stemmi[详细]
2023-03-18 20:40 分类:问答What are some good ways of estimating 'approximate' semantic similarity between sentences?
I have been looking at the nlp tag on SO for the past couple of hours and am confident I did not miss anything but if I did, please do point me to the question.[详细]
2023-03-18 12:27 分类:问答The relationship between latent Dirichlet allocation and documents clustering
I would like to clarify the relationship between latent Dirichlet allocation (LDA) and the generic task of document clustering.[详细]
2023-03-18 12:05 分类:问答Ease of use: Stanford CoreNLP vs. OpenNLP [closed]
Closed. This question is opinion-based. It is not currently accepting answers. Want to improve 开发者_运维技巧this question? Update the question so it can be answered with facts and citation[详细]
2023-03-18 07:46 分类:问答How to recognise a particular user in a long multi-user internet chat log?
Here is an online programming contest we are planning to have. What could be possible approaches to solving the same?[详细]
2023-03-17 15:27 分类:问答Are there published generative grammars for natural languages?
I have some ideas to do with natural language processing. I will need some grammars of the S -> NP VP[详细]
2023-03-17 04:19 分类:问答Stanford Core NLP - understanding coreference resolution
I\'m having some trouble understanding the changes made to the coref resolver in the last version of the Stanford NLP tools.[详细]
2023-03-16 23:34 分类:问答