Closed. This question is opinion-based. It is not currently accepting answers. Want to improve this question? Update the question so it can be answered with facts and citations by editing
As a School assignment i\'m required to implement Naïve Bayes algorithm which i am intending to do in Java.
In the field of Data Mining,开发者_如何学编程 is there a specific sub-discipline called \'Similarity\'? If yes, what does it deal with. Any examples, links, references will be helpful.
I have a data set with multiple layers of annotation over the underlying text, such as part-of-tags, chunks from a shallow parser, name entities, and others from variousnatural language processing (NL
开发者_JAVA百科I have over 1000 surveys, many of which contains open-ended replies. I would like to be able to \'parse\' in all the words and get a ranking of the most used words (disregarding commo
EDIT: I the size of the wordlist is 10-20 times bigger than I wrote down. I simply forgot a zero. EDIT2: I will have a look into SVDLIBC and also see how to reduce a matrix to its dense version so t
Trying to install rattle on a windows server 2008 R2 64bit machine, using 64-bit R ver2.11, I got the following message:
I am trying to implement a naive bayseian approach to find the topic of a given document or stream of words. Is there are Naive Bayesian approach that i might be able to look up for this ?
We are looking at acquiring Data Mining software to primarily run predictive analysis processes. How does SQL Server Data Mining solution compares to other solutions like SPSS from IBM?
This question already has answers here: 开发者_StackOverflow A better similarity ranking algorithm for variable length strings