开发者

Large scale naïve Bayes classifier with top-k output

I need a library for naïve Bayes large scale, with millions of training examples and +100k b开发者_JAVA百科inary features. It must be an online version (updatable after training). I also need top-k output, that is multiple classifications for a single instance. Accuracy is not very important.

The purpose is an automatic text categorization application.

Any suggestions for a good library is very appreciated.

EDIT: The library should preferably be in Java.


If a learning algorithm other than naïve Bayes is also acceptable, then check out Vowpal Wabbit (C++), which has the reputation of being one of the best scalable text classification algorithms (online stochastic gradient descent + LDA). I'm not sure if it does top-K output.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜