how to perform word clustering using k-means algorithm in java

2023-01-16 00:42 问答作者：

Please help me how to perform word clustering using k-means algorithm in java. From the set of documents, I get word and its frequency count. Then i dont know how to start for clustering.I already search google. But no idea. 开发者_开发百科Please tell me steps to perform word clustering. Very needful now. Thanks in advance.

"Programming Collective Intelligence" by Toby Segaran has a wonderful chapter on how to do this. The examples are in Python, but they should be easy to port to Java.

In clustering most important thing is to build a method, which check how to things (for example) are "close" together. E.g. is you are interested in string with same lang, this could be like:

int calculateDistance(String s1, String s2) {
     return Math.abs(s1.length() - s2.length());
}

Then I'm not so sure, but in can be like this: 1. choose (can be randomly) first k string, 2. iterate for all string, and relate them to their "nearest" string.

Then can be something, like choosing from every "cluster" middle of it, and start it again. I don't remember it for 100% but I thing it is good way to start.

And remember, that most important is the method calculateDistance()!

how to perform word clustering using k-means algorithm in java

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？