开发者

Datasets for Apache Mahout

I am looking for datasets that can be used for implemen开发者_如何学Goting recommendation system usecase of Apache Mahout. I know of only MovieLens Data Sets from GroupLens Research group.

Anyone knows any other datasets that can be used for recommendation system implementation? I am particularly interested in item-based data sets though other datasets are most welcome.


this is Sebastian from Mahout.

There is a dataset from a czech dating website available that might be of interest to you: http://www.occamslab.com/petricek/data/

Btw the term item-based refers to a special collaborative filtering approach not to the dataset itself, which is usually in the common form of user-item-rating tripels that most collaborative filtering approaches work with.

We would love to hear from your experimentation results and experiences (if you wanna share them) on our user mailinglist at user@mahout.apache.org


While searching for data sets, I found few sites that list publicly available data sets which can used for data mining. Some of these can be used for Mahout too.

Bixo Labs

UCI Datasets

KDnuggets


You can look at iPinYou RTB Bidding Data Set Quora : http://qr.ae/OrqgM http://contest.ipinyou.com/data-release.html

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜