开发者

Hadoop jobs that read from Cassandra seem to run only on master (slaves completely idle)

But when I run the hadoop included wordcount example (dfs version), I see the load getting distributed to all the slaves. What is special about C开发者_运维问答olumnFamilyInputFormat in Cassandra? Do I need to set any other parameters in hadoop config?

Thanks, Vivek


I would recommend starting with Brisk (http://www.datastax.com/brisk) instead of trying to set up hadoop-on-Cassandra from scratch, since fairly deep knowledge of both systems is required if you take that approach.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜