开发者

Variants of Hadoop

A project of mine is to compare different varian开发者_C百科ts of Hadoop, it is said that there are many of them out there, but googling didn't work well for me :(

Does anyone know any different variants of Hadoop? The only one I found was Haloop.


I think the more generic term is "map reduce":

http://www.google.com/search?gcx=c&sourceid=chrome&ie=UTF-8&q=map+reduce&safe=active


Not exactly sure what you mean by different variants for Hadoop.

But, there are a lot of companies providing commercial support or providing their own versions of Hadoop (open-source and proprietary). You can find more details here.

For ex., MapR has it's own proprietary implementation of Hadoop, but they claim it's compatible with Apache Hadoop, which is a bit vague because Apache Hadoop is evolving and there are no standards around Hadoop API. Cloudera has it's own version of Hadoop CDH which is based on the Apache Hadoop. HortonWorks has been spun from Yahoo, which provides commercial support for Hadoop.

You can find more information here. Hadoop is evolving very fast, so this might be a bit stale.


This can refer to - hadoops file system, - or its effective support for map reduce... - or even more generally, to the idea of cloud / distributed storage systems.

Best to clarify what aspects of hadoop you are interested In.

Of course when comparing hadoop academically, you must first start looking at GFS- since that is the origin of hadoop.


Taking aside HBase we can see hadoop as two layers - storage layer and map-reduce layer. Storage layer has the following really different implementation which would be interesting to compare: standard hadoop file system, HDFS over Cassandra (Brisk), HDFS over S3, MapR hadoop implementation. MapR also have changed Map-reduce implementation.


This site http://www.nosql-database.org/ has a list of a lot of NoSql DBs out there. Maybe it can help you.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜