mapreduce_开发者

开发者

mapreduce

相关标签：javascript jquery android 多少钱 iPhone

What is the maximum number of records that a hadoop reducer's reduce() call can take?
I have a mapper whose output i开发者_如何学运维s mapped to multiple different reducer instances by using my own Partitioner. My partitioner makes sure that a given is sent always to a given reducer in
问答阅读(1)
Hadoop for processing very large binary files
I have a system I wish to distribute where I have a number of very large non-splittable binary files I wish to process in a distributed fashion. These are of the order of a couple of hundreds of Gb. F
问答阅读(3)
Get the input path in a Hadoop Mapper Class
I have implemented a simple MapReduce project in Hadoop for processing logs. The input path is the directory where the logs are.
问答阅读(3)
How to use a binary executable which takes filenames as arguments in hadoop streaming?
Say I have a binary executable which takes filenames as arguments, like \'myprog file1 file2\', it reads from file1 and writes t开发者_如何学Goo file2. The binary executable does not take stdin and do
问答阅读(2)
Dimension Reduction with Map reduce, using distributed computing?
Do you know an application or algorithm to reduce dimensionality of big data, maybe using Map-Reduce, or other ap开发者_如何学Pythoni, also:
问答阅读(7)
MongoDB, return recent document for each user_id in collection
Looking for similar functionality to Postgres\' Dist开发者_JS百科inct On. Have a collection of documents {user_id, current_status, date}, where status is just text and date is a Date.Still in the ear
问答阅读(0)
Are these the ways to write a query without joins in a NOSQL scalable website architecture?
I keep hearing that one of the ways to architect a scalable website is to not use joins. How is the world do you do that since most data is relational?
问答阅读(2)
Sort and shuffle optimization in Hadoop MapReduce
I\'m looking for a research/implementation based project on Hadoop and I came across the list posted on the wiki page - http://wiki.apache.org/hadoop/ProjectSuggestions. But, this page was last update
问答阅读(0)
Hadoop, hardware and bioinformatics
We\'re about to buy new hardware to run our analyses and are wondering if we\'re making the right decisions.
问答阅读(3)
MapReduce - how do I calculate relative values (average, top k and so)?
I\'m looking for a way to calculate \"global\" or \"relative\" values during a MapReduce process - an average, sum, top etc. Say I have a list of workers, with their IDs associated with their salaries
问答阅读(2)

首页上一页第21页下一页共39页