mapreduce_开发者

开发者

mapreduce

相关标签：javascript jquery android 多少钱 iPhone

"Child Error" in Executing stream Job on multi node Hadoop cluster (cloudera distribution CDH3u0 Hadoop 0.20.2)
I am working on 8 node Hadoop cluster, and I am trying to execute a simple streaming Job with the specified configuration.
问答阅读(6)
CouchDB for Fixed Categories Queries
I have documents like this in my CouchDB: { \"_id\": \"0cb35be3cc73d6859c303fa3200011d2\", \"_rev\": \"1-f6e356bbf6ab09290aae11132af50d66\",
问答阅读(2)
MapReduce or a batch job?
I have a function which needs to be called on a lot of files (1000\'s). Each is independent of another, and can be run in parallel. The output of the function for each of the files does not need to be
问答阅读(5)
How to take the average of big data in MongoDB vs CouchDB?
I\'m looking at this chart... http://www.mongodb.org/display/DOCS/MongoDB,+CouchDB,+MySQL+Compare+Grid
问答阅读(5)
What does Disco's "Could not parse worker event:" error mean?
I\'m trying to run a Disco job using map and reduce functions that are deserialized after being passed over a TCP socket using the mar开发者_如何学Goshal library. Specifically, I\'m unpacking them wit
问答阅读(2)
How to sort (order by) big data with hive efficiently?
I want to sort a big dataset efficiently (i.e. with a custom partitioner, like described here: How does the MapReduce sort algorithm work?)开发者_开发技巧, but I want to do it with hive.
问答阅读(4)
MapReduce pairwise comparison of all lines in multiple files
I\'m getting started with using python\'s mrjob to convert some of my long running python programs into MapReduce hadoop jobs. I\'ve gotten the simple word count examples to work and I conceptually un
问答阅读(1)
Most efficient way to generate a list of Unigrams from a text field in MongoDB
I need to generate a vector of u开发者_JAVA百科nigrams, i.e. a vector of all the unique words which appear in a specific text field that I have stored as part of a broader JSON object in MongoDB.
问答阅读(4)
Doubling each number a number of times as specify by the user
I am new to hadoop and I am learning by using few examples. I am currently trying to pass a file with random integers on it. For each and every number i w开发者_运维知识库ant it to be double base on t
问答阅读(2)
How could I programmatically get all the job tracker and tasktracker information that is displayed by Hadoop in the web interface?
I\'m using Cloudera\'s Hadoop distribution CDH-0.20.2CDH3u0. Is there any way I could the information such as 开发者_如何学Cjobtracker status, tasktracker status, counters using a JAVA program running
问答阅读(3)

首页上一页第11页下一页共39页