Hadoop_开发者

开发者

Hadoop

相关标签：Mysql sql c django mongodb

Hadoop gzip compressed files
I am new to hadoop and trying to process wikipedia dump. It\'s a 6.7 GB gzip compressed xml file. I read that hadoop supports gzip compressed files but can only be processed by mapper on a single job
问答阅读(0)
Hadoop pig latin unable to stream through a python script
I have a simple python script (moo.py) that i am trying to stream though impor开发者_如何学Pythont sys, os
问答阅读(0)
installing plugin in eclipse-cpp-helios-SR2-linux-gtk-x86_64
Can someone tell me how to install a map reduce 开发者_开发问答(hadoop) plugin in eclipse-cpp-helios-SR2-linux ? Thanx in advancehi
问答阅读(5)
Running a mapreduce jar on Hadoop cluster
I\'m trying to run the map reduce implementation of quadratic sieve algorithm on Hadoop. For this purpose I\'m using karmasphere Hadoop community plugin with Netbeans. The program works fine using the
问答阅读(3)
Any suggestions for reading two different dataset into Hadoop at the same time?
Dear hadooper: I\'m new for hadoop, and recently try to implement an algorithm. This algorithm needs to calculate a matrix, which represent the different rating of every two pa开发者_Go百科ir of song
问答阅读(2)
Hadoop, how to compress mapper output but not the reducer output
I have a map-reduce java program in which I try to only compress the mapper output but not the reducer output. I thought that this would be possible by setting the following properties in the Confi开发
问答阅读(3)
Hadoop: intermediate merge failed
I\'m running into a strange issue. When I run my Hadoop job over a large dataset (>1TB compressed text files), several of the reduce tasks fail, with stacktraces like these:
问答阅读(5)
How can I write my own Hadoop scheduler?
I\'ve been studying hadoop\'s scheduler mechanism recently. Using 0.20.2(fair&capaci开发者_开发百科ty included)
问答阅读(6)
Progress rate during map phase (LATE scheduler) - Hadoop
I am trying to开发者_StackOverflow中文版 find out the progress rate of the map tasks. If someone can help me out it will be great !! Thanks !!There are two ways we monitor the progress of the Map and
问答阅读(2)
Ubuntu cluster management
I am trying to figure out a solution for managing a set of linux machines(OS:Ubuntu,~40 nodes. same hardware). These machines are supposed to be images of each other, softwareinstalled in one needs to
问答阅读(1)

首页上一页第31页下一页共67页