Hadoop_开发者

开发者

Hadoop

相关标签：Mysql sql c django mongodb

Get names of field schema from Pig
I\'m working on a JsonStorage for Pig. Everything works fine, but at least I need to get the names of the fields (i.e. crdate, name, positions) from the pig schema.
问答阅读(6)
How should I partition data in s3 for use with hadoop hive?
I have a s3 bucket containing about 300gb of log files in no particular order. I want to partition this data for use in hadoop-hive using a date-time stamp so that log-lines related to a particular
问答阅读(4)
Which is the easiest way to combine small HDFS blocks?
I\'m collecting logs with Flume开发者_JAVA技巧 to the HDFS. For the test case I have small files (~300kB) because the log collecting process was scaled for the real usage.
问答阅读(4)
Implementation of an ArrayWritable for a custom Hadoop type
How do I define an ArrayWritable for a custom Hadoop type ? I am trying to implement an inverted index in Hadoop, with custom Hadoop types to store the data
问答阅读(3)
How to create the hadoop-0.21.0-core.jar using the source code?
How to create the hadoop-0.21.0-core.jar using the source code? I have check out the source code from svn. Now I have three dirs common,hdfs,mapred
问答阅读(2)
The logs doesn't appear in the console :( [Hadoop Question]
I am开发者_如何学C trying to debug the WordCount example of Cloudera Hadoop but I can\'t. I\'ve logged the mapper and the reducer class, but in the console doesn\'t appear the log.
问答阅读(3)
EOFException thrown by a Hadoop pipes program
First of all, I am a newbie of Hadoop. I have a small Hadoop pipes program that throws java.io.EOFException. The program takes
问答阅读(6)
running multiple MapReduce jobs in hadoop
I want to run a chain of map reduce jobs, so the easiest solution seems to be jobcontroller. say I have two jobs, job1 and job2. and I want to run job2 after job1. Well, it faced some problems. after
问答阅读(5)
Apache Hadoop : Can it do "time-varying" input?
I haven\'t found an answer to this even after a bit of googling. My input files are generated by a process which chunks them out at say, when the file touches 1GB. Now, if I were to run a mapreduce jo
问答阅读(3)
Encoding image into Jpeg2000 using Distributed Computing like Hadoop
Just wondering if anybody has done/aware about encoding/compressing large image into JPEG2000 format using Hadoop ?
问答阅读(4)

首页上一页第46页下一页共67页