Hadoop_开发者

开发者

Hadoop

相关标签：Mysql sql c django mongodb

Where do I download all of the necessary classes to write Hadoop MapReduce jobs? [closed]
Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow.
问答阅读(4)
Uploading large gzipped data files to HDFS
I have a use case where I want to upload big gzipped text data files (~ 60 GB) on HDFS. My code below is taking about 2 hours to upload these files in chunks of 500 MB. Following is the pseudo code.
问答阅读(7)
Will using hadoop 20-append with hbase 90.3 break?
Trying to install hbase, but the word on the street is that if I don\'t use a hadoop from the 20-append branch, I\'ll lose data.This tutorial says that it will work with 90.2, but doesn\'t discuss 9开
问答阅读(5)
Is Apache Hive used more for the programming language or for the data warehouse aspects?
I used to think that Hive was just a SQL-like programming language used to make writing MapReduce-type jobs easier (i.e., a SQL-like version of Pig/Pig Latin). I\'m reading more about it now, though,
问答阅读(4)
Assessing and comparing Hadoop for Business Intelligence Design considerations
I am considering various tec开发者_运维问答hnologies for data warehousing and business intelligence, and have come upon this radical tool called Hadoop. Hadoop doesn\'t seem to be exactly built for BI
问答阅读(5)
Hadoop Hive Query: Multi-join
How can I do sub-selections in Hive? I think I might be making a really obvious mistake that\'s not so obvious to me...
问答阅读(5)
Hive - create a table from zip file
I have开发者_StackOverflow中文版 bunch of zip files of CSVs, that I want to create Hive table from. I\'m trying to figure out what\'s the best way to do so.
问答阅读(5)
What does this error tell us when I'm trying to run an example in Apache Mahout?
I am studying to use Apache Mahout, and get the following message after running one of its example: Exception in thread \"main\" org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input pat
问答阅读(4)
How to track which data block is in which data node in hadoop?
If开发者_如何学C a data block is replicated, in which data node will it be replicated to? Is there any tool to show where the replicated blocks are present? If you know the filename, you can look this
问答阅读(3)
Is it possible to append to HDFS file from multiple clients in parallel?
Basically whole question is in the title. I\'m wondering if it\'s possible to append to file located on HDFS from multiple computers simultaneously? Something like storing stream of events constantly
问答阅读(7)

首页上一页第21页下一页共67页