Hadoop_开发者

开发者

Hadoop

相关标签：Mysql sql c django mongodb

How to connect to Hadoop/Hive from .NET
I am working on a solution 开发者_如何学JAVAwhere I will have a Hadoop cluster with Hive running and I want to send jobs and hive queries from a .NET application to be processed and get notified when
问答阅读(5)
Why does DistributedCache mangle my file names
I have a weird problem, DistributedCache appears to change the names of my files, it uses the original name as the parent folder and adds the file as a child.
问答阅读(5)
How does Hadoop's RunJar method distribute class/jar files across nodes?
I\'m trying to use JIT compilation in clojure to generate mapper and reducer classes on the fly. However, these classes aren\'t being recognized by the JobClient (it\'s the usual ClassNotFoundExceptio
问答阅读(2)
Where should Map put temporary files when running under Hadoop
I am running Hadoop 0.20.1 under SLES 10 (SUSE). My Map task takes a file and generates a few more, I then generate my results from these files. I would like to know where I should place these files,
问答阅读(4)
How can I group a large dataset
I have simple text file containing two columns, both integers 1 5 1 12 2 5 2 341 2 12 and so on.. I need to group the dataset by second value,
问答阅读(1)
Including jar files in Hadoop streaming using Groovy
I love Hadoop streaming for it\'s ability to quickly pump out quick and dirty one off map reduce jobs. I also love Hroovy for making all my carefully coded java accessible to a scripting language. Now
问答阅读(4)
How to avoid OutOfMemoryException when running Hadoop?
I\'m running a Hadoop job over 1,5 TB of data with doing much pattern matching. I have several machines with 16GB RAM each, and I always get OutOfMemoryException on this job with this data (I\'m using
问答阅读(7)
File Processing with Elastic MapReduce - No Reducer Step?
I have a large set of text files in an S3 directory.For each text file, I want to apply a function (an executable loaded through bootstrapping) and then write the results to another text file with the
问答阅读(4)
Parsing bulk text with Hadoop: best practices for generating keys
I have a \'large\' set of line delimited full sentences that I\'m processing with Hadoop.I\'ve developed a mapper that applies some of my favorite NLP techniques to it.There are several different tech
问答阅读(4)
Hadoop Streaming Multiline Input
I\'m using Dumbo for some Hadoop Streaming jobs.I have a bunch of JSON dictionaries each containing an article (multiline text) and some meta data.I know Hadoop performs best when give large files, so
问答阅读(7)

首页上一页第52页下一页共67页