Hadoop_开发者

开发者

Hadoop

相关标签：Mysql sql c django mongodb

Sorting key-value pairs after map function in mapreduce
I have a file, which contains IP packet headers in text format. After the map function, each reduce method is called for a particular IP address. I wan开发者_开发知识库t the values in a sorted order,
问答阅读(5)
hadoop作者是谁啊？？
位阳阳 2021-04-21 21:37 开发者_运维百科主编李屹之位阳阳 2021-04-21 21:38开发者_Go百科
问答阅读(0)
XML Processing in hadoop
I have nearly 200+ xml files in the hdfs. I use the XmlInputFormat (of mahout) to stream the elements. The mapper is able to get the xml contents and process it. But the problem is only the first xml
问答阅读(7)
Understanding SQL joins within WHERE clause
I have a query in SQL that I\'m trying to translate into Pig Latin (for use on a Hadoop cluster).Most of the time I have no problem moving the queries over to Pig, but I\'ve encountered something I ca
问答阅读(2)
Hive - How can I write a create statement for a variable length, existing, hdfs file?
So, I have an ex开发者_JAVA百科isting hdfs directory, containing a bunch of files.These files are all tab delimited.
问答阅读(6)
hadoop single node setup
I am trying to do a singlenode setup for hadoop as given on following开发者_运维知识库 link http://hadoop.apache.org/common/docs/current/single_node_setup.html
问答阅读(2)
Processing large set of small files with Hadoop
I am using Hadoop example program WordCount to process large set of small files/web pages (cca. 2-3 kB). Since this is far away from optimal file size for hadoop files, the program is very slow. I gue
问答阅读(6)
Cassandra wih Hive
Am new in cassandra and Hive. Now i want integrate cassandra with th开发者_运维知识库e Hadoop-Hive but how can i integrate the cassandra with Hive.You\'re in luck: DataStax just released Brisk, a Cass
问答阅读(2)
Equivalent of linux 'diff' in Apache Pig
I wan开发者_开发百科t to be able to do a standard diff on two large files. I\'ve got something that will work but it\'s not nearly as quick as diff on the command line.
问答阅读(1)
FileInputFormat where filename is KEY and text contents are VALUE
I\'d like to use an entire file as a single record for MAP processing, with the filename as the key. I\'ve read the following post: How to get Filename/File Contents as key/value input for MAP when ru
问答阅读(1)

首页上一页第27页下一页共67页