Package org.htuple

In MapReduce using compound map output keys and customizing which fields are partitioned, sorted and grouped can be tedious, especially when doing this across multiple jobs.

See:
          Description

Interface Summary
SerializationUtils.TupleElementSerializer  
 

Class Summary
SerializationUtils  
ShuffleUtils Utilities to help with custom sorting, grouping and partitioning of Tuple's.
ShuffleUtils.ConfigBuilder A builder that allows you to tune how partitioning, sorting and grouping should work for a given MapReduce job using your Tuple instances.
Tuple A tuple for use as intermediary map outputs.
TupleComparator A Comparator for Tuple instances, which can compare a subset of the elements in the tuple, which is useful for sorting and grouping when secondary-sort behavior is desired.
TupleGroupingComparator  
TupleMapRedPartitioner A partitioner for Tuple's to support secondary sorting.
TupleMapReducePartitioner A partitioner for Tuple's to support secondary sorting.
TuplePartitioner A partitioner for Tuple's to support secondary sorting.
TupleSortComparator  
 

Package org.htuple Description

In MapReduce using compound map output keys and customizing which fields are partitioned, sorted and grouped can be tedious, especially when doing this across multiple jobs. The goal of this library is to provide a Tuple class, which can contain multiple elements, and provide along with it a ShuffleUtils class to give you a easy-to-use method to tune which tuple elements should be used for partitioning, sorting and grouping. The two classes that you as an end-user care about are:



Copyright © 2013. All Rights Reserved.