软件包 ml.shifu.guagua.mapreduce

This package contains all the guagua-mapreduce implementation details based one only mapper hadoop job.

请参见:
          描述

类摘要
GuaguaInputFormat GuaguaInputFormat is used to determine how many mappers in guagua MapReduce job.
GuaguaInputSplit InputSplit implementation in guagua for Hadoop MapReduce job.
GuaguaLineRecordReader Copy from LineRecordReader but to support GuaguaRecordReader interface.
GuaguaMapper<MASTER_RESULT extends Bytable,WORKER_RESULT extends Bytable> GuaguaMapper is the Hadoop Mapper implementation for both guagua master and guagua workers.
GuaguaMapReduceClient GuaguaMapReduceClient is the entry point for guagua mapreduce implementation application.
GuaguaMapReduceConstants Constants in guagua mapreduce.
GuaguaMRRecordReader GuaguaMRRecordReader is used as a mock for mapreduce reader interface, not real reading data.
GuaguaMRUnitDriver<MASTER_RESULT extends Bytable,WORKER_RESULT extends Bytable> GuaguaMRUnitDriver is used to run in-memory guagua application by using hadoop MapReduce splits.
GuaguaOptionsParser Copy from hadoop GenericOptionsParser and extend it.
GuaguaOutputCommitter This output committer doesn't do anything, meant for the case where output isn't desired, or as a base for not using FileOutputCommitter.
GuaguaOutputFormat This is for internal use only.
GuaguaRecordWriter We don't use GuaguaRecordWriter but hadoop MapReduce needs it.
GuaguaWritableAdapter<W extends Writable> GuaguaWritableAdapter is used to adapt hadoop io to Bytable interface.
GuaguaWritableSerializer<RESULT extends Bytable> GuaguaWritableSerializer is to provide the functionality to support use hadoop writable interface.
 

软件包 ml.shifu.guagua.mapreduce 的描述

This package contains all the guagua-mapreduce implementation details based one only mapper hadoop job.

com.paypal.guagua.mapreduce.GuaguaMapReduceClient is the entry point to trigger a guagua job by invoking its main method.

IO related classes including com.paypal.guagua.mapreduce.GuaguaWritableAdapter and com.paypal.guagua.mapreduce.GuaguaWritableSerializer can be used to combine hadoop Writable with guagua com.paypal.guagua.io.Bytable.

com.paypal.guagua.mapreduce.GuaguaMRUnitDriver is a class for unit test, it will run guagua master-workers in one jvm instance. See SumTest in guagua-mapreduce-examples project for details.

Hadoop input format and output format customization for guagua is in com.paypal.guagua.mapreduce.GuaguaInputFormat and com.paypal.guagua.mapreduce.GuaguaOutputFormat and other related classes.



Copyright © 2014. All Rights Reserved.