Class GuaguaMRUnitDriver<MASTER_RESULT extends ml.shifu.guagua.io.Bytable,​WORKER_RESULT extends ml.shifu.guagua.io.Bytable>

  • Type Parameters:
    MASTER_RESULT - master result for computation in each iteration.
    WORKER_RESULT - worker result for computation in each iteration.

    public class GuaguaMRUnitDriver<MASTER_RESULT extends ml.shifu.guagua.io.Bytable,​WORKER_RESULT extends ml.shifu.guagua.io.Bytable>
    extends ml.shifu.guagua.unit.GuaguaUnitDriver<MASTER_RESULT,​WORKER_RESULT>
    GuaguaMRUnitDriver is used to run in-memory guagua application by using hadoop MapReduce splits. Please check ml.shifu.guagua.mapreduce.example.sum.SumTest in guagua-mapreduce-examples project.
    • Constructor Detail

      • GuaguaMRUnitDriver

        public GuaguaMRUnitDriver​(Properties props)
        A only constructor here for local in-memory guagua job. Please check ml.shifu.guagua.mapreduce.example.sum.SumTest in guagua-mapreduce-examples project.
        Parameters:
        props - set all the configurations like input, output and ..
    • Method Detail

      • generateWorkerSplits

        public List<ml.shifu.guagua.io.GuaguaFileSplit[]> generateWorkerSplits​(String inputs)
                                                                        throws IOException
        Specified by:
        generateWorkerSplits in class ml.shifu.guagua.unit.GuaguaUnitDriver<MASTER_RESULT extends ml.shifu.guagua.io.Bytable,​WORKER_RESULT extends ml.shifu.guagua.io.Bytable>
        Throws:
        IOException
      • listStatus

        protected List<org.apache.hadoop.fs.FileStatus> listStatus​(org.apache.hadoop.conf.Configuration conf,
                                                                   String input)
                                                            throws IOException
        List input directories. Subclasses may override to, e.g., select only files matching a regular expression.
        Parameters:
        conf - the job configuration
        input - the input file path
        Returns:
        array of FileStatus objects
        Throws:
        IOException - if zero items.
        org.apache.hadoop.mapreduce.lib.input.InvalidInputException - If any IOException for input files.