Class FormatConverter


  • public class FormatConverter
    extends java.lang.Object
    • Constructor Summary

      Constructors 
      Constructor Description
      FormatConverter()  
    • Method Summary

      All Methods Static Methods Concrete Methods 
      Modifier and Type Method Description
      static void main​(java.lang.String[] args)  
      static void makeDataSplit​(double fractionGold, int initSize, java.util.ArrayList<java.lang.String> pipedData, java.util.ArrayList<java.lang.String> initOut, java.util.ArrayList<java.lang.String> poolOut, java.util.ArrayList<java.lang.String> goldOut)
      split data in piped format into pool data (corpus) and gold data and a inital trainingset
      static void makeDataSplit​(double fractionGold, java.util.ArrayList<java.lang.String> pipedData, java.util.ArrayList<java.lang.String> poolOut, java.util.ArrayList<java.lang.String> goldOut)
      split data in piped format into pool data (corpus) and gold data
      static java.util.ArrayList<java.lang.String> makePipedFormat​(java.util.ArrayList<java.lang.String> iobData, java.util.ArrayList<java.util.ArrayList<java.lang.String>> metaData, java.util.TreeSet<java.lang.String> tags)
      converts a IOB and a POS file into one file in piped format
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Constructor Detail

      • FormatConverter

        public FormatConverter()
    • Method Detail

      • main

        public static void main​(java.lang.String[] args)
      • makePipedFormat

        public static java.util.ArrayList<java.lang.String> makePipedFormat​(java.util.ArrayList<java.lang.String> iobData,
                                                                            java.util.ArrayList<java.util.ArrayList<java.lang.String>> metaData,
                                                                            java.util.TreeSet<java.lang.String> tags)
        converts a IOB and a POS file into one file in piped format
      • makeDataSplit

        public static void makeDataSplit​(double fractionGold,
                                         java.util.ArrayList<java.lang.String> pipedData,
                                         java.util.ArrayList<java.lang.String> poolOut,
                                         java.util.ArrayList<java.lang.String> goldOut)
        split data in piped format into pool data (corpus) and gold data
        Parameters:
        fractionGold - the fraction of the gold input
        poolOut - output of pooldata
        goldOut - output of golddata
      • makeDataSplit

        public static void makeDataSplit​(double fractionGold,
                                         int initSize,
                                         java.util.ArrayList<java.lang.String> pipedData,
                                         java.util.ArrayList<java.lang.String> initOut,
                                         java.util.ArrayList<java.lang.String> poolOut,
                                         java.util.ArrayList<java.lang.String> goldOut)
        split data in piped format into pool data (corpus) and gold data and a inital trainingset
        Parameters:
        fractionGold - the fraction of the gold
        initSize - size of initial trainingset
        initOut - output of initial trainingset
        poolOut - output of pooldata
        goldOut - output of golddata