Package de.julielab.jcore.ae.jtbd
Class TokenizerApplication
- java.lang.Object
-
- de.julielab.jcore.ae.jtbd.TokenizerApplication
-
public class TokenizerApplication extends Object
-
-
Constructor Summary
Constructors Constructor Description TokenizerApplication()
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static de.julielab.jcore.ae.jtbd.TokenizerApplication.EvalResultdoEvaluation(ArrayList<String> trainOrgSentences, ArrayList<String> trainTokSentences, ArrayList<String> predictOrgSentences, ArrayList<String> predictTokSentences, ArrayList<String> errors, ArrayList<String> predictions)static voiddoPrediction(File inDir, File outDir, String modelFilename)tokenize documentsstatic voiddoTraining(File orgSentencesFile, File tokSentencesFile, String modelFilename)train a modelstatic voidmain(String[] args)
-
-
-
Method Detail
-
doEvaluation
public static de.julielab.jcore.ae.jtbd.TokenizerApplication.EvalResult doEvaluation(ArrayList<String> trainOrgSentences, ArrayList<String> trainTokSentences, ArrayList<String> predictOrgSentences, ArrayList<String> predictTokSentences, ArrayList<String> errors, ArrayList<String> predictions)
-
doPrediction
public static void doPrediction(File inDir, File outDir, String modelFilename) throws IOException
tokenize documents- Parameters:
inDir- the directory with the documents to be tokenizedoutDir- the directory where the tokenized documents should be written tomodelFilename- the model to use for tokenization- Throws:
IOException
-
doTraining
public static void doTraining(File orgSentencesFile, File tokSentencesFile, String modelFilename)
train a model- Parameters:
orgSentencesFile-tokSentencesFile-modelFilename-
-
main
public static void main(String[] args) throws IOException
- Throws:
IOException
-
-