public class TokenizerApplication extends Object
| Constructor and Description |
|---|
TokenizerApplication() |
| Modifier and Type | Method and Description |
|---|---|
static de.julielab.jcore.ae.jtbd.TokenizerApplication.EvalResult |
doEvaluation(ArrayList<String> trainOrgSentences,
ArrayList<String> trainTokSentences,
ArrayList<String> predictOrgSentences,
ArrayList<String> predictTokSentences,
ArrayList<String> errors,
ArrayList<String> predictions)
general evaluation function, is called from doCrossEvaluation or
do9010Evaluation.
|
static void |
doPrediction(File inDir,
File outDir,
String modelFilename)
tokenize documents
|
static void |
doTraining(File orgSentencesFile,
File tokSentencesFile,
String modelFilename)
train a model
|
static void |
main(String[] args) |
public static de.julielab.jcore.ae.jtbd.TokenizerApplication.EvalResult doEvaluation(ArrayList<String> trainOrgSentences, ArrayList<String> trainTokSentences, ArrayList<String> predictOrgSentences, ArrayList<String> predictTokSentences, ArrayList<String> errors, ArrayList<String> predictions)
crf - the crf modelpredictOrgSentences - predictTokSentences - errors - predictions - public static void doPrediction(File inDir, File outDir, String modelFilename) throws IOException
inDir - the directory with the documents to be tokenizedoutDir - the directory where the tokenized documents should be written tomodelFile - the model to use for tokenizationIOExceptionpublic static void doTraining(File orgSentencesFile, File tokSentencesFile, String modelFilename)
orgSentencesFile - tokSentencesFile - modelFilename - public static void main(String[] args) throws IOException
IOExceptionCopyright © 2019 JULIE Lab Jena, Germany. All rights reserved.