@Beta
public abstract class Tokenizer_ImplBase<TOKEN_TYPE extends org.apache.uima.jcas.tcas.Annotation,SENTENCE_TYPE extends org.apache.uima.jcas.tcas.Annotation>
extends org.apache.uima.fit.component.JCasAnnotator_ImplBase
This class provides a wrapper for the ClearNLP part of speech tokenizer for UIMA and/or ClearTK type systems. Subclasses should override the abstract methods to produce the annotations relevant for the target type system. This tagger is available here:
http://clearnlp.googlecode.com
| Modifier and Type | Field and Description |
|---|---|
static String |
PARAM_LANGUAGE_CODE |
static String |
PARAM_SEGMENT_SENTENCES |
static String |
PARAM_WINDOW_CLASS |
| Constructor and Description |
|---|
Tokenizer_ImplBase() |
| Modifier and Type | Method and Description |
|---|---|
protected abstract SentenceOps<SENTENCE_TYPE> |
getSentenceOps() |
protected abstract TokenOps<TOKEN_TYPE> |
getTokenOps() |
void |
initialize(org.apache.uima.UimaContext context) |
void |
process(org.apache.uima.jcas.JCas jCas) |
getRequiredCasInterface, processgetCasInstancesRequired, hasNext, nextpublic static final String PARAM_SEGMENT_SENTENCES
public static final String PARAM_LANGUAGE_CODE
public static final String PARAM_WINDOW_CLASS
protected abstract TokenOps<TOKEN_TYPE> getTokenOps()
protected abstract SentenceOps<SENTENCE_TYPE> getSentenceOps()
public void initialize(org.apache.uima.UimaContext context)
throws org.apache.uima.resource.ResourceInitializationException
initialize in interface org.apache.uima.analysis_component.AnalysisComponentinitialize in class org.apache.uima.fit.component.JCasAnnotator_ImplBaseorg.apache.uima.resource.ResourceInitializationExceptionpublic void process(org.apache.uima.jcas.JCas jCas)
throws org.apache.uima.analysis_engine.AnalysisEngineProcessException
process in class org.apache.uima.analysis_component.JCasAnnotator_ImplBaseorg.apache.uima.analysis_engine.AnalysisEngineProcessExceptionCopyright © 2023. All rights reserved.