public class Annotate extends Object
| Constructor and Description |
|---|
Annotate(Properties properties)
It manages the use of the three different name finders:
StatisticalNameFinder,
DictionariesNameFinder and NumericNameFinder. |
| Modifier and Type | Method and Description |
|---|---|
void |
annotateNEs(ixa.kaflib.KAFDocument kaf)
Classify Named Entities creating the entities layer in the
KAFDocument using statistical models, post-processing and/or
dictionaries only. |
String |
annotateNEsToCoNLL2002(ixa.kaflib.KAFDocument kaf)
Output Conll2002 format.
|
String |
annotateNEsToCoNLL2003(ixa.kaflib.KAFDocument kaf)
Output Conll2003 format.
|
String |
annotateNEsToKAF(ixa.kaflib.KAFDocument kaf)
Output annotation as NAF.
|
String |
annotateNEsToOpenNLP(ixa.kaflib.KAFDocument kaf)
Output annotation in OpenNLP format.
|
String |
convertToConLLTypes(String neType)
Convert Entity class annotation to CoNLL formats.
|
StatisticalNameFinder |
getStatisticalNameFinder()
Get the statistical namefinder.
|
public Annotate(Properties properties) throws IOException
StatisticalNameFinder,
DictionariesNameFinder and NumericNameFinder. In particular, if --dictTag
option in CLI is off, statistical models are used (this is the default). If --dictTag is
activated, it has two options, "tag" and "post": tag only tags with a gazetteer and "post"
post-processes the probabilistic annotation giving priority to the gazetteer.
Obviously, this option depends on the --dictPath parameter being correctly specified. The
--lexer numeric option annotates numeric entities (dates, percentages, and so on) via rules.
Finally, the oepc option post-process the output using the one entity per class hypothesis.properties - the propertiesIOException - the io thrownpublic StatisticalNameFinder getStatisticalNameFinder()
public final void annotateNEs(ixa.kaflib.KAFDocument kaf)
throws IOException
KAFDocument using statistical models, post-processing and/or
dictionaries only.kaf - the kaf document to be used for annotationIOException - throws exception if problems with the kaf documentpublic final String annotateNEsToKAF(ixa.kaflib.KAFDocument kaf)
kaf - the naf documentpublic final String annotateNEsToOpenNLP(ixa.kaflib.KAFDocument kaf)
kaf - the naf documentpublic String annotateNEsToCoNLL2003(ixa.kaflib.KAFDocument kaf)
kaf - the kaf documentpublic String annotateNEsToCoNLL2002(ixa.kaflib.KAFDocument kaf)
kaf - the kaf documentCopyright © 2015 IXA pipes. All rights reserved.