| Package | Description |
|---|---|
| eus.ixa.ixa.pipe.ml |
Package containing common code for training and evaluating IXA pipes models.
|
| eus.ixa.ixa.pipe.ml.formats |
Classes to directly train NERC models from a variety of corpus formats.
|
| eus.ixa.ixa.pipe.ml.lemma |
Package containing classes related to dictionary lemmatizers.
|
| eus.ixa.ixa.pipe.ml.nerc |
Package containing Named Entity Recognition and Classification classes.
|
| eus.ixa.ixa.pipe.ml.parse |
Package containing the Parser classes.
|
| eus.ixa.ixa.pipe.ml.pos |
Package containing the POS tagging classes.
|
| eus.ixa.ixa.pipe.ml.resources |
Package containing the dictionaries, clustering lexicons and other resources.
|
| eus.ixa.ixa.pipe.ml.sequence |
The package containing all the infrastructure for Sequence objects.
|
| eus.ixa.ixa.pipe.ml.utils |
Utility classes.
|
| Modifier and Type | Method and Description |
|---|---|
Span[][] |
StatisticalSequenceLabeler.getAllTags(String[] tokens)
Produces a multidimensional array containing all the taggings
possible for a given sentence.
|
Span[] |
StatisticalSequenceLabeler.lemmatizeToSpans(String[] tokens) |
Span[] |
StatisticalSequenceLabeler.seqToSpans(String[] tokens)
Get array of Spans from a list of tokens.
|
| Modifier and Type | Method and Description |
|---|---|
List<SequenceLabel> |
StatisticalSequenceLabeler.getLemmaSequencesFromSpans(String[] tokens,
Span[] seqSpans) |
com.google.common.collect.ListMultimap<String,String> |
StatisticalSequenceLabeler.getMultipleLemmas(String[] tokens,
Span[][] posTags)
Takes a sentence with multiple tags alternatives for each word and produces
a lemma for each of the word-tag combinations.
|
List<SequenceLabel> |
StatisticalSequenceLabeler.getSequencesFromSpans(String[] tokens,
Span[] seqSpans)
Creates a list of
SequenceLabel objects from spans and tokens. |
| Modifier and Type | Method and Description |
|---|---|
static Span |
CoNLL02Format.extract(int begin,
int end,
String beginTag) |
| Modifier and Type | Method and Description |
|---|---|
List<String> |
DictionaryLemmatizer.lemmatize(String[] tokens,
Span[] postags) |
| Modifier and Type | Method and Description |
|---|---|
Span[] |
NumericNERTagger.nercToSpans(String[] tokens) |
Span[] |
DictionariesNERTagger.nercToSpans(String[] tokens)
Detects Named Entities in a
Dictionaries by NE type ignoring case. |
Span[] |
DictionariesNERTagger.nercToSpansExact(String[] tokens)
Detects Named Entities in a
Dictionaries by NE type This method is
case sensitive. |
| Modifier and Type | Method and Description |
|---|---|
List<SequenceLabel> |
NumericNERTagger.getNamesFromSpans(Span[] neSpans,
String[] tokens) |
List<SequenceLabel> |
DictionariesNERTagger.getNamesFromSpans(Span[] neSpans,
String[] tokens)
Creates a list of
SequenceLabel objects from spans and tokens. |
| Modifier and Type | Method and Description |
|---|---|
Span |
Parse.getSpan()
Returns the character offsets for this constituent.
|
Span |
Constituent.getSpan()
Returns the span of the constituent.
|
| Modifier and Type | Method and Description |
|---|---|
static void |
Parse.addNames(String tag,
Span[] names,
Parse[] tokens)
Utility method to inserts named entities.
|
| Constructor and Description |
|---|
Constituent(String label,
Span span) |
Parse(String text,
Span span,
String type,
double p,
int index)
Creates a new parse node for this specified text and span of the specified type with the specified probability
and the specified head index.
|
Parse(String text,
Span span,
String type,
double p,
Parse h)
Creates a new parse node for this specified text and span of the specified type with the specified probability
and the specified head and head index.
|
| Modifier and Type | Method and Description |
|---|---|
Span[] |
MultiWordMatcher.multiWordsToSpans(String[] tokens)
Detects multiword expressions ignoring case.
|
| Modifier and Type | Method and Description |
|---|---|
Span[] |
SequenceModelResource.seqToSpans(String[] tokens)
Tag the current sentence.
|
| Modifier and Type | Method and Description |
|---|---|
List<String> |
MFSResource.getFirstSenseBilou(List<String> lemmas,
Span[] posTags)
Extract most frequent sense baseline from WordNet data, using Ciaramita and
Altun's (2006) approach for bilou encoding.
|
List<String> |
MFSResource.getFirstSenseBio(List<String> lemmas,
Span[] posTags)
Extract most frequent sense baseline from WordNet data, using Ciaramita and
Altun's (2006) approach for a bio encoding.
|
| Modifier and Type | Method and Description |
|---|---|
protected Span[] |
SequenceLabelerDetailedFMeasureListener.asSpanArray(SequenceLabelSample sample) |
protected abstract Span[] |
DetailedFMeasureListener.asSpanArray(T sample) |
Span[] |
BilouCodec.decode(List<String> c) |
Span[] |
BioCodec.decode(List<String> c) |
Span[] |
SequenceLabelerCodec.decode(List<T> c)
Decodes a sequence T objects into Span objects.
|
static Span[] |
SequenceLabelerME.dropOverlappingSpans(Span[] spans)
Removes spans with are intersecting or crossing in anyway.
|
Span[] |
SequenceLabelSample.getSequences() |
Span |
SequenceLabel.getSpan()
Return the Span (or null if none).
|
Span[][] |
SequenceLabelerME.tag(int numTaggings,
String[] tokens)
Returns at most the specified number of taggings for the specified sentence.
|
Span[] |
SequenceLabeler.tag(String[] tokens)
Generates tags for the given sequence, typically a sentence, returning token spans for any identified sequences.
|
Span[] |
SequenceLabelerME.tag(String[] tokens) |
Span[] |
SequenceLabelerME.tag(String[] tokens,
String[][] additionalContext)
Generates sequence tags for the given sequence, returning
spans for any identified sequences.
|
| Modifier and Type | Method and Description |
|---|---|
SequenceLabel |
SequenceLabelFactory.createSequence(String seqString,
String seqType,
Span seqSpan)
Constructs a
SequenceLabel as a String with a type and a Span
specified in terms of the number of tokens it contains. |
SequenceLabel |
SequenceLabelFactory.createSequence(String seqString,
String seqType,
Span seqSpan,
int offset,
int length)
Constructs a Sequence as a String with corresponding offsets and length from
which to calculate start and end position of the Sequence.
|
static Span[] |
SequenceLabelerME.dropOverlappingSpans(Span[] spans)
Removes spans with are intersecting or crossing in anyway.
|
String[] |
BilouCodec.encode(Span[] sequences,
int length) |
String[] |
BioCodec.encode(Span[] sequences,
int length) |
T[] |
SequenceLabelerCodec.encode(Span[] sequences,
int length)
Encodes Span objects into a sequence of T objects.
|
static String[] |
SequenceLabelerEventStream.generateOutcomes(Span[] names,
String type,
int length)
Deprecated.
use the BioCodec implementation of the SequenceValidator instead!
|
protected void |
EvaluationErrorPrinter.printError(Span[] references,
Span[] predictions,
T referenceSample,
T predictedSample,
String sentence) |
protected void |
EvaluationErrorPrinter.printError(Span[] references,
Span[] predictions,
T referenceSample,
T predictedSample,
String sentence) |
protected void |
EvaluationErrorPrinter.printError(Span[] references,
Span[] predictions,
T referenceSample,
T predictedSample,
String[] sentenceTokens) |
protected void |
EvaluationErrorPrinter.printError(Span[] references,
Span[] predictions,
T referenceSample,
T predictedSample,
String[] sentenceTokens) |
protected void |
EvaluationErrorPrinter.printError(String id,
Span[] references,
Span[] predictions,
T referenceSample,
T predictedSample,
String[] sentenceTokens) |
protected void |
EvaluationErrorPrinter.printError(String id,
Span[] references,
Span[] predictions,
T referenceSample,
T predictedSample,
String[] sentenceTokens) |
double[] |
SequenceLabelerME.probs(Span[] spans)
Returns an array of probabilities for each of the specified spans which is
the arithmetic mean of the probabilities for each of the outcomes which
make up the span.
|
void |
SequenceLabel.setSpan(Span span)
Set the Span for the Name.
|
| Constructor and Description |
|---|
SequenceLabel(String aStr,
String aType,
Span aSeqSpan)
Create a new
Sequence with the given string and Span. |
SequenceLabel(String aStr,
String aType,
Span aSeqSpan,
int aStartOffset,
int aSeqLength)
Creates a new
Sequence with the given content. |
SequenceLabelSample(String[] tokens,
Span[] sequences,
boolean clearAdaptiveData) |
SequenceLabelSample(String[] tokens,
Span[] sequences,
String[][] additionalContext,
boolean clearAdaptiveData)
Initializes the current instance.
|
SequenceLabelSample(String id,
String[] tokens,
Span[] sequences,
String[][] additionalContext,
boolean clearAdaptiveData) |
| Modifier and Type | Method and Description |
|---|---|
Span |
Span.trim(CharSequence text)
Return a copy of this span with leading and trailing white spaces removed.
|
| Modifier and Type | Method and Description |
|---|---|
int |
Span.compareTo(Span s)
Compares the specified span to the current span.
|
static void |
Span.concatenateSpans(List<Span> allSpans,
Span[] neSpans)
Concatenates two span lists adding the spans of the second parameter to the
list in first parameter.
|
boolean |
Span.contains(Span s)
Returns true if the specified span is contained by this span.
|
boolean |
Span.crosses(Span s)
Returns true is the specified span crosses this span.
|
static String[] |
StringUtils.decodeLemmas(String[] tokens,
Span[] preds)
Decodes the lemma from the word and the induced lemma class.
|
static void |
StringUtils.decodeLemmasToSpans(String[] tokens,
Span[] preds)
Decodes the lemma induced type into the lemma and sets it as
value of the Span type.
|
static String[] |
Span.getTypesFromSpans(Span[] spans,
String[] tokens)
Get an array of Spans and their associated tokens and obtains an array of
Strings containing the type for each Span.
|
boolean |
Span.intersects(Span s)
Returns true if the specified span intersects with this span.
|
static void |
Span.postProcessDuplicatedSpans(List<Span> preList,
Span[] postList)
Removes spans from the preList if the span is contained in the postList.
|
static String[] |
Span.spansToStrings(Span[] spans,
CharSequence s)
|
static String[] |
Span.spansToStrings(Span[] spans,
String[] tokens) |
boolean |
Span.startsWith(Span s)
Returns true if the specified span is the begin of this span and the
specified span is contained in this span.
|
| Modifier and Type | Method and Description |
|---|---|
static void |
Span.concatenateSpans(List<Span> allSpans,
Span[] neSpans)
Concatenates two span lists adding the spans of the second parameter to the
list in first parameter.
|
static void |
Span.postProcessDuplicatedSpans(List<Span> preList,
Span[] postList)
Removes spans from the preList if the span is contained in the postList.
|
| Constructor and Description |
|---|
Span(Span span,
double prob)
Creates a new immutable span based on an existing span, where the existing
span did not include the prob
|
Span(Span span,
int offset)
Initializes a new Span object with an existing Span which is shifted by an
offset.
|
Copyright © 2016 IXA pipes. All rights reserved.