public class CentroidTfidfSimilarityExtractor<OUTCOME_T,FOCUS_T extends Annotation> extends TfidfExtractor<OUTCOME_T,FOCUS_T>
TfidfExtractor.IDFMapidfMap, isTrained, subExtractorname| Constructor and Description |
|---|
CentroidTfidfSimilarityExtractor(String name) |
CentroidTfidfSimilarityExtractor(String name,
FeatureExtractor1<FOCUS_T> extractor) |
| Modifier and Type | Method and Description |
|---|---|
protected Map<String,Double> |
computeCentroid(Iterable<Instance<OUTCOME_T>> instances,
TfidfExtractor.IDFMap idfs) |
List<Feature> |
extract(JCas view,
FOCUS_T focusAnnotation)
Some feature extractors will require some specific number of annotations that is greater than
one.
|
Map<String,Double> |
featuresToFeatureMap(List<Feature> features) |
static URI |
getCentroidDataURI(String name,
URI baseURI) |
static URI |
getDocumentFrequencyDataURI(String name,
URI baseURI) |
void |
load(URI baseURI)
Loads statistics from location URI
|
void |
save(URI baseURI)
Saves statistics from train in location URI
|
void |
train(Iterable<Instance<OUTCOME_T>> instances)
In the prototypical case, train takes a collection of instances and computes statistics over
the values such as computing mean, standard deviation, TF*IDF, etc...
|
Instance<OUTCOME_T> |
transform(Instance<OUTCOME_T> instance)
Transforms all features handled by this extractor.
|
createIdfMap, transformisTransformablepublic CentroidTfidfSimilarityExtractor(String name)
public CentroidTfidfSimilarityExtractor(String name, FeatureExtractor1<FOCUS_T> extractor)
protected Map<String,Double> computeCentroid(Iterable<Instance<OUTCOME_T>> instances, TfidfExtractor.IDFMap idfs)
public List<Feature> extract(JCas view, FOCUS_T focusAnnotation) throws CleartkExtractorException
FeatureExtractor1extract in interface FeatureExtractor1<FOCUS_T extends Annotation>extract in class TfidfExtractor<OUTCOME_T,FOCUS_T extends Annotation>view - the current view of the document.focusAnnotation - the annotation for which to collect features.CleartkExtractorExceptionpublic static URI getCentroidDataURI(String name, URI baseURI) throws MalformedURLException, URISyntaxException
public static URI getDocumentFrequencyDataURI(String name, URI baseURI) throws MalformedURLException, URISyntaxException
public void load(URI baseURI) throws IOException
TrainableExtractorload in interface TrainableExtractor<OUTCOME_T>load in class TfidfExtractor<OUTCOME_T,FOCUS_T extends Annotation>IOExceptionpublic void save(URI baseURI) throws IOException
TrainableExtractorsave in interface TrainableExtractor<OUTCOME_T>save in class TfidfExtractor<OUTCOME_T,FOCUS_T extends Annotation>IOExceptionpublic void train(Iterable<Instance<OUTCOME_T>> instances)
TrainableExtractortrain in interface TrainableExtractor<OUTCOME_T>train in class TfidfExtractor<OUTCOME_T,FOCUS_T extends Annotation>instances - - URI pointing to the output location for saving statisticspublic Instance<OUTCOME_T> transform(Instance<OUTCOME_T> instance)
TrainableExtractorTrainableExtractor.train(Iterable) was called, to complete the processing of the Instance.transform in interface TrainableExtractor<OUTCOME_T>transform in class OneToOneTrainableExtractor_ImplBase<OUTCOME_T>instance - An instance that was created before TrainableExtractor.train(Iterable) was called.Copyright © 2014. All rights reserved.