A B C E F G L M O P R S T

A

addUnitOnValues(List<String>, String) - Static method in class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
Adding unit on each values of the list
annotate(ComposedUnit, Map<String, List<String>>) - Static method in class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
Annotates cu with the predicates and literals contained in toAnnot.

B

BASE_URI_PROPERTY_NAME - Static variable in class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
 

C

characters(char[], int, int) - Method in class org.ow2.weblab.services.normaliser.tika.MediaUnitContentHandler
 
checkArgs(ProcessArgs) - Static method in class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
 
cleanMap(Map<String, List<String>>) - Static method in class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
Modify the Map in parameter.
CONFIG_FILE - Static variable in class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
Properties file
convertToISO8601Date(String) - Static method in class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
 
CustomOfficeParser - Class in org.apache.tika.parser.microsoft
Defines a Microsoft document content extractor.
CustomOfficeParser() - Constructor for class org.apache.tika.parser.microsoft.CustomOfficeParser
 

E

EmlParser - Class in org.apache.tika.parser.microsoft
Defines a EML document content extractor.
EmlParser() - Constructor for class org.apache.tika.parser.microsoft.EmlParser
 
endElement(String, String, String) - Method in class org.ow2.weblab.services.normaliser.tika.MediaUnitContentHandler
 
extractTextAndMetadata(ComposedUnit, File, Map<String, List<String>>, boolean) - Static method in class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
 

F

fillMapWithMetadata(Map<String, List<String>>, Metadata) - Static method in class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
The method converts the metadata extracted by Tika into a Map of predicates with their values that can be annotated.

G

getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.microsoft.CustomOfficeParser
 
getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.microsoft.EmlParser
 
getTikaConfig() - Static method in class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
 

L

loadTikaServiceProps() - Method in class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
 

M

MediaUnitContentHandler - Class in org.ow2.weblab.services.normaliser.tika
 
MediaUnitContentHandler(ContentHandler, ComposedUnit) - Constructor for class org.ow2.weblab.services.normaliser.tika.MediaUnitContentHandler
 

O

org.apache.tika.parser.microsoft - package org.apache.tika.parser.microsoft
 
org.ow2.weblab.services.normaliser.tika - package org.ow2.weblab.services.normaliser.tika
 
OVERRIDE_METADATA_PROPERTY_NAME - Static variable in class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
 

P

parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.microsoft.CustomOfficeParser
Extracts properties and text from an MS Document input stream
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.microsoft.CustomOfficeParser
Deprecated. This method will be removed in Apache Tika 1.0.
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.microsoft.EmlParser
Extracts properties and text from an EML Document input stream
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.microsoft.EmlParser
Deprecated. This method will be removed in Apache Tika 1.0.
process(ProcessArgs) - Method in class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
 

R

REMOVE_COTNENT_PROPERTY_NAME - Static variable in class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
 

S

startElement(String, String, String, Attributes) - Method in class org.ow2.weblab.services.normaliser.tika.MediaUnitContentHandler
 

T

TikaExtractorService - Class in org.ow2.weblab.services.normaliser.tika
Tika extractor is quite simple since it does not handle with structure of documents (sheets in Excel, paragraphs in Word, etc.)
TikaExtractorService() - Constructor for class org.ow2.weblab.services.normaliser.tika.TikaExtractorService
The default and only constructor.

A B C E F G L M O P R S T

Copyright © 2004-2010. All Rights Reserved.