Skip navigation links
A B C D E F G H I J L M N O P Q R S T U V W 

A

AbstractIdentifier - Class in org.icij.extract.document
 
AbstractIdentifier(String, Charset) - Constructor for class org.icij.extract.document.AbstractIdentifier
 
AbstractIdentifier() - Constructor for class org.icij.extract.document.AbstractIdentifier
 
accept(TikaDocument) - Method in class org.icij.extract.extractor.DocumentConsumer
Consume a file.
accept(String, String[]) - Method in interface org.icij.spewer.MetadataTransformer.ValueArrayConsumer
 
accept(String, String) - Method in interface org.icij.spewer.MetadataTransformer.ValueConsumer
 
add(Option<T>) - Method in class org.icij.task.Options
 
add(String, Function<Option<T>, OptionParser<T>>) - Method in class org.icij.task.Options
 
add(Option, Function<Option<T>, OptionParser<T>>) - Method in class org.icij.task.Options
 
add(OptionsClass, Function<Option<T>, OptionParser<T>>) - Method in class org.icij.task.Options
 
addEmbed(Metadata) - Method in class org.icij.extract.document.TikaDocument
 
addEmbed(String, Identifier, Path, Metadata) - Method in class org.icij.extract.document.TikaDocument
 
addListener(Listener) - Method in class org.icij.event.DefaultMonitor
 
addListener(Listener) - Method in interface org.icij.event.Monitorable
 
addListener(Listener) - Method in class org.icij.task.MonitorableTask
 
addTask(String, Class<? extends DefaultTask>) - Method in class org.icij.task.DefaultTaskFactory
 
afterExecute(Runnable, Throwable) - Method in class org.icij.concurrent.BlockingThreadPoolExecutor
After calling super's implementation of this method, the number of acquired permits is decremented.
ArrayDocumentQueue - Class in org.icij.extract.queue
A DocumentQueue using an array as a backend.
ArrayDocumentQueue(int) - Constructor for class org.icij.extract.queue.ArrayDocumentQueue
Instantiate a new ArrayPathQueue with the given capacity.
asBoolean() - Method in interface org.icij.task.OptionParser
 
asBoolean() - Method in class org.icij.task.StringOptionParser
 
asCharset() - Method in interface org.icij.task.OptionParser
 
asCharset() - Method in class org.icij.task.StringOptionParser
 
asDuration() - Method in interface org.icij.task.OptionParser
 
asDuration() - Method in class org.icij.task.StringOptionParser
 
asEnum(Function<V, E>) - Method in interface org.icij.task.OptionParser
 
asEnum(Function<String, E>) - Method in class org.icij.task.StringOptionParser
 
asInteger() - Method in interface org.icij.task.OptionParser
 
asInteger() - Method in class org.icij.task.StringOptionParser
 
asPath() - Method in interface org.icij.task.OptionParser
 
asPath() - Method in class org.icij.task.StringOptionParser
 
asURI() - Method in interface org.icij.task.OptionParser
 
asURI() - Method in class org.icij.task.StringOptionParser
 
available() - Method in class org.icij.extract.io.DataURIEncodingInputStream
 
await() - Method in class org.icij.concurrent.BlockingThreadPoolExecutor
A blocking wait for this ThreadPoolExecutor to be in idle state, which means that there are no more tasks in the Queue or currently executed by one of the threads.
await(long, TimeUnit) - Method in class org.icij.concurrent.BlockingThreadPoolExecutor
A blocking wait for this ThreadPool to be in idle state or a certain timeout to elapse.
await() - Method in class org.icij.concurrent.BooleanSealableLatch
 
await() - Method in interface org.icij.concurrent.SealableLatch
 
awaitTermination(long, TimeUnit) - Method in class org.icij.concurrent.ExecutorProxy
Blocks until all the queued tasks have finished and the thread pool is empty, or the timeout is reached (whichever first).

B

BlockingThreadPoolExecutor - Class in org.icij.concurrent
This class is a specialized extension of the ThreadPoolExecutor class.
BlockingThreadPoolExecutor(int, int, long, TimeUnit, BlockingQueue<Runnable>, ThreadFactory, RejectedExecutionHandler, long, TimeUnit) - Constructor for class org.icij.concurrent.BlockingThreadPoolExecutor
This constructor is used in order to maintain the first functionality specified above.
BlockingThreadPoolExecutor(int, int, long, TimeUnit, BlockingQueue<Runnable>) - Constructor for class org.icij.concurrent.BlockingThreadPoolExecutor
 
BlockingThreadPoolExecutor(int, int, long, TimeUnit) - Constructor for class org.icij.concurrent.BlockingThreadPoolExecutor
 
BlockingThreadPoolExecutor(int) - Constructor for class org.icij.concurrent.BlockingThreadPoolExecutor
 
BodgeHostnameVerifier(String) - Constructor for class org.icij.spewer.http.PinnedHttpClientBuilder.BodgeHostnameVerifier
 
BooleanSealableLatch - Class in org.icij.concurrent
A latch class that is like a CountDownLatch except that it only requires a single signal to fire.
BooleanSealableLatch() - Constructor for class org.icij.concurrent.BooleanSealableLatch
 
build() - Method in class org.icij.spewer.http.PinnedHttpClientBuilder
 

C

cacheHit() - Method in class org.icij.extract.parser.CachingTesseractOCRParser
 
cacheMiss() - Method in class org.icij.extract.parser.CachingTesseractOCRParser
 
CachingTesseractOCRParser - Class in org.icij.extract.parser
 
CachingTesseractOCRParser(Path) - Constructor for class org.icij.extract.parser.CachingTesseractOCRParser
 
call() - Method in class org.icij.extract.ScannerVisitor
Recursively walks the file tree of a directory.
call(String[]) - Method in class org.icij.task.DefaultTask
 
call(U) - Method in interface org.icij.task.Task
 
characters(char[], int, int) - Method in class org.icij.extract.parser.HTML5Serializer
Writes out characters.
check(TikaDocument, ExtractionStatus) - Method in class org.icij.extract.report.Reporter
Check an extraction result.
clearLatch() - Method in class org.icij.extract.queue.DocumentQueueDrainer
Clear the latch.
clearPollTimeout() - Method in class org.icij.extract.queue.DocumentQueueDrainer
Causes the consumer to wait until a new file is available, without any timeout.
clearReader() - Method in class org.icij.extract.document.TikaDocument
 
close() - Method in class org.icij.extract.io.DataURIEncodingInputStream
 
close() - Method in class org.icij.extract.io.TokenReplacingReader
 
close() - Method in class org.icij.extract.parser.EmbeddingHTMLParsingReader
 
close() - Method in class org.icij.extract.parser.ParsingReader
Closes the read end of the pipe.
close() - Method in class org.icij.extract.queue.ArrayDocumentQueue
 
close() - Method in class org.icij.extract.redis.RedisDocumentQueue
 
close() - Method in class org.icij.extract.report.HashMapReportMap
 
close() - Method in class org.icij.extract.report.Reporter
 
close() - Method in class org.icij.spewer.FileSpewer
 
close() - Method in class org.icij.spewer.PrintStreamSpewer
 
close() - Method in class org.icij.spewer.RESTSpewer
 
code(Character) - Method in class org.icij.task.Option
 
code(String) - Method in class org.icij.task.Option
 
code() - Method in class org.icij.task.Option
 
configure(Options<String>) - Method in class org.icij.extract.document.DocumentFactory
 
configure(Options<String>) - Method in class org.icij.extract.extractor.Extractor
 
configure(Options<String>) - Method in class org.icij.extract.queue.DocumentQueueDrainer
 
configure(Options<String>) - Method in class org.icij.extract.Scanner
Configure the scanner with the given options.
configure(Options<String>) - Method in class org.icij.spewer.FieldNames
 
configure(Options<String>) - Method in class org.icij.spewer.FileSpewer
 
configure(Options<String>) - Method in class org.icij.spewer.Spewer
 
ConsoleProgressListener - Class in org.icij.event.listeners
 
ConsoleProgressListener(ProgressBar) - Constructor for class org.icij.event.listeners.ConsoleProgressListener
 
CONTENT_DISPOSITION - Static variable in class org.icij.extract.document.TikaDocument
 
CONTENT_ENCODING - Static variable in class org.icij.extract.document.TikaDocument
 
CONTENT_LANGUAGE - Static variable in class org.icij.extract.document.TikaDocument
 
CONTENT_LENGTH - Static variable in class org.icij.extract.document.TikaDocument
 
CONTENT_LOCATION - Static variable in class org.icij.extract.document.TikaDocument
 
CONTENT_MD5 - Static variable in class org.icij.extract.document.TikaDocument
 
CONTENT_TYPE - Static variable in class org.icij.extract.document.TikaDocument
 
context - Variable in class org.icij.extract.extractor.EmbedParser
 
context - Variable in class org.icij.extract.parser.ParsingReader
The parse context.
copy(Reader, OutputStream) - Method in class org.icij.spewer.Spewer
 
copy(Reader, Writer) - Static method in class org.icij.spewer.Spewer
 
CountdownHttpRequestRetryHandler - Class in org.icij.spewer.http
Implements HttpRequestRetryHandler for retrying HTTP requests up to the given number of maximum attempts.
CountdownHttpRequestRetryHandler() - Constructor for class org.icij.spewer.http.CountdownHttpRequestRetryHandler
 
CountdownHttpRequestRetryHandler(int) - Constructor for class org.icij.spewer.http.CountdownHttpRequestRetryHandler
 
countListeners() - Method in class org.icij.event.DefaultMonitor
 
countListeners() - Method in interface org.icij.event.Monitorable
 
countListeners() - Method in class org.icij.task.MonitorableTask
 
create(String, Path) - Method in class org.icij.extract.document.DocumentFactory
 
create(String, Path, long) - Method in class org.icij.extract.document.DocumentFactory
 
create(String, Path, Metadata) - Method in class org.icij.extract.document.DocumentFactory
 
create(Path) - Method in class org.icij.extract.document.DocumentFactory
 
create(Path, BasicFileAttributes) - Method in class org.icij.extract.document.DocumentFactory
 
create(Path, long) - Method in class org.icij.extract.document.DocumentFactory
 
create(String) - Method in class org.icij.extract.document.DocumentFactory
 
create(String, String) - Method in class org.icij.extract.document.DocumentFactory
 
create(Path, Metadata) - Method in class org.icij.extract.document.DocumentFactory
 
create(URL) - Method in class org.icij.extract.document.DocumentFactory
 
create() - Method in class org.icij.extract.redis.RedissonClientFactory
Create a new connection manager for a single server using the supplied address.
createFrom(Options<T>) - Method in class org.icij.task.Options
 
createReader(Path, Metadata) - Static method in class org.icij.extract.io.DataURIEncodingInputStream
 
createScannerVisitor(Path) - Method in class org.icij.extract.Scanner
 
createTrustStore(String, String) - Static method in class org.icij.spewer.http.PinnedHttpClientBuilder
 
createWithDefaults() - Static method in class org.icij.spewer.http.PinnedHttpClientBuilder
 

D

DataURIEncodingInputStream - Class in org.icij.extract.io
An InputStream that encodes arbitrary binary data from an input stream in data URI format.
DataURIEncodingInputStream(Path, Metadata) - Constructor for class org.icij.extract.io.DataURIEncodingInputStream
 
DataURIEncodingInputStream(InputStream, MediaType) - Constructor for class org.icij.extract.io.DataURIEncodingInputStream
 
decode(ByteBuf, State) - Method in class org.icij.extract.redis.DocumentDecoder
 
decode(ByteBuf, State) - Method in class org.icij.extract.redis.ResultDecoder
 
DEFAULT_BASE_TYPE_FIELD - Static variable in class org.icij.spewer.FieldNames
 
DEFAULT_ID_FIELD - Static variable in class org.icij.spewer.FieldNames
 
DEFAULT_LEVEL_FIELD - Static variable in class org.icij.spewer.FieldNames
 
DEFAULT_METADATA_FIELD_PREFIX - Static variable in class org.icij.spewer.FieldNames
 
DEFAULT_METADATA_ISO_DATE_POSTFIX - Static variable in class org.icij.spewer.FieldNames
 
DEFAULT_PARENT_ID_FIELD - Static variable in class org.icij.spewer.FieldNames
 
DEFAULT_PARENT_PATH_FIELD - Static variable in class org.icij.spewer.FieldNames
 
DEFAULT_PATH_FIELD - Static variable in class org.icij.spewer.FieldNames
 
DEFAULT_RETRIES - Static variable in class org.icij.spewer.http.CountdownHttpRequestRetryHandler
 
DEFAULT_ROOT_FIELD - Static variable in class org.icij.spewer.FieldNames
 
DEFAULT_TAG_FIELD_PREFIX - Static variable in class org.icij.spewer.FieldNames
 
DEFAULT_TEXT_FIELD - Static variable in class org.icij.spewer.FieldNames
 
DEFAULT_VERSION_FIELD - Static variable in class org.icij.spewer.FieldNames
 
DefaultMonitor - Class in org.icij.event
 
DefaultMonitor() - Constructor for class org.icij.event.DefaultMonitor
 
defaultPoolSize() - Static method in class org.icij.extract.extractor.DocumentConsumer
Returns the default thread pool size, which is equivalent to the number of available processors minus 1, or 1 - whichever is greater.
DefaultTask<R> - Class in org.icij.task
 
DefaultTask() - Constructor for class org.icij.task.DefaultTask
 
DefaultTaskFactory - Class in org.icij.task
 
DefaultTaskFactory() - Constructor for class org.icij.task.DefaultTaskFactory
 
deleteListener(Listener) - Method in class org.icij.event.DefaultMonitor
 
deleteListener(Listener) - Method in interface org.icij.event.Monitorable
 
deleteListener(Listener) - Method in class org.icij.task.MonitorableTask
 
deleteListeners() - Method in class org.icij.event.DefaultMonitor
 
deleteListeners() - Method in interface org.icij.event.Monitorable
 
deleteListeners() - Method in class org.icij.task.MonitorableTask
 
describe(String) - Method in class org.icij.task.Option
 
description() - Method in class org.icij.task.DefaultTask
 
description() - Method in class org.icij.task.Option
 
description() - Method in interface org.icij.task.Task
 
DigestIdentifier - Class in org.icij.extract.document
 
disableOcr() - Method in class org.icij.extract.extractor.Extractor
Disable OCR.
DocumentConsumer - Class in org.icij.extract.extractor
Base consumer for documents.
DocumentConsumer(Spewer, Extractor, ExecutorService) - Constructor for class org.icij.extract.extractor.DocumentConsumer
Create a new consumer that submits tasks to the given Executor.
DocumentConsumer(Spewer, Extractor, int) - Constructor for class org.icij.extract.extractor.DocumentConsumer
Create a new consumer with the given pool size.
DocumentConsumer(Spewer, Extractor) - Constructor for class org.icij.extract.extractor.DocumentConsumer
Create a new consumer with the default pool size, which is the number of available processors.
DocumentDecoder - Class in org.icij.extract.redis
Decoder for converting a string to Path.
DocumentDecoder(DocumentFactory, Charset) - Constructor for class org.icij.extract.redis.DocumentDecoder
 
DocumentEncoder - Class in org.icij.extract.redis
 
DocumentEncoder(Charset) - Constructor for class org.icij.extract.redis.DocumentEncoder
 
DocumentFactory - Class in org.icij.extract.document
A factory class for creating TikaDocument objects with default parameters.
DocumentFactory(Options<String>) - Constructor for class org.icij.extract.document.DocumentFactory
 
DocumentFactory() - Constructor for class org.icij.extract.document.DocumentFactory
 
DocumentQueue - Interface in org.icij.extract.queue
The interface for a queue of TikaDocument objects.
DocumentQueueDrainer - Class in org.icij.extract.queue
Drains a queue by polling for documents to consume.
DocumentQueueDrainer(DocumentQueue, Consumer<TikaDocument>) - Constructor for class org.icij.extract.queue.DocumentQueueDrainer
Create a new drainer that will drain documents from the given queue to the given consumer on a single thread.
DocumentQueueType - Enum in org.icij.extract.queue
An enumerated list of implemented queue types.
DosHiddenFileMatcher - Class in org.icij.extract.io.file
Create a PathMatcher that matches hidden files by checking the DOS hidden file attribute.
DosHiddenFileMatcher() - Constructor for class org.icij.extract.io.file.DosHiddenFileMatcher
 
drain() - Method in class org.icij.extract.queue.DocumentQueueDrainer
Drain the queue in a non-blocking way until the draining thread is interrupted, the task is cancelled or the given timeout is reached (if any is set).
drain(TikaDocument) - Method in class org.icij.extract.queue.DocumentQueueDrainer
Like DocumentQueueDrainer.drain() except that draining will stop when the given poison pill is returned from the queue.

E

EmbedBlocker - Class in org.icij.extract.extractor
A custom extractor that prevents Tika from parsing any embedded documents.
EmbedBlocker() - Constructor for class org.icij.extract.extractor.EmbedBlocker
 
EmbeddedTikaDocument - Class in org.icij.extract.document
 
EmbeddingHTMLParsingReader - Class in org.icij.extract.parser
Example: final String uuid = UUID.randomUUID().toString(); final String open = uuid + "/"; final String close = "/" + uuid; context.set(Parser.class, EmptyParser.INSTANCE); context.set(EmbeddedDocumentExtractor.class, new EmbedLinker(document, tmp, open, close)); reader = new EmbeddingHTMLParsingReader(document, open, close, parser, input, metadata, context);
EmbeddingHTMLParsingReader(TikaDocument, String, String, Parser, TikaInputStream, Metadata, ParseContext) - Constructor for class org.icij.extract.parser.EmbeddingHTMLParsingReader
 
EmbedLinker - Class in org.icij.extract.extractor
A custom extractor that saves all embeds to temporary files and records the new paths.
EmbedParser - Class in org.icij.extract.extractor
A custom extractor that is an almost exact copy of Tika's default extractor for embedded documents.
EmbedSpawner - Class in org.icij.extract.extractor
 
encode(Object) - Method in class org.icij.extract.redis.DocumentEncoder
 
encode(Object) - Method in class org.icij.extract.redis.ResultEncoder
 
endDocument() - Method in class org.icij.extract.parser.HTML5Serializer
Must be called last.
endElement(String, String, String) - Method in class org.icij.extract.parser.HTML5Serializer
Writes an end tag if the element is an XHTML element and is not an empty element in HTML 4.01 Strict.
endPrefixMapping(String) - Method in class org.icij.extract.parser.HTML5Serializer
This method does nothing.
equals(Object) - Method in class org.icij.extract.document.TikaDocument
 
equals(Object) - Method in class org.icij.task.Option
 
equals(Object) - Method in class org.icij.task.Options
 
exclude(String) - Method in class org.icij.extract.Scanner
Add a glob pattern for excluding files and directories.
execute(Runnable) - Method in class org.icij.concurrent.BlockingThreadPoolExecutor
Before calling super's version of this method, a permit is acquired in order to queue the task for execution.
executor - Variable in class org.icij.concurrent.ExecutorProxy
The executor proxied by the implementing class.
ExecutorProxy - Class in org.icij.concurrent
A class of traits used by implementing classes that proxy an executor.
ExecutorProxy(ExecutorService) - Constructor for class org.icij.concurrent.ExecutorProxy
Instantiate a proxy for the given executor.
extract(TikaDocument) - Method in class org.icij.extract.extractor.Extractor
This method will wrap the given TikaDocument in a TikaInputStream and return a Reader which can be used to initiate extraction on demand.
extract(TikaDocument, Spewer) - Method in class org.icij.extract.extractor.Extractor
Extract and spew content from a document.
extract(TikaDocument, Spewer, Reporter) - Method in class org.icij.extract.extractor.Extractor
Extract and spew content from a document.
extract(TikaDocument, TikaInputStream) - Method in class org.icij.extract.extractor.Extractor
Create a pull-parser from the given TikaInputStream.
ExtractionStatus - Enum in org.icij.extract.extractor
Status for the extraction result of a file.
extractor - Variable in class org.icij.extract.extractor.DocumentConsumer
 
Extractor - Class in org.icij.extract.extractor
A reusable class that sets up Tika parsers based on runtime options.
Extractor() - Constructor for class org.icij.extract.extractor.Extractor
Create a new extractor, which will OCR images by default if Tesseract is available locally, extract inline images from PDF files and OCR them and use PDFBox's non-sequential PDF parser.
Extractor.EmbedHandling - Enum in org.icij.extract.extractor
 
Extractor.OutputFormat - Enum in org.icij.extract.extractor
 

F

FallbackParser - Class in org.icij.extract.parser
 
FallbackParser() - Constructor for class org.icij.extract.parser.FallbackParser
 
fastPut(TikaDocument, Report) - Method in class org.icij.extract.report.HashMapReportMap
 
fastPut(TikaDocument, Report) - Method in interface org.icij.extract.report.ReportMap
Allow implementations to define a faster method for putting values into the map that doesn't require the previous value to be returned.
FieldNames - Class in org.icij.spewer
Defaults for use with spewers.
FieldNames() - Constructor for class org.icij.spewer.FieldNames
 
fields - Variable in class org.icij.spewer.Spewer
 
FileSpewer - Class in org.icij.spewer
Writes the text or HTML output from a Reader to the filesystem.
FileSpewer(FieldNames) - Constructor for class org.icij.spewer.FileSpewer
 
followSymLinks(boolean) - Method in class org.icij.extract.Scanner
Set whether symlinks should be followed.
followSymLinks() - Method in class org.icij.extract.Scanner
Check whether symlinks will be followed.
forBaseType() - Method in class org.icij.spewer.FieldNames
 
forId() - Method in class org.icij.spewer.FieldNames
 
forLevel() - Method in class org.icij.spewer.FieldNames
 
format(Duration) - Static method in class org.icij.time.HumanDuration
Convert the duration to a string of the same format that is accepted by HumanDuration.parse(String).
forMetadata(String) - Method in class org.icij.spewer.FieldNames
 
forMetadataISODate(String) - Method in class org.icij.spewer.FieldNames
 
forParentId() - Method in class org.icij.spewer.FieldNames
 
forParentPath() - Method in class org.icij.spewer.FieldNames
 
forPath() - Method in class org.icij.spewer.FieldNames
 
forRoot() - Method in class org.icij.spewer.FieldNames
 
forTag(String) - Method in class org.icij.spewer.FieldNames
 
forText() - Method in class org.icij.spewer.FieldNames
 
forVersion() - Method in class org.icij.spewer.FieldNames
 
from(Map<String, String>) - Static method in class org.icij.task.Options
 
from(Properties) - Static method in class org.icij.task.Options
 

G

generate(TikaDocument) - Method in class org.icij.extract.document.DigestIdentifier
 
generate(TikaDocument) - Method in interface org.icij.extract.document.Identifier
Generate an identifier for a root tikaDocument.
generate(TikaDocument) - Method in class org.icij.extract.document.PathDigestIdentifier
 
generate(TikaDocument) - Method in class org.icij.extract.document.PathIdentifier
 
generate() - Method in interface org.icij.extract.document.TikaDocument.ReaderGenerator
 
generateForEmbed(EmbeddedTikaDocument) - Method in class org.icij.extract.document.DigestIdentifier
 
generateForEmbed(EmbeddedTikaDocument) - Method in interface org.icij.extract.document.Identifier
Generate an identifier for an embedded document.
generateForEmbed(EmbeddedTikaDocument) - Method in class org.icij.extract.document.PathDigestIdentifier
 
generateForEmbed(EmbeddedTikaDocument) - Method in class org.icij.extract.document.PathIdentifier
 
get(String) - Method in class org.icij.task.Options
 
get(Option<T>) - Method in class org.icij.task.Options
 
getCode() - Method in enum org.icij.extract.extractor.ExtractionStatus
 
getDefault() - Static method in enum org.icij.extract.extractor.Extractor.EmbedHandling
 
getEmbed(String) - Method in class org.icij.extract.document.TikaDocument
 
getEmbedHandling() - Method in class org.icij.extract.extractor.Extractor
Get the embed handling mode.
getEmbedOutputPath() - Method in class org.icij.extract.extractor.Extractor
Get the output directory path for embed files.
getEmbeds() - Method in class org.icij.extract.document.TikaDocument
 
getException() - Method in class org.icij.extract.report.Report
 
getFields() - Method in class org.icij.spewer.Spewer
 
getForeignId() - Method in class org.icij.extract.document.TikaDocument
 
getHash() - Method in class org.icij.extract.document.TikaDocument
 
getId() - Method in class org.icij.extract.document.TikaDocument
 
getLatch() - Method in class org.icij.extract.queue.DocumentQueueDrainer
Get the poll latch.
getLatch() - Method in class org.icij.extract.Scanner
Get the latch.
getMaxDepth() - Method in class org.icij.extract.Scanner
Get the currently set maximum depth to recurse when scanning.
getMetadata(String) - Method in class org.icij.extract.document.TikaDocument
 
getMetadata() - Method in class org.icij.extract.document.TikaDocument
 
getNumberOfFiles(Path) - Method in class org.icij.extract.Scanner
 
getOutputDirectory() - Method in class org.icij.spewer.FileSpewer
 
getOutputEncoding() - Method in class org.icij.spewer.Spewer
 
getOutputExtension() - Method in class org.icij.spewer.FileSpewer
 
getOutputFormat() - Method in class org.icij.extract.extractor.Extractor
Get the extraction output format.
getPath() - Method in class org.icij.extract.document.TikaDocument
 
getPollTimeout() - Method in class org.icij.extract.queue.DocumentQueueDrainer
Get the poll timeout.
getReader() - Method in class org.icij.extract.document.TikaDocument
 
getReporter() - Method in class org.icij.extract.extractor.DocumentConsumer
Get the reporter.
getStatus() - Method in class org.icij.extract.report.Report
 
getSupportedTypes(ParseContext) - Method in class org.icij.extract.parser.FallbackParser
 
getTask(String) - Method in class org.icij.task.DefaultTaskFactory
 

H

handler - Variable in class org.icij.extract.parser.ParsingReader
Receives SAX events.
hasEmbeds() - Method in class org.icij.extract.document.TikaDocument
 
hash(TikaDocument) - Method in class org.icij.extract.document.AbstractIdentifier
 
hash(TikaDocument) - Method in interface org.icij.extract.document.Identifier
Generate or retrieve (from metadata) a hash digest of the tikaDocument's underlying file data.
hashCode() - Method in class org.icij.extract.document.TikaDocument
 
hashCode() - Method in class org.icij.task.Option
 
hashCode() - Method in class org.icij.task.Options
 
HashMapReportMap - Class in org.icij.extract.report
A ReportMap using a ConcurrentHashMap as a backend.
HashMapReportMap() - Constructor for class org.icij.extract.report.HashMapReportMap
Instantiate a new report with the default ConcurrentHashMap capacity (16).
hasNext() - Method in class org.icij.task.OptionsIterator
 
hintRemaining(int) - Method in class org.icij.event.DefaultMonitor
 
hintRemaining(int) - Method in interface org.icij.event.Listener
 
hintRemaining(int) - Method in class org.icij.event.listeners.ConsoleProgressListener
 
hintRemaining(int) - Method in interface org.icij.event.Notifiable
 
HTML5Serializer - Class in org.icij.extract.parser
Serializes a sequence of SAX events representing an XHTML 1.0 Strict document to a Writer as a UTF-8-encoded HTML 5 document.
HTML5Serializer(Writer) - Constructor for class org.icij.extract.parser.HTML5Serializer
Creates a new instance of HtmlSerializer in the HTML 4.01 doctype mode with the UTF-8 encoding and no charset meta.
HumanDuration - Class in org.icij.time
Parses a "human" time duration string like 1m or 500ms to a Duration.
HumanDuration() - Constructor for class org.icij.time.HumanDuration
 

I

Identifier - Interface in org.icij.extract.document
An Identifier holds logic for generating both unique identifiers for documents as well as digest hashes of the the underlying file data.
ifPresent(String, Function<Option<T>, Optional<R>>) - Method in class org.icij.task.Options
 
ignorableWhitespace(char[], int, int) - Method in class org.icij.extract.parser.HTML5Serializer
This method does nothing.
ignoreHiddenFiles(boolean) - Method in class org.icij.extract.Scanner
Set whether hidden files should be ignored.
ignoreHiddenFiles() - Method in class org.icij.extract.Scanner
Check whether hidden files will be ignored.
ignoreSystemFiles(boolean) - Method in class org.icij.extract.Scanner
Set whether system files should be ignored.
ignoreSystemFiles() - Method in class org.icij.extract.Scanner
Check whether system files will be ignored.
include(String) - Method in class org.icij.extract.Scanner
Add a glob pattern for including files.
IndexType - Enum in org.icij.extract
An enumerated list of implemented index types.
input - Variable in class org.icij.extract.parser.ParsingReader
The binary stream being parsed.
INSTANCE - Static variable in class org.icij.extract.parser.FallbackParser
 
isOff() - Method in interface org.icij.task.OptionParser
 
isOff() - Method in class org.icij.task.StringOptionParser
 
isOn() - Method in interface org.icij.task.OptionParser
 
isOn() - Method in class org.icij.task.StringOptionParser
 
isSealed() - Method in class org.icij.concurrent.BooleanSealableLatch
 
isSealed() - Method in interface org.icij.concurrent.SealableLatch
 
iterator() - Method in class org.icij.task.Options
 

J

journalableExceptions() - Method in interface org.icij.extract.report.ReportMap
Allow implementations to define a list of exception classes that when caught, would indicate to the caller that arguments should be journaled and flushed later.

L

LAST_MODIFIED - Static variable in class org.icij.extract.document.TikaDocument
 
Listener - Interface in org.icij.event
 
listNames() - Method in class org.icij.task.DefaultTaskFactory
 
LOCATION - Static variable in class org.icij.extract.document.TikaDocument
 

M

map - Variable in class org.icij.task.Options
 
mark(int) - Method in class org.icij.extract.io.TokenReplacingReader
 
markSupported() - Method in class org.icij.extract.io.TokenReplacingReader
 
matches(Path) - Method in class org.icij.extract.io.file.DosHiddenFileMatcher
 
matches(Path) - Method in class org.icij.extract.io.file.PosixHiddenFileMatcher
 
matches(Path) - Method in class org.icij.extract.io.file.SystemFileMatcher
 
metadata - Variable in class org.icij.extract.parser.ParsingReader
Metadata associated with the document being parsed.
MetadataTransformer - Class in org.icij.spewer
 
MetadataTransformer(Metadata, FieldNames) - Constructor for class org.icij.spewer.MetadataTransformer
 
MetadataTransformer.ValueArrayConsumer - Interface in org.icij.spewer
 
MetadataTransformer.ValueConsumer - Interface in org.icij.spewer
 
Monitor - Interface in org.icij.event
 
monitor - Variable in class org.icij.task.MonitorableTask
 
Monitorable - Interface in org.icij.event
 
MonitorableTask<R> - Class in org.icij.task
 
MonitorableTask() - Constructor for class org.icij.task.MonitorableTask
 
MonitorableTask(Monitor) - Constructor for class org.icij.task.MonitorableTask
 

N

name() - Method in class org.icij.task.Option
 
next() - Method in class org.icij.task.OptionsIterator
 
Notifiable - Interface in org.icij.event
 
notify(Monitorable, Object) - Method in interface org.icij.event.Listener
 
notify(Object) - Method in interface org.icij.event.Listener
 
notify(Monitorable, Object) - Method in class org.icij.event.listeners.ConsoleProgressListener
 
notify(Object) - Method in class org.icij.event.listeners.ConsoleProgressListener
 
notifyListeners() - Method in class org.icij.event.DefaultMonitor
 
notifyListeners(Object) - Method in class org.icij.event.DefaultMonitor
 
notifyListeners() - Method in interface org.icij.event.Notifiable
 
notifyListeners(Object) - Method in interface org.icij.event.Notifiable
 

O

Option - Annotation Type in org.icij.task.annotation
 
option(String) - Method in class org.icij.task.DefaultTask
 
Option<V> - Class in org.icij.task
 
Option(String, Function<Option<V>, OptionParser<V>>) - Constructor for class org.icij.task.Option
 
option(String) - Method in interface org.icij.task.Task
 
OptionParser<V> - Interface in org.icij.task
 
Options - Annotation Type in org.icij.task.annotation
 
options - Variable in class org.icij.task.DefaultTask
 
options(Class<? extends DefaultTask>) - Static method in class org.icij.task.DefaultTask
 
options() - Method in class org.icij.task.DefaultTask
 
Options<T> - Class in org.icij.task
 
Options() - Constructor for class org.icij.task.Options
 
options() - Method in interface org.icij.task.Task
 
OptionsClass - Annotation Type in org.icij.task.annotation
 
OptionsClasses - Annotation Type in org.icij.task.annotation
 
OptionsIterator<T> - Class in org.icij.task
 
org.icij.concurrent - package org.icij.concurrent
 
org.icij.event - package org.icij.event
 
org.icij.event.listeners - package org.icij.event.listeners
 
org.icij.extract - package org.icij.extract
 
org.icij.extract.document - package org.icij.extract.document
 
org.icij.extract.extractor - package org.icij.extract.extractor
 
org.icij.extract.io - package org.icij.extract.io
 
org.icij.extract.io.file - package org.icij.extract.io.file
 
org.icij.extract.parser - package org.icij.extract.parser
 
org.icij.extract.queue - package org.icij.extract.queue
 
org.icij.extract.redis - package org.icij.extract.redis
 
org.icij.extract.report - package org.icij.extract.report
 
org.icij.spewer - package org.icij.spewer
 
org.icij.spewer.http - package org.icij.spewer.http
 
org.icij.task - package org.icij.task
 
org.icij.task.annotation - package org.icij.task.annotation
 
org.icij.time - package org.icij.time
 
outputMetadata - Variable in class org.icij.spewer.Spewer
 
outputMetadata(boolean) - Method in class org.icij.spewer.Spewer
 
outputMetadata() - Method in class org.icij.spewer.Spewer
 
OutputType - Enum in org.icij.extract
An enumerated list of implemented output types.

P

parameter() - Method in class org.icij.task.Option
 
parameter(String) - Method in class org.icij.task.Option
 
parse(Number) - Static method in enum org.icij.extract.extractor.ExtractionStatus
 
parse(Integer) - Static method in enum org.icij.extract.extractor.ExtractionStatus
 
parse(String) - Static method in enum org.icij.extract.extractor.ExtractionStatus
 
parse(String) - Static method in enum org.icij.extract.extractor.Extractor.EmbedHandling
 
parse(String) - Static method in enum org.icij.extract.extractor.Extractor.OutputFormat
 
parse(String) - Static method in enum org.icij.extract.IndexType
Create an instance from a string.
parse(String) - Static method in enum org.icij.extract.OutputType
Create an instance from a string.
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.icij.extract.parser.CachingTesseractOCRParser
 
parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.icij.extract.parser.FallbackParser
 
parse() - Method in class org.icij.extract.parser.ParsingReader
Parses the given binary stream and writes the text content to the write end of the pipe.
parse(String) - Static method in enum org.icij.extract.queue.DocumentQueueType
Parse the given string representation of the type into an instance.
parse(String) - Static method in enum org.icij.extract.report.ReportMapType
Parse the given string representation of the type into an instance.
parse() - Method in class org.icij.task.Option
 
parse(String) - Static method in class org.icij.time.HumanDuration
Creates a new Duration by parsing the given string.
parseEmbedded(InputStream, ContentHandler, Metadata, boolean) - Method in class org.icij.extract.extractor.EmbedBlocker
 
parseEmbedded(InputStream, ContentHandler, Metadata, boolean) - Method in class org.icij.extract.extractor.EmbedLinker
 
parseEmbedded(InputStream, ContentHandler, Metadata, boolean) - Method in class org.icij.extract.extractor.EmbedParser
 
parseEmbedded(InputStream, ContentHandler, Metadata, boolean) - Method in class org.icij.extract.extractor.EmbedSpawner
 
parseInline(InputStream, XHTMLContentHandler, ParseContext, TesseractOCRConfig) - Method in class org.icij.extract.parser.CachingTesseractOCRParser
 
parser - Variable in class org.icij.extract.parser.ParsingReader
Parser instance used for parsing the given binary stream.
parser - Variable in class org.icij.task.Option
 
ParsingReader - Class in org.icij.extract.parser
Reader for the content from a given binary stream.
ParsingReader(InputStream) - Constructor for class org.icij.extract.parser.ParsingReader
Creates a reader for the content of the given binary stream.
ParsingReader(InputStream, String) - Constructor for class org.icij.extract.parser.ParsingReader
Creates a reader for the content of the given binary stream with the given name.
ParsingReader(Parser, InputStream, Metadata, ParseContext) - Constructor for class org.icij.extract.parser.ParsingReader
 
ParsingReader(Parser, InputStream, Metadata, ParseContext, Function<Writer, ContentHandler>) - Constructor for class org.icij.extract.parser.ParsingReader
Creates a reader for the content of the given binary stream with the given document metadata.
PathDigestIdentifier - Class in org.icij.extract.document
 
PathIdentifier - Class in org.icij.extract.document
 
PathIdentifier() - Constructor for class org.icij.extract.document.PathIdentifier
 
pinCertificate(String) - Method in class org.icij.spewer.http.PinnedHttpClientBuilder
 
pinCertificate(String, String) - Method in class org.icij.spewer.http.PinnedHttpClientBuilder
 
PinnedHttpClientBuilder - Class in org.icij.spewer.http
Extends HttpClientBuilder with the ability to pin a certificate and a hostname.
PinnedHttpClientBuilder() - Constructor for class org.icij.spewer.http.PinnedHttpClientBuilder
 
PinnedHttpClientBuilder.BodgeHostnameVerifier - Class in org.icij.spewer.http
 
PosixHiddenFileMatcher - Class in org.icij.extract.io.file
Create a PathMatcher that matches hidden files by checking if the last element in the path starts with a dot.
PosixHiddenFileMatcher() - Constructor for class org.icij.extract.io.file.PosixHiddenFileMatcher
 
preVisitDirectory(Path, BasicFileAttributes) - Method in class org.icij.extract.ScannerVisitor
 
PrintStreamSpewer - Class in org.icij.spewer
Writes the text output from a ParsingReader, and metadata, to the given PrintStream.
PrintStreamSpewer(PrintStream, FieldNames) - Constructor for class org.icij.spewer.PrintStreamSpewer
 
processingInstruction(String, String) - Method in class org.icij.extract.parser.HTML5Serializer
This method does nothing.

Q

queue - Variable in class org.icij.extract.Scanner
 
queued() - Method in class org.icij.extract.Scanner
 

R

read() - Method in class org.icij.extract.io.DataURIEncodingInputStream
 
read() - Method in class org.icij.extract.io.TokenReplacingReader
 
read(char[]) - Method in class org.icij.extract.io.TokenReplacingReader
 
read(char[], int, int) - Method in class org.icij.extract.io.TokenReplacingReader
 
read() - Method in class org.icij.extract.io.URLEncodingInputStream
 
read(byte[]) - Method in class org.icij.extract.io.URLEncodingInputStream
 
read(byte[], int, int) - Method in class org.icij.extract.io.URLEncodingInputStream
 
read(char[], int, int) - Method in class org.icij.extract.parser.EmbeddingHTMLParsingReader
 
read(char[], int, int) - Method in class org.icij.extract.parser.ParsingReader
Reads parsed text from the pipe connected to the parsing thread.
reader(InputStream) - Static method in class org.icij.extract.io.URLEncodingInputStream
 
reader - Variable in class org.icij.extract.parser.ParsingReader
Buffered read end of the pipe.
ready() - Method in class org.icij.extract.io.TokenReplacingReader
 
RedisDocumentQueue - Class in org.icij.extract.redis
A DocumentQueue using Redis as a backend.
RedisDocumentQueue(DocumentFactory, Options<String>) - Constructor for class org.icij.extract.redis.RedisDocumentQueue
Create a Redis-backed queue using the provided configuration.
RedissonClientFactory - Class in org.icij.extract.redis
Factory for creating a Redis client.
RedissonClientFactory() - Constructor for class org.icij.extract.redis.RedissonClientFactory
 
remove() - Method in class org.icij.task.OptionsIterator
 
removeEmbed(EmbeddedTikaDocument) - Method in class org.icij.extract.document.TikaDocument
 
Report - Class in org.icij.extract.report
 
Report(ExtractionStatus, Exception) - Constructor for class org.icij.extract.report.Report
 
Report(ExtractionStatus) - Constructor for class org.icij.extract.report.Report
 
report(TikaDocument) - Method in class org.icij.extract.report.Reporter
Check the extraction result of a given tikaDocument.
Reporter - Class in org.icij.extract.report
Records the extraction result of a file to the given ReportMap.
Reporter(ReportMap) - Constructor for class org.icij.extract.report.Reporter
Create a new reporter that will record results to the given ReportMap.
ReportMap - Interface in org.icij.extract.report
The interface for a report map.
ReportMapType - Enum in org.icij.extract.report
An enumerated list of implemented report map types.
reset() - Method in class org.icij.extract.io.TokenReplacingReader
 
resolveToken(String) - Method in interface org.icij.extract.io.TokenResolver
 
RESTSpewer - Class in org.icij.spewer
 
RESTSpewer(FieldNames, CloseableHttpClient, URI) - Constructor for class org.icij.spewer.RESTSpewer
 
ResultDecoder - Class in org.icij.extract.redis
Decoder for converting a string to a ExtractionStatus.
ResultDecoder() - Constructor for class org.icij.extract.redis.ResultDecoder
 
ResultEncoder - Class in org.icij.extract.redis
 
ResultEncoder() - Constructor for class org.icij.extract.redis.ResultEncoder
 
retrieveHash(Metadata) - Method in class org.icij.extract.document.AbstractIdentifier
 
retrieveHash(Metadata) - Method in interface org.icij.extract.document.Identifier
Retrieve a hash digest of the document's underlying file data.
retryRequest(IOException, int, HttpContext) - Method in class org.icij.spewer.http.CountdownHttpRequestRetryHandler
 

S

save(TikaDocument, Report) - Method in class org.icij.extract.report.Reporter
Save the extraction report for the given tikaDocument.
save(TikaDocument, ExtractionStatus, Exception) - Method in class org.icij.extract.report.Reporter
Save the extraction status and optional exception for the given tikaDocument.
save(TikaDocument, ExtractionStatus) - Method in class org.icij.extract.report.Reporter
Save the extraction status for the given tikaDocument.
scan(Path) - Method in class org.icij.extract.Scanner
Queue a scanning job.
scan(Path[]) - Method in class org.icij.extract.Scanner
Submit all of the given paths to the scanner for execution, returning a list of Future objects representing those tasks.
scan(String[]) - Method in class org.icij.extract.Scanner
 
Scanner - Class in org.icij.extract
Scanner for scanning the directory tree starting at a given path.
Scanner(DocumentFactory, BlockingQueue<TikaDocument>) - Constructor for class org.icij.extract.Scanner
 
Scanner(DocumentFactory, BlockingQueue<TikaDocument>, SealableLatch) - Constructor for class org.icij.extract.Scanner
 
Scanner(DocumentFactory, BlockingQueue<TikaDocument>, SealableLatch, Notifiable) - Constructor for class org.icij.extract.Scanner
Creates a Scanner that sends all results straight to the underlying BlockingQueue on a single thread.
ScannerVisitor - Class in org.icij.extract
 
ScannerVisitor(Path, BlockingQueue<TikaDocument>, DocumentFactory, Options<String>) - Constructor for class org.icij.extract.ScannerVisitor
Instantiate a new task for scanning the given path.
seal() - Method in class org.icij.concurrent.BooleanSealableLatch
 
seal() - Method in interface org.icij.concurrent.SealableLatch
 
SealableLatch - Interface in org.icij.concurrent
 
setDigestAlgorithms(CommonsDigester.DigestAlgorithm...) - Method in class org.icij.extract.extractor.Extractor
 
setDocumentLocator(Locator) - Method in class org.icij.extract.parser.HTML5Serializer
This method does nothing.
setEmbedHandling(Extractor.EmbedHandling) - Method in class org.icij.extract.extractor.Extractor
Set the embed handling mode.
setEmbedOutputPath(Path) - Method in class org.icij.extract.extractor.Extractor
Set the output directory path for embed files.
setForeignId(String) - Method in class org.icij.extract.document.TikaDocument
 
setLatch(SealableLatch) - Method in class org.icij.extract.queue.DocumentQueueDrainer
If given, the latch should be used to signal that the queue should be polled.
setMaxDepth(int) - Method in class org.icij.extract.Scanner
Set the maximum depth to recurse when scanning.
setMaximumPoolSize(int) - Method in class org.icij.concurrent.BlockingThreadPoolExecutor
Increase or decreases the maximum pool size by adjusting the number of permits accordingly.
setOcrLanguage(String) - Method in class org.icij.extract.extractor.Extractor
Set the languages used by Tesseract.
setOcrTimeout(Duration) - Method in class org.icij.extract.extractor.Extractor
Instructs Tesseract to attempt OCR for no longer than the given duration.
setOutputDirectory(Path) - Method in class org.icij.spewer.FileSpewer
 
setOutputEncoding(Charset) - Method in class org.icij.spewer.Spewer
 
setOutputFormat(Extractor.OutputFormat) - Method in class org.icij.extract.extractor.Extractor
Set the output format.
setPollTimeout(Duration) - Method in class org.icij.extract.queue.DocumentQueueDrainer
Set the amount of time to wait until an item becomes available.
setReader(Reader) - Method in class org.icij.extract.document.TikaDocument
 
setReader(TikaDocument.ReaderGenerator) - Method in class org.icij.extract.document.TikaDocument
 
setReporter(Reporter) - Method in class org.icij.extract.extractor.DocumentConsumer
Set the reporter.
setTags(Map<String, String>) - Method in class org.icij.spewer.Spewer
 
setVerifyHostname(String) - Method in class org.icij.spewer.http.PinnedHttpClientBuilder
 
shouldParseEmbedded(Metadata) - Method in class org.icij.extract.extractor.EmbedBlocker
 
shouldParseEmbedded(Metadata) - Method in class org.icij.extract.extractor.EmbedLinker
Always returns true.
shutdown() - Method in class org.icij.concurrent.ExecutorProxy
Shuts down the executor.
shutdown() - Method in interface org.icij.concurrent.Shutdownable
 
Shutdownable - Interface in org.icij.concurrent
 
shutdownNow() - Method in class org.icij.concurrent.ExecutorProxy
Shut down the executor immediately, halting running tasks and discarding waiting tasks.
signal() - Method in class org.icij.concurrent.BooleanSealableLatch
 
signal() - Method in interface org.icij.concurrent.SealableLatch
 
skip(long) - Method in class org.icij.extract.io.TokenReplacingReader
 
skip(TikaDocument) - Method in class org.icij.extract.report.Reporter
Check whether a path should be skipped.
skippedEntity(String) - Method in class org.icij.extract.parser.HTML5Serializer
This method does nothing.
spewer - Variable in class org.icij.extract.extractor.DocumentConsumer
 
Spewer - Class in org.icij.spewer
Base class for Spewer superclasses that write text output from a ParsingReader to specific endpoints.
Spewer(FieldNames) - Constructor for class org.icij.spewer.Spewer
 
startDocument() - Method in class org.icij.extract.parser.HTML5Serializer
Must be called first.
startElement(String, String, String, Attributes) - Method in class org.icij.extract.parser.HTML5Serializer
Writes a start tag if the element is an XHTML element.
startPrefixMapping(String, String) - Method in class org.icij.extract.parser.HTML5Serializer
This method does nothing.
StringOptionParser - Class in org.icij.task
 
StringOptionParser(Option<String>) - Constructor for class org.icij.task.StringOptionParser
 
SystemFileMatcher - Class in org.icij.extract.io.file
Create a PathMatcher that matches operating-system-generated files.
SystemFileMatcher() - Constructor for class org.icij.extract.io.file.SystemFileMatcher
 

T

tags - Variable in class org.icij.spewer.Spewer
 
Task - Annotation Type in org.icij.task.annotation
 
Task<T,U,R> - Interface in org.icij.task
 
TikaDocument - Class in org.icij.extract.document
 
TikaDocument(String, Identifier, Path, Metadata) - Constructor for class org.icij.extract.document.TikaDocument
Instantiate a document with a pre-generated ID.
TikaDocument(String, Identifier, Path) - Constructor for class org.icij.extract.document.TikaDocument
 
TikaDocument(Identifier, Path, Metadata) - Constructor for class org.icij.extract.document.TikaDocument
Instantiate a document when the ID has not yet been generated.
TikaDocument(Identifier, Path) - Constructor for class org.icij.extract.document.TikaDocument
 
TikaDocument.ReaderGenerator - Interface in org.icij.extract.document
 
TokenReplacingReader - Class in org.icij.extract.io
Reader for in-place token replacements.
TokenReplacingReader(TokenResolver<Reader>, Reader, String, String) - Constructor for class org.icij.extract.io.TokenReplacingReader
 
TokenResolver<R> - Interface in org.icij.extract.io
Interface for resolving of tokens found via TokenReplacingReader.
toString() - Method in class org.icij.extract.document.TikaDocument
 
toString() - Method in enum org.icij.extract.IndexType
Print a friendly name for the index type.
toString() - Method in enum org.icij.extract.OutputType
Print a friendly name for the output type.
toString() - Method in enum org.icij.extract.queue.DocumentQueueType
Return the name of the queue type.
toString() - Method in enum org.icij.extract.report.ReportMapType
Return the name of the report type.
toString(Reader) - Static method in class org.icij.spewer.Spewer
 
toString() - Method in class org.icij.task.Option
 
toString() - Method in class org.icij.task.Options
 
transform(MetadataTransformer.ValueConsumer, MetadataTransformer.ValueArrayConsumer) - Method in class org.icij.spewer.MetadataTransformer
 

U

update(V) - Method in class org.icij.task.Option
 
update(List<V>) - Method in class org.icij.task.Option
 
update(Supplier<List<V>>) - Method in class org.icij.task.Option
 
URLEncodingInputStream - Class in org.icij.extract.io
A FilterInputStream that URL-encodes arbitrary binary data from an input stream.
URLEncodingInputStream(InputStream) - Constructor for class org.icij.extract.io.URLEncodingInputStream
 

V

value() - Method in class org.icij.task.Option
 
value(Function<V, R>) - Method in class org.icij.task.Option
 
valueIfPresent(String) - Method in class org.icij.task.Options
 
valueOf(String) - Static method in enum org.icij.extract.extractor.ExtractionStatus
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.icij.extract.extractor.Extractor.EmbedHandling
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.icij.extract.extractor.Extractor.OutputFormat
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.icij.extract.IndexType
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.icij.extract.OutputType
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.icij.extract.queue.DocumentQueueType
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.icij.extract.report.ReportMapType
Returns the enum constant of this type with the specified name.
values() - Static method in enum org.icij.extract.extractor.ExtractionStatus
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.icij.extract.extractor.Extractor.EmbedHandling
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.icij.extract.extractor.Extractor.OutputFormat
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.icij.extract.IndexType
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.icij.extract.OutputType
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.icij.extract.queue.DocumentQueueType
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.icij.extract.report.ReportMapType
Returns an array containing the constants of this enum type, in the order they are declared.
values - Variable in class org.icij.task.Option
 
values() - Method in class org.icij.task.Option
 
values(Function<V, R>) - Method in class org.icij.task.Option
 
verify(String, SSLSession) - Method in class org.icij.spewer.http.PinnedHttpClientBuilder.BodgeHostnameVerifier
 
visitFile(Path, BasicFileAttributes) - Method in class org.icij.extract.ScannerVisitor
 
visitFileFailed(Path, IOException) - Method in class org.icij.extract.ScannerVisitor
 

W

withIdentifier(Identifier) - Method in class org.icij.extract.document.DocumentFactory
 
withLatch(SealableLatch) - Method in class org.icij.extract.ScannerVisitor
 
withMonitor(Notifiable) - Method in class org.icij.extract.ScannerVisitor
 
withOptions(Options<String>) - Method in class org.icij.extract.redis.RedissonClientFactory
Create a new connection manager by query the given set of options.
write(TikaDocument, Reader) - Method in class org.icij.spewer.FileSpewer
 
write(TikaDocument, Reader) - Method in class org.icij.spewer.PrintStreamSpewer
 
write(TikaDocument, Reader) - Method in class org.icij.spewer.RESTSpewer
 
write(TikaDocument, Reader) - Method in class org.icij.spewer.Spewer
 
write(Path) - Method in class org.icij.spewer.Spewer
 
writeMetadata(TikaDocument) - Method in class org.icij.spewer.FileSpewer
 
writeMetadata(TikaDocument) - Method in class org.icij.spewer.PrintStreamSpewer
 
writeMetadata(TikaDocument) - Method in class org.icij.spewer.RESTSpewer
 
writeMetadata(TikaDocument) - Method in class org.icij.spewer.Spewer
 
A B C D E F G H I J L M N O P Q R S T U V W 
Skip navigation links

Copyright © 2018. All rights reserved.