| Class | Description |
|---|---|
| DocumentConsumer |
Base consumer for documents.
|
| EmbedBlocker |
A custom extractor that prevents Tika from parsing any embedded documents.
|
| EmbedLinker |
A custom extractor that saves all embeds to temporary files and records the new paths.
|
| EmbedParser |
A custom extractor that is an almost exact copy of Tika's default extractor for embedded documents.
|
| EmbedSpawner | |
| Extractor |
A reusable class that sets up Tika parsers based on runtime options.
|
| Enum | Description |
|---|---|
| ExtractionStatus |
Status for the extraction result of a file.
|
| Extractor.EmbedHandling | |
| Extractor.OutputFormat |
Copyright © 2018 The International Consortium of Investigative Journalists. All rights reserved.