| Package | Description |
|---|---|
| org.icij.extract.document | |
| org.icij.extract.extractor | |
| org.icij.extract.redis |
| Modifier and Type | Method and Description |
|---|---|
DocumentFactory |
DocumentFactory.configure(Options<String> options) |
DocumentFactory |
DocumentFactory.withIdentifier(Identifier identifier) |
| Constructor and Description |
|---|
Extractor(DocumentFactory factory)
Create a new extractor, which will OCR images by default if Tesseract is available locally, extract inline
images from PDF files and OCR them and use PDFBox's non-sequential PDF parser.
|
| Constructor and Description |
|---|
RedisDocumentQueue(DocumentFactory factory,
Options<String> options)
Create a Redis-backed queue using the provided configuration.
|
RedisDocumentQueue(DocumentFactory factory,
String queueName,
String redisAddress)
Create a Redis-backed queue with a pre-defined document factory
|
Copyright © 2019 The International Consortium of Investigative Journalists. All rights reserved.