| Package | Description |
|---|---|
| org.icij.extract.document | |
| org.icij.extract.extractor | |
| org.icij.extract.parser | |
| org.icij.spewer |
| Modifier and Type | Class and Description |
|---|---|
class |
EmbeddedTikaDocument |
| Modifier and Type | Method and Description |
|---|---|
TikaDocument |
DocumentFactory.create(Path path) |
TikaDocument |
DocumentFactory.create(Path path,
BasicFileAttributes attributes) |
TikaDocument |
DocumentFactory.create(Path path,
long size) |
TikaDocument |
DocumentFactory.create(Path path,
org.apache.tika.metadata.Metadata metadata) |
TikaDocument |
DocumentFactory.create(String path) |
TikaDocument |
DocumentFactory.create(String id,
Path path) |
TikaDocument |
DocumentFactory.create(String id,
Path path,
long size) |
TikaDocument |
DocumentFactory.create(String id,
Path path,
org.apache.tika.metadata.Metadata metadata) |
TikaDocument |
DocumentFactory.create(String id,
String path) |
TikaDocument |
DocumentFactory.create(URL url) |
| Modifier and Type | Method and Description |
|---|---|
String |
Identifier.generate(TikaDocument tikaDocument)
Generate an identifier for a root tikaDocument.
|
String |
PathDigestIdentifier.generate(TikaDocument document) |
String |
DigestIdentifier.generate(TikaDocument tikaDocument) |
String |
PathIdentifier.generate(TikaDocument tikaDocument) |
String |
Identifier.hash(TikaDocument tikaDocument)
Generate or retrieve (from metadata) a hash digest of the tikaDocument's underlying file data.
|
String |
AbstractIdentifier.hash(TikaDocument tikaDocument) |
| Modifier and Type | Method and Description |
|---|---|
TikaDocument |
Extractor.extract(Path path)
Create a pull-parser from the given
TikaInputStream. |
| Modifier and Type | Method and Description |
|---|---|
TikaDocumentSource |
EmbeddedDocumentMemoryExtractor.extract(TikaDocument rootDocument,
String embeddedDocumentDigest) |
| Constructor and Description |
|---|
EmbeddingHTMLParsingReader(TikaDocument parent,
String open,
String close,
org.apache.tika.parser.Parser parser,
org.apache.tika.io.TikaInputStream input,
org.apache.tika.metadata.Metadata metadata,
org.apache.tika.parser.ParseContext context) |
| Modifier and Type | Method and Description |
|---|---|
TikaDocument[] |
Spewer.write(Path path) |
| Modifier and Type | Method and Description |
|---|---|
protected Map<String,Object> |
Spewer.getMetadata(TikaDocument document) |
void |
RESTSpewer.write(TikaDocument tikaDocument) |
void |
FileSpewer.write(TikaDocument tikaDocument) |
void |
PrintStreamSpewer.write(TikaDocument tikaDocument) |
void |
Spewer.write(TikaDocument document) |
protected void |
RESTSpewer.writeDocument(TikaDocument doc,
TikaDocument parent,
TikaDocument root,
int level) |
protected void |
FileSpewer.writeDocument(TikaDocument doc,
TikaDocument parent,
TikaDocument root,
int level) |
protected void |
PrintStreamSpewer.writeDocument(TikaDocument doc,
TikaDocument parent,
TikaDocument root,
int level) |
protected abstract void |
Spewer.writeDocument(TikaDocument doc,
TikaDocument parent,
TikaDocument root,
int level) |
void |
RESTSpewer.writeMetadata(TikaDocument tikaDocument) |
Copyright © 2019 The International Consortium of Investigative Journalists. All rights reserved.