| Class | Description |
|---|---|
| CachingTesseractOCRParser | |
| EmbeddingHTMLParsingReader |
Example:
final String uuid = UUID.randomUUID().toString();
final String open = uuid + "/";
final String close = "/" + uuid;
context.set(Parser.class, EmptyParser.INSTANCE);
context.set(EmbeddedDocumentExtractor.class, new EmbedLinker(document, tmp, open, close));
reader = new EmbeddingHTMLParsingReader(document, open, close, parser, input, metadata, context);
|
| FallbackParser | |
| HTML5Serializer |
Serializes a sequence of SAX events representing an XHTML 1.0 Strict document
to a
Writer as a UTF-8-encoded HTML 5 document. |
| ParsingReader |
Reader for the content from a given binary stream.
|
Copyright © 2018 The International Consortium of Investigative Journalists. All rights reserved.