org.apache.tika.parser.pdf
Class CustomPDFParser
java.lang.Object
org.apache.tika.parser.pdf.CustomPDFParser
- All Implemented Interfaces:
- java.io.Serializable, org.apache.tika.parser.Parser
public class CustomPDFParser
- extends java.lang.Object
- implements org.apache.tika.parser.Parser
PDF parser.
This parser can process also encrypted PDF documents if the required
password is given as a part of the input metadata associated with a
document. If no password is given, then this parser will try decrypting
the document using the empty password that's often used with PDFs.
- See Also:
- Serialized Form
|
Field Summary |
static java.lang.String |
PASSWORD
Metadata key for giving the document password to the parser. |
|
Method Summary |
java.util.Set<org.apache.tika.mime.MediaType> |
getSupportedTypes(org.apache.tika.parser.ParseContext context)
|
void |
parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
org.apache.tika.metadata.Metadata metadata)
Deprecated. This method will be removed in Apache Tika 1.0. |
void |
parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
org.apache.tika.metadata.Metadata metadata,
org.apache.tika.parser.ParseContext context)
|
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
PASSWORD
public static final java.lang.String PASSWORD
- Metadata key for giving the document password to the parser.
- Since:
- Apache Tika 0.5
- See Also:
- Constant Field Values
CustomPDFParser
public CustomPDFParser()
getSupportedTypes
public java.util.Set<org.apache.tika.mime.MediaType> getSupportedTypes(org.apache.tika.parser.ParseContext context)
- Specified by:
getSupportedTypes in interface org.apache.tika.parser.Parser
parse
public void parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
org.apache.tika.metadata.Metadata metadata,
org.apache.tika.parser.ParseContext context)
throws java.io.IOException,
org.xml.sax.SAXException,
org.apache.tika.exception.TikaException
- Specified by:
parse in interface org.apache.tika.parser.Parser
- Throws:
java.io.IOException
org.xml.sax.SAXException
org.apache.tika.exception.TikaException
parse
public void parse(java.io.InputStream stream,
org.xml.sax.ContentHandler handler,
org.apache.tika.metadata.Metadata metadata)
throws java.io.IOException,
org.xml.sax.SAXException,
org.apache.tika.exception.TikaException
- Deprecated. This method will be removed in Apache Tika 1.0.
- Specified by:
parse in interface org.apache.tika.parser.Parser
- Throws:
java.io.IOException
org.xml.sax.SAXException
org.apache.tika.exception.TikaException
Copyright © 2004-2011. All Rights Reserved.