JavaScript is disabled on your browser.
Skip navigation links
Overview
Package
Class
Use
Tree
Deprecated
Index
Help
Prev
Next
Frames
No Frames
All Classes
A
B
C
D
E
F
G
H
I
L
N
O
P
R
S
T
U
W
_
A
addChpNodes(List, byte[], int, int)
- Method in class org.textmining.extraction.word.model.
ComplexNodeHelper
addChpNodes(List, byte[], int, int)
- Method in class org.textmining.extraction.word.model.
NodeHelper
addProperty(GenericPropertyNode)
- Method in class org.textmining.extraction.word.model.
PlexOfCps
adjustForDelete(int, int)
- Method in class org.textmining.extraction.word.model.
PropertyNode
Adjust for a deletion that can span multiple PropertyNodes.
append(Writer, String)
- Method in class org.textmining.extraction.word.
WordTextScrubber
ASCII_ENC
- Static variable in class org.textmining.extraction.word.model.
TextPiece
B
BOF_BIFF2
- Static variable in class org.textmining.extraction.excel.
Record
BOF_BIFF3
- Static variable in class org.textmining.extraction.excel.
Record
BOF_BIFF4
- Static variable in class org.textmining.extraction.excel.
Record
BOF_BIFF5678
- Static variable in class org.textmining.extraction.excel.
Record
Boundsheet
- Class in
org.textmining.extraction.excel
Boundsheet(String, int)
- Constructor for class org.textmining.extraction.excel.
Boundsheet
BOUNDSHEET
- Static variable in class org.textmining.extraction.excel.
Record
C
CHPBinTable
- Class in
org.textmining.extraction.word.model
This class holds all of the character formatting properties.
CHPBinTable()
- Constructor for class org.textmining.extraction.word.model.
CHPBinTable
CHPBinTable(byte[], byte[], int, int, int, NodeHelper)
- Constructor for class org.textmining.extraction.word.model.
CHPBinTable
Constructor used to read a binTable in from a Word document.
CHPFormattedDiskPage
- Class in
org.textmining.extraction.word.model
Represents a CHP fkp.
CHPFormattedDiskPage()
- Constructor for class org.textmining.extraction.word.model.
CHPFormattedDiskPage
CHPFormattedDiskPage(byte[], int, int, NodeHelper)
- Constructor for class org.textmining.extraction.word.model.
CHPFormattedDiskPage
This constructs a CHPFormattedDiskPage from a raw fkp (512 byte array read from a Word file).
CHPX
- Class in
org.textmining.extraction.word.model
Comment me
CHPX(int, int, byte[])
- Constructor for class org.textmining.extraction.word.model.
CHPX
clone()
- Method in class org.textmining.extraction.word.model.
PropertyNode
compareTo(Object)
- Method in class org.textmining.extraction.word.model.
PropertyNode
Used for sorting in collections.
ComplexFileTable
- Class in
org.textmining.extraction.word
ComplexFileTable(byte[], byte[], int, int)
- Constructor for class org.textmining.extraction.word.
ComplexFileTable
ComplexNodeHelper
- Class in
org.textmining.extraction.word.model
ComplexNodeHelper(TextPieceTable)
- Constructor for class org.textmining.extraction.word.model.
ComplexNodeHelper
D
DBCELL
- Static variable in class org.textmining.extraction.excel.
Record
DEFAULTCOLWIDTH
- Static variable in class org.textmining.extraction.excel.
Record
DEFAULTROWHEIGHT
- Static variable in class org.textmining.extraction.excel.
Record
DIMENSIONS
- Static variable in class org.textmining.extraction.excel.
Record
doFastSaveExtraction(Writer, int, List, List, WordTextScrubber)
- Method in class org.textmining.extraction.word.
WordTextExtractor
E
EOF
- Static variable in class org.textmining.extraction.excel.
Record
equals(Object)
- Method in class org.textmining.extraction.word.model.
PieceDescriptor
equals(Object)
- Method in class org.textmining.extraction.word.model.
PropertyNode
ExcelTextExtractor
- Class in
org.textmining.extraction.excel
ExcelTextExtractor(InputStream)
- Constructor for class org.textmining.extraction.excel.
ExcelTextExtractor
F
fill(List)
- Method in class org.textmining.extraction.word.model.
CHPFormattedDiskPage
fill(ArrayList, int)
- Method in class org.textmining.extraction.word.model.
CHPFormattedDiskPage
FormattedDiskPage
- Class in
org.textmining.extraction.word.model
Represents an FKP data structure.
FormattedDiskPage()
- Constructor for class org.textmining.extraction.word.model.
FormattedDiskPage
FormattedDiskPage(byte[], int, NodeHelper)
- Constructor for class org.textmining.extraction.word.model.
FormattedDiskPage
Uses a 512-byte array to create a FKP
G
GenericPropertyNode
- Class in
org.textmining.extraction.word.model
GenericPropertyNode(int, int, byte[])
- Constructor for class org.textmining.extraction.word.model.
GenericPropertyNode
getBytes()
- Method in class org.textmining.extraction.word.model.
GenericPropertyNode
getChpTableOffset()
- Method in class org.textmining.extraction.word.
Word2TextExtractor
getChpTableOffset()
- Method in class org.textmining.extraction.word.
Word6TextExtractor
getChpTableSize()
- Method in class org.textmining.extraction.word.
Word2TextExtractor
getChpTableSize()
- Method in class org.textmining.extraction.word.
Word6TextExtractor
getCHPX(int)
- Method in class org.textmining.extraction.word.model.
CHPFormattedDiskPage
getComplexOffset()
- Method in class org.textmining.extraction.word.
Word2TextExtractor
getComplexOffset()
- Method in class org.textmining.extraction.word.
Word6TextExtractor
getEnd(int)
- Method in class org.textmining.extraction.word.model.
FormattedDiskPage
Used to get the end of the text corresponding to a grpprl in this fkp.
getEnd()
- Method in class org.textmining.extraction.word.model.
PropertyNode
getEndIndex()
- Method in class org.textmining.extraction.word.model.
CHPFormattedDiskPage
getFcEnd()
- Method in class org.textmining.extraction.word.model.
TextPiece
getFcStart()
- Method in class org.textmining.extraction.word.model.
TextPiece
getFilePosition()
- Method in class org.textmining.extraction.word.model.
PieceDescriptor
getGrpprl(int)
- Method in class org.textmining.extraction.word.model.
CHPFormattedDiskPage
Gets the chpx for the character run at index in this fkp.
getGrpprl()
- Method in class org.textmining.extraction.word.model.
CHPX
getGrpprl(int)
- Method in class org.textmining.extraction.word.model.
FormattedDiskPage
getOffset()
- Method in class org.textmining.extraction.excel.
Record
getOperand()
- Method in class org.textmining.extraction.word.sprm.
SprmOperation
getOperation()
- Method in class org.textmining.extraction.word.sprm.
SprmOperation
getOverflow()
- Method in class org.textmining.extraction.word.model.
CHPFormattedDiskPage
getPieceDescriptor()
- Method in class org.textmining.extraction.word.model.
TextPiece
getProperty(int)
- Method in class org.textmining.extraction.word.model.
PlexOfCps
getSize()
- Method in class org.textmining.extraction.excel.
Record
getSizeInBytes()
- Static method in class org.textmining.extraction.word.model.
PieceDescriptor
getStart(int)
- Method in class org.textmining.extraction.word.model.
FormattedDiskPage
Used to get a text offset corresponding to a grpprl in this fkp.
getStart()
- Method in class org.textmining.extraction.word.model.
PropertyNode
getText()
- Method in class org.textmining.extraction.excel.
ExcelTextExtractor
getText(Writer)
- Method in class org.textmining.extraction.excel.
ExcelTextExtractor
getText()
- Method in interface org.textmining.extraction.
TextExtractor
getText(Writer)
- Method in interface org.textmining.extraction.
TextExtractor
getText(byte[])
- Method in class org.textmining.extraction.word.model.
TextPiece
getText()
- Method in class org.textmining.extraction.word.
Word6TextExtractor
getText(Writer)
- Method in class org.textmining.extraction.word.
Word6TextExtractor
getText()
- Method in class org.textmining.extraction.word.
Word97TextExtractor
getText(Writer)
- Method in class org.textmining.extraction.word.
Word97TextExtractor
getTextPieces()
- Method in class org.textmining.extraction.word.model.
TextPieceTable
getTextPieceTable()
- Method in class org.textmining.extraction.word.
ComplexFileTable
getTextRuns()
- Method in class org.textmining.extraction.word.chp.
Word6CHPBinTable
getTextRuns()
- Method in class org.textmining.extraction.word.model.
CHPBinTable
getType()
- Method in class org.textmining.extraction.excel.
Record
getType()
- Method in class org.textmining.extraction.word.sprm.
SprmOperation
getVersion()
- Method in class org.textmining.extraction.word.
WordExtractorFactory
getVersion(int)
- Static method in class org.textmining.extraction.word.
WordVersion
H
hasNext()
- Method in class org.textmining.extraction.word.sprm.
SprmIterator
I
INDEX
- Static variable in class org.textmining.extraction.excel.
Record
initOptions()
- Method in class org.textmining.extraction.word.
WordTextExtractor
initWordHeader(InputStream)
- Method in class org.textmining.extraction.word.
WordExtractorFactory
initWordHeader(InputStream)
- Method in class org.textmining.extraction.word.
WordTextExtractor
isDeleted(byte[])
- Method in class org.textmining.extraction.word.
Word6TextExtractor
Used to determine if a run of text has been deleted.
isDeleted(byte[])
- Method in class org.textmining.extraction.word.
Word97TextExtractor
Used to determine if a run of text has been deleted.
isDeleted(byte[])
- Method in class org.textmining.extraction.word.
WordTextExtractor
isUnicode()
- Method in class org.textmining.extraction.word.model.
PieceDescriptor
L
length()
- Method in class org.textmining.extraction.word.model.
PlexOfCps
returns the number of data structures in this PlexOfCps.
limitsAreEqual(Object)
- Method in class org.textmining.extraction.word.model.
PropertyNode
N
next()
- Method in class org.textmining.extraction.word.sprm.
SprmIterator
NodeHelper
- Class in
org.textmining.extraction.word.model
NodeHelper(TextPieceTable)
- Constructor for class org.textmining.extraction.word.model.
NodeHelper
O
org.textmining.extraction
- package org.textmining.extraction
org.textmining.extraction.excel
- package org.textmining.extraction.excel
org.textmining.extraction.word
- package org.textmining.extraction.word
org.textmining.extraction.word.chp
- package org.textmining.extraction.word.chp
org.textmining.extraction.word.model
- package org.textmining.extraction.word.model
org.textmining.extraction.word.sprm
- package org.textmining.extraction.word.sprm
P
PasswordProtectedException
- Exception in
org.textmining.extraction.word
PasswordProtectedException(String)
- Constructor for exception org.textmining.extraction.word.
PasswordProtectedException
PieceDescriptor
- Class in
org.textmining.extraction.word.model
PieceDescriptor(byte[], int)
- Constructor for class org.textmining.extraction.word.model.
PieceDescriptor
PieceDescriptor()
- Constructor for class org.textmining.extraction.word.model.
PieceDescriptor
PlexOfCps
- Class in
org.textmining.extraction.word.model
common data structure in a Word file.
PlexOfCps(int)
- Constructor for class org.textmining.extraction.word.model.
PlexOfCps
PlexOfCps(byte[], int, int, int)
- Constructor for class org.textmining.extraction.word.model.
PlexOfCps
Constructor
PropertyNode
- Class in
org.textmining.extraction.word.model
Represents a lightweight node in the Trees used to store content properties.
PropertyNode(int, int, Object)
- Constructor for class org.textmining.extraction.word.model.
PropertyNode
R
Record
- Class in
org.textmining.extraction.excel
Record(int, int, int)
- Constructor for class org.textmining.extraction.excel.
Record
ROW
- Static variable in class org.textmining.extraction.excel.
Record
RowBlock
- Class in
org.textmining.extraction.excel
RowBlock(byte[], int)
- Constructor for class org.textmining.extraction.excel.
RowBlock
S
setEnd(int)
- Method in class org.textmining.extraction.word.model.
PropertyNode
setFilePosition(int)
- Method in class org.textmining.extraction.word.model.
PieceDescriptor
setStart(int)
- Method in class org.textmining.extraction.word.model.
PropertyNode
setUnicode(boolean)
- Method in class org.textmining.extraction.word.model.
PieceDescriptor
size()
- Method in class org.textmining.extraction.word.model.
CHPFormattedDiskPage
size()
- Method in class org.textmining.extraction.word.model.
FormattedDiskPage
Used to get the total number of grrprl's stored int this FKP
size()
- Method in class org.textmining.extraction.word.sprm.
SprmOperation
sortNodes(List, boolean)
- Method in class org.textmining.extraction.word.model.
ComplexNodeHelper
sortNodes(List, boolean)
- Method in class org.textmining.extraction.word.model.
NodeHelper
SprmIterator
- Class in
org.textmining.extraction.word.sprm
This class is used to iterate through a list of sprms from a Word 97/2000/XP document.
SprmIterator(byte[])
- Constructor for class org.textmining.extraction.word.sprm.
SprmIterator
SprmOperation
- Class in
org.textmining.extraction.word.sprm
SprmOperation(byte[], int)
- Constructor for class org.textmining.extraction.word.sprm.
SprmOperation
SST_RECORD
- Static variable in class org.textmining.extraction.excel.
Record
supportsUnicode()
- Method in class org.textmining.extraction.word.
Word97TextExtractor
supportsUnicode()
- Method in class org.textmining.extraction.word.
WordTextExtractor
T
TextExtractor
- Interface in
org.textmining.extraction
textExtractor(InputStream)
- Method in class org.textmining.extraction.word.
WordTextExtractorFactory
Gets the text from a Word document.
TextPiece
- Class in
org.textmining.extraction.word.model
TextPiece(int, int, PieceDescriptor)
- Constructor for class org.textmining.extraction.word.model.
TextPiece
TextPieceTable
- Class in
org.textmining.extraction.word.model
TextPieceTable()
- Constructor for class org.textmining.extraction.word.model.
TextPieceTable
TextPieceTable(byte[], byte[], int, int, int)
- Constructor for class org.textmining.extraction.word.model.
TextPieceTable
toByteArray(int)
- Method in class org.textmining.extraction.word.model.
CHPFormattedDiskPage
toByteArray()
- Method in class org.textmining.extraction.word.model.
PieceDescriptor
toByteArray()
- Method in class org.textmining.extraction.word.model.
PlexOfCps
U
unicode()
- Method in class org.textmining.extraction.word.model.
TextPiece
UNICODE_ENC
- Static variable in class org.textmining.extraction.word.model.
TextPiece
usesUnicode()
- Method in class org.textmining.extraction.word.model.
TextPiece
W
Word2TextExtractor
- Class in
org.textmining.extraction.word
Word2TextExtractor(InputStream)
- Constructor for class org.textmining.extraction.word.
Word2TextExtractor
Word6
- Static variable in class org.textmining.extraction.word.
WordVersion
Word6CHPBinTable
- Class in
org.textmining.extraction.word.chp
This class holds all of the character formatting properties from a Word 6.0/95 document.
Word6CHPBinTable(byte[], int, int, int, NodeHelper)
- Constructor for class org.textmining.extraction.word.chp.
Word6CHPBinTable
Constructor used to read a binTable in from a Word document.
Word6TextExtractor
- Class in
org.textmining.extraction.word
This class is used to extract text from Word 6 documents only.
Word6TextExtractor()
- Constructor for class org.textmining.extraction.word.
Word6TextExtractor
Word6TextExtractor(InputStream)
- Constructor for class org.textmining.extraction.word.
Word6TextExtractor
Word97
- Static variable in class org.textmining.extraction.word.
WordVersion
Word97TextExtractor
- Class in
org.textmining.extraction.word
Word97TextExtractor(InputStream)
- Constructor for class org.textmining.extraction.word.
Word97TextExtractor
WordExtractorFactory
- Class in
org.textmining.extraction.word
WordExtractorFactory()
- Constructor for class org.textmining.extraction.word.
WordExtractorFactory
WordTextExtractor
- Class in
org.textmining.extraction.word
WordTextExtractor()
- Constructor for class org.textmining.extraction.word.
WordTextExtractor
WordTextExtractorFactory
- Class in
org.textmining.extraction.word
This class extracts the text from a Word 6.0/95/97/2000/XP word doc
WordTextExtractorFactory()
- Constructor for class org.textmining.extraction.word.
WordTextExtractorFactory
Constructor
WordTextScrubber
- Class in
org.textmining.extraction.word
This class acts as a StringBuffer for text from a word document.
WordTextScrubber()
- Constructor for class org.textmining.extraction.word.
WordTextScrubber
WordVersion
- Class in
org.textmining.extraction.word
WordVersion()
- Constructor for class org.textmining.extraction.word.
WordVersion
_
_buf
- Variable in class org.textmining.extraction.word.model.
PropertyNode
_crun
- Variable in class org.textmining.extraction.word.model.
FormattedDiskPage
_currentIndex
- Variable in class org.textmining.extraction.word.model.
FormattedDiskPage
_fastSave
- Variable in class org.textmining.extraction.word.
WordTextExtractor
_fastSaved
- Variable in class org.textmining.extraction.word.
WordExtractorFactory
_fc2Cp
- Variable in class org.textmining.extraction.word.model.
FormattedDiskPage
_fkp
- Variable in class org.textmining.extraction.word.model.
FormattedDiskPage
_fsys
- Variable in class org.textmining.extraction.word.
WordTextExtractor
_header
- Variable in class org.textmining.extraction.word.
WordTextExtractor
_offset
- Variable in class org.textmining.extraction.word.model.
FormattedDiskPage
_textPieces
- Variable in class org.textmining.extraction.word.model.
TextPieceTable
_textRuns
- Variable in class org.textmining.extraction.word.model.
CHPBinTable
List of character properties.
_tpt
- Variable in class org.textmining.extraction.word.
ComplexFileTable
A
B
C
D
E
F
G
H
I
L
N
O
P
R
S
T
U
W
_
Skip navigation links
Overview
Package
Class
Use
Tree
Deprecated
Index
Help
Prev
Next
Frames
No Frames
All Classes
Copyright © 2021. All rights reserved.