Package lux.index.analysis

Class Summary
AttributeTokenStream A TokenStream that extracts words from attributes in a Saxon Document model (XdmNode)
CharSequenceStream  
ContentIterator Iterates over //text(); all descendant text nodes
DefaultAnalyzer  
ElementTokenStream A TokenStream that extracts text from a Saxon Document model (XdmNode) and generates a token for every "word" for every element that contains it.
OffsetCharFilter exposes the offset map so it can be set externally.
PathValueTokenStream  
QNameTokenFilter Expand the input term by adding additional terms at the same position, prefixed by the node names (QNames) found in the QNameAttribute.
QNameValueTokenStream  
TextOffsetTokenStream This TokenStream records the offsets (character positions in the original text) of every token.
ValueTokenStream  
WhitespaceGapAnalyzer An Analyzer that uses WhitespaceTokenizer, and inserts position gaps of 100 between multiple values to inhibit phrase and span matches across values.
WildcardAnalyzer Used by the XML query parser to handle wildcarded terms.
XmlTextTokenStream Extracts tokens from an s9api XML document tree (XdmNode) in order to make them available to Lucene classes that accept TokenStreams, like the indexer and highlighter.
 



Copyright © 2013. All Rights Reserved.