lux.index.analysis
Class AttributeTokenStream

java.lang.Object
  extended by org.apache.lucene.util.AttributeSource
      extended by org.apache.lucene.analysis.TokenStream
          extended by lux.index.analysis.TextOffsetTokenStream
              extended by lux.index.analysis.AttributeTokenStream
All Implemented Interfaces:
Closeable

public final class AttributeTokenStream
extends TextOffsetTokenStream

A TokenStream that extracts words from attributes in a Saxon Document model (XdmNode)


Nested Class Summary
 
Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource
org.apache.lucene.util.AttributeSource.AttributeFactory, org.apache.lucene.util.AttributeSource.State
 
Field Summary
protected  Reader charStream
           
protected  Iterator<net.sf.saxon.s9api.XdmNode> contentIter
           
protected  net.sf.saxon.s9api.XdmNode curNode
           
protected static net.sf.saxon.s9api.XdmSequenceIterator EMPTY
           
protected  org.apache.lucene.analysis.tokenattributes.CharTermAttribute termAtt
           
 
Constructor Summary
AttributeTokenStream(String fieldName, org.apache.lucene.analysis.Analyzer analyzer, org.apache.lucene.analysis.TokenStream wrapped, net.sf.saxon.s9api.XdmNode doc, Offsets offsets)
           
 
Method Summary
 org.apache.lucene.analysis.TokenStream getWrappedTokenStream()
           
 boolean incrementToken()
           
protected  boolean incrementWrappedTokenStream()
           
 void reset()
           
 void reset(Reader reader)
           
protected  void setWrappedTokenStream(org.apache.lucene.analysis.TokenStream wrapped)
           
protected  void updateNodeAtts()
           
 
Methods inherited from class lux.index.analysis.TextOffsetTokenStream
resetTokenizer
 
Methods inherited from class org.apache.lucene.analysis.TokenStream
close, end
 
Methods inherited from class org.apache.lucene.util.AttributeSource
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState
 
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

curNode

protected net.sf.saxon.s9api.XdmNode curNode

contentIter

protected Iterator<net.sf.saxon.s9api.XdmNode> contentIter

termAtt

protected org.apache.lucene.analysis.tokenattributes.CharTermAttribute termAtt

charStream

protected Reader charStream

EMPTY

protected static final net.sf.saxon.s9api.XdmSequenceIterator EMPTY
Constructor Detail

AttributeTokenStream

public AttributeTokenStream(String fieldName,
                            org.apache.lucene.analysis.Analyzer analyzer,
                            org.apache.lucene.analysis.TokenStream wrapped,
                            net.sf.saxon.s9api.XdmNode doc,
                            Offsets offsets)
Method Detail

reset

public void reset(Reader reader)
           throws IOException
Throws:
IOException

updateNodeAtts

protected void updateNodeAtts()

reset

public void reset()
           throws IOException
Overrides:
reset in class org.apache.lucene.analysis.TokenStream
Throws:
IOException

incrementToken

public boolean incrementToken()
                       throws IOException
Specified by:
incrementToken in class org.apache.lucene.analysis.TokenStream
Throws:
IOException

getWrappedTokenStream

public org.apache.lucene.analysis.TokenStream getWrappedTokenStream()
Returns:
the underlying stream of text tokens to which additional xml-related attributes are added by this.

setWrappedTokenStream

protected void setWrappedTokenStream(org.apache.lucene.analysis.TokenStream wrapped)

incrementWrappedTokenStream

protected boolean incrementWrappedTokenStream()
                                       throws IOException
Throws:
IOException


Copyright © 2013. All Rights Reserved.