Class LuceneAnalyzerTokenizerAdapter

java.lang.Object
org.carrot2.language.extras.LuceneAnalyzerTokenizerAdapter
All Implemented Interfaces:
org.carrot2.language.Tokenizer

public class LuceneAnalyzerTokenizerAdapter
extends java.lang.Object
implements org.carrot2.language.Tokenizer
  • Field Summary

    Fields inherited from interface org.carrot2.language.Tokenizer

    TF_COMMON_WORD, TF_QUERY_WORD, TF_SEPARATOR_DOCUMENT, TF_SEPARATOR_FIELD, TF_SEPARATOR_SENTENCE, TF_TERMINATOR, TT_ACRONYM, TT_BARE_URL, TT_EMAIL, TT_EOF, TT_FILE, TT_FULL_URL, TT_HYPHTERM, TT_NUMERIC, TT_PUNCTUATION, TT_TERM, TYPE_MASK
  • Constructor Summary

    Constructors 
    Constructor Description
    LuceneAnalyzerTokenizerAdapter​(org.apache.lucene.analysis.Analyzer analyzer)  
  • Method Summary

    Modifier and Type Method Description
    short nextToken()  
    void reset​(java.io.Reader reader)  
    void setTermBuffer​(org.carrot2.util.MutableCharArray array)  

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
  • Constructor Details

  • Method Details

    • reset

      public void reset​(java.io.Reader reader) throws java.io.IOException
      Specified by:
      reset in interface org.carrot2.language.Tokenizer
      Throws:
      java.io.IOException
    • nextToken

      public short nextToken()
      Specified by:
      nextToken in interface org.carrot2.language.Tokenizer
    • setTermBuffer

      public void setTermBuffer​(org.carrot2.util.MutableCharArray array)
      Specified by:
      setTermBuffer in interface org.carrot2.language.Tokenizer