Class LuceneAnalyzerTokenizerAdapter

  • All Implemented Interfaces:
    org.carrot2.language.Tokenizer

    public class LuceneAnalyzerTokenizerAdapter
    extends java.lang.Object
    implements org.carrot2.language.Tokenizer
    • Field Summary

      • Fields inherited from interface org.carrot2.language.Tokenizer

        TF_COMMON_WORD, TF_QUERY_WORD, TF_SEPARATOR_DOCUMENT, TF_SEPARATOR_FIELD, TF_SEPARATOR_SENTENCE, TF_TERMINATOR, TT_ACRONYM, TT_BARE_URL, TT_EMAIL, TT_EOF, TT_FILE, TT_FULL_URL, TT_HYPHTERM, TT_NUMERIC, TT_PUNCTUATION, TT_TERM, TYPE_MASK
    • Method Summary

      All Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      short nextToken()  
      void reset​(java.io.Reader reader)  
      void setTermBuffer​(org.carrot2.util.MutableCharArray array)  
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Constructor Detail

      • LuceneAnalyzerTokenizerAdapter

        public LuceneAnalyzerTokenizerAdapter​(org.apache.lucene.analysis.Analyzer analyzer)
    • Method Detail

      • reset

        public void reset​(java.io.Reader reader)
                   throws java.io.IOException
        Specified by:
        reset in interface org.carrot2.language.Tokenizer
        Throws:
        java.io.IOException
      • nextToken

        public short nextToken()
        Specified by:
        nextToken in interface org.carrot2.language.Tokenizer
      • setTermBuffer

        public void setTermBuffer​(org.carrot2.util.MutableCharArray array)
        Specified by:
        setTermBuffer in interface org.carrot2.language.Tokenizer