Package org.fryske_akademy.exist.lucene
Class NoPunctuationTokenizer
java.lang.Object
org.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.Tokenizer
org.apache.lucene.analysis.util.CharTokenizer
org.fryske_akademy.exist.lucene.NoPunctuationTokenizer
- All Implemented Interfaces:
Closeable,AutoCloseable
public class NoPunctuationTokenizer
extends org.apache.lucene.analysis.util.CharTokenizer
For this tokenizer every character is a tokenChar except whitespace and . , ; ? ! : [ ] ( ) { } " '
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource
org.apache.lucene.util.AttributeSource.State -
Field Summary
FieldsFields inherited from class org.apache.lucene.analysis.Tokenizer
inputFields inherited from class org.apache.lucene.analysis.TokenStream
DEFAULT_TOKEN_ATTRIBUTE_FACTORYFields inherited from class org.apache.lucene.util.AttributeSource
DEFAULT_ATTRIBUTE_FACTORY -
Constructor Summary
ConstructorsConstructorDescriptionNoPunctuationTokenizer(Reader input) NoPunctuationTokenizer(org.apache.lucene.util.AttributeFactory factory, Reader input) -
Method Summary
Methods inherited from class org.apache.lucene.analysis.util.CharTokenizer
end, incrementToken, normalize, resetMethods inherited from class org.apache.lucene.analysis.Tokenizer
close, correctOffset, setReaderMethods inherited from class org.apache.lucene.util.AttributeSource
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
-
Field Details
-
PUNCTS
public static final char[] PUNCTS
-
-
Constructor Details
-
NoPunctuationTokenizer
-
NoPunctuationTokenizer
-
-
Method Details
-
isTokenChar
protected boolean isTokenChar(int c) - Specified by:
isTokenCharin classorg.apache.lucene.analysis.util.CharTokenizer
-