org.languagetool.dev.index
Class AnyCharTokenizer
java.lang.Object
org.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.Tokenizer
org.apache.lucene.analysis.util.CharTokenizer
org.languagetool.dev.index.AnyCharTokenizer
- All Implemented Interfaces:
- Closeable
public final class AnyCharTokenizer
- extends org.apache.lucene.analysis.util.CharTokenizer
A tokenizer that renders the whole input as one token.
- Author:
- Tao Lin
| Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource |
org.apache.lucene.util.AttributeSource.AttributeFactory, org.apache.lucene.util.AttributeSource.State |
| Fields inherited from class org.apache.lucene.analysis.Tokenizer |
input |
|
Constructor Summary |
AnyCharTokenizer(org.apache.lucene.util.Version matchVersion,
org.apache.lucene.util.AttributeSource.AttributeFactory factory,
Reader in)
Construct a new AnyCharTokenizer using a given
AttributeSource.AttributeFactory. |
AnyCharTokenizer(org.apache.lucene.util.Version matchVersion,
org.apache.lucene.util.AttributeSource source,
Reader in)
Construct a new AnyCharTokenizer using a given AttributeSource. |
AnyCharTokenizer(org.apache.lucene.util.Version matchVersion,
Reader in)
Construct a new AnyCharTokenizer. |
|
Method Summary |
protected boolean |
isTokenChar(int c)
Collects any characters. |
| Methods inherited from class org.apache.lucene.analysis.util.CharTokenizer |
end, incrementToken, normalize, reset |
| Methods inherited from class org.apache.lucene.analysis.Tokenizer |
close, correctOffset, setReader |
| Methods inherited from class org.apache.lucene.util.AttributeSource |
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState |
AnyCharTokenizer
public AnyCharTokenizer(org.apache.lucene.util.Version matchVersion,
Reader in)
- Construct a new AnyCharTokenizer.
- Parameters:
matchVersion - Lucene version to match See abovein - the input to split up into tokens
AnyCharTokenizer
public AnyCharTokenizer(org.apache.lucene.util.Version matchVersion,
org.apache.lucene.util.AttributeSource source,
Reader in)
- Construct a new AnyCharTokenizer using a given
AttributeSource.
- Parameters:
matchVersion - Lucene version to match See abovesource - the attribute source to use for this Tokenizerin - the input to split up into tokens
AnyCharTokenizer
public AnyCharTokenizer(org.apache.lucene.util.Version matchVersion,
org.apache.lucene.util.AttributeSource.AttributeFactory factory,
Reader in)
- Construct a new AnyCharTokenizer using a given
AttributeSource.AttributeFactory.
- Parameters:
matchVersion - Lucene version to match See abovefactory - the attribute factory to use for this Tokenizerin - the input to split up into tokens
isTokenChar
protected boolean isTokenChar(int c)
- Collects any characters.
- Specified by:
isTokenChar in class org.apache.lucene.analysis.util.CharTokenizer
Copyright © 2013. All Rights Reserved.