edu.washington.cs.knowitall.sequence
Class LayeredPatternTokenizer
java.lang.Object
edu.washington.cs.knowitall.sequence.LayeredPatternTokenizer
public class LayeredPatternTokenizer
- extends Object
A class that tokenizes the String representation of patterns for the
LayeredTokenPattern class. A pattern consists of two types of tokens:
layer/symbol pairs like The_pos, and meta-characters defined by
rexex.Pattern. This class takes a string consisting of
these and breaks them into tokens.
- Author:
- afader
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
LayeredPatternTokenizer
public LayeredPatternTokenizer()
- Constructs a new tokenizer.
tokenize
public String[] tokenize(String text)
throws SequenceException
- Tokenizes the given text
- Parameters:
text -
- Returns:
- an array of Strings, one for each token in text
- Throws:
SequenceException - if unable to tokenize the text
isSymbolLayerName
public boolean isSymbolLayerName(String text)
- Parameters:
text -
- Returns:
- true if the given String is a symbol/layer name pair
getSymbolLayerName
public String[] getSymbolLayerName(String text)
- Parameters:
text -
- Returns:
Copyright © 2010-2012 University of Washington CSE. All Rights Reserved.