edu.washington.cs.knowitall.sequence
Class LayeredPatternTokenizer

java.lang.Object
  extended by edu.washington.cs.knowitall.sequence.LayeredPatternTokenizer

public class LayeredPatternTokenizer
extends Object

A class that tokenizes the String representation of patterns for the LayeredTokenPattern class. A pattern consists of two types of tokens: layer/symbol pairs like The_pos, and meta-characters defined by rexex.Pattern. This class takes a string consisting of these and breaks them into tokens.

Author:
afader

Constructor Summary
LayeredPatternTokenizer()
          Constructs a new tokenizer.
 
Method Summary
 String[] getSymbolLayerName(String text)
           
 boolean isSymbolLayerName(String text)
           
 String[] tokenize(String text)
          Tokenizes the given text
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

LayeredPatternTokenizer

public LayeredPatternTokenizer()
Constructs a new tokenizer.

Method Detail

tokenize

public String[] tokenize(String text)
                  throws SequenceException
Tokenizes the given text

Parameters:
text -
Returns:
an array of Strings, one for each token in text
Throws:
SequenceException - if unable to tokenize the text

isSymbolLayerName

public boolean isSymbolLayerName(String text)
Parameters:
text -
Returns:
true if the given String is a symbol/layer name pair

getSymbolLayerName

public String[] getSymbolLayerName(String text)
Parameters:
text -
Returns:


Copyright © 2010-2012 University of Washington CSE. All Rights Reserved.