Class Stemmer
- java.lang.Object
-
- edu.cornell.mannlib.vitro.webapp.utils.Stemmer
-
public class Stemmer extends Object
Stemmer, implementing the Porter Stemming Algorithm The Stemmer class transforms a word into its root form. The input word can be provided a character at time (by calling add()), or at once by calling one of the various stem(something) methods.
-
-
Constructor Summary
Constructors Constructor Description Stemmer()
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description voidadd(char ch)Add a character to the word being stemmed.voidadd(char[] w, int wLen)Adds wLen characters to the word being stemmed contained in a portion of a char[] array.char[]getResultBuffer()Returns a reference to a character buffer containing the results of the stemming process.intgetResultLength()Returns the length of the word resulting from the stemming process.static voidmain(String[] args)voidstem()Stem the word placed into the Stemmer buffer through calls to add().static StringStemString(String inputStr, int maxLength)StringtoString()After a word has been stemmed, it can be retrieved by toString(), or a reference to the internal buffer can be retrieved by getResultBuffer and getResultLength (which is generally more efficient.)
-
-
-
Method Detail
-
add
public void add(char ch)
Add a character to the word being stemmed. When you are finished adding characters, you can call stem(void) to stem the word.
-
add
public void add(char[] w, int wLen)Adds wLen characters to the word being stemmed contained in a portion of a char[] array. This is like repeated calls of add(char ch), but faster.
-
toString
public String toString()
After a word has been stemmed, it can be retrieved by toString(), or a reference to the internal buffer can be retrieved by getResultBuffer and getResultLength (which is generally more efficient.)
-
getResultLength
public int getResultLength()
Returns the length of the word resulting from the stemming process.
-
getResultBuffer
public char[] getResultBuffer()
Returns a reference to a character buffer containing the results of the stemming process. You also need to consult getResultLength() to determine the length of the result.
-
stem
public void stem()
Stem the word placed into the Stemmer buffer through calls to add(). Returns true if the stemming process resulted in a word different from the input. You can retrieve the result with getResultLength()/getResultBuffer() or toString().
-
main
public static void main(String[] args)
-
-