public class CatalanWordTokenizer extends WordTokenizer
REMOVED_EMOJI| Constructor and Description |
|---|
CatalanWordTokenizer() |
| Modifier and Type | Method and Description |
|---|---|
List<String> |
tokenize(String text) |
getProtocols, getTokenizingCharacters, isCurrencyExpression, isEMail, isUrl, joinEMails, joinEMailsAndUrls, joinUrls, replaceEmojis, restoreEmojis, splitCurrencyExpressionpublic List<String> tokenize(String text)
tokenize in interface Tokenizertokenize in class WordTokenizertext - Text to tokenize