public class RussianWordTokenizer extends WordTokenizer
REMOVED_EMOJI| Constructor and Description |
|---|
RussianWordTokenizer() |
| Modifier and Type | Method and Description |
|---|---|
String |
getTokenizingCharacters() |
List<String> |
tokenize(String text) |
getProtocols, isCurrencyExpression, isEMail, isUrl, joinEMails, joinEMailsAndUrls, joinUrls, replaceEmojis, restoreEmojis, splitCurrencyExpressionpublic String getTokenizingCharacters()
getTokenizingCharacters in class WordTokenizer