public class Clitics extends Object
proclitics) and enclitics (enclitics) in this version.
Meso- and endoclitics are not yet supported.
The String representation of the respective clitics needs to be a
regular expression, as it will be used to Pattern.compile(String) a
pattern to split the STextualDS's text, i.e., as below.
Pattern.compile("^" XClitic "(.)$")
Two examples for such a regex string are (note the main group!):
"('(s|re|ve|d|m|em|ll)|n't)""([dcjlmnstDCJLNMST]'|[Qq]u'|[Jj]usqu'|[Ll]orsqu')"Tokenizer.| Constructor and Description |
|---|
Clitics(String proclitics,
String enclitics) |
| Modifier and Type | Method and Description |
|---|---|
String |
getEnclitics() |
String |
getProclitics() |
Copyright © 2009–2019 Humboldt-Universität zu Berlin, INRIA. All rights reserved.