chalk.text.tokenize

SimpleEnglishTokenizer

trait SimpleEnglishTokenizer extends Tokenizer

Simple English document tokenizer that splits up words on whitespace or punctuation, but keeps word-internal punctuation within the word. Skips whitespace.

Because this class may improve over time in non-backwards-compatible ways, the default behavior of SimpleEnglishTokenizer.apply() is to return an instance of SimpleEnglishTokenizer.V1. To get an instance of the old version (based on patterns by Steven Bethard), you can call SimpleEnglishTokenizer.V0().

Linear Supertypes
Tokenizer, Serializable, Serializable, (String) ⇒ Iterable[String], AnyRef, Any
Known Subclasses
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. SimpleEnglishTokenizer
  2. Tokenizer
  3. Serializable
  4. Serializable
  5. Function1
  6. AnyRef
  7. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Abstract Value Members

  1. abstract def apply(v1: String): Iterable[String]

    Definition Classes
    Function1

Concrete Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. def andThen(g: Transformer): Tokenizer

    Definition Classes
    Tokenizer
  7. def andThen[A](g: (Iterable[String]) ⇒ A): (String) ⇒ A

    Definition Classes
    Function1
    Annotations
    @unspecialized()
  8. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  9. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  10. def compose[A](g: (A) ⇒ String): (A) ⇒ Iterable[String]

    Definition Classes
    Function1
    Annotations
    @unspecialized()
  11. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  12. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  13. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  14. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  15. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  16. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  17. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  18. final def notify(): Unit

    Definition Classes
    AnyRef
  19. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  20. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  21. def toString(): String

    Definition Classes
    Tokenizer → Function1 → AnyRef → Any
  22. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  23. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  24. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  25. def ~>(g: Transformer): Chain

    Definition Classes
    Tokenizer

Inherited from Tokenizer

Inherited from Serializable

Inherited from Serializable

Inherited from (String) ⇒ Iterable[String]

Inherited from AnyRef

Inherited from Any

Ungrouped