chalk

corpora

package corpora

Visibility
  1. Public
  2. All

Type Members

  1. class Corpora extends AnyRef

    Class to help load corpora.

  2. trait Corpus[+T] extends AnyRef

    Represents a corpus of something.

  3. trait JarCorpus[+T] extends Corpus[T]

    Pretty useful for creating a corpus from a jar file where the resources are packaged in the jar file.

  4. case class MAnnotation(id: String, label: String, ref: String, features: Map[String, String]) extends Product with Serializable

  5. case class MEdge(id: String, from: String, to: String) extends Product with Serializable

  6. case class MNode(id: String, targets: Seq[String]) extends Product with Serializable

  7. case class MRegion(id: String, start: Int, end: Int) extends Ordered[MRegion] with Product with Serializable

  8. class MascFile extends AnyRef

  9. case class MascSentence(orderedTokens: Seq[String], orderedPos: Seq[String], bioLabels: Seq[String], orderedRegions: Seq[MRegion]) extends Product with Serializable

Value Members

  1. object CONLLSequenceReader

    Reads tag sequences in the conll shared task format.

  2. object Corpora extends Corpora

  3. object CorpusUtils

  4. object MascFile

  5. object MascTransform

    Convert native MASC xml into CONLL format for named entity recognition.

  6. object MascUtil

    Simple objects and functions for working with MASC data.

Ungrouped