One band of the Locality Sensitive Hash.
Capability trait for packing and unpacking tuples (or other things) to a List of Any.
A confusion matrix for comparing gold clusters to some predicted clusters.
Simple line function: y = mx+b
A Locality Sensitive Hash that hashes the documents into buckets.
Low priority implicit for packing a single value as itself.
Implicit packers for tuples.
A tokenizer that replaces all non-word characters with whitespace and then returns a StringTokenizer.
A companion object for constructing ConfusionMatrices.
Helper object for hash function functions.
A very simple tokenizer that pulls most puncuation off the characters.
Cleans up a string by ripping out punctuation, turning all digit sequences into a single numeric symbol, and getting rid of tokens that contain mixtures of alphabetic and numeric characters.