org.allenai.scienceparse.pipeline

SimilarityMeasures

Related Doc: package pipeline

object SimilarityMeasures

This contains a bunch of helper functions stolen from the pipeline code. We need it here to anticipate how well the pipeline will work with the output from science-parse.

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. SimilarityMeasures
  2. AnyRef
  3. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Value Members

  1. final def !=(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  5. def characterNgramSimilarity(left: String, right: String, ngramLength: Int = 3, s: Double = 1.2, l: Int = 10): Option[Double]

    Smooth interpolation between containment Jaccard and plain Jaccard, based on character n-grams.

    Smooth interpolation between containment Jaccard and plain Jaccard, based on character n-grams. Short strings must match exactly, but longer strings are considered a match if one is a substring of the other.

    The final score is (J + F * JC) / (1 + F) in which J is the plain Jaccard JC is the containment Jaccard F = s ** (m - 1) m is the minimum length of the two strings s, l are parameters

    left

    String to compare

    right

    Other string to compare

    ngramLength

    Longer values will give a larger penalty to single-character typos

    s

    Determines how rapidly F rises with string length

    l

    The string length (in characters) for which which the two Jaccard scores have equal weights

    returns

  6. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  7. def containmentJaccardSim[T](s1: Set[T], s2: Set[T]): Double

  8. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  9. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  10. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  11. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  12. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  13. def identical(left: String, right: String): Option[Double]

  14. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  15. def jaccardSim[T](s1: Set[T], s2: Set[T]): Double

  16. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  17. final def notify(): Unit

    Definition Classes
    AnyRef
  18. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  19. def pickFromOptions[T](members: Option[T]*): Option[T]

  20. def prePostfix(left: String, right: String, transform: (Int) ⇒ Double = x => x / (x + 0.5)): Option[Double]

  21. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  22. def titleNgramSimilarity(left: TitleAuthors, right: TitleAuthors, s: Double = 1.2, l: Int = 10): Option[Double]

  23. def toString(): String

    Definition Classes
    AnyRef → Any
  24. def twoWayPrePostfix(left: String, right: String, transform: (Int) ⇒ Double = x => x / (x + 0.5)): Option[Double]

  25. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  26. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  27. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from AnyRef

Inherited from Any

Ungrouped