c

net.gonzberg.spark.sorting

SecondarySortGroupAndSortByPairRDDFunctions

final class SecondarySortGroupAndSortByPairRDDFunctions[K, V] extends Serializable

Linear Supertypes
Serializable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. SecondarySortGroupAndSortByPairRDDFunctions
  2. Serializable
  3. AnyRef
  4. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. Protected

Instance Constructors

  1. new SecondarySortGroupAndSortByPairRDDFunctions(rdd: RDD[(K, V)])(implicit arg0: Ordering[K], arg1: ClassTag[K], arg2: ClassTag[V])

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##: Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.CloneNotSupportedException]) @native()
  6. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  7. def equals(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef → Any
  8. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.Throwable])
  9. def foldLeftByKeySortedBy[S, A](startValue: A, op: (A, V) => A, sortBy: (V) => S)(implicit arg0: Ordering[S]): RDD[(K, A)]

    Groups by key and applies a binary operation using foldLeft over the values sorted by some implicit ordering

    Groups by key and applies a binary operation using foldLeft over the values sorted by some implicit ordering

    A

    the result type of the folding operation

    startValue

    the start value for the fold

    op

    the binary operation for folding

    sortBy

    how to sort values

    returns

    PairRDD with keys and values, where values are the result of applying foldLeft across the sorted values

  10. def foldLeftByKeySortedBy[S, A](startValue: A, op: (A, V) => A, sortBy: (V) => S, numPartitions: Int)(implicit arg0: Ordering[S]): RDD[(K, A)]

    Groups by key and applies a binary operation using foldLeft over the values sorted by some implicit ordering

    Groups by key and applies a binary operation using foldLeft over the values sorted by some implicit ordering

    A

    the result type of the folding operation

    startValue

    the start value for the fold

    op

    the binary operation for folding

    sortBy

    how to sort values

    numPartitions

    the number of partitions for shuffling

    returns

    PairRDD with keys and values, where values are the result of applying foldLeft across the sorted values

  11. def foldLeftByKeySortedBy[S, A](startValue: A, op: (A, V) => A, sortBy: (V) => S, partitioner: Partitioner)(implicit arg0: Ordering[S]): RDD[(K, A)]

    Groups by key and applies a binary operation using foldLeft over the values sorted by some implicit ordering

    Groups by key and applies a binary operation using foldLeft over the values sorted by some implicit ordering

    A

    the result type of the folding operation

    startValue

    the start value for the fold

    op

    the binary operation for folding

    sortBy

    how to sort values

    partitioner

    the partitioner for shuffling

    returns

    PairRDD with keys and values, where values are the result of applying foldLeft across the sorted values

  12. final def getClass(): Class[_ <: AnyRef]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  13. def groupByKeyAndSortBy[A](sortBy: (V) => A)(implicit arg0: Ordering[A]): RDD[(K, Iterable[V])]

    Groups by key and sorts the values by some implicit ordering

    Groups by key and sorts the values by some implicit ordering

    sortBy

    how to sort values

    returns

    a PairRDD of keys and sorted values

  14. def groupByKeyAndSortBy[A](sortBy: (V) => A, numPartitions: Int)(implicit arg0: Ordering[A]): RDD[(K, Iterable[V])]

    Groups by key and sorts the values by some implicit ordering

    Groups by key and sorts the values by some implicit ordering

    sortBy

    how to sort values

    numPartitions

    the number of partitions for shuffling

    returns

    PairRDD of keys and sorted values

  15. def groupByKeyAndSortBy[A](sortBy: (V) => A, partitioner: Partitioner)(implicit arg0: Ordering[A]): RDD[(K, Iterable[V])]

    Groups by key and sorts the values by some implicit ordering

    Groups by key and sorts the values by some implicit ordering

    sortBy

    how to sort values

    partitioner

    the partitioner for shuffling

    returns

    PairRDD of keys and sorted values

  16. def hashCode(): Int
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  17. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  18. def mapValuesWithKeyedPreparedResourceSortedBy[R, R1, S, A](resources: RDD[(K, R)], prepareResource: (R) => R1, op: (R1, V) => A, sortBy: (V) => S)(implicit arg0: ClassTag[R], arg1: Ordering[S]): RDD[(K, A)]

    Applies op to every value with some resource, where values and resources share the same key.

    Applies op to every value with some resource, where values and resources share the same key. This allows you to send data to executors based on key, so that: (1) the entire set of resources are not held in memory on all executors; and, (2) a specific resource is not duplicated; it is reused for all corresponding data values One example usage might be when conducting a geospatial operation. If the keys indicate a geographic area, and the value contains geospatial resources in that geographic area, one can apply a method using geospatially local resources to all values while reducing data duplication and shuffling.

    R

    the type of resources being used

    A

    the type returned by applying the operation with the resource to each value

    resources

    a PairRDD of keys and resources, where keys are used to determine what data the resource is associated with for the operation. There must be exactly one resource for each key in the RDD this method is applied to

    prepareResource

    a function to transform the resource into what will be used in op

    op

    the operation to apply to each value. The operation takes a resource and value and returns the transformed value

    sortBy

    how to sort values

    returns

    PairRDD of values transformed by applying the operation with the appropriate resource

  19. def mapValuesWithKeyedPreparedResourceSortedBy[R, R1, S, A](resources: RDD[(K, R)], prepareResource: (R) => R1, op: (R1, V) => A, sortBy: (V) => S, numPartitions: Int)(implicit arg0: ClassTag[R], arg1: Ordering[S]): RDD[(K, A)]

    Applies op to every value with some resource, where values and resources share the same key.

    Applies op to every value with some resource, where values and resources share the same key. This allows you to send data to executors based on key, so that: (1) the entire set of resources are not held in memory on all executors; and, (2) a specific resource is not duplicated; it is reused for all corresponding data values One example usage might be when conducting a geospatial operation. If the keys indicate a geographic area, and the value contains geospatial resources in that geographic area, one can apply a method using geospatially local resources to all values while reducing data duplication and shuffling.

    R

    the type of resources being used

    A

    the type returned by applying the operation with the resource to each value

    resources

    a PairRDD of keys and resources, where keys are used to determine what data the resource is associated with for the operation. There must be exactly one resource for each key in the RDD this method is applied to

    prepareResource

    a function to transform the resource into what will be used in op

    op

    the operation to apply to each value. The operation takes a resource and value and returns the transformed value

    sortBy

    how to sort values

    numPartitions

    the number of partitions for shuffling

    returns

    PairRDD of values transformed by applying the operation with the appropriate resource

  20. def mapValuesWithKeyedPreparedResourceSortedBy[R, R1, S, A](resources: RDD[(K, R)], prepareResource: (R) => R1, op: (R1, V) => A, sortBy: (V) => S, partitioner: Partitioner)(implicit arg0: ClassTag[R], arg1: Ordering[S]): RDD[(K, A)]

    Applies op to every value with some resource, where values and resources share the same key.

    Applies op to every value with some resource, where values and resources share the same key. This allows you to send data to executors based on key, so that: (1) the entire set of resources are not held in memory on all executors; and, (2) a specific resource is not duplicated; it is reused for all corresponding data values One example usage might be when conducting a geospatial operation. If the keys indicate a geographic area, and the value contains geospatial resources in that geographic area, one can apply a method using geospatially local resources to all values while reducing data duplication and shuffling.

    R

    the type of resources being used

    A

    the type returned by applying the operation with the resource to each value

    resources

    a PairRDD of keys and resources, where keys are used to determine what data the resource is associated with for the operation. There must be exactly one resource for each key in the RDD this method is applied to

    prepareResource

    a function to transform the resource into what will be used in op

    op

    the operation to apply to each value. The operation takes a resource and value and returns the transformed value

    sortBy

    how to sort values

    partitioner

    the partitioner for shuffling

    returns

    PairRDD of values transformed by applying the operation with the appropriate resource

  21. def mapValuesWithKeyedPreparedResourceSortedBy[R, S, A](resources: RDD[(K, R)], op: (R) => (V) => A, sortBy: (V) => S)(implicit arg0: ClassTag[R], arg1: Ordering[S]): RDD[(K, A)]

    Applies op to every value with some resource, where values and resources share the same key.

    Applies op to every value with some resource, where values and resources share the same key. This allows you to send data to executors based on key, so that: (1) the entire set of resources are not held in memory on all executors; and, (2) a specific resource is not duplicated; it is reused for all corresponding data values One example usage might be when conducting a geospatial operation. If the keys indicate a geographic area, and the value contains geospatial resources in that geographic area, one can apply a method using geospatially local resources to all values while reducing data duplication and shuffling.

    R

    the type of resources being used

    A

    the type returned by applying the operation with the resource to each value

    resources

    a PairRDD of keys and resources, where keys are used to determine what data the resource is associated with for the operation. There must be exactly one resource for each key in the RDD this method is applied to

    op

    The operation to apply to each value. The operation takes a resource and returns a function, which will then be applied to each value.

    sortBy

    how to sort values

    returns

    PairRDD of values transformed by applying the operation with the appropriate resource

  22. def mapValuesWithKeyedPreparedResourceSortedBy[R, S, A](resources: RDD[(K, R)], op: (R) => (V) => A, sortBy: (V) => S, numPartitions: Int)(implicit arg0: ClassTag[R], arg1: Ordering[S]): RDD[(K, A)]

    Applies op to every value with some resource, where values and resources share the same key.

    Applies op to every value with some resource, where values and resources share the same key. This allows you to send data to executors based on key, so that: (1) the entire set of resources are not held in memory on all executors; and, (2) a specific resource is not duplicated; it is reused for all corresponding data values One example usage might be when conducting a geospatial operation. If the keys indicate a geographic area, and the value contains geospatial resources in that geographic area, one can apply a method using geospatially local resources to all values while reducing data duplication and shuffling.

    R

    the type of resources being used

    A

    the type returned by applying the operation with the resource to each value

    resources

    a PairRDD of keys and resources, where keys are used to determine what data the resource is associated with for the operation. There must be exactly one resource for each key in the RDD this method is applied to

    op

    The operation to apply to each value. The operation takes a resource and returns a function, which will then be applied to each value.

    sortBy

    how to sort values

    numPartitions

    the number of partitions for shuffling

    returns

    PairRDD of values transformed by applying the operation with the appropriate resource

  23. def mapValuesWithKeyedPreparedResourceSortedBy[R, S, A](resources: RDD[(K, R)], op: (R) => (V) => A, sortBy: (V) => S, partitioner: Partitioner)(implicit arg0: ClassTag[R], arg1: Ordering[S]): RDD[(K, A)]

    Applies op to every value with some resource, where values and resources share the same key.

    Applies op to every value with some resource, where values and resources share the same key. This allows you to send data to executors based on key, so that: (1) the entire set of resources are not held in memory on all executors; and, (2) a specific resource is not duplicated; it is reused for all corresponding data values One example usage might be when conducting a geospatial operation. If the keys indicate a geographic area, and the value contains geospatial resources in that geographic area, one can apply a method using geospatially local resources to all values while reducing data duplication and shuffling.

    R

    the type of resources being used

    A

    the type returned by applying the operation with the resource to each value

    resources

    a PairRDD of keys and resources, where keys are used to determine what data the resource is associated with for the operation. There must be exactly one resource for each key in the RDD this method is applied to

    op

    The operation to apply to each value. The operation takes a resource and returns a function, which will then be applied to each value.

    sortBy

    how to sort values

    partitioner

    the partitioner for shuffling

    returns

    PairRDD of values transformed by applying the operation with the appropriate resource

  24. def mapValuesWithKeyedResourceSortedBy[R, S, A](resources: RDD[(K, R)], op: (R, V) => A, sortBy: (V) => S)(implicit arg0: ClassTag[R], arg1: Ordering[S]): RDD[(K, A)]

    Applies op to every value with some resource, where values and resources share the same key.

    Applies op to every value with some resource, where values and resources share the same key. This allows you to send data to executors based on key, so that: (1) the entire set of resources are not held in memory on all executors; and, (2) a specific resource is not duplicated; it is reused for all corresponding data values One example usage might be when conducting a geospatial operation. If the keys indicate a geographic area, and the value contains geospatial resources in that geographic area, one can apply a method using geospatially local resources to all values while reducing data duplication and shuffling.

    R

    the type of resources being used

    A

    the type returned by applying the operation with the resource to each value

    resources

    a PairRDD of keys and resources, where keys are used to determine what data the resource is associated with for the operation. There must be exactly one resource for each key in the RDD this method is applied to

    op

    the operation to apply to each value. Takes a resource and value and returns the transformed value

    sortBy

    how to sort values

    returns

    PairRDD of values transformed by applying the operation with the appropriate resource

  25. def mapValuesWithKeyedResourceSortedBy[R, S, A](resources: RDD[(K, R)], op: (R, V) => A, sortBy: (V) => S, numPartitions: Int)(implicit arg0: ClassTag[R], arg1: Ordering[S]): RDD[(K, A)]

    Applies op to every value with some resource, where values and resources share the same key.

    Applies op to every value with some resource, where values and resources share the same key. This allows you to send data to executors based on key, so that: (1) the entire set of resources are not held in memory on all executors; and, (2) a specific resource is not duplicated; it is reused for all corresponding data values One example usage might be when conducting a geospatial operation. If the keys indicate a geographic area, and the value contains geospatial resources in that geographic area, one can apply a method using geospatially local resources to all values while reducing data duplication and shuffling.

    R

    the type of resources being used

    A

    the type returned by applying the operation with the resource to each value

    resources

    a PairRDD of keys and resources, where keys are used to determine what data the resource is associated with for the operation. There must be exactly one resource for each key in the RDD this method is applied to

    op

    the operation to apply to each value. Takes a resource and value and returns the transformed value

    sortBy

    how to sort values

    numPartitions

    the number of partitions for shuffling

    returns

    PairRDD of values transformed by applying the operation with the appropriate resource

  26. def mapValuesWithKeyedResourceSortedBy[R, S, A](resources: RDD[(K, R)], op: (R, V) => A, sortBy: (V) => S, partitioner: Partitioner)(implicit arg0: ClassTag[R], arg1: Ordering[S]): RDD[(K, A)]

    Applies op to every value with some resource, where values and resources share the same key.

    Applies op to every value with some resource, where values and resources share the same key. This allows you to send data to executors based on key, so that: (1) the entire set of resources are not held in memory on all executors; and, (2) a specific resource is not duplicated; it is reused for all corresponding data values One example usage might be when conducting a geospatial operation. If the keys indicate a geographic area, and the value contains geospatial resources in that geographic area, one can apply a method using geospatially local resources to all values while reducing data duplication and shuffling.

    R

    the type of resources being used

    A

    the type returned by applying the operation with the resource to each value

    resources

    a PairRDD of keys and resources, where keys are used to determine what data the resource is associated with for the operation. There must be exactly one resource for each key in the RDD this method is applied to

    op

    the operation to apply to each value. Takes a resource and value and returns the transformed value

    sortBy

    how to sort values

    partitioner

    the partitioner for shuffling

    returns

    PairRDD of values transformed by applying the operation with the appropriate resource

  27. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  28. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  29. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  30. final def synchronized[T0](arg0: => T0): T0
    Definition Classes
    AnyRef
  31. def toString(): String
    Definition Classes
    AnyRef → Any
  32. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException])
  33. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException])
  34. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException]) @native()

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped