Packages

implicit final class MapKeyValueWrapper[K, V] extends AnyVal

Maps RDDs without changing the partitioning.

Linear Supertypes
AnyVal, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. MapKeyValueWrapper
  2. AnyVal
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new MapKeyValueWrapper(rdd: RDD[(K, V)])

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    Any
  2. final def ##(): Int
    Definition Classes
    Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. def collectWithKey[U](pf: PartialFunction[(K, V), U]): RDD[(K, U)]

    Performs filtering and mapping on values of RDD preserving partitioning.

    Performs filtering and mapping on values of RDD preserving partitioning.

    U

    the type of the new values

    pf

    the mapping partial function, providing the new value for the element

    returns

    an RDD with mapping function applied to each element

  6. def flatMapValuesWithKey[U](f: ((K, V)) ⇒ GenIterable[U]): RDD[(K, U)]

    Similar to flatMap on a pair RDD, but does not allow changing the key and preserves the partitioning.

    Similar to flatMap on a pair RDD, but does not allow changing the key and preserves the partitioning.

    U

    the type of the new values

    f

    the mapping function providing the new value for the element

    returns

    an RDD with mapping function applied to each element

  7. def flatMapValuesWithKeyParallel[U](numThreads: Int)(f: ((K, V)) ⇒ GenIterable[U]): RDD[(K, U)]

    Similar to flatMapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    Similar to flatMapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    U

    the ype of the new values

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    f

    the mapping function, providing the new value for the element

    returns

    an RDD with mapping function applied to each element

  8. def flatMapValuesWithKeyParallel[U](f: ((K, V)) ⇒ GenIterable[U], numThreads: Int): RDD[(K, U)]

    Similar to flatMapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    Similar to flatMapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    U

    the ype of the new values

    f

    the mapping function, providing the new value for the element

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    returns

    an RDD with mapping function applied to each element

  9. def flatMapValuesWithKeyParallelUnordered[U](numThreads: Int)(f: ((K, V)) ⇒ GenIterable[U]): RDD[(K, U)]

    Similar to flatMapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    Similar to flatMapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1. Elements are emitted as soon as they are ready, therefore it is possible that the transformed elements in each Spark partition are not in the same order as in the upstream RDD.

    U

    the ype of the new values

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    f

    the mapping function, providing the new value for the element

    returns

    an RDD with mapping function applied to each element

  10. def flatMapValuesWithKeyParallelUnordered[U](f: ((K, V)) ⇒ GenIterable[U], numThreads: Int): RDD[(K, U)]

    Similar to flatMapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    Similar to flatMapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1. Elements are emitted as soon as they are ready, therefore it is possible that the transformed elements in each Spark partition are not in the same order as in the upstream RDD.

    U

    the ype of the new values

    f

    the mapping function, providing the new value for the element

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    returns

    an RDD with mapping function applied to each element

  11. def getClass(): Class[_ <: AnyVal]
    Definition Classes
    AnyVal → Any
  12. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  13. def mapValuesParallel[U](f: (V) ⇒ U, numThreads: Int): RDD[(K, U)]

    Similar to mapValues on a pair RDD, but uses a thread pool if numThreads is >1.

    Similar to mapValues on a pair RDD, but uses a thread pool if numThreads is >1.

    U

    the type of the new values

    f

    the mapping function, providing the new value for the element

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    returns

    an RDD with mapping function applied to each element

  14. def mapValuesParallelUnordered[U](f: (V) ⇒ U, numThreads: Int): RDD[(K, U)]

    Similar to mapValues on a pair RDD, but uses a thread pool if numThreads is >1.

    Similar to mapValues on a pair RDD, but uses a thread pool if numThreads is >1. Elements are emitted as soon as they are ready, therefore it is possible that the transformed elements in each Spark partition are not in the same order as in the upstream RDD.

    U

    the type of the new values

    f

    the mapping function, providing the new value for the element

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    returns

    an RDD with mapping function applied to each element

  15. def mapValuesSortedWithKey[U, B](f: ((K, V)) ⇒ U, s: ((K, V)) ⇒ B)(implicit ord: Ordering[B]): RDD[(K, U)]

    Similar to mapValues on a pair RDD, but provides the key as additional input to the mapping function and sorts the elements in each Spark partition before applying the function.

    Similar to mapValues on a pair RDD, but provides the key as additional input to the mapping function and sorts the elements in each Spark partition before applying the function.

    U

    the type of the new values

    B

    the type of the ordering for the sort

    f

    the mapping function, providing the new value for the element

    s

    the mapping function, providing the ordering value of type B

    ord

    the ordering assumed on type B

    returns

    an RDD with mapping function applied to each element

  16. def mapValuesSortedWithKeyParallel[U, B](f: ((K, V)) ⇒ U, s: ((K, V)) ⇒ B, numThreads: Int)(implicit ord: Ordering[B]): RDD[(K, U)]

    Similar to mapValuesSortedWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    Similar to mapValuesSortedWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    U

    the type of the new values

    B

    the type of the ordering for the sort

    f

    the mapping function, providing the new value for the element

    s

    the mapping function, providing the ordering value of type B

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    ord

    the ordering assumed on type B

    returns

    an RDD with mapping function applied to each element

  17. def mapValuesSortedWithKeyParallelUnordered[U, B](f: ((K, V)) ⇒ U, s: ((K, V)) ⇒ B, numThreads: Int)(implicit ord: Ordering[B]): RDD[(K, U)]

    Similar to mapValuesSortedWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    Similar to mapValuesSortedWithKey on a pair RDD, but uses a thread pool if numThreads is >1. Elements are emitted as soon as they are ready, therefore it is possible that the transformed elements in each Spark partition are not in the same order as in the upstream RDD.

    U

    the type of the new values

    B

    the type of the ordering for the sort

    f

    the mapping function, providing the new value for the element

    s

    the mapping function, providing the ordering value of type B

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    ord

    the ordering assumed on type B

    returns

    an RDD with mapping function applied to each element

  18. def mapValuesWithKey[U](f: ((K, V)) ⇒ U): RDD[(K, U)]

    Similar to mapValues on a pair RDD, but provides the key as additional input to the mapping function.

    Similar to mapValues on a pair RDD, but provides the key as additional input to the mapping function.

    U

    the type of the new values

    f

    the mapping function, providing the new value for the element

    returns

    an RDD with mapping function applied to each element.

  19. def mapValuesWithKeyParallel[U](numThreads: Int)(f: ((K, V)) ⇒ U): RDD[(K, U)]

    Similar to mapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    Similar to mapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    U

    the type of the new values

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    f

    the mapping function, providing the new value for the element

    returns

    an RDD with mapping function applied to each element

  20. def mapValuesWithKeyParallel[U](f: ((K, V)) ⇒ U, numThreads: Int): RDD[(K, U)]

    Similar to mapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    Similar to mapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    U

    the type of the new values

    f

    the mapping function, providing the new value for the element

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    returns

    an RDD with mapping function applied to each element

  21. def mapValuesWithKeyParallelUnordered[U](numThreads: Int)(f: ((K, V)) ⇒ U): RDD[(K, U)]

    Similar to mapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    Similar to mapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1. Elements are emitted as soon as they are ready, therefore it is possible that the transformed elements in each Spark partition are not in the same order as in the upstream RDD.

    U

    the type of the new values

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    f

    the mapping function, providing the new value for the element

    returns

    an RDD with mapping function applied to each element

  22. def mapValuesWithKeyParallelUnordered[U](f: ((K, V)) ⇒ U, numThreads: Int): RDD[(K, U)]

    Similar to mapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1.

    Similar to mapValuesWithKey on a pair RDD, but uses a thread pool if numThreads is >1. Elements are emitted as soon as they are ready, therefore it is possible that the transformed elements in each Spark partition are not in the same order as in the upstream RDD.

    U

    the type of the new values

    f

    the mapping function, providing the new value for the element

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    returns

    an RDD with mapping function applied to each element

  23. def toString(): String
    Definition Classes
    Any

Inherited from AnyVal

Inherited from Any

Ungrouped