Packages

implicit final class MapPartitionWrapper[V] extends AnyVal

Maps partitions of RDDs in parallel.

Linear Supertypes
AnyVal, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. MapPartitionWrapper
  2. AnyVal
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new MapPartitionWrapper(rdd: RDD[V])

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    Any
  2. final def ##(): Int
    Definition Classes
    Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. def getClass(): Class[_ <: AnyVal]
    Definition Classes
    AnyVal → Any
  6. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  7. def mapParallel[U](numThreads: Int)(f: (V) ⇒ U)(implicit arg0: ClassTag[U], vt: ClassTag[V]): RDD[U]

    Applies map to an RDD, but it can use a thread pool if numThreads is >1, in that case partitions are computed in parallel.

    Applies map to an RDD, but it can use a thread pool if numThreads is >1, in that case partitions are computed in parallel.

    U

    the type of the new values

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    f

    the mapping function, providing the new value for the element

    returns

    an RDD with mapping function applied to each element

    Note

    calling this function loses partitioning for key-value pair RDD

  8. def mapParallel[U](f: (V) ⇒ U, numThreads: Int)(implicit arg0: ClassTag[U], vt: ClassTag[V]): RDD[U]

    Applies map to an RDD, but it can use a thread pool if numThreads is >1, in that case partitions are computed in parallel.

    Applies map to an RDD, but it can use a thread pool if numThreads is >1, in that case partitions are computed in parallel.

    U

    the type of the new values

    f

    the mapping function, providing the new value for the element

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    returns

    an RDD with mapping function applied to each element

    Note

    calling this function loses partitioning for key-value pair RDD

  9. def mapParallelUnordered[U](numThreads: Int)(f: (V) ⇒ U)(implicit arg0: ClassTag[U], vt: ClassTag[V]): RDD[U]

    Applies map to an RDD, but it can use a thread pool if numThreads is >1, in that case partitions are computed in parallel.

    Applies map to an RDD, but it can use a thread pool if numThreads is >1, in that case partitions are computed in parallel. Elements are emitted as soon as they are ready, therefore it is possible that the transformed elements in each Spark partition are not in the same order as in the upstream RDD.

    U

    the type of the new values

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    f

    the mapping function, providing the new value for the element

    returns

    an RDD with mapping function applied to each element

    Note

    calling this function loses partitioning for key-value pair RDD

  10. def mapParallelUnordered[U](f: (V) ⇒ U, numThreads: Int)(implicit arg0: ClassTag[U], vt: ClassTag[V]): RDD[U]

    Applies map to an RDD, but it can use a thread pool if numThreads is >1, in that case partitions are computed in parallel.

    Applies map to an RDD, but it can use a thread pool if numThreads is >1, in that case partitions are computed in parallel. Elements are emitted as soon as they are ready, therefore it is possible that the transformed elements in each Spark partition are not in the same order as in the upstream RDD.

    U

    the type of the new values

    f

    the mapping function, providing the new value for the element

    numThreads

    the number of threads to use for parallel invocations of f(). Can only be set to a value larger than 1 if the operations include IO operations, or if multiple cores are available per executor

    returns

    an RDD with mapping function applied to each element

    Note

    calling this function loses partitioning for key-value pair RDD

  11. def toString(): String
    Definition Classes
    Any

Inherited from AnyVal

Inherited from Any

Ungrouped