Packages

trait Stateless[K, V] extends AnyRef

Groups transformations for a DeltaSet[K, V] that do not require storing metadata in the output catalog, to enable incremental processing.

K

The type of the DeltaSet's keys.

V

The type of the DeltaSet's values.

Linear Supertypes
AnyRef, Any
Known Subclasses
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. Stateless
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Abstract Value Members

  1. abstract val autoIdAssigner: BaseSetIdAssigner

    The ID assigner used to automatically assign DeltaSet IDs if not provided.

    The ID assigner used to automatically assign DeltaSet IDs if not provided.

    Attributes
    protected
  2. abstract def disjointUnion[X >: V, W <: X](other: DeltaSet[K, W], id: Id = autoIdAssigner("disjointUnion"), configOverride: Override = DeltaSetConfig.noOverride)(implicit arg0: ClassTag[X], arg1: ClassTag[W]): DeltaSet[K, X]

    Computes the disjoint union of this DeltaSet with another DeltaSet.

    Computes the disjoint union of this DeltaSet with another DeltaSet. If a key is present in both DeltaSets, the operation will throw an exception.

    This function does not shuffle any data any data.

    Both DeltaSets must be partitioned by the same partitioner. If they are not, you must explicitly repartition one or both DeltaSets.

    W

    The value type of the other DeltaSet.

    other

    The other DeltaSet.

    id

    Deprecated.

    configOverride

    Deprecated.

    returns

    A DeltaSet that represents the result of the union.

    Note

    All arguments must be serializable, since this transformation is serialized and sent to workers nodes.

  3. abstract def filterByKey(filterFn: (K) ⇒ Boolean, id: Id = autoIdAssigner("filterByKey"), configOverride: Override = DeltaSetConfig.noOverride): DeltaSet[K, V]

    Selects all key-value pairs with keys that satisfy a predicate.

    Selects all key-value pairs with keys that satisfy a predicate.

    This function does not shuffle any data any data.

    filterFn

    The predicate to test the keys.

    id

    Deprecated.

    configOverride

    Deprecated.

    returns

    A DeltaSet containing all key-value pairs with keys that satisfy the predicate.

    Note

    All arguments must be serializable, since this transformation is serialized and sent to workers nodes.

  4. abstract def flatMapKeys[K2](flatMapFn: OneToMany[K, K2], partitioning: PartitioningStrategy[K2], id: Id = autoIdAssigner("flatMapKeys"), configOverride: Override = DeltaSetConfig.noOverride)(implicit arg0: Ordering[K2], arg1: ClassTag[K2]): DeltaSet[K2, V]

    Applies a 1-to-n function to the keys of all key-value pairs in this DeltaSet and repartitions the result.

    Applies a 1-to-n function to the keys of all key-value pairs in this DeltaSet and repartitions the result.

    If the partitioning is not PreservesPartitioning, this transformation will shuffle data. In particular, all key-value pairs that belong to a different partition after the key transformation may have to be transferred between executors.

    K2

    The type of output keys.

    flatMapFn

    The 1-to-n function used to transform the keys.

    partitioning

    Either a partitioner to use, or PreservesPartitioning to indicate that the mapped keys are always in the same Spark partition as the original keys. This will be checked at run-time.

    id

    Deprecated.

    configOverride

    Deprecated.

    returns

    A DeltaSet representing the result of the transformation.

    Note

    All arguments must be serializable, since this transformation is serialized and sent to workers nodes.

  5. abstract def flatMapKeysGroup[K2](flatMapFn: ManyToMany[K, K2], partitioning: PartitioningStrategy[K2], id: Id = autoIdAssigner("flatMapKeysGroup"), configOverride: Override = DeltaSetConfig.noOverride)(implicit arg0: Ordering[K2], arg1: ClassTag[K2]): DeltaSet[K2, Iterable[V]]

    Applies an m-to-n function to the keys of all key-value pairs in this DeltaSet, repartitions the result, and then groups all output key-value pairs by key.

    Applies an m-to-n function to the keys of all key-value pairs in this DeltaSet, repartitions the result, and then groups all output key-value pairs by key.

    If the partitioning is not PreservesPartitioning, this transformation will shuffle data. In particular, all key-value pairs that belong to a different partition after the key transformation may have to be transferred between executors.

    K2

    The type of output keys.

    flatMapFn

    The m-to-n function used to transform the keys.

    partitioning

    Either a partitioner to use, or PreservesPartitioning to indicate that the mapped keys are always in the same Spark partition as the original keys. This will be checked at run-time.

    id

    Deprecated.

    configOverride

    Deprecated.

    returns

    A DeltaSet representing the result of the transformation.

    Note

    All arguments must be serializable, since this transformation is serialized and sent to workers nodes.

  6. abstract def flatMapKeysReduce[K2](flatMapFn: ManyToMany[K, K2], reduceFn: (V, V) ⇒ V, partitioning: PartitioningStrategy[K2], id: Id = autoIdAssigner("flatMapKeysReduce"), configOverride: Override = DeltaSetConfig.noOverride)(implicit arg0: Ordering[K2], arg1: ClassTag[K2]): DeltaSet[K2, V]

    Applies an m-to-n function to the keys of all key-value pairs in this DeltaSet, repartitions the result, and then reduces all values with the same key to a single value.

    Applies an m-to-n function to the keys of all key-value pairs in this DeltaSet, repartitions the result, and then reduces all values with the same key to a single value.

    If the partitioning is not PreservesPartitioning, this transformation will shuffle data. In particular, all key-value pairs that belong to a different partition after the key transformation may have to be transferred between executors.

    K2

    The type of output keys.

    flatMapFn

    The m-to-n function used to transform the keys.

    reduceFn

    The associative and commutative reduce function.

    partitioning

    Either a partitioner to use, or PreservesPartitioning to indicate that the mapped keys are always in the same Spark partition as the original keys. This will be checked at run-time.

    id

    Deprecated.

    configOverride

    Deprecated.

    returns

    A DeltaSet representing the result of the transformation.

    Note

    All arguments must be serializable, since this transformation is serialized and sent to workers nodes.

  7. abstract def leftOuterJoin[W](other: DeltaSet[K, W], id: Id = autoIdAssigner("leftOuterJoin"), configOverride: Override = DeltaSetConfig.noOverride)(implicit arg0: ClassTag[W]): DeltaSet[K, (V, Option[W])]

    Computes the left outer join of this DeltaSet with another DeltaSet.

    Computes the left outer join of this DeltaSet with another DeltaSet. For each key contained in this DeltaSet, the result contains the pair of values associated with the key in each of the DeltaSets. If a key is not associated with a value in the other DeltaSets, the entry in the pair is set to None.

    This function does not shuffle any data any data.

    Both DeltaSets must be partitioned by the same partitioner. If they are not, you must explicitly repartition one or both DeltaSets.

    W

    The value type of the other DeltaSet.

    other

    The other DeltaSet.

    id

    The ID of the DeltaSet, that identifies it in configuration files and logs, and for storing metadata in the output catalog. The ID must be unique among all DeltaSets created through the same DeltaContext. By default, the ID is automatically assigned.

    configOverride

    Overrides the default configuration of the transformation. By default, nothing is overridden. Settings passed in this parameter can, in turn, be overridden by DeltaSet-specific settings in the configuration file.

    returns

    A DeltaSet that represents the result of the left outer join.

    Note

    All arguments must be serializable, since this transformation is serialized and sent to workers nodes.

  8. abstract def mapKeys[K2](mapFn: OneToOne[K, K2], partitioning: PartitioningStrategy[K2], id: Id = autoIdAssigner("mapKeys"), configOverride: Override = DeltaSetConfig.noOverride)(implicit arg0: Ordering[K2], arg1: ClassTag[K2]): DeltaSet[K2, V]

    Applies a 1-to-1 function to the keys of all key-value pairs in this DeltaSet and repartitions the result.

    Applies a 1-to-1 function to the keys of all key-value pairs in this DeltaSet and repartitions the result.

    If the partitioning is not PreservesPartitioning, this transformation will shuffle data. In particular, all key-value pairs that belong to a different partition after the key transformation may have to be transferred between executors.

    K2

    The type of output keys.

    mapFn

    The 1-to-1 function used to transform the keys.

    partitioning

    Either a partitioner to use, or PreservesPartitioning to indicate that the mapped keys are always in the same Spark partition as the original keys. This will be checked at run-time.

    id

    Deprecated.

    configOverride

    Deprecated.

    returns

    A DeltaSet representing the result of the transformation.

    Note

    All arguments must be serializable, since this transformation is serialized and sent to workers nodes.

  9. abstract def mapKeysGroup[K2](mapFn: ManyToOne[K, K2], partitioning: PartitioningStrategy[K2], id: Id = autoIdAssigner("mapKeysGroup"), configOverride: Override = DeltaSetConfig.noOverride)(implicit arg0: Ordering[K2], arg1: ClassTag[K2]): DeltaSet[K2, Iterable[V]]

    Applies an m-to-1 function to the keys of all key-value pairs in this DeltaSet, repartitions the result, and then groups all output key-value pairs by key.

    Applies an m-to-1 function to the keys of all key-value pairs in this DeltaSet, repartitions the result, and then groups all output key-value pairs by key.

    If the partitioning is not PreservesPartitioning, this transformation will shuffle data. In particular, all key-value pairs that belong to a different partition after the key transformation may have to be transferred between executors.

    K2

    The type of output keys.

    mapFn

    The m-to-1 function used to transform the keys.

    partitioning

    Either a partitioner to use, or PreservesPartitioning to indicate that the mapped keys are always in the same Spark partition as the original keys. This will be checked at run-time.

    id

    Deprecated.

    configOverride

    Deprecated.

    returns

    A DeltaSet representing the result of the transformation.

    Note

    All arguments must be serializable, since this transformation is serialized and sent to workers nodes.

  10. abstract def mapKeysReduce[K2](mapFn: ManyToOne[K, K2], reduceFn: (V, V) ⇒ V, partitioning: PartitioningStrategy[K2], id: Id = autoIdAssigner("mapKeysReduce"), configOverride: Override = DeltaSetConfig.noOverride)(implicit arg0: Ordering[K2], arg1: ClassTag[K2]): DeltaSet[K2, V]

    Applies an m-to-1 function to the keys of all key-value pairs in this DeltaSet, repartitions the result, and then reduces all values with the same key to a single value.

    Applies an m-to-1 function to the keys of all key-value pairs in this DeltaSet, repartitions the result, and then reduces all values with the same key to a single value.

    If the partitioning is not PreservesPartitioning, this transformation will shuffle data. In particular, all key-value pairs that belong to a different partition after the key transformation may have to be transferred between executors.

    K2

    The type of output keys.

    mapFn

    The m-to-1 function used to transform the keys.

    reduceFn

    The associative and commutative reduce function.

    partitioning

    Either a partitioner to use, or PreservesPartitioning to indicate that the mapped keys are always in the same Spark partition as the original keys. This will be checked at run-time.

    id

    Deprecated.

    configOverride

    Deprecated.

    returns

    A DeltaSet representing the result of the transformation.

    Note

    All arguments must be serializable, since this transformation is serialized and sent to workers nodes.

  11. abstract def mapValues[V2](mapFn: (V) ⇒ V2, id: Id = autoIdAssigner("mapValues"), configOverride: Override = DeltaSetConfig.noOverride)(implicit arg0: ClassTag[V2]): DeltaSet[K, V2]

    Applies a function to the values of all key-value pairs in this DeltaSet.

    Applies a function to the values of all key-value pairs in this DeltaSet.

    This function does not shuffle any data any data.

    V2

    The type of output values.

    mapFn

    The function applied to the values of all key-value pairs in this DeltaSet.

    id

    Deprecated.

    configOverride

    Deprecated.

    returns

    A DeltaSet that represents the result of the transformation.

    Note

    All arguments must be serializable, since this transformation is serialized and sent to workers nodes.

  12. abstract def mapValuesWithKey[V2](mapFn: (K, V) ⇒ V2, id: Id = autoIdAssigner("mapValuesWithKey"), configOverride: Override = DeltaSetConfig.noOverride)(implicit arg0: ClassTag[V2]): DeltaSet[K, V2]

    Applies a function to the values of all key-value pairs of this DeltaSet.

    Applies a function to the values of all key-value pairs of this DeltaSet.

    This function does not shuffle any data any data.

    V2

    The type of the output values.

    mapFn

    The function applied to the values of all key-value pairs in this DeltaSet.

    id

    Deprecated.

    configOverride

    Deprecated.

    returns

    A DeltaSet that represents the result of the transformation.

    Note

    All arguments must be serializable, since this transformation is serialized and sent to workers nodes.

  13. abstract def outerJoin[W](other: DeltaSet[K, W], id: Id = autoIdAssigner("outerJoin"), configOverride: Override = DeltaSetConfig.noOverride)(implicit arg0: ClassTag[W]): DeltaSet[K, (Option[V], Option[W])]

    Computes the outer join of this DeltaSet with another DeltaSet.

    Computes the outer join of this DeltaSet with another DeltaSet. For each key contained in either DeltaSet, the result contains the pair of values associated with the key in each of the DeltaSets. If a key is not associated with values in one of the DeltaSets, the entry in the pair is set to None.

    This function does not shuffle any data any data.

    Both DeltaSets must be partitioned by the same partitioner. If they are not, you must explicitly repartition one or both DeltaSets.

    W

    The value type of the other DeltaSet.

    other

    The other DeltaSet.

    id

    Deprecated.

    configOverride

    Deprecated.

    returns

    A DeltaSet that represents the result of the join.

    Note

    All arguments must be serializable, since this transformation is serialized and sent to workers nodes.

  14. abstract def persist(storageLevel: StorageLevel, id: Id = autoIdAssigner("persist"), configOverride: Override = DeltaSetConfig.noOverride): DeltaSet[K, V]

    Persists the data in the DeltaSet to memory or disk, depending on the provided org.apache.spark.storage.StorageLevel.

    Persists the data in the DeltaSet to memory or disk, depending on the provided org.apache.spark.storage.StorageLevel.

    This function does not shuffle any data any data.

    storageLevel

    The storage level at which to persist the DeltaSet.

    id

    Deprecated.

    configOverride

    Deprecated.

    returns

    A DeltaSet that represents the result of the join.

    Note

    All arguments must be serializable, since this transformation is serialized and sent to workers nodes.

  15. abstract def repartition(partitioner: Partitioner[K], id: Id = autoIdAssigner("repartition"), configOverride: Override = DeltaSetConfig.noOverride): DeltaSet[K, V]

    Repartitions the data in the DeltaSet, using the given partitioner.

    Repartitions the data in the DeltaSet, using the given partitioner.

    This function may shuffle all the data, depending on the partitioner.

    partitioner

    The partitioner to apply.

    id

    Deprecated.

    configOverride

    Deprecated.

    returns

    A DeltaSet that represents the result of the join.

    Note

    All arguments must be serializable, since this transformation is serialized and sent to workers nodes.

Concrete Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  6. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  7. def equals(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  8. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  9. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  10. def hashCode(): Int
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  11. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  12. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  13. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  14. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  15. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  16. def toString(): String
    Definition Classes
    AnyRef → Any
  17. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  18. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  19. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()

Inherited from AnyRef

Inherited from Any

Ungrouped