com.here.platform.data.processing.compiler

Direct1ToNCompiler

trait Direct1ToNCompiler[T] extends InputLayers with InputOptPartitioner with CompileInFn[T] with OutputLayers with OutputOptPartitioner with compiler.direct.CompileOutFn[T]

This compiler allows a stateless incremental compilation, in the case the keys of output partitions are a function of the input keys only, so where the input/output mapping does not depend on the input content but only on the input keys.

This compiler is applicable only when each output partition is affected by one single input partition. This is restriction enforced but allows a very efficient implementation of the incremental compilation case. The limitation is on the output side only: one input partition may be mapped to zero, one or more output partitions but each output partition can be mapped to by one input partition only, without overlaps.

In case multiple input partitions may affect the same output partition, the DirectMToNCompiler should be used instead.

T: the type of the values passed between front-end and back-end

Note: the implementation must be scala.Serializable as this is copied to workers and run inside Spark map functions
See also: traits mixed in for more details

Linear Supertypes

direct.CompileOutFn[T], OutputOptPartitioner, OutputLayers, CompileInFn[T], Serializable, InputOptPartitioner, InputLayers, AnyRef, Any

Known Subclasses

ConcreteDirect1ToNCompiler, WrapperDirect1ToNCompiler

Ordering

Alphabetic
By Inheritance

Inherited

Direct1ToNCompiler
CompileOutFn
OutputOptPartitioner
OutputLayers
CompileInFn
Serializable
InputOptPartitioner
InputLayers
AnyRef
Any

Hide All
Show All

Visibility

Public
Protected

Abstract Value Members

abstract def compileInFn(in: (InKey, InMeta)): T
Calculates the intermediate result from a single input partition.
Calculates the intermediate result from a single input partition. The result will be provided together with the input key in the com.here.platform.data.processing.compiler.direct.CompileOutFn.
in
the input partition to process
returns
the value of intermediate data of type T for this partition. This value will be passed in com.here.platform.data.processing.compiler.direct.CompileOutFn to all output keys impacted by the in partition.
Definition Classes
CompileInFn
abstract def compileOutFn(outKey: OutKey, intermediate: T): Option[Payload]
Compiles a single output partition from intermediate data of a single input partition.
Compiles a single output partition from intermediate data of a single input partition. This function will only be called for output keys that were returned in com.here.platform.data.processing.compiler.direct.CompileInFn.
Output keys which are no longer mapped to the mapping function calls, for example due to deleted input partitions in incremental processing, get deleted automatically.
outKey
Key of the output partition to generate
intermediate
Intermediate value calculated by com.here.platform.data.processing.compiler.direct.CompileInFn.compileInFn for the input partition that was mapped to this output key
returns
scala.Options of com.here.platform.data.processing.blobstore.Payload, with either the partition content, or None if there is no data to be published.
Definition Classes
CompileOutFn
abstract def inLayers: Map[Id, Set[Id]]
Represents layers of the input catalogs that you should query and provide to the compiler.
Represents layers of the input catalogs that you should query and provide to the compiler. These layers are grouped by input catalog and identified by catalog ID and layer ID.
Definition Classes
InputLayers
abstract def inPartitioner(parallelism: Int): Option[Partitioner[InKey]]
Specifies the partitioner to use when querying the input catalogs.
Specifies the partitioner to use when querying the input catalogs. If no partitioner is provided, by returning None from this function, then the Executor uses the default partitioner.
parallelism
The number of partitions the partitioner should partition the catalog into, this should match the parallelism of the Spark RDD containing the input partitions.
returns
The optional input partitioner with the parallelism specified.
Definition Classes
InputOptPartitioner
abstract def mappingFn(inKey: InKey): Iterable[OutKey]
Calculates which output partitions, if any, are affected by the given single input partition.
Calculates which output partitions, if any, are affected by the given single input partition. The mapping must be function of only the input com.here.platform.data.processing.compiler.InKey.
The metadata is intentionally not provided, because the result of this call cannot be function of the metadata and therefore the data. This is because the direct.CompileInFn implementation does not perform a stateful dependency-tracking for incremental compilation. If the function would use additional information from the tile content, it would break incremental compilation use case.
inKey
the input partition being mapped
returns
the output partitions that the given input partition maps to
Definition Classes
CompileInFn
abstract def outLayers: Set[Id]
Layers to be produced by the compiler.
Layers to be produced by the compiler.
Definition Classes
OutputLayers
abstract def outPartitioner(parallelism: Int): Option[Partitioner[OutKey]]
Specifies the partitioner to use when querying the output catalog and producing output data.
Specifies the partitioner to use when querying the output catalog and producing output data. If no partitioner is provided, by returning None from this function, then the Executor uses the default partitioner.
parallelism
The number of partitions the partitioner should partition the catalog into, this should match the parallelism of the Spark RDD containing the output partitions.
returns
The optional output partitioner with the parallelism specified.
Definition Classes
OutputOptPartitioner

Concrete Value Members

final def !=(arg0: Any): Boolean
Definition Classes
AnyRef → Any
final def ##: Int
Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean
Definition Classes
AnyRef → Any
final def asInstanceOf[T0]: T0
Definition Classes
Any
def clone(): AnyRef
Attributes
protected[lang]
Definition Classes
AnyRef
Annotations
@throws(classOf[java.lang.CloneNotSupportedException]) @IntrinsicCandidate() @native()
final def eq(arg0: AnyRef): Boolean
Definition Classes
AnyRef
def equals(arg0: AnyRef): Boolean
Definition Classes
AnyRef → Any
final def getClass(): Class[_ <: AnyRef]
Definition Classes
AnyRef → Any
Annotations
@IntrinsicCandidate() @native()
def hashCode(): Int
Definition Classes
AnyRef → Any
Annotations
@IntrinsicCandidate() @native()
final def isInstanceOf[T0]: Boolean
Definition Classes
Any
final def ne(arg0: AnyRef): Boolean
Definition Classes
AnyRef
final def notify(): Unit
Definition Classes
AnyRef
Annotations
@IntrinsicCandidate() @native()
final def notifyAll(): Unit
Definition Classes
AnyRef
Annotations
@IntrinsicCandidate() @native()
final val outCatalogId: Id
Identifier for the output catalog.
Identifier for the output catalog.
Definition Classes
OutputLayers
final def synchronized[T0](arg0: => T0): T0
Definition Classes
AnyRef
def toString(): String
Definition Classes
AnyRef → Any
final def wait(arg0: Long, arg1: Int): Unit
Definition Classes
AnyRef
Annotations
@throws(classOf[java.lang.InterruptedException])
final def wait(arg0: Long): Unit
Definition Classes
AnyRef
Annotations
@throws(classOf[java.lang.InterruptedException]) @native()
final def wait(): Unit
Definition Classes
AnyRef
Annotations
@throws(classOf[java.lang.InterruptedException])

Deprecated Value Members

def finalize(): Unit
Attributes
protected[lang]
Definition Classes
AnyRef
Annotations
@throws(classOf[java.lang.Throwable]) @Deprecated
Deprecated
(Since version 9)

Packages

Direct1ToNCompiler

trait Direct1ToNCompiler[T] extends InputLayers with InputOptPartitioner with CompileInFn[T] with OutputLayers with OutputOptPartitioner with compiler.direct.CompileOutFn[T]

Abstract Value Members

Concrete Value Members

Deprecated Value Members

Inherited from direct.CompileOutFn[T]

Inherited from OutputOptPartitioner

Inherited from OutputLayers

Inherited from CompileInFn[T]

Inherited from Serializable

Inherited from InputOptPartitioner

Inherited from InputLayers

Inherited from AnyRef

Inherited from Any

Ungrouped

Packages

Direct1ToNCompiler

trait Direct1ToNCompiler[T] extends InputLayers with InputOptPartitioner with CompileInFn[T] with OutputLayers with OutputOptPartitioner with compiler.direct.CompileOutFn[T]

Abstract Value Members

Concrete Value Members

Deprecated Value Members

Inherited from direct.CompileOutFn[T]

Inherited from OutputOptPartitioner

Inherited from OutputLayers

Inherited from CompileInFn[T]

Inherited from Serializable

Inherited from InputOptPartitioner

Inherited from InputLayers

Inherited from AnyRef

Inherited from Any

Ungrouped

Direct1ToNCompiler