Packages

trait RefTreeCompiler[T] extends InputLayers with InputOptPartitioner with ResolveInFn with CompileInFn[T] with OutputLayers with OutputOptPartitioner with CompileOutFn[T]

A RefTreeCompiler allows full and incremental compilation with complex reference structures. A condition is that all references of a layer can be calculated purely from the source data of this layer using a resolve function, which just gets the meta data of one partition as an input. The structure of the references has to be predefined in a reftree.RefTree object.

Apart from the reference resolution pre-phase described above, the compilation itself is split in two phases, front-end and backend, as in other compilers, similar to MapGroupCompiler.

In the first phase, compileIn from reftree.CompileInFn is called for every partition, with the full list of meta data for all of its referenced partitions. This method returns one or more values of type T for every impacted output partition.

The compileIn function for the first phase of the compilation is defined in traits that extend reftree.CompileInFn. One of these need to be mixed in, like reftree.CompileInFnWithRefs or reftree.CompileInFnWithRefsReturnsReferences.

In the second phase, a method from one of the CompileOutFn traits is called for every output partition where the first phase provided at least one element of T. Elements coming from various input partitions are grouped together and provided as input of compilation for each output partition.

T

The custom type of the values passed between front-end and back-end

Note

The implementation must be scala.Serializable as this is copied to workers and run inside Spark map functions.

See also

traits mixed in for more details

Linear Supertypes
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. RefTreeCompiler
  2. CompileOutFn
  3. OutputOptPartitioner
  4. OutputLayers
  5. CompileInFn
  6. ResolveInFn
  7. Serializable
  8. Serializable
  9. InputOptPartitioner
  10. InputLayers
  11. AnyRef
  12. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Abstract Value Members

  1. abstract def compileInFnDefined(): Unit

    Must be overridden as final by all subclasses, to block the mixin of different interfaces in the same compiler class.

    Must be overridden as final by all subclasses, to block the mixin of different interfaces in the same compiler class.

    Definition Classes
    CompileInFn
  2. abstract def compileOutFnDefined(): Unit

    Must be overridden as final by all subclasses, to block the mixin of different interfaces in the same compiler class and to assure that at least one child interface is mixed in.

    Must be overridden as final by all subclasses, to block the mixin of different interfaces in the same compiler class and to assure that at least one child interface is mixed in.

    Attributes
    protected
    Definition Classes
    CompileOutFn
  3. abstract def inLayers: Map[Id, Set[Id]]

    Represents layers of the input catalogs that you should query and provide to the compiler.

    Represents layers of the input catalogs that you should query and provide to the compiler. These layers are grouped by input catalog and identified by catalog ID and layer ID.

    Definition Classes
    InputLayers
  4. abstract def inPartitioner(parallelism: Int): Option[Partitioner[InKey]]

    Specifies the partitioner to use when querying the input catalogs.

    Specifies the partitioner to use when querying the input catalogs. If no partitioner is provided, by returning None from this function, then the Executor uses the default partitioner.

    parallelism

    The number of partitions the partitioner should partition the catalog into, this should match the parallelism of the Spark RDD containing the input partitions.

    returns

    The optional input partitioner with the parallelism specified.

    Definition Classes
    InputOptPartitioner
  5. abstract def outLayers: Set[Id]

    Layers to be produced by the compiler.

    Layers to be produced by the compiler.

    Definition Classes
    OutputLayers
  6. abstract def outPartitioner(parallelism: Int): Option[Partitioner[OutKey]]

    Specifies the partitioner to use when querying the output catalog and producing output data.

    Specifies the partitioner to use when querying the output catalog and producing output data. If no partitioner is provided, by returning None from this function, then the Executor uses the default partitioner.

    parallelism

    The number of partitions the partitioner should partition the catalog into, this should match the parallelism of the Spark RDD containing the output partitions.

    returns

    The optional output partitioner with the parallelism specified.

    Definition Classes
    OutputOptPartitioner
  7. abstract def refStructure: RefTree

    The refStructure defines the maximum reference tree of the source layers.

    The refStructure defines the maximum reference tree of the source layers.

    See also

    com.here.platform.data.processing.compiler.reftree.RefTree for details.

  8. abstract def resolveFn(src: (InKey, InMeta)): Map[RefName, Set[InKey]]

    Resolves the list of partitions in all the reference target layers based on the a source partition.

    Resolves the list of partitions in all the reference target layers based on the a source partition. The returned com.here.platform.data.processing.compiler.reftree.RefTree.RefName values for the references must be aligned with the values in the com.here.platform.data.processing.compiler.RefTreeCompiler.refStructure.

    src

    the source partition key and metadata

    Definition Classes
    ResolveInFn

Concrete Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  6. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  7. def equals(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  8. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  9. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  10. def hashCode(): Int
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  11. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  12. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  13. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  14. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  15. final val outCatalogId: Id

    Identifier for the output catalog.

    Identifier for the output catalog.

    Definition Classes
    OutputLayers
  16. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  17. def toString(): String
    Definition Classes
    AnyRef → Any
  18. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  19. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  20. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()

Inherited from CompileOutFn[T]

Inherited from OutputOptPartitioner

Inherited from OutputLayers

Inherited from CompileInFn[T]

Inherited from ResolveInFn

Inherited from Serializable

Inherited from Serializable

Inherited from InputOptPartitioner

Inherited from InputLayers

Inherited from AnyRef

Inherited from Any

Ungrouped