Packages

trait MapGroupCompiler[T] extends InputLayers with InputOptPartitioner with CompileInFn[T] with OutputLayers with OutputOptPartitioner with CompileOutFn[T]

Compiler to implement a generic Map-Reduce pattern, where the reduce function is group-by. The front-end compiler each input partition and produces the list of output partition that this input affect, each with a value of custom type. Values are then grouped per output partition and passed to the back-end that produces the output map.

This pattern is a more general version of Direct1ToNCompiler and DirectMToNCompiler where not only a M:N input/output relationship is supported, but this relationship is function of the input payloads, so the input content.

This pattern, however, compiles input partitions standalone, meaning that compiling one input partition sees data and metadata of that partition only. In case it is needed to lookup information from additional input partition in the front-end, please refer to RefTreeCompiler.

T

the custom type of the values passed between front-end and back-end

Note

the implementation must be scala.Serializable as this is copied to workers and run inside Spark map functions

,

This is a Java friendly version of com.here.platform.data.processing.compiler.MapGroupCompiler.

See also

extended interfaces for more details

Linear Supertypes
CompileOutFn[T], OutputOptPartitioner, OutputLayers, CompileInFn[T], Serializable, Serializable, InputOptPartitioner, InputLayers, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. MapGroupCompiler
  2. CompileOutFn
  3. OutputOptPartitioner
  4. OutputLayers
  5. CompileInFn
  6. Serializable
  7. Serializable
  8. InputOptPartitioner
  9. InputLayers
  10. AnyRef
  11. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Abstract Value Members

  1. abstract def compileInFn(in: Java.Pair[InKey, InMeta]): Iterable[Java.Pair[OutKey, T]]

    Calculates the dependent output partitions and intermediate results from a single input partition.

    Calculates the dependent output partitions and intermediate results from a single input partition.

    in

    the input partition to process

    returns

    all the impacted output partitions OutKey and intermediate data of type T for this partition. It may contain more than one element per output key. compileOutFn will only be called for outKeys which have at least one intermediate value from this phase. Other outKeys will be automatically deleted.

    Definition Classes
    CompileInFn
  2. abstract def inLayers: Map[String, Set[String]]

    Layers of the input catalogs that should be queried and provided to the compiler, grouped by input catalog and identified by catalog id and layer ID.

    Layers of the input catalogs that should be queried and provided to the compiler, grouped by input catalog and identified by catalog id and layer ID.

    Definition Classes
    InputLayers
  3. abstract def inPartitioner(parallelism: Int): Option[PartitionerOfKey]

    If the returned com.here.platform.data.processing.java.Java.Option is Empty, the Executor will use a default partitioner, if it is defined then the given partitioner will be applied when querying the input catalogs.

    If the returned com.here.platform.data.processing.java.Java.Option is Empty, the Executor will use a default partitioner, if it is defined then the given partitioner will be applied when querying the input catalogs.

    parallelism

    the parallelism of the partitioner

    returns

    the input optional partitioner with the given parallelism

    Definition Classes
    InputOptPartitioner
  4. abstract def outLayers: Set[String]

    Layers that are expected to be produced by the compiler.

    Layers that are expected to be produced by the compiler.

    Definition Classes
    OutputLayers
  5. abstract def outPartitioner(parallelism: Int): Option[PartitionerOfKey]

    If the returned com.here.platform.data.processing.java.Java.Option is Empty, the Executor will use a default partitioner, if it is defined then the given partitioner will be applied when querying the output catalog and producing data to be published.

    If the returned com.here.platform.data.processing.java.Java.Option is Empty, the Executor will use a default partitioner, if it is defined then the given partitioner will be applied when querying the output catalog and producing data to be published.

    parallelism

    the parallelism of the partitioner

    returns

    the output optional partitioner with the given parallelism

    Definition Classes
    OutputOptPartitioner

Concrete Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  6. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  7. def equals(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  8. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  9. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  10. def hashCode(): Int
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  11. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  12. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  13. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  14. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  15. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  16. def toString(): String
    Definition Classes
    AnyRef → Any
  17. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  18. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  19. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()

Inherited from CompileOutFn[T]

Inherited from OutputOptPartitioner

Inherited from OutputLayers

Inherited from CompileInFn[T]

Inherited from Serializable

Inherited from Serializable

Inherited from InputOptPartitioner

Inherited from InputLayers

Inherited from AnyRef

Inherited from Any

Ungrouped