Packages

c

com.here.platform.data.processing.spark.partitioner

LocalityAwarePartitioner

case class LocalityAwarePartitioner(numPartitions: Int, level: Int) extends PartitionNamePartitioner with Product with Serializable

Implements a Partitioner for com.here.platform.data.processing.catalog.Partition.Key that is aware of the geographic location of the keys and can therefore put keys that are close to each other in the same Spark partition. This increases the data locality and speeds up the processing of Spark worker nodes.

The partitioner detects which com.here.platform.data.processing.catalog.Partition.Keys are actually HereTiles: in this case keys are grouped at a fixed quadtree level, that is generally higher that the level of the keys (= lower number). Keys that are at a level even higher that the one specified, or keys that are not HERE tiles are partitioned using their hashcode and spread uniformly across all the available partitions. These partitions do not have data locality.

numPartitions

The overall number of partitions.

level

The level by which to group HereTiles in the same Spark partition.

Linear Supertypes
Product, Equals, PartitionNamePartitioner, Partitioner[KeyOrName], org.apache.spark.Partitioner, Serializable, Serializable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. LocalityAwarePartitioner
  2. Product
  3. Equals
  4. PartitionNamePartitioner
  5. Partitioner
  6. Partitioner
  7. Serializable
  8. Serializable
  9. AnyRef
  10. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new LocalityAwarePartitioner(numPartitions: Int, level: Int)

    numPartitions

    The overall number of partitions.

    level

    The level by which to group HereTiles in the same Spark partition.

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  6. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  7. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  8. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  9. def getPartition(key: Any): Int

    Implements the Spark org.apache.spark.Partitioner interface by forwarding the calls to getPartitionForKey.

    Implements the Spark org.apache.spark.Partitioner interface by forwarding the calls to getPartitionForKey.

    If the object passed is not of type K or can't be converted to it (e.g. java.lang.Integer to Int), a IllegalArgumentException is thrown. This should be considered a bug that should not happen because the processing library uses Partitioner of type K only for RDDs for which it is aware and sure to have keys of type K.

    Basically, this function is a no-op call that forwards to getPartitionForKey, but the important point here is to have a type-safe Partitioner in the processing library.

    key

    the key for which the partition must be calculated

    returns

    the partition, identified by one scala.Int, in which the key should be located

    Definition Classes
    Partitioner → Partitioner
    Note

    This is called by Spark and should not be called by developer's code, as it may be unsafe.

  10. final def getPartitionForKey(key: KeyOrName): Int

    Gets the partition for a given key of type K.

    Gets the partition for a given key of type K. This is the function that must be implemented by children partitioners.

    key

    the key for which the partition must be calculated

    returns

    the partition, identified by one scala.Int, in which the key should be located

    Definition Classes
    PartitionNamePartitionerPartitioner
  11. def getPartitionForName(name: Name): Int
  12. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  13. val level: Int
  14. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  15. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  16. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  17. val numPartitions: Int
    Definition Classes
    LocalityAwarePartitioner → Partitioner
  18. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  19. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  20. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  21. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()

Inherited from Product

Inherited from Equals

Inherited from PartitionNamePartitioner

Inherited from Partitioner[KeyOrName]

Inherited from org.apache.spark.Partitioner

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped