case class LocalityAwarePartitioner(numPartitions: Int, level: Int) extends PartitionNamePartitioner with Product with Serializable
Implements a Partitioner for com.here.platform.data.processing.catalog.Partition.Key that is aware of the geographic location of the keys and can therefore put keys that are close to each other in the same Spark partition. This increases the data locality and speeds up the processing of Spark worker nodes.
The partitioner detects which com.here.platform.data.processing.catalog.Partition.Keys are actually HereTiles: in this case keys are grouped at a fixed quadtree level, that is generally higher that the level of the keys (= lower number). Keys that are at a level even higher that the one specified, or keys that are not HERE tiles are partitioned using their hashcode and spread uniformly across all the available partitions. These partitions do not have data locality.
- numPartitions
The overall number of partitions.
- level
The level by which to group HereTiles in the same Spark partition.
- Alphabetic
- By Inheritance
- LocalityAwarePartitioner
- Product
- Equals
- PartitionNamePartitioner
- Partitioner
- Partitioner
- Serializable
- Serializable
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Instance Constructors
-
new
LocalityAwarePartitioner(numPartitions: Int, level: Int)
- numPartitions
The overall number of partitions.
- level
The level by which to group HereTiles in the same Spark partition.
Value Members
-
final
def
!=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
##(): Int
- Definition Classes
- AnyRef → Any
-
final
def
==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
asInstanceOf[T0]: T0
- Definition Classes
- Any
-
def
clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()
-
final
def
eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
def
finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( classOf[java.lang.Throwable] )
-
final
def
getClass(): Class[_]
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
-
def
getPartition(key: Any): Int
Implements the Spark org.apache.spark.Partitioner interface by forwarding the calls to getPartitionForKey.
Implements the Spark org.apache.spark.Partitioner interface by forwarding the calls to getPartitionForKey.
If the object passed is not of type
K
or can't be converted to it (e.g. java.lang.Integer to Int), a IllegalArgumentException is thrown. This should be considered a bug that should not happen because the processing library uses Partitioner of typeK
only for RDDs for which it is aware and sure to have keys of typeK
.Basically, this function is a no-op call that forwards to getPartitionForKey, but the important point here is to have a type-safe Partitioner in the processing library.
- key
the key for which the partition must be calculated
- returns
the partition, identified by one scala.Int, in which the key should be located
- Definition Classes
- Partitioner → Partitioner
- Note
This is called by Spark and should not be called by developer's code, as it may be unsafe.
-
final
def
getPartitionForKey(key: KeyOrName): Int
Gets the partition for a given key of type
K
.Gets the partition for a given key of type
K
. This is the function that must be implemented by children partitioners.- key
the key for which the partition must be calculated
- returns
the partition, identified by one scala.Int, in which the key should be located
- Definition Classes
- PartitionNamePartitioner → Partitioner
-
def
getPartitionForName(name: Name): Int
- Definition Classes
- LocalityAwarePartitioner → PartitionNamePartitioner
-
final
def
isInstanceOf[T0]: Boolean
- Definition Classes
- Any
- val level: Int
-
final
def
ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
final
def
notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
final
def
notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
val
numPartitions: Int
- Definition Classes
- LocalityAwarePartitioner → Partitioner
-
final
def
synchronized[T0](arg0: ⇒ T0): T0
- Definition Classes
- AnyRef
-
final
def
wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()