package spark
Ordering
- Alphabetic
Visibility
- Public
- All
Type Members
-
case class
Cluster[GeolocatedItem](events: Seq[GeolocatedItem])(implicit evidence$1: GeoCoordinateOperations[GeolocatedItem]) extends Product with Serializable
A cluster of elements.
A cluster of elements.
- events
the events belonging to the cluster
-
class
DistributedClustering[GeolocatedItem] extends Serializable
Performs distributed clustering on a given collection of GeolocatedItem events.
Performs distributed clustering on a given collection of GeolocatedItem events.
The clustering is performed using a DBSCAN algorithm on each tile with a buffer zone of specified size. The clusters whose center lies on the buffer zone are rejected (as they are included in the output for a different tile).
Limitations:
- Due to the distribution scheme, this implementation returns incorrect results if there are clusters larger than the buffer zone.
- DBSCAN identifies clusters by density, so the algorithm works best if all clusters have approximately the same spatial density.
Value Members
- object DistributedClustering extends Serializable