class Publisher extends AnyRef
Publishes payloads, in the form of an RDD of com.here.platform.data.processing.catalog.Partition.Key and com.here.platform.data.processing.blobstore.Payload, to the Blob API and returns RDD of com.here.platform.data.processing.catalog.Partition.Commit to be committed to the Metadata API.
- Note
both the full snapshot and the incremental one are implemented here the difference is mainly in the filtering strategy adopted
,metadata related to the published payloads need to be committed in a subsequent operation
- Alphabetic
- By Inheritance
- Publisher
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Instance Constructors
-
new
Publisher(uploader: Uploader, numThreads: Int, uniquePartitionLimitInBytes: Int, statistics: Option[Map[Id, PublisherStats]] = None)
Creates a new publisher.
Creates a new publisher.
- uploader
The com.here.platform.data.processing.blobstore.Uploader object.
- numThreads
The number of uploads to perform in parallel.
- statistics
The optional statistics counters.
Value Members
-
final
def
!=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
##(): Int
- Definition Classes
- AnyRef → Any
-
final
def
==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
asInstanceOf[T0]: T0
- Definition Classes
- Any
-
def
clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()
- val dataPublisher: DataPublisher
-
final
def
eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
def
equals(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
def
finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( classOf[java.lang.Throwable] )
-
final
def
getClass(): Class[_]
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
-
def
hashCode(): Int
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
-
final
def
isInstanceOf[T0]: Boolean
- Definition Classes
- Any
-
final
def
ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
final
def
notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
final
def
notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
def
publishFullSnapshot(current: RDD[(Key, Meta)], candidates: RDD[(Key, Option[Payload])]): RDD[(Key, Commit)]
Publishes a full snapshot of data.
Publishes a full snapshot of data.
- current
The RDD containing the catalog's current status.
- candidates
the RDD containing the candidate operations to upload.
- returns
the metadata RDD that were already published but not yet committed.
- Note
Candidate operations are potential operations that can be done, depending on the status of the catalog. These operations depend on whether there is duplicated content or not, as duplicated content do not have to be published. This function deletes content in the current RDD that is not present in the candidates' RDD.
-
def
publishIncrementalSnapshot(current: RDD[(Key, Meta)], candidates: RDD[(Key, Option[Payload])]): RDD[(Key, Commit)]
Publishes an increment snapshot of data.
Publishes an increment snapshot of data.
- current
The RDD containing the catalog's current status.
- candidates
the RDD containing the candidate operations to upload.
- returns
the metadata RDD that were already published but not yet committed.
- Note
Candidate operations are potential operations that can be done, depending on the status of the catalog. These operations depend on whether there is duplicated content or not, as duplicated content do not have to be published.
-
final
def
synchronized[T0](arg0: ⇒ T0): T0
- Definition Classes
- AnyRef
-
def
toString(): String
- Definition Classes
- AnyRef → Any
-
final
def
wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()