package zetasketch
- Source
- package.scala
- Alphabetic
- By Inheritance
- zetasketch
- AllSyntax
- SCollectionSyntax
- AnyRef
- Any
- Hide All
- Show All
- Public
- Protected
Type Members
- sealed trait HllPlus[T] extends Serializable
- final class ZetaSketchHll[T] extends Serializable
This is a wrapper for internal HyperLogLogPlusPlus implementation.
- case class ZetaSketchHllPlusPlus[T](p: Int = HllCount.DEFAULT_PRECISION)(implicit zs: ZetaSketchable[T]) extends ApproxDistinctCounter[T] with Product with Serializable
com.spotify.scio.estimators.ApproxDistinctCounter implementation for org.apache.beam.sdk.extensions.zetasketch.HllCount.
com.spotify.scio.estimators.ApproxDistinctCounter implementation for org.apache.beam.sdk.extensions.zetasketch.HllCount. HllCount estimate the distinct count using HyperLogLogPlusPlus (HLL++) sketches on data streams based on the ZetaSketch implementation.
The HyperLogLog++ (HLL++) algorithm estimates the number of distinct values in a data stream. HLL++ is based on HyperLogLog; HLL++ more accurately estimates the number of distinct values in very large and small data streams.
- p
Precision, controls the accuracy of the estimation. The precision value will have an impact on the number of buckets used to store information about the distinct elements. should be in the range
[10, 24]
, default precision value is15
.
- sealed trait ZetaSketchable[T] extends AnyRef
- implicit final class PairedZetaSCollection[K, V] extends AnyRef
- Definition Classes
- SCollectionSyntax
- implicit final class ZetaSCollection[T] extends AnyRef
- Definition Classes
- SCollectionSyntax
- implicit final class ZetaSketchHllSCollection[T] extends AnyRef
- Definition Classes
- SCollectionSyntax
- implicit final class ZetaSketchHllSCollectionKV[K, V] extends AnyRef
- Definition Classes
- SCollectionSyntax
Value Members
- object HllPlus extends Serializable
- object ZetaSketchHll extends Serializable
- object ZetaSketchable