object BloomFilter extends ApproxFilterCompanion with Serializable
- Alphabetic
- By Inheritance
- BloomFilter
- Serializable
- ApproxFilterCompanion
- AnyRef
- Any
- Hide All
- Show All
- Public
- Protected
Type Members
- type Filter[T] = BloomFilter[T]
Type of the ApproxFilter implementation.
Type of the ApproxFilter implementation.
- Definition Classes
- BloomFilter → ApproxFilterCompanion
- type Hash[T] = Funnel[T]
Type of the hashing function for ApproxFilter elements, e.g.
Type of the hashing function for ApproxFilter elements, e.g. Guava Funnel or Algebird Hash128.
- Definition Classes
- BloomFilter → ApproxFilterCompanion
Value Members
- final def !=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
- final def ##: Int
- Definition Classes
- AnyRef → Any
- final def ==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
- final def asInstanceOf[T0]: T0
- Definition Classes
- Any
- def clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.CloneNotSupportedException]) @native()
- final def create[T](elems: SCollection[T], expectedInsertions: Long, fpp: Double)(implicit arg0: Hash[T]): SCollection[Filter[T]]
Creates an ApproxFilter from an SCollection with the expected number of insertions and expected false positive probability.
Creates an ApproxFilter from an SCollection with the expected number of insertions and expected false positive probability.
Note that overflowing an ApproxFilter with significantly more elements than specified, will result in its saturation, and a sharp deterioration of its false positive probability.
- Definition Classes
- ApproxFilterCompanion
- final def create[T](elems: SCollection[T], expectedInsertions: Long)(implicit arg0: Hash[T]): SCollection[Filter[T]]
Creates an ApproxFilter from an SCollection with the expected number of insertions and default
fpp
of 0.03.Creates an ApproxFilter from an SCollection with the expected number of insertions and default
fpp
of 0.03.Note that overflowing an ApproxFilter with significantly more elements than specified, will result in its saturation, and a sharp deterioration of its false positive probability.
- Definition Classes
- ApproxFilterCompanion
- final def create[T](elems: SCollection[T])(implicit arg0: Hash[T]): SCollection[Filter[T]]
Creates an ApproxFilter from an SCollection with the collection size as
expectedInsertions
and defaultfpp
of 0.03.Creates an ApproxFilter from an SCollection with the collection size as
expectedInsertions
and defaultfpp
of 0.03.Note that overflowing an ApproxFilter with significantly more elements than specified, will result in its saturation, and a sharp deterioration of its false positive probability.
- Definition Classes
- ApproxFilterCompanion
- final def create[T](elems: Iterable[T], expectedInsertions: Long, fpp: Double)(implicit arg0: Hash[T]): Filter[T]
Creates an ApproxFilter from an Iterable with the expected number of insertions and expected false positive probability.
Creates an ApproxFilter from an Iterable with the expected number of insertions and expected false positive probability.
Note that overflowing an ApproxFilter with significantly more elements than specified, will result in its saturation, and a sharp deterioration of its false positive probability.
- Definition Classes
- ApproxFilterCompanion
- final def create[T](elems: Iterable[T], expectedInsertions: Long)(implicit arg0: Hash[T]): Filter[T]
Creates an ApproxFilter from an Iterable with the expected number of insertions and default
fpp
of 0.03.Creates an ApproxFilter from an Iterable with the expected number of insertions and default
fpp
of 0.03.Note that overflowing an ApproxFilter with significantly more elements than specified, will result in its saturation, and a sharp deterioration of its false positive probability.
- Definition Classes
- ApproxFilterCompanion
- final def create[T](elems: Iterable[T])(implicit arg0: Hash[T]): Filter[T]
Creates an ApproxFilter from an Iterable with the collection size as
expectedInsertions
and defaultfpp
of 0.03.Creates an ApproxFilter from an Iterable with the collection size as
expectedInsertions
and defaultfpp
of 0.03.Note that overflowing an ApproxFilter with significantly more elements than specified, will result in its saturation, and a sharp deterioration of its false positive probability.
- Definition Classes
- ApproxFilterCompanion
- def createImpl[T](elems: Iterable[T], expectedInsertions: Long, fpp: Double)(implicit arg0: Hash[T]): Filter[T]
- Attributes
- protected
- Definition Classes
- BloomFilter → ApproxFilterCompanion
- final def createSideInput[T](elems: SCollection[T], expectedInsertions: Long, fpp: Double)(implicit arg0: Hash[T]): SideInput[Filter[T]]
Creates a
SideInput[ApproxFilter]
from an SCollection with the expected number of insertions and expected false positive probability.Creates a
SideInput[ApproxFilter]
from an SCollection with the expected number of insertions and expected false positive probability.The
expectedInsertions
should be approximately the number of unique elements in the SCollection.Note that overflowing an ApproxFilter with significantly more elements than specified, will result in its saturation, and a sharp deterioration of its false positive probability.
Since this results in one filter as a SideInput care should be taken that the size of the filter does not exceed the runner recommended max size of Side Inputs (100 MB for Dataflow) This implies that
expectedInsertions
should not exceed 112 Million with a fp of 0.03 on Dataflow.- Definition Classes
- ApproxFilterCompanion
- final def createSideInput[T](elems: SCollection[T], expectedInsertions: Long)(implicit arg0: Hash[T]): SideInput[Filter[T]]
Creates a
SideInput[ApproxFilter]
from an SCollection with the expected number of insertions and expected false positive probability.Creates a
SideInput[ApproxFilter]
from an SCollection with the expected number of insertions and expected false positive probability.The
expectedInsertions
should be approximately the number of unique elements in the SCollection.The default false positive probability is 0.03
Note that overflowing an ApproxFilter with significantly more elements than specified, will result in its saturation, and a sharp deterioration of its false positive probability.
Since this results in one filter as a SideInput care should be taken that the size of the filter does not exceed the runner recommended max size of Side Inputs (100 MB for Dataflow) This implies that
expectedInsertions
should not exceed 112 Million with a fp of 0.03 on Dataflow.- Definition Classes
- ApproxFilterCompanion
- final def createSideInput[T](elems: SCollection[T])(implicit arg0: Hash[T]): SideInput[Filter[T]]
Creates a
SideInput[ApproxFilter]
from an SCollection with the collection size asexpectedInsertions
and false positive probability of 0.03.Creates a
SideInput[ApproxFilter]
from an SCollection with the collection size asexpectedInsertions
and false positive probability of 0.03.Note that overflowing an ApproxFilter with significantly more elements than specified, will result in its saturation, and a sharp deterioration of its false positive probability.
Since this results in one filter as a SideInput care should be taken that the size of the filter does not exceed the runner recommended max size of Side Inputs (100 MB for Dataflow) This implies that
expectedInsertions
should not exceed 112 Million with a fp of 0.03 on Dataflow.- Definition Classes
- ApproxFilterCompanion
- final def eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
- def equals(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef → Any
- implicit def filterCoder[T](implicit arg0: Hash[T]): Coder[Filter[T]]
Coder for the ApproxFilter implementation.
Coder for the ApproxFilter implementation.
Note that Hash should be supplied at compile time and not serialized since it might not have deterministic serialization.
- Definition Classes
- BloomFilter → ApproxFilterCompanion
- def finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.Throwable])
- final def getClass(): Class[_ <: AnyRef]
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
- def hashCode(): Int
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
- final def isInstanceOf[T0]: Boolean
- Definition Classes
- Any
- final def ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
- final def notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
- final def notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
- final def synchronized[T0](arg0: => T0): T0
- Definition Classes
- AnyRef
- def toString(): String
- Definition Classes
- AnyRef → Any
- final def wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException])
- final def wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException])
- final def wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException]) @native()