2008-01-28

fanf: (Default)

While writing up my notes on Bloom filters, I wanted a better idea of the relationship between the false positive rate and the size multiplier.

sm= size / pop
pz= exp(−nhpop / size)
= exp(−nh / sm)
fpr= (1 − pz) nh

Given a fixed number of hash functions, what size multiplier do we need to make enough space in the Bloom filter to produce the desired false positive rate? How does the size multiplier change as nh and fpr change? I wrote a program to solve this numerically, and the output is below. The entries for higher fprs show quite nicely that there's an optimal nh for a given fpr.

big table )

February 2026

S M T W T F S
1234567
891011121314
1516 1718192021
2223 2425262728

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated 2026-03-07 19:31
Powered by Dreamwidth Studios