fanf | 2008-01-28

While writing up my notes on Bloom filters, I wanted a better idea of the relationship between the false positive rate and the size multiplier.

sm = size / pop

pz = exp(−nh ∗ pop / size)

= exp(−nh / sm)

fpr = (1 − pz) ^nh

Given a fixed number of hash functions, what size multiplier do we need to make enough space in the Bloom filter to produce the desired false positive rate? How does the size multiplier change as nh and fpr change? I wrote a program to solve this numerically, and the output is below. The entries for higher fprs show quite nicely that there's an optimal nh for a given fpr.

( big table )

S	M	T	W	T	F	S
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

Tony Finch's blog

2008-01-28

2008-01-28

More about Bloom filters

Profile

July 2025

Most Popular Tags

Page Summary

Style Credit

Expand Cut Tags

sm	= size / pop
pz	= exp(−nh ∗ pop / size)
	= exp(−nh / sm)
fpr	= (1 − pz) ^nh