A New Analysis of the False Positive Rate of a Bloom Filter

Christensen, K.; Roginsky, Allen; Jimeno, M.

doi:https://doi.org/10.1016/j.ipl.2010.07.024

Journal Article

A New Analysis of the False Positive Rate of a Bloom Filter

Published: October 15, 2010
Citation: Information Processing Letters vol. 110, no. 21, (October 15, 2010) pp. 944-949

Author(s)

K. Christensen, Allen Roginsky, M. Jimeno

Announcement

Abstract

A Bloom filter is a space-efficient data structure used for probabilistic set membership testing. When testing an object for set membership, a Bloom filter may give a false positive. The analysis of the false positive rate is a key to understanding the Bloom filter and applications that use it. We show experimentally that the classic analysis for false positive rate is wrong. We formally derive a correct formula using a balls-and-bins model and show how to numerically compute the new, correct formula in a stable manner. We also prove that the new formula always results in a predicted greater false positive rate than the classic formula. This correct formula is numerically compared to the classic formula for relative error – for a small Bloom filter the prediction of false positive rate will be in error when the classic formula is used.

Keywords

Bloom filter; data structures; analysis of algorithms

Control Families

None selected

Documentation

Publication:
Journal Article (DOI)

Supplemental Material:
None available

Document History:
10/15/10: Journal Article (Final)

Information Technology Laboratory

Computer Security Resource Center

Computer Security Resource Center

Journal Article

A New Analysis of the False Positive Rate of a Bloom Filter

Author(s)

Announcement

Abstract

Keywords

Control Families

Documentation