Poster: AVSS 2012

Background Subtraction for Real-time Video Analytics Based on
Multi-hypothesis Mixture-of-Gaussians
Mahfuzul Haque and Manzur Murshed
Gippsland School of Information Technology, Monash University, Victoria 3842, Australia
Email: {Mahfuzul.Haque, Manzur.Murshed}@monash.edu
1 Abstract
Robust background subtraction (BS) is essential for high quality foreground detection in most video analytics systems. Recent
BS techniques achieve superior detection quality mostly by exploiting the complementary strengths of multiple background
models. Consequently, these techniques fail to meet the operational requirements of real-time video analytics. The proposed
BS technique, named multi-hypothesis mixture-of-Gaussians (MH-MOG), maintains a single background model based on
perception-aware mixture-of-Gaussians and then, generates multiple detection hypotheses with different processing bases.
Finally, only during the detection stage, the complementary strengths of the hypotheses are exploited to achieve superior
detection quality without significant computational overhead.

2 Dynamic Background Subtraction

3 Multiple Detection Hypotheses for Superior Detection Quality
Detection Decision

Detection Decision
Background
Model

Video Frame

Hypothesis1

Hypothesis2

…

Hypothesisn

Model1

Model2

…

Modeln

Foreground Mask

Hypothesis1

Hypothesis2

Hypothesisn

Model
Proposed

Conventional

Dynamic background subtraction is an essential precursor in
most video analytics systems for moving foreground detection.
The quality of foreground detection directly impacts the
performance of subsequent processing tasks.

…

To achieve superior detection quality conventional approaches use the complementary strengths of multiple
detection hypotheses that are originated from different background models while the proposed technique
uses a single underlying background model to generate complementary detection hypotheses.

4 The Proposed Background Subtraction Technique (MH-MOG)

MH-MOG
Background
Model

Perception Inspired
Detection Hypothesis

Confidence Level for

Detection Algorithm
Probabilistic

Confidence Level for
High Quality
Foreground Mask

Incoming
Video Stream

In the proposed background subtraction technique, a single background model is maintained based on observed video frames. Then based on this background model two
independent detection hypotheses (e.g., perception inspired and probabilistic) are generated. For both hypotheses, associated confidence levels are computed based on
spatial detection results in the corresponding hypothesis space. Finally, all these information is used by the detection algorithm to produce high quality foreground mask by
maximising the complementary strengths of both hypotheses [1].

6 Perception inspired detection hypothesis

5 Background Modelling
Observed intensity value: m
Mean: µ
Standard deviation: σ
Weight: ω

P(x)

Intensity

The background of the operating environment is modelled at
pixel-level by maintaining at most N observed intensity values
(m1, m2, …, mN). For each sample, associated Gaussian
variables (µ, σ, and ω) are maintained to determine the order
of the samples based on observation frequency.

8 Experiments

0

m1

m2
Intensity

m3

255

A confidence interval is determined for each believed-to-bebackground intensity value based on the characteristics of
human visual system in perceiving noticeable intensity
deviation from background (Weber’s Law). Observed
intensity values are classified as background based on their
membership in any background confidence interval [2].

9 Quantitative Evaluation

Quantitative comparison: This figure shows overall (ALL),
dataset-wise (PETS, WF, UCF, IBM, CAV, VSSN), and sequenceclass-wise (SR, MM, LC) performance comparisons.

Unlike perception inspired hypothesis, no subset
of samples is chosen as background for intensity
comparison. Rather a probabilistic formulation
involving all Gaussian components is used [3] for
background/foreground
classification.
This
hypothesis shows higher foreground sensitivity
and thus recovers missing foreground regions
due to intensity thresholding by the perception
inspired hypothesis.

10 Visual Comparison
First Frame

More than 50 test sequences
were selected from eight
different datasets including
PETS,
Wallflower,
IBM,
VSSN06, CAVIAR, and UCF
and categorised in following
classes
based
on
the
characteristics of the operating
environments: low contrast
foreground (LC), shadows and
reflections (SR), multi-modal
background
(MM),
indoor
(INDOOR),
and
outdoor
(OUTDOOR).

7 Probabilistic detection hypothesis

Test Frame

Ground Truth MOG (S&G)

MOG (Lee)

ViBe

MH-MOG

MOG (S&G) – TPAMI 2000, MOG (Lee) – TPAMI 2005, ViBe – TIP 2011, and MH-MOG – Proposed.

[1] M. Haque and M. Murshed, Background Subtraction for Real-time Video Analytics Based on Multi-hypothesis Mixture-of-Gaussians, IEEE International
Conference on Advanced Video and Signal Based Surveillance (AVSS), Beijing, China, 2012.
[2] M. Haque and M. Murshed, Robust Background Subtraction Based on Perceptual Mixture-of-Gaussians with Dynamic Adaptation Speed, IEEE International
Workshop on Advances in Automated Multimedia Surveillance for Public Safety, Melbourne, Australia, 2012.
[3] D.-S. Lee. Effective Gaussian mixture learning for video background subtraction, IEEE TPAMI, 27(5):827– 832, 2005.
The images shown in the header has been taken from http://www.informationliberation.com

Poster: AVSS 2012

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Poster: AVSS 2012

Ähnlich wie Poster: AVSS 2012 (20)

Mehr von Mahfuzul Haque

Mehr von Mahfuzul Haque (16)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Poster: AVSS 2012