2024: Domino Containers - The Next Step. News from the Domino Container commu...
Poster: AVSS 2012
1. Background Subtraction for Real-time Video Analytics Based on
Multi-hypothesis Mixture-of-Gaussians
Mahfuzul Haque and Manzur Murshed
Gippsland School of Information Technology, Monash University, Victoria 3842, Australia
Email: {Mahfuzul.Haque, Manzur.Murshed}@monash.edu
1 Abstract
Robust background subtraction (BS) is essential for high quality foreground detection in most video analytics systems. Recent
BS techniques achieve superior detection quality mostly by exploiting the complementary strengths of multiple background
models. Consequently, these techniques fail to meet the operational requirements of real-time video analytics. The proposed
BS technique, named multi-hypothesis mixture-of-Gaussians (MH-MOG), maintains a single background model based on
perception-aware mixture-of-Gaussians and then, generates multiple detection hypotheses with different processing bases.
Finally, only during the detection stage, the complementary strengths of the hypotheses are exploited to achieve superior
detection quality without significant computational overhead.
2 Dynamic Background Subtraction
3 Multiple Detection Hypotheses for Superior Detection Quality
Detection Decision
Detection Decision
Background
Model
Video Frame
Hypothesis1
Hypothesis2
…
Hypothesisn
Model1
Model2
…
Modeln
Foreground Mask
Hypothesis1
Hypothesis2
Hypothesisn
Model
Proposed
Conventional
Dynamic background subtraction is an essential precursor in
most video analytics systems for moving foreground detection.
The quality of foreground detection directly impacts the
performance of subsequent processing tasks.
…
To achieve superior detection quality conventional approaches use the complementary strengths of multiple
detection hypotheses that are originated from different background models while the proposed technique
uses a single underlying background model to generate complementary detection hypotheses.
4 The Proposed Background Subtraction Technique (MH-MOG)
MH-MOG
Background
Model
Perception Inspired
Detection Hypothesis
Confidence Level for
Detection Hypothesis
Detection Algorithm
Probabilistic
Detection Hypothesis
Confidence Level for
Detection Hypothesis
High Quality
Foreground Mask
Incoming
Video Stream
In the proposed background subtraction technique, a single background model is maintained based on observed video frames. Then based on this background model two
independent detection hypotheses (e.g., perception inspired and probabilistic) are generated. For both hypotheses, associated confidence levels are computed based on
spatial detection results in the corresponding hypothesis space. Finally, all these information is used by the detection algorithm to produce high quality foreground mask by
maximising the complementary strengths of both hypotheses [1].
6 Perception inspired detection hypothesis
5 Background Modelling
Observed intensity value: m
Mean: µ
Standard deviation: σ
Weight: ω
P(x)
Intensity
The background of the operating environment is modelled at
pixel-level by maintaining at most N observed intensity values
(m1, m2, …, mN). For each sample, associated Gaussian
variables (µ, σ, and ω) are maintained to determine the order
of the samples based on observation frequency.
8 Experiments
0
m1
m2
Intensity
m3
255
A confidence interval is determined for each believed-to-bebackground intensity value based on the characteristics of
human visual system in perceiving noticeable intensity
deviation from background (Weber’s Law). Observed
intensity values are classified as background based on their
membership in any background confidence interval [2].
9 Quantitative Evaluation
Quantitative comparison: This figure shows overall (ALL),
dataset-wise (PETS, WF, UCF, IBM, CAV, VSSN), and sequenceclass-wise (SR, MM, LC) performance comparisons.
Unlike perception inspired hypothesis, no subset
of samples is chosen as background for intensity
comparison. Rather a probabilistic formulation
involving all Gaussian components is used [3] for
background/foreground
classification.
This
hypothesis shows higher foreground sensitivity
and thus recovers missing foreground regions
due to intensity thresholding by the perception
inspired hypothesis.
10 Visual Comparison
First Frame
More than 50 test sequences
were selected from eight
different datasets including
PETS,
Wallflower,
IBM,
VSSN06, CAVIAR, and UCF
and categorised in following
classes
based
on
the
characteristics of the operating
environments: low contrast
foreground (LC), shadows and
reflections (SR), multi-modal
background
(MM),
indoor
(INDOOR),
and
outdoor
(OUTDOOR).
7 Probabilistic detection hypothesis
Test Frame
Ground Truth MOG (S&G)
MOG (Lee)
ViBe
MH-MOG
MOG (S&G) – TPAMI 2000, MOG (Lee) – TPAMI 2005, ViBe – TIP 2011, and MH-MOG – Proposed.
[1] M. Haque and M. Murshed, Background Subtraction for Real-time Video Analytics Based on Multi-hypothesis Mixture-of-Gaussians, IEEE International
Conference on Advanced Video and Signal Based Surveillance (AVSS), Beijing, China, 2012.
[2] M. Haque and M. Murshed, Robust Background Subtraction Based on Perceptual Mixture-of-Gaussians with Dynamic Adaptation Speed, IEEE International
Workshop on Advances in Automated Multimedia Surveillance for Public Safety, Melbourne, Australia, 2012.
[3] D.-S. Lee. Effective Gaussian mixture learning for video background subtraction, IEEE TPAMI, 27(5):827– 832, 2005.
The images shown in the header has been taken from http://www.informationliberation.com