M.Sc. Thesis - Automatic People Counting in Crowded Scenes

Automatic People Counting
in Crowded Scenes
Menoufia University
Faculty of Computers and Information
Information Technology Department
By
Ahmed F. Gad
ahmed.fawzy@ci.menofia.edu.eg
Supervised By
Prof. Khalid M. Amin
Dr. Ahmed M. Hamad
15 August 2018

Index
2
• Introduction
• Problem Definition
• Related Work
• Proposed Method & Experimental Results
• Decrease Prediction Error
• Testing using Overcrowded Scene
• Feature Reduction
• Decrease Computation Complexity
• Conclusion
• Publications
• References

Motivation
3
Difficult to analyze manually
Krausz, Barbara, and Christian Bauckhage. "Loveparade 2010: Automatic video analysis of a crowd
disaster." Computer Vision and Image Understanding 116.3 (2012): 307-319.

Crowd Counting Approaches
Detection-Based Crowd Counting
4

5
Test
Classifier

6
Holistic Partial
Test
Classifier

Limitations
7

Limitations
8
Occlusion
Overcrowded
Scenes

Loy, Chen Change, et al. "Crowd counting and profiling: Methodology and evaluation." Modeling, Simulation and
Visual Analysis of Crowds. Springer, New York, NY, 2013. 347-382.
Crowd Density Estimation
9
▰ Solves the requirements to detect and track objects.
▰ Counting based on groups not individuals.
Scene Features
Count
X
Y

Applications
10
https://www.eco-compteur.com/en/solutions/pedestrian-monitoring
http://crowdsize.com/
https://play.google.com/store/apps/details?id=com.efendioglu.counter&hl=en
People
Counter

Index
11
• Introduction
• Related Work
• Conclusion
• Publications
• References

Problem Definition
▰ Predicting the people count in a scene is not straight forward.
Count

Problem Definition
13
Count
Feature Mining
Texture Edge

Problem Definition
14
Count
Feature Mining
Texture Edge
Crowd
Levels
Non-Linearity

Problem Definition
15
Count
Feature Mining
Texture Edge
Crowd
Levels
Same
Size
Non-Linearity

Problem Definition
16
Count
Feature Mining
Texture Edge
Crowd
Levels
Same
Size
Scale
Non-Linearity

Problem Definition
17
Count
Feature Mining
Texture Edge
Crowd
Levels
Same
Size
Scale
Non-Linearity
SR Properties
Variations
Capacity

Problem Definition
Perspective Distortion
▰ Identical objects seems
different at different
distances from the camera
due to perspective
distortion
18

Index
19
• Introduction
• Related Work
• Conclusion
• Publications
• References

Related Work
Pixel Count
20Ma, Ruihua, et al. "On pixel count based crowd density estimation for visual surveillance.“ IEEE Conference
on Cybernetics and Intelligent Systems. Vol. 1. 2004.
Region Pixel Count
Pixel Count is not a good
feature to be used in
complex environments

Related Work
Texture & Edge Features
21
Segmented
Region
Texture
Edge
GLCM
HOG
Pixel Count
Chan, Antoni B., Zhang-Sheng John Liang, and Nuno Vasconcelos. "Privacy preserving crowd monitoring: Counting people
without people models or tracking.". IEEE Conference on Computer Vision and Pattern Recognition (CCPR). 2008.

Related Work
22
P, X
P

Related Work
23
P, X
P

Related Work
24
P, X
P
Error
12.997%

Related Work
25
Pixel Count
Texture - GLCM
Edge - HOG
Chen, Ke, et al. "Feature mining for localised crowd counting." BMVC. Vol. 1. No. 2. 2012.

Related Work
26
Pixel Count
Texture - GLCM
Edge - HOG
Error
17.96%
Chen, Ke, et al. "Feature mining for localised crowd counting." BMVC. Vol. 1. No. 2. 2012.

Related Work
KeyPoints
27Al-Zaydi, Zeyad QH, et al. "A robust multimedia surveillance system for people counting." Multimedia Tools and
Applications 76.22 (2017): 23777-23804.
KeyPoint
SIFT
FAST

Related Work
KeyPoints
28Al-Zaydi, Zeyad QH, et al. "A robust multimedia surveillance system for people counting." Multimedia Tools and
Applications 76.22 (2017): 23777-23804.
Error
14.11%
KeyPoint
SIFT
FAST

Index
29
• Introduction
• Related Work
• Conclusion
• Publications
• References

Proposed Method
Decrease Prediction Error
30
Training Testing

Feature Extraction
31
Segmented
Region
Texture Edge KeyPoint
GLCM LBP
SIFTHOG
Edge
Strength
Area Extent
Circularity
Scale Orientation
GLGCM

Feature Extraction
32
Segmented
Region
Texture Edge KeyPoint
GLCM LBP
SIFTHOG
Edge
Strength
Area Extent
Circularity
Scale Orientation
Feature Vector of 164 Elements
GLGCM

Regression Modelling
▰ Regression model maps independent variable (feature) to
some independent variables (people count)
33
Features Count
Regression Model
Independent Dependent
GPR
RF
RPF
LASSO
KNN
Ryan, David, et al. "An evaluation of crowd counting methods, features and regression models." Computer Vision
and Image Understanding 130 (2015): 1-17.
Loy, Chen Change, et al. "Crowd counting and profiling: Methodology and evaluation." Modeling, Simulation and
Visual Analysis of Crowds. Springer, New York, NY, (2013). 347-382.

UCSD Crowd Counting Dataset
34

35
Strong GT

36
Strong GT
8,000 Training 12,067 Testing
Core i7 – 16 GB RAM –
scikit-learn

1st Experiment Results
Discover the Best Regression Models
37

1st Experiment Results
Previous Works Comparison
38

2nd Experiment Results
Covering All Variations using Cross Validation
39
Problem
Random Sample
Selection only
Covered 33 Levels
Ground Truth
31
Before CV
22.49

40
Cross
Validation
Select Samples
from All Levels
Problem
Random Sample
Selection only
Covered 33 Levels
Ground Truth
31
Before CV
22.49
After CV
30.99
Solution

41
Cross
Validation
Select Samples
from All Levels
Problem
Random Sample
Selection only
Covered 33 Levels
Ground Truth
31
Before CV
22.49
After CV
30.99
Solution

Index
42
• Introduction
• Related Work
• Conclusion
• Publications
• References

Applying Proposed Method with Overcrowded Dataset
UCF Crowd Dataset – VERY CHALLENGING
43
Idrees, Haroon, et al. "Multi-source multi-scale counting in extremely dense crowd images." IEEE Conference on
Computer Vision and Pattern Recognition (CVPR). 2013.

UCF Crowd Dataset – VERY CHALLENGING
44
50 Images
Idrees, Haroon, et al. "Multi-source multi-scale counting in extremely dense crowd images." IEEE Conference on
Computer Vision and Pattern Recognition (CVPR). 2013.
40 Training
10 Testing
MAE : 338.41
Error Percent : 26.45%

UCF Crowd Dataset
45
Regression

UCF Crowd Dataset
46
Regression
2015:2016
Deep CNN

UCF Crowd Dataset
47
2017
Deep CNN
Regression
2015:2016
Deep CNN

Marathon Crowd Dataset
48
Ali, Saad, and Mubarak Shah. "Floor fields for tracking in high density crowd scenes." European conference on
computer vision. Springer, Berlin, Heidelberg, 2008.

Marathon Crowd Dataset
49
492
Images
Ali, Saad, and Mubarak Shah. "Floor fields for tracking in high density crowd scenes." European conference on
computer vision. Springer, Berlin, Heidelberg, 2008.
350 Training
142 Testing
MAE : 13.88
Error Percent : 3.79%

Index
50
• Introduction
• Related Work
• Conclusion
• Publications
• References

Proposed Method + Feature Reduction
5151
Training Testing

Proposed Method + Feature Reduction
5252
Training Testing

Feature Reduction
53
Reduction Techniques
Filter Wrapper Embedded
Keep Good Features &
Remove Bad Ones
(Irrelevant & Correlated)
Chandrashekar, Girish, and Ferat Sahin. "A survey on feature selection methods." Computers & Electrical
Engineering 40.1 (2014): 16-28.

Feature Reduction
54
Reduction Techniques
Filter Wrapper Embedded
Keep Good Features &
Remove Bad Ones
(Irrelevant & Correlated)
Chandrashekar, Girish, and Ferat Sahin. "A survey on feature selection methods." Computers & Electrical
Engineering 40.1 (2014): 16-28.
𝑪𝒐𝒔𝒕 𝑾 =
𝒊=𝟏
𝑵
(𝒚𝒊 −
𝒋=𝟎
𝑴
𝒘𝒊 𝒙𝒊𝒋) 𝟐 + 𝝀
𝒋=𝟎
𝑴
|𝒘𝒋|
LASSO

Feature Reduction
Increase Model Capacity
55

Feature Reduction
Increase Model Capacity
56
All Features &
Less Samples
Less Features &
More Samples

Index
57
• Introduction
• Related Work
• Conclusion
• Publications
• References

Proposed Method + Feature Tracking
58
Frame i-1Frame i

Proposed Method + Feature Tracking
59
Frame i-1Frame i
Matching Metrics
Spatial
Top Left
Corner (X, Y)
Width
Height
Texture
LBP

Feature Tracking
Computational Time
▰ 85.12% of the time consumed to extract features is saved (i.e.
we have not to call the FE for 85.12% of the total regions).
63

Feature Tracking
FPS
64
𝑺𝑹/𝑭𝒓𝒂𝒎𝒆 =
𝑻𝒐𝒕𝒂𝒍 # 𝒐𝒇 𝑺𝑹𝒔
𝑻𝒐𝒕𝒂𝒍 # 𝒐𝒇 𝑭𝒓𝒂𝒎𝒆𝒔

Feature Tracking
FPS
65
𝑻𝒊𝒎𝒆/𝑭𝒓𝒂𝒎𝒆 = 𝑺𝑹/𝑭𝒓𝒂𝒎𝒆 ∗ 𝑺𝑹 𝑻𝒊𝒎𝒆

Feature Tracking
FPS
66
𝑻𝒊𝒎𝒆/𝑭𝒓𝒂𝒎𝒆 = 𝑺𝑹/𝑭𝒓𝒂𝒎𝒆 ∗ 𝑺𝑹 𝑻𝒊𝒎𝒆 𝑭𝑷𝑺 =
𝟏
𝑻𝒊𝒎𝒆/𝑭𝒓𝒂𝒎𝒆

Feature Tracking
Prediction Error Percent with Tracking
67

Feature Tracking
Prediction Error Percent with Tracking
68
No Track
Track

Index
69
• Introduction
• Related Work
• Conclusion
• Publications
• References

Conclusion
▰ This work proposed a technique for crowd density estimation based multiple
features.
▰ Less Prediction Error Compared to Previous Works using All Features.
▰ Enhanced Results using Cross Validation.
▰ Accuracy Proved by using Different Datasets.
▰ Increasing Model Capacity after Feature Reduction.
▰ Reduced Computational Time using Feature Tracking.
70

Index
71
• Introduction
• Related Work
• Conclusion
• Publications
• References

Publications
▰ A. Gad, A. Hamad, K. Amin. "Crowd Density Estimation Using
Multiple Features Categories and Multiple Regression
Models." 12th IEEE International Conference on Computer
Engineering & Systems (ICCES), pp. 430-435, Dec. 2017.
▰ Estimating People Count in Crowded Scenes Using Multiple
Features Categories and Multiple Regression Models. Pattern
Analysis and Applications Journal, Springer, Under Review.
▰ Time-Efficient Crowd Density Estimation using Feature Tracking.
Prepared for submission.
72

Index
73
• Introduction
• Related Work
• Conclusion
• Publications
• References

References
▰ C. C. Loy, K. Chen, S. Gong, and T. Xiang, "Crowd counting and profiling: Methodology and evaluation," Modeling, Simulation and Visual Analysis of
Crowds,Springer, pp. 347-382, 2013.
▰ W. Zhen, L. Mao, and Z. Yuan, "Analysis of trample disaster and a case study–Mihong bridge fatality in China in 2004," Safety Science, vol. 46, pp.
1255-1270, 2008.
▰ D. Helbing, A. Johansson, and H. Z. Al-Abideen, "Dynamics of crowd disasters: An empirical study," Physical review E, vol. 75, p. 046109, 2007.
▰ B. Krausz and C. Bauckhage, "Loveparade 2010: Automatic video analysis of a crowd disaster," Computer Vision and Image Understanding, vol. 116,
pp. 307-319, 2012.
▰ B. Wu and R. Nevatia, "Detection and tracking of multiple, partially occluded humans by bayesian combination of edgelet based part detectors,"
International Journal of Computer Vision, vol. 75, pp. 247-266, 2007.
▰ D. Ryan, S. Denman, S. Sridharan, and C. Fookes, "An evaluation of crowd counting methods, features and regression models," Computer Vision and
Image Understanding, vol. 130, pp. 1-17, 2015.
▰ A. B. Chan, Z.-S. J. Liang, and N. Vasconcelos, "Privacy preserving crowd monitoring: Counting people without people models or tracking,". IEEE
Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1-7, 2008.
▰ A. B. Chan and N. Vasconcelos, "Counting people with low-level features and Bayesian regression," IEEE Transactions on Image Processing, vol. 21,
pp. 2160-2177, 2012.
▰ L. Dong, V. Parameswaran, V. Ramesh, and I. Zoghlami, "Fast crowd segmentation using shape indexing,". IEEE 11th International Conference on
Computer Vision (ICCV), pp. 1-8, 2007.
▰ Z. Q. Al-Zaydi, D. L. Ndzi, M. L. Kamarudin, A. Zakaria, and A. Y. Shakaff, "A robust multimedia surveillance system for people counting," Multimedia
Tools and Applications, pp. 1-28, 2016.
74

References
75
▰ R. Liang, Y. Zhu, and H. Wang, "Counting crowd flow based on feature points," Neurocomputing, vol. 133, pp. 377-384, 2014.
▰ D. G. Lowe, "Distinctive image features from scale-invariant keypoints," International journal of computer vision, vol. 60, pp. 91-110, 2004.
▰ K. Chen, C. C. Loy, S. Gong, and T. Xiang, "Feature Mining for Localised Crowd Counting," BMVC, p. 3, 2012.
▰ B. Xu and G. Qiu, "Crowd density estimation based on rich features and random projection forest,"IEEE Winter Conference on Applications of
Computer Vision (WACV), pp. 1-8, 2016.
▰ D. Kong, D. Gray, and H. Tao, "A viewpoint invariant approach for crowd counting," 18th International Conference on in Pattern Recognition (ICPR).
pp. 1187-1190, 2006.
▰ Zeng, Xinchuan, and Tony R. Martinez. "Distributed-balanced stratified cross-validation for accuracy estimation." Journal of Experimental &
Theoretical Artificial Intelligence vol. 12, pp. 1-12, 2000.
▰ Ojala, Timo, Matti Pietikainen, and Topi Maenpaa. "Multiresolution gray-scale and rotation invariant texture classification with local binary patterns."
IEEE Transactions on pattern analysis and machine intelligence, vol. 24, pp. 971-987, 2002.
▰ S. L. Kukreja, J. Löfberg, and M. J. Brenner, "A least absolute shrinkage and selection operator (LASSO) for nonlinear system identification," IFAC
Proceedings Volumes, vol. 39, pp. 814-819, 2006.
▰ D. Kang, D. Dhar, and A. B. Chan, "Crowd Counting by Adapting Convolutional Neural Networks with Side Information," arXiv preprint
arXiv:1611.06748, 2016.
▰ C. Zhang, H. Li, X. Wang, and X. Yang, "Cross-scene crowd counting via deep convolutional neural networks," IEEE Conference on Computer Vision
and Pattern Recognition, pp. 833-841, 2015.

M.Sc. Thesis - Automatic People Counting in Crowded Scenes

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie M.Sc. Thesis - Automatic People Counting in Crowded Scenes

Ähnlich wie M.Sc. Thesis - Automatic People Counting in Crowded Scenes (20)

Mehr von Ahmed Gad

Mehr von Ahmed Gad (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

M.Sc. Thesis - Automatic People Counting in Crowded Scenes