SlideShare ist ein Scribd-Unternehmen logo
1 von 17
A METRIC FOR NO-REFERENCE VIDEO QUALITY ASSESSMENT FOR HD TV DELIVERY BASED ON SALIENCY MAPS  H. BOUJUT*, J. BENOIS-PINEAU*, T. AHMED*, O. HADAR** & P. BONNET*** *LaBRI UMR CNRS 5800, University of Bordeaux, France **Communication Systems Engineering Dept., Ben Gurion University of the Negev, Israel ***AudematWorldCast Systems Group, France ICME 2011 – Workshop on Hot Topics in Multimedia Delivery (HotMD’11) 2011-07-11
Overview Introduction Focus Of Attention and Saliency Maps Our approach: Weighted Macro Block Error Rate (WMBER) based on saliency maps a no reference video quality metric Prediction of subjective quality metrics from objective quality metrics Evaluation and results Conclusion and future work
Introduction Motivation VQA for HD broadcast applications Measure the influence of transmission loss on perceived quality Video quality assessment protocol Full Reference (FR) SSIM (Z. Wang, A. Bovik) A novel perceptual metric for video compression (A. Bhat, I. Richardson) PCS’09 Evaluation of temporal variation of video quality in packet loss networks (C. Yim, A. C. Bovik, 2011) Image Communication 26 (2011) Reduced Reference (RR) A Convolutional Neural Network Approach for Objective Video Quality Assessment (P. Le Callet, C. Viard-Gaudin, D. Barba) IEEE Transactions on Neural Networks 17. No Reference (NR) No-reference image and video quality estimation: Applications and human-motivated design (S. Hemami, A. Reibman) Image Communication 25 (2010) In this work: NR VQA with visual saliency in H.264/AVC framework Contributions: Visual saliency map during compression process WMBER NR quality metric Prediction of subjective quality metrics from objective quality metrics
Focus of Attention and Saliency maps FOA is mostly attracted by salient areas which stand out from the visual scene. FOA is sequentially grabbed over the salient areas. Salient stimuli are mainly due to: High color Contrast Motion Edge orientation  Original Frame Saliency map Tractor sequence (TUM/VQEG)
Saliency maps (1/2) Several methods for saliency map extraction already exist in the literature. All methods work in the same way [O. Brouard, V. Ricordel and D. Barba, 2009], [S. Marat, et al., 2009]: Extraction of the spatial saliency map (static pathway) Extraction of the temporal saliency map (dynamic pathway) Fusion of the spatial and the temporal saliency maps (fusion) Temporal saliency map Spatial saliency map Spatio-temporal saliency map
Saliency maps (2/2) In this work we re-used the saliency map extraction method published at IS&T Electronic Imaging 2011 : Based on the saliency map model from O. Brouard, V. Ricordeland D. Barba. Use partial decoding of H.264 stream to reach real-time performances. A fusion method to combine spatial and temporal saliency maps          has been proposed. We propose a new fusion method
Saliency map fusion (1/2) We use the multiplication fusion method                  and the logarithm fusion method         , both weighted with a 5 visual deg. 2D Gaussian 2DGauss(s) to compare with our proposed fusion method. Spatio-temporal saliency map
To produce spatio-temporal saliency map, we also propose a new fusion method  Similar fusion properties as  Gives more weight to regions which have both: High spatial saliency High temporal saliency Do not provide null spatio-temporal saliency when temporal saliency is very low. Saliency map fusion (2/2)
WMBER Vq metric based on saliency maps (1/3)  Weighted Macro Block Error Rate (WMBER) is a No Reference metric Visual attention is focused on the saliency map Video transmission artifacts may change the saliency map We propose to extract the saliency maps on the already broadcasted disturbed video stream. WMBER also relies on MB error detection in the bit stream DC/AC and MV error detection Error propagation according to H.264 decoding process WMBER is based on: MB error detection Weighted by Saliency maps Original transmission error Propagation of transmission errors
WMBER Vq metric based on saliency maps (2/3) MB errormap & Decoder Decoded Frame Gradient energy X Σ GME SaliencyMap / Σ WMBER
WMBER Vq metric based on saliency maps (3/3) When MB errors covers the whole frame and the energy of the gradient is high: WMBER is high (near 1.0) When there no MB errors or the energy of the gradient is low: WMBER is low (near 0.0) The WMBER of a video sequence is the average WMBER of the frames.
Subjective Experiment Subjective experiment According to: VQEG Report on Validation of the Video Quality Models for High Definition Video Content (June 2010). ITU-R Rec. BT.500-11 20 HDTV (1920x1080 pixels) video sources (SRC) from : The Open Video Project: www.open-video.org NTIA/ITS TUM/Taurus Media Technik French HDTV Measure the influence of transmission loss on perceived quality 2 loss models: IP model (ITU-T Rec. G.1050) RF (Radio Frequency) model 8 loss profiles were compared 160 Processed Video Streams (PVS) 35 participants were gathered MOS values were computed for each SRC and PVS. Experiment room
Subjective experiment results
We propose to use a supervised learning method to predict MOS values from WMBER or MSE This prediction method is called: Similarity-weighted average Requires a training data set of n known pairs (xi, yi) to predict y from x. Here (xi, yi) pairs are WMBERor MSE values associated with MOS values. y is the predicted MOS from a given WMBER/MSE x. The prediction is performed using (known as a weighted mean classifier): Prediction of subjective quality metrics from objective quality metrics
Evaluation and results We compare 6 objective video quality metrics: MSE WMBER using the 5 v/deg 2D Gaussian (WMBER2DGauss) WMBER using the multiplication fusion (WMBERmul) WMBER using the log sum fusion (WMBERlog) WMBER using the square sum fusion (WMBERsquare) WMBER using the spatial saliency map (WMBERsp) All metrics are computed for each 160 PVS + 20 SRC. 6data sets are built: 180 pairs Objective Metric/MOS Each data set is split in 2 equal parts: Training set and Evaluation set The Pearson Correlation Coefficient (PCC) is used for the evaluation Cross validation
Conclusion and future Work We were interested in the problem of objective video quality assessment over lossy channels. We followed the recent trends in the definition of spatio-temporal saliency maps for FOA. New no reference metric : the WMBER based on saliency maps. We bought a new solution for saliency maps fusion: the Square sum fusion. We proposed a supervised learning method to predict subjective quality metric MOS from objective quality metrics. Similarity weighted average. Gives better results than the conventional approach: polynomial fitting. We intend to improve the saliency model to better consider: Transmission artifacts Masking effect in the neighborhood of high saliency areas. We plan to evaluate the WMBER on the IRCCyN/IVC Eyetracker SD 2009_12 Database.
Thank you for your attention. Any questions?

Weitere ähnliche Inhalte

Was ist angesagt?

DETERMINATION OF SPATIAL RESOLUTION IN COMPUTED RADIOGRAPHY (CR) BY COMPARING...
DETERMINATION OF SPATIAL RESOLUTION IN COMPUTED RADIOGRAPHY (CR) BY COMPARING...DETERMINATION OF SPATIAL RESOLUTION IN COMPUTED RADIOGRAPHY (CR) BY COMPARING...
DETERMINATION OF SPATIAL RESOLUTION IN COMPUTED RADIOGRAPHY (CR) BY COMPARING...
AM Publications
 

Was ist angesagt? (20)

COMPLEMENTARY VISION BASED DATA FUSION FOR ROBUST POSITIONING AND DIRECTED FL...
COMPLEMENTARY VISION BASED DATA FUSION FOR ROBUST POSITIONING AND DIRECTED FL...COMPLEMENTARY VISION BASED DATA FUSION FOR ROBUST POSITIONING AND DIRECTED FL...
COMPLEMENTARY VISION BASED DATA FUSION FOR ROBUST POSITIONING AND DIRECTED FL...
 
DETERMINATION OF SPATIAL RESOLUTION IN COMPUTED RADIOGRAPHY (CR) BY COMPARING...
DETERMINATION OF SPATIAL RESOLUTION IN COMPUTED RADIOGRAPHY (CR) BY COMPARING...DETERMINATION OF SPATIAL RESOLUTION IN COMPUTED RADIOGRAPHY (CR) BY COMPARING...
DETERMINATION OF SPATIAL RESOLUTION IN COMPUTED RADIOGRAPHY (CR) BY COMPARING...
 
Repeat-Frame Selection Algorithm for Frame Rate Video Transcoding
Repeat-Frame Selection Algorithm for Frame Rate Video TranscodingRepeat-Frame Selection Algorithm for Frame Rate Video Transcoding
Repeat-Frame Selection Algorithm for Frame Rate Video Transcoding
 
A Closed-form Solution to Photorealistic Image Stylization
A Closed-form Solution to Photorealistic Image StylizationA Closed-form Solution to Photorealistic Image Stylization
A Closed-form Solution to Photorealistic Image Stylization
 
Mimo detection-bp-p mrf-yoon-160706a
Mimo detection-bp-p mrf-yoon-160706aMimo detection-bp-p mrf-yoon-160706a
Mimo detection-bp-p mrf-yoon-160706a
 
Design and Implementation of Efficient Analysis and Synthesis QMF Bank for Mu...
Design and Implementation of Efficient Analysis and Synthesis QMF Bank for Mu...Design and Implementation of Efficient Analysis and Synthesis QMF Bank for Mu...
Design and Implementation of Efficient Analysis and Synthesis QMF Bank for Mu...
 
Current issues - Signal & Image Processing: An International Journal (SIPIJ)
Current issues - Signal & Image Processing: An International Journal (SIPIJ)Current issues - Signal & Image Processing: An International Journal (SIPIJ)
Current issues - Signal & Image Processing: An International Journal (SIPIJ)
 
Pre processing of raw rs data
Pre processing of raw rs dataPre processing of raw rs data
Pre processing of raw rs data
 
YUV, Y CB CR and Subsampling
YUV, Y CB CR and SubsamplingYUV, Y CB CR and Subsampling
YUV, Y CB CR and Subsampling
 
Camera Analytics System (Based on IEEE topic Camera Selection for adaptive hu...
Camera Analytics System (Based on IEEE topic Camera Selection for adaptive hu...Camera Analytics System (Based on IEEE topic Camera Selection for adaptive hu...
Camera Analytics System (Based on IEEE topic Camera Selection for adaptive hu...
 
DICTA 2017 poster
DICTA 2017 posterDICTA 2017 poster
DICTA 2017 poster
 
Ak03302260233
Ak03302260233Ak03302260233
Ak03302260233
 
3D reconstruction
3D reconstruction3D reconstruction
3D reconstruction
 
ECCV WS 2012 (Frank)
ECCV WS 2012 (Frank)ECCV WS 2012 (Frank)
ECCV WS 2012 (Frank)
 
Channel Estimation Techniques Based on Pilot Arrangement in OFDM Systems
Channel Estimation Techniques Based on Pilot Arrangement in OFDM SystemsChannel Estimation Techniques Based on Pilot Arrangement in OFDM Systems
Channel Estimation Techniques Based on Pilot Arrangement in OFDM Systems
 
Particle Swarm Optimization for the Path Loss Reduction in Suburban and Rural...
Particle Swarm Optimization for the Path Loss Reduction in Suburban and Rural...Particle Swarm Optimization for the Path Loss Reduction in Suburban and Rural...
Particle Swarm Optimization for the Path Loss Reduction in Suburban and Rural...
 
06466595
0646659506466595
06466595
 
Interferogram Filtering Using Gaussians Scale Mixtures in Steerable Wavelet D...
Interferogram Filtering Using Gaussians Scale Mixtures in Steerable Wavelet D...Interferogram Filtering Using Gaussians Scale Mixtures in Steerable Wavelet D...
Interferogram Filtering Using Gaussians Scale Mixtures in Steerable Wavelet D...
 
Blind channel estimation for mimo ofdm systems
Blind channel estimation for mimo ofdm systemsBlind channel estimation for mimo ofdm systems
Blind channel estimation for mimo ofdm systems
 
Parallel implementation of geodesic distance transform with application in su...
Parallel implementation of geodesic distance transform with application in su...Parallel implementation of geodesic distance transform with application in su...
Parallel implementation of geodesic distance transform with application in su...
 

Andere mochten auch (8)

Social Media for Non Profits - Westminster College
Social Media for Non Profits - Westminster CollegeSocial Media for Non Profits - Westminster College
Social Media for Non Profits - Westminster College
 
Cavalls
CavallsCavalls
Cavalls
 
Punto de Vista
Punto de VistaPunto de Vista
Punto de Vista
 
Research global flow cytometry market aarkstore.com
Research  global flow cytometry market aarkstore.comResearch  global flow cytometry market aarkstore.com
Research global flow cytometry market aarkstore.com
 
Presentació
PresentacióPresentació
Presentació
 
P2P-Next Experiences from a broadcaster's view
P2P-Next Experiences from a broadcaster's viewP2P-Next Experiences from a broadcaster's view
P2P-Next Experiences from a broadcaster's view
 
About our school
About our schoolAbout our school
About our school
 
bostad
bostadbostad
bostad
 

Ähnlich wie A metric for no reference video quality assessment for hd tv delivery based on saliency maps

VIDEO QUALITY ASSESSMENT USING LAPLACIAN MODELING OF MOTION VECTOR DISTRIBUTI...
VIDEO QUALITY ASSESSMENT USING LAPLACIAN MODELING OF MOTION VECTOR DISTRIBUTI...VIDEO QUALITY ASSESSMENT USING LAPLACIAN MODELING OF MOTION VECTOR DISTRIBUTI...
VIDEO QUALITY ASSESSMENT USING LAPLACIAN MODELING OF MOTION VECTOR DISTRIBUTI...
sipij
 
H04011 04 5361
H04011 04 5361H04011 04 5361
H04011 04 5361
IJMER
 
Evaluation of bandwidth performance for interactive spherical video
Evaluation of bandwidth performance for interactive spherical videoEvaluation of bandwidth performance for interactive spherical video
Evaluation of bandwidth performance for interactive spherical video
Alpen-Adria-Universität
 
Full reference video quality assessment
Full reference video quality assessmentFull reference video quality assessment
Full reference video quality assessment
Hoàng Sơn
 
Paper id 36201508
Paper id 36201508Paper id 36201508
Paper id 36201508
IJRAT
 
Ibica2014(p15)image fusion based on broveywavelet
Ibica2014(p15)image fusion based on broveywaveletIbica2014(p15)image fusion based on broveywavelet
Ibica2014(p15)image fusion based on broveywavelet
Aboul Ella Hassanien
 
Ibica2014(p15)image fusion based on broveywavelet
Ibica2014(p15)image fusion based on broveywaveletIbica2014(p15)image fusion based on broveywavelet
Ibica2014(p15)image fusion based on broveywavelet
Aboul Ella Hassanien
 

Ähnlich wie A metric for no reference video quality assessment for hd tv delivery based on saliency maps (20)

VIDEO QUALITY ASSESSMENT USING LAPLACIAN MODELING OF MOTION VECTOR DISTRIBUTI...
VIDEO QUALITY ASSESSMENT USING LAPLACIAN MODELING OF MOTION VECTOR DISTRIBUTI...VIDEO QUALITY ASSESSMENT USING LAPLACIAN MODELING OF MOTION VECTOR DISTRIBUTI...
VIDEO QUALITY ASSESSMENT USING LAPLACIAN MODELING OF MOTION VECTOR DISTRIBUTI...
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Efficient pu mode decision and motion estimation for h.264 avc to hevc transc...
Efficient pu mode decision and motion estimation for h.264 avc to hevc transc...Efficient pu mode decision and motion estimation for h.264 avc to hevc transc...
Efficient pu mode decision and motion estimation for h.264 avc to hevc transc...
 
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGES
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGESEFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGES
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGES
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
H04011 04 5361
H04011 04 5361H04011 04 5361
H04011 04 5361
 
Performance Evaluation of H.264 AVC Using CABAC Entropy Coding For Compound I...
Performance Evaluation of H.264 AVC Using CABAC Entropy Coding For Compound I...Performance Evaluation of H.264 AVC Using CABAC Entropy Coding For Compound I...
Performance Evaluation of H.264 AVC Using CABAC Entropy Coding For Compound I...
 
Motion detection in compressed video using macroblock classification
Motion detection in compressed video using macroblock classificationMotion detection in compressed video using macroblock classification
Motion detection in compressed video using macroblock classification
 
JPEG XR objective and subjective evaluations
JPEG XR objective and subjective evaluationsJPEG XR objective and subjective evaluations
JPEG XR objective and subjective evaluations
 
Rate Distortion Performance for Joint Source Channel Coding of JPEG image Ove...
Rate Distortion Performance for Joint Source Channel Coding of JPEG image Ove...Rate Distortion Performance for Joint Source Channel Coding of JPEG image Ove...
Rate Distortion Performance for Joint Source Channel Coding of JPEG image Ove...
 
Evaluation of bandwidth performance for interactive spherical video
Evaluation of bandwidth performance for interactive spherical videoEvaluation of bandwidth performance for interactive spherical video
Evaluation of bandwidth performance for interactive spherical video
 
Accurate wireless channel modeling for efficient adaptive Forward Error Corre...
Accurate wireless channel modeling for efficient adaptive Forward Error Corre...Accurate wireless channel modeling for efficient adaptive Forward Error Corre...
Accurate wireless channel modeling for efficient adaptive Forward Error Corre...
 
Experimental analysis of non-Gaussian noise resistance on global method optic...
Experimental analysis of non-Gaussian noise resistance on global method optic...Experimental analysis of non-Gaussian noise resistance on global method optic...
Experimental analysis of non-Gaussian noise resistance on global method optic...
 
Full reference video quality assessment
Full reference video quality assessmentFull reference video quality assessment
Full reference video quality assessment
 
A04840107
A04840107A04840107
A04840107
 
Paper id 36201508
Paper id 36201508Paper id 36201508
Paper id 36201508
 
call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...
 
Ibica2014(p15)image fusion based on broveywavelet
Ibica2014(p15)image fusion based on broveywaveletIbica2014(p15)image fusion based on broveywavelet
Ibica2014(p15)image fusion based on broveywavelet
 
Ibica2014(p15)image fusion based on broveywavelet
Ibica2014(p15)image fusion based on broveywaveletIbica2014(p15)image fusion based on broveywavelet
Ibica2014(p15)image fusion based on broveywavelet
 
L0936775
L0936775L0936775
L0936775
 

Mehr von Alpen-Adria-Universität

Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Alpen-Adria-Universität
 
Content-adaptive Video Coding for HTTP Adaptive Streaming
Content-adaptive Video Coding for HTTP Adaptive StreamingContent-adaptive Video Coding for HTTP Adaptive Streaming
Content-adaptive Video Coding for HTTP Adaptive Streaming
Alpen-Adria-Universität
 
Evaluation of Quality of Experience of ABR Schemes in Gaming Stream
Evaluation of Quality of Experience of ABR Schemes in Gaming StreamEvaluation of Quality of Experience of ABR Schemes in Gaming Stream
Evaluation of Quality of Experience of ABR Schemes in Gaming Stream
Alpen-Adria-Universität
 
Policy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-Driven Dynamic HTTP Adaptive Streaming Player EnvironmentPolicy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Alpen-Adria-Universität
 
Energy Consumption in Video Streaming: Components, Measurements, and Strategies
Energy Consumption in Video Streaming: Components, Measurements, and StrategiesEnergy Consumption in Video Streaming: Components, Measurements, and Strategies
Energy Consumption in Video Streaming: Components, Measurements, and Strategies
Alpen-Adria-Universität
 
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Alpen-Adria-Universität
 
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming ApplicationsSARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
Alpen-Adria-Universität
 

Mehr von Alpen-Adria-Universität (20)

VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instancesVEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
 
GREEM: An Open-Source Energy Measurement Tool for Video Processing
GREEM: An Open-Source Energy Measurement Tool for Video ProcessingGREEM: An Open-Source Energy Measurement Tool for Video Processing
GREEM: An Open-Source Energy Measurement Tool for Video Processing
 
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
 
VEEP: Video Encoding Energy and CO₂ Emission Prediction
VEEP: Video Encoding Energy and CO₂ Emission PredictionVEEP: Video Encoding Energy and CO₂ Emission Prediction
VEEP: Video Encoding Energy and CO₂ Emission Prediction
 
Content-adaptive Video Coding for HTTP Adaptive Streaming
Content-adaptive Video Coding for HTTP Adaptive StreamingContent-adaptive Video Coding for HTTP Adaptive Streaming
Content-adaptive Video Coding for HTTP Adaptive Streaming
 
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
 
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Vid...
Empowerment of Atypical Viewers  via Low-Effort Personalized Modeling  of Vid...Empowerment of Atypical Viewers  via Low-Effort Personalized Modeling  of Vid...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Vid...
 
Optimizing Video Streaming for Sustainability and Quality: The Role of Prese...
Optimizing Video Streaming  for Sustainability and Quality: The Role of Prese...Optimizing Video Streaming  for Sustainability and Quality: The Role of Prese...
Optimizing Video Streaming for Sustainability and Quality: The Role of Prese...
 
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
 
Machine Learning Based Resource Utilization Prediction in the Computing Conti...
Machine Learning Based Resource Utilization Prediction in the Computing Conti...Machine Learning Based Resource Utilization Prediction in the Computing Conti...
Machine Learning Based Resource Utilization Prediction in the Computing Conti...
 
Evaluation of Quality of Experience of ABR Schemes in Gaming Stream
Evaluation of Quality of Experience of ABR Schemes in Gaming StreamEvaluation of Quality of Experience of ABR Schemes in Gaming Stream
Evaluation of Quality of Experience of ABR Schemes in Gaming Stream
 
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
 
Multi-access Edge Computing for Adaptive Video Streaming
Multi-access Edge Computing for Adaptive Video StreamingMulti-access Edge Computing for Adaptive Video Streaming
Multi-access Edge Computing for Adaptive Video Streaming
 
Policy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-Driven Dynamic HTTP Adaptive Streaming Player EnvironmentPolicy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-Driven Dynamic HTTP Adaptive Streaming Player Environment
 
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
 
Energy Consumption in Video Streaming: Components, Measurements, and Strategies
Energy Consumption in Video Streaming: Components, Measurements, and StrategiesEnergy Consumption in Video Streaming: Components, Measurements, and Strategies
Energy Consumption in Video Streaming: Components, Measurements, and Strategies
 
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
 
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine LearningVideo Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
 
Optimizing QoE and Latency of Live Video Streaming Using Edge Computing a...
Optimizing  QoE and Latency of  Live Video Streaming Using  Edge Computing  a...Optimizing  QoE and Latency of  Live Video Streaming Using  Edge Computing  a...
Optimizing QoE and Latency of Live Video Streaming Using Edge Computing a...
 
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming ApplicationsSARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
 

Kürzlich hochgeladen

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

A metric for no reference video quality assessment for hd tv delivery based on saliency maps

  • 1. A METRIC FOR NO-REFERENCE VIDEO QUALITY ASSESSMENT FOR HD TV DELIVERY BASED ON SALIENCY MAPS H. BOUJUT*, J. BENOIS-PINEAU*, T. AHMED*, O. HADAR** & P. BONNET*** *LaBRI UMR CNRS 5800, University of Bordeaux, France **Communication Systems Engineering Dept., Ben Gurion University of the Negev, Israel ***AudematWorldCast Systems Group, France ICME 2011 – Workshop on Hot Topics in Multimedia Delivery (HotMD’11) 2011-07-11
  • 2. Overview Introduction Focus Of Attention and Saliency Maps Our approach: Weighted Macro Block Error Rate (WMBER) based on saliency maps a no reference video quality metric Prediction of subjective quality metrics from objective quality metrics Evaluation and results Conclusion and future work
  • 3. Introduction Motivation VQA for HD broadcast applications Measure the influence of transmission loss on perceived quality Video quality assessment protocol Full Reference (FR) SSIM (Z. Wang, A. Bovik) A novel perceptual metric for video compression (A. Bhat, I. Richardson) PCS’09 Evaluation of temporal variation of video quality in packet loss networks (C. Yim, A. C. Bovik, 2011) Image Communication 26 (2011) Reduced Reference (RR) A Convolutional Neural Network Approach for Objective Video Quality Assessment (P. Le Callet, C. Viard-Gaudin, D. Barba) IEEE Transactions on Neural Networks 17. No Reference (NR) No-reference image and video quality estimation: Applications and human-motivated design (S. Hemami, A. Reibman) Image Communication 25 (2010) In this work: NR VQA with visual saliency in H.264/AVC framework Contributions: Visual saliency map during compression process WMBER NR quality metric Prediction of subjective quality metrics from objective quality metrics
  • 4. Focus of Attention and Saliency maps FOA is mostly attracted by salient areas which stand out from the visual scene. FOA is sequentially grabbed over the salient areas. Salient stimuli are mainly due to: High color Contrast Motion Edge orientation Original Frame Saliency map Tractor sequence (TUM/VQEG)
  • 5. Saliency maps (1/2) Several methods for saliency map extraction already exist in the literature. All methods work in the same way [O. Brouard, V. Ricordel and D. Barba, 2009], [S. Marat, et al., 2009]: Extraction of the spatial saliency map (static pathway) Extraction of the temporal saliency map (dynamic pathway) Fusion of the spatial and the temporal saliency maps (fusion) Temporal saliency map Spatial saliency map Spatio-temporal saliency map
  • 6. Saliency maps (2/2) In this work we re-used the saliency map extraction method published at IS&T Electronic Imaging 2011 : Based on the saliency map model from O. Brouard, V. Ricordeland D. Barba. Use partial decoding of H.264 stream to reach real-time performances. A fusion method to combine spatial and temporal saliency maps has been proposed. We propose a new fusion method
  • 7. Saliency map fusion (1/2) We use the multiplication fusion method and the logarithm fusion method , both weighted with a 5 visual deg. 2D Gaussian 2DGauss(s) to compare with our proposed fusion method. Spatio-temporal saliency map
  • 8. To produce spatio-temporal saliency map, we also propose a new fusion method Similar fusion properties as Gives more weight to regions which have both: High spatial saliency High temporal saliency Do not provide null spatio-temporal saliency when temporal saliency is very low. Saliency map fusion (2/2)
  • 9. WMBER Vq metric based on saliency maps (1/3) Weighted Macro Block Error Rate (WMBER) is a No Reference metric Visual attention is focused on the saliency map Video transmission artifacts may change the saliency map We propose to extract the saliency maps on the already broadcasted disturbed video stream. WMBER also relies on MB error detection in the bit stream DC/AC and MV error detection Error propagation according to H.264 decoding process WMBER is based on: MB error detection Weighted by Saliency maps Original transmission error Propagation of transmission errors
  • 10. WMBER Vq metric based on saliency maps (2/3) MB errormap & Decoder Decoded Frame Gradient energy X Σ GME SaliencyMap / Σ WMBER
  • 11. WMBER Vq metric based on saliency maps (3/3) When MB errors covers the whole frame and the energy of the gradient is high: WMBER is high (near 1.0) When there no MB errors or the energy of the gradient is low: WMBER is low (near 0.0) The WMBER of a video sequence is the average WMBER of the frames.
  • 12. Subjective Experiment Subjective experiment According to: VQEG Report on Validation of the Video Quality Models for High Definition Video Content (June 2010). ITU-R Rec. BT.500-11 20 HDTV (1920x1080 pixels) video sources (SRC) from : The Open Video Project: www.open-video.org NTIA/ITS TUM/Taurus Media Technik French HDTV Measure the influence of transmission loss on perceived quality 2 loss models: IP model (ITU-T Rec. G.1050) RF (Radio Frequency) model 8 loss profiles were compared 160 Processed Video Streams (PVS) 35 participants were gathered MOS values were computed for each SRC and PVS. Experiment room
  • 14. We propose to use a supervised learning method to predict MOS values from WMBER or MSE This prediction method is called: Similarity-weighted average Requires a training data set of n known pairs (xi, yi) to predict y from x. Here (xi, yi) pairs are WMBERor MSE values associated with MOS values. y is the predicted MOS from a given WMBER/MSE x. The prediction is performed using (known as a weighted mean classifier): Prediction of subjective quality metrics from objective quality metrics
  • 15. Evaluation and results We compare 6 objective video quality metrics: MSE WMBER using the 5 v/deg 2D Gaussian (WMBER2DGauss) WMBER using the multiplication fusion (WMBERmul) WMBER using the log sum fusion (WMBERlog) WMBER using the square sum fusion (WMBERsquare) WMBER using the spatial saliency map (WMBERsp) All metrics are computed for each 160 PVS + 20 SRC. 6data sets are built: 180 pairs Objective Metric/MOS Each data set is split in 2 equal parts: Training set and Evaluation set The Pearson Correlation Coefficient (PCC) is used for the evaluation Cross validation
  • 16. Conclusion and future Work We were interested in the problem of objective video quality assessment over lossy channels. We followed the recent trends in the definition of spatio-temporal saliency maps for FOA. New no reference metric : the WMBER based on saliency maps. We bought a new solution for saliency maps fusion: the Square sum fusion. We proposed a supervised learning method to predict subjective quality metric MOS from objective quality metrics. Similarity weighted average. Gives better results than the conventional approach: polynomial fitting. We intend to improve the saliency model to better consider: Transmission artifacts Masking effect in the neighborhood of high saliency areas. We plan to evaluate the WMBER on the IRCCyN/IVC Eyetracker SD 2009_12 Database.
  • 17. Thank you for your attention. Any questions?