H263.ppt

Image & Video Compression Conferencing & Internet Video Portland State University Sharif University of Technology

Objectives ,[object Object],[object Object],[object Object],[object Object],[object Object]

Outline Section 1: Conferencing Video Section 2: Internet Review Section 3: Internet Video

Section 1: Conferencing Video ,[object Object],[object Object],[object Object],[object Object],[object Object]

Garden Variety Video Coder Motion Estimation & Compensation Transform, Quantization, Zig- Zag Scan & Run- Length Encoding Symbol Encoder Frames of Digital Video Bit Stream Video codecs have three main functional blocks Video Compression Review

Symbol Encoding Motion Estimation & Compensation Transform, Quantization, Zig- Zag Scan & Run- Length Encoding Symbol Encoder Frames of Digital Video Bit Stream The symbol encoder exploits the statistical properties of its input by using shorter code words for more common symbols. Examples: Huffman & Arithmetic Coding Video Compression Review

Symbol Encoding Motion Estimation & Compensation Transform, Quantization, Zig- Zag Scan & Run- Length Encoding Symbol Encoder Frames of Digital Video Bit Stream This block is the basis for most lossless image coders (in conjunction with DPCM, etc.) Video Compression Review

Transform & Quantization Motion Estimation & Compensation Transform, Quantization, Zig- Zag Scan & Run- Length Encoding Symbol Encoder Frames of Digital Video Bit Stream A transform (usually DCT) is applied to the input data for better energy compaction which decreases the entropy and improves the performance of the symbol encoder. Video Compression Review

Transform & Quantization Motion Estimation & Compensation Transform, Quantization, Zig- Zag Scan & Run- Length Encoding Symbol Encoder Frames of Digital Video Bit Stream The DCT also decomposes the input into its frequency components so that perceptual properties can be exploited. For example, we can throw away high frequency content first. Video Compression Review

Transform & Quantization Motion Estimation & Compensation Transform, Quantization, Zig- Zag Scan & Run- Length Encoding Symbol Encoder Frames of Digital Video Bit Stream Quantization lets us reduce the representation size of each symbol, improving compression but at the expense of added errors. It’s the main tuning knob for controlling data rate. Video Compression Review

Transform & Quantization Motion Estimation & Compensation Transform, Quantization, Zig- Zag Scan & Run- Length Encoding Symbol Encoder Frames of Digital Video Bit Stream Zig-zag scanning and run-length encoding orders the data into 1-D arrays and replaces long runs of zeros with run-length symbols. Video Compression Review

Still Image Compression Motion Estimation & Compensation Transform, Quantization, Zig- Zag Scan & Run- Length Encoding Symbol Encoder Frames of Digital Video Bit Stream These two components form the basis for many still image compression algorithms such as JPEG, PhotoCD, M-JPEG and DV. Video Compression Review

Motion Estimation/Compensation Motion Estimation & Compensation Transform, Quantization, Zig- Zag Scan & Run- Length Encoding Symbol Encoder Frames of Digital Video Bit Stream Finally, because video is a sequence of pictures with high temporal correlation, we add motion estimation/compensation to try to predict as much of the current frame as possible from the previous frame. Video Compression Review

Motion Estimation/Compensation Motion Estimation & Compensation Transform, Quantization, Zig- Zag Scan & Run- Length Encoding Symbol Encoder Frames of Digital Video Bit Stream Most common method is to predict each block in the current frame by a (possibly translated) block of the previous frame. Video Compression Review

Garden Variety Video Coder Motion Estimation & Compensation Transform, Quantization, Zig- Zag Scan & Run- Length Encoding Symbol Encoder Frames of Digital Video Bit Stream These three components form the basis for most of the standard video compression algorithms: MPEG-1, -2, & -4, H.261, H.263, H.263+. Video Compression Review

Section 1: Conferencing Video ,[object Object],[object Object],[object Object],[object Object],Chronology of Video Standards

Chronology of Video Standards 1990 1996 2002 1992 1994 1998 2000 H.263L H.263++ H.263+ H.263 H.261 MPEG 7 MPEG 4 MPEG 2 MPEG 1 ISO ITU-T

Chronology of Video Standards ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Chronology continued ,[object Object],[object Object],[object Object],[object Object]

Chronology continued ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Chronology continued ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Chronology continued ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Section 1: Conferencing Video ,[object Object],[object Object],[object Object],[object Object],The Input Video Format

Video Format for Conferencing ,[object Object],[object Object],[object Object],Input Format Cb Cr Y =

YCbCr (YUV) Color Space 0.299 0.587 0.114 -0.169 -0.331 0.500 0.500 -0.419 -0.081 R G B Y Cb Cr = Y represents the luminance of a pixel. Cr, Cb represents the color difference or chrominance of a pixel. Input Format ,[object Object],[object Object]

Subsampled Chrominance Input Format ,[object Object],[object Object]

Spatial relation between luma and chroma pels for CIF 4:2:0 Different than MPEG-2 4:2:0 Input Format

Common Intermediate Format Input Format ,[object Object],[object Object],[object Object],[object Object]

Picture & Pixel Aspect Ratios Picture 4:3 352 288 Pixel 12:11 Pixels are not square in CIF. Input Format

Picture & Pixel Aspect Ratios Hence on a square pixel display such as a computer screen, the video will look slightly compressed horizontally. The solution is to spatially resample the video frames to be 384 x 288 or 352 x 264 This corresponds to a 4:3 aspect ratio for the picture area on a square pixel display. Input Format

Blocks and Macroblocks The luma and chroma planes are divided into 8x8 pixel blocks. Every four luma blocks are associated with a corresponding Cb and Cr block to create a macroblock . 8x8 pixel blocks macroblock Y Cb Cr Input Format

Section 1: Conferencing Video ,[object Object],[object Object],[object Object],[object Object],H.263 Overview

ITU-T Recommendation H.263 ,[object Object],[object Object],[object Object]

ITU-T Recommendation H.263 Composed of a baseline plus four negotiable options Baseline Codec Unrestricted/Extended Motion Vector Mode Advanced Prediction Mode PB Frames Mode Syntax-based Arithmetic Coding Mode

Frame Formats Always 12:11 pixel aspect ratio. H.263 Baseline

Picture & Macroblock Types ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],H.263 Baseline

Motion Vectors H.263 Baseline ,[object Object],[object Object],[object Object],X C B A

Motion Vector Delta (MVD) Symbol Lengths H.263 Baseline

Transform Coefficient Coding H.263 Baseline ,[object Object],[object Object],[object Object],[object Object],[object Object]

Quantization H.263 Baseline ,[object Object],[object Object],[object Object],Q -Q 2Q -2Q in out

Bit Stream Syntax Hierarchy of three layers. Picture Layer GOB* Layer MB Layer *A GOB is usually a row of macroblocks, except for frame sizes greater than CIF. Picture Hdr GOB Hdr MB MB ... GOB Hdr ... H.263 Baseline

Picture Layer Concepts Picture Start Code Temporal Reference Picture Type Picture Quant H.263 Baseline ,[object Object],[object Object],[object Object],[object Object]

GOB Layer Concepts GOB Headers are Optional GOB Start Code GOB Number GOB Quant H.263 Baseline ,[object Object],[object Object],[object Object],GOB can be decoded independently from the rest of the frame.

Macroblock Layer Concepts Coded Flag MB Type Code Block Pattern MV Deltas Transform Coefficients DQuant H.263 Baseline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Unrestricted/Extended Motion Vector Mode ,[object Object],[object Object],[object Object],[object Object],H.263 Options

Motion Vectors Over Picture Boundaries Target Frame N Reference Frame N-1 Edge pixels are repeated. H.263 Options

Extended MV Range Base motion vector range. Extended motion vector range, [-16,15.5] around MV predictor. H.263 Options 15.5 15.5 -16 -16 -16 -16 15.5 15.5 (31.5,31.5)

Advanced Prediction Mode H.263 Options ,[object Object],[object Object],[object Object]

Overlapped Motion Compensation ,[object Object],[object Object],[object Object],[object Object],H.263 Options

Overlapped Motion Comp. ,[object Object],[object Object],[object Object],H.263 Options MV 0 MV 1 MV 1 MV 2 MV 2

Overlapped Motion Comp. ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],H.263 Options

Overlapped Motion Comp. H 0 (i, j) = H.263 Options

Overlapped Motion Comp. H 1 (i, j) = H.263 Options H 2 (i, j) = ( H 1 (i, j) ) T

PB Frames Mode H.263 Options ,[object Object],[object Object],[object Object],[object Object]

PB Frames Picture 1 P or I Frame Picture 2 B Frame Picture 3 P or I Frame V 1/2 -V 1/2 2X frame rate for only 30% more bits. H.263 Options PB

Syntax based Arithmetic Coding Mode H.263 Options ,[object Object],[object Object],[object Object]

H.263 Improvements over H.261 ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Demo: QCIF, 8 fps @ 28 Kb/s H.261 H.263

Video Conferencing Demonstration

Section 1: Conferencing Video ,[object Object],[object Object],[object Object],[object Object],H.263 Options H.263+ Overview

ITU-T Recommendation H.263 Version 2 (H.263+)

H.263 Ver. 2 (H.263+) ,[object Object],[object Object],[object Object],H.263+

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],H.263: Overview H.263 “plus” more negotiable options H.263+

H.263: Overview H.263 “plus” more negotiable options ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Arbitrary Frame Size, Pixel Aspect Ratio, Clock Frequency H.263+ ,[object Object],[object Object],[object Object]

Advanced INTRA Coding Mode H.263+ ,[object Object],[object Object],[object Object],[object Object]

Advanced INTRA Mode DCT Blocks Row Prediction Column Prediction H.263+

Deblocking Filter Mode H.263+ ,[object Object],[object Object],[object Object]

Deblocking Filter Mode Block Boundary Block Boundary H.263+

Deblocking Filter Mode ,[object Object],[object Object],H.263+

Deblocking Filter Mode ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],H.263+

Post-Filter ,[object Object],[object Object],[object Object],H.263+

Deblocking Filter Demo H.263+ No Filter Deblocking Loop Filter

Deblocking Filter Demo H.263+ No Filter Loop & Post Filter

Filter Demo Videos No Filter Loop Filter Loop & Post Filter

Slice Structured Mode H.263+ ,[object Object],[object Object],[object Object],[object Object]

Slice Structured Mode Slice Boundaries No INTRA or MV Prediction across slice boundaries. H.263+ Slices start and end on macroblock boundaries.

Slice Structured Mode Independent Segments Slice Boundaries No INTRA or MV Prediction across slice boundaries. H.263+ Slice sizes remain fixed between INTRA frames.

Supplemental Enhancement Information H.263+ ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Reference Picture Resampling H.263+ ,[object Object],[object Object],[object Object]

Reference Picture Resampling with Warping Specify arbitrary warping parameters via displacement vectors from corners. H.263+

Reference Picture Resampling Factor of 4 Size Change P P P P P No INTRA Frame Required when changing video frame sizes H.263+

Scalability Mode H.263+ ,[object Object],[object Object],[object Object],[object Object],Base Layer Enhancement Layer 1 Enhancement Layer 2

Layered Video Bitstreams Enh. Layer 1 Enhancement Layer 3 Enhancement Layer 4 Base Layer Enhancement Layer 2 H.263+ Encoder 40 kb/s 20 kb/s 90 kb/s 200 kb/s 320 kb/s H.263+

Scalability Mode H.263+ ,[object Object],[object Object]

Layered Video Bit Streams in multipoint conferencing 384 kb/s 384 kb/s 128 kb/s 28.8 kb/s H.263+

Temporal Enhancement Higher Frame Rate! Base Layer + B Frames H.263+

Temporal Scalability Temporal scalability means that two or more frame rates can be supported by the same bit stream. In other words, frames can be discarded (to lower the frame rate) and the bit stream remains usable. H.263+ I or P B B P ... ...

Temporal Scalability H.263+ ,[object Object],[object Object],[object Object]

B Frames Picture 1 P or I Frame Picture 2 B Frame Picture 3 P or I Frame V 1/2 -V 1/2 2X frame rate for only 30% more bits H.263+

Temporal Scalability Demonstration ,[object Object],[object Object],H.263+

SNR Enhancement Better Spatial Quality! Base Layer + SNR Layer H.263+

SNR Scalability H.263+ ,[object Object],[object Object],[object Object],[object Object]

SNR Scalability Base Layer (15 kbit/s) Enhancement Layer (40 kbit/s) H.263+ EI EP EP P P I Legend: I - Intracoded or Key Frame P - Predicted Frame EI - Enhancement layer key frame EP - Enhancement layer predicted frame

SNR Scalability Demonstration ,[object Object],[object Object],H.263+

Spatial Enhancement More Spatial Resolution!! Base Layer + Spatial Layer H.263+

Spatial Scalability H.263+ ,[object Object],[object Object],[object Object]

Spatial Scalability H.263+ EP EP EI I P P Enhancement Layer Base Layer

Spatial Scalability Demonstration ,[object Object],[object Object],H.263+

Hybrid Scalability It is possible to combine temporal, SNR and spatial scalability into a flexible layered framework with many levels of quality. H.263+

Hybrid Scalability H.263+ EI EP P B EP P Base Layer Enhancement Layer 1 Enhancement Layer 2 EP EP P EI EI I

Scalability Demonstration ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],H.263+

Other Miscellaneous Features ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],H.263+

Other Miscellaneous Features ,[object Object],[object Object],[object Object],[object Object],H.263+

Outline Section 1: Conferencing Video Section 2: Internet Review Section 3: Internet Video 

Internet Basics Phone lines are “circuit-switched”. A (virtual) circuit is established at call initiation and remains for the duration of the call. Source Dest. switch switch switch Internet Review

Internet Basics Computer networks are “packet-switched”. Data is fragmented into packets, and each packet finds its way to the destination using different routes. Lots of implications... Source Dest. switch switch switch X Internet Review

The Internet is heterogeneous [V. Cerf] Corporate LAN INTERNET (Global Public) HyperStream FR, SMDS, ATM TYMNET Host Dial-up IP “ SLIP”, “PPP” IP IP IP “ SMTP” E-mail FR FR FR “ SLIP” “ PPP” X.25 “ SMTP” IP Dial-up E-mail R R R AOL LAN LAN MCI Mail LAN Mail GW

Layers in the Internet Protocol Architecture Network Access Layer consists of routines for accessing physical networks 1 Internet Layer defines the datagram and handles the routing of data. 2 Host-to-Host Transport Layer provides end-to-end data delivery services. 3 Application Layer consists of applications and processes that use the network. 4 Internet Review

Data Encapsulation Header Header Data Encapsulation Header Data Application Layer Transport Layer Internet Layer Network Access Layer Data Header Data Header Header Data Internet Review

Internet Protocol Architecture FDDI Ethernet Token Ring HDLC SMDS X.25 ATM FR TCP UDP SNMP DNS TELNET FTP SMTP MIME . . . . . . Network Access Layer Internet Host-Host Transport Utility/ Application RTP Internet Review MBone VIC/VAT I P

Specific Protocols for Multimedia IP UDP RTP Specific Protocols for Multimedia IP TCP UDP RTP Physical Network payload RTP payload UDP RTP payload Data Internet Review Payload header

[object Object],[object Object],[object Object],[object Object],[object Object],The Internet Protocol (IP) Internet Review

The Internet Protocol (IP) ,[object Object],[object Object],[object Object],[object Object],Internet Review

Transmission Control Protocol (TCP) ,[object Object],[object Object],[object Object],[object Object],[object Object],Transmission Control Protocol (TCP) Internet Review

Universal Datagram Protocol (UDP) ,[object Object],[object Object],[object Object],[object Object],Internet Review

Real time Transport Protocol (RTP) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Internet Review

Real-time Transport Protocol (RTP) ,[object Object],[object Object],Internet Review

Real-time Transport Control Protocol (RTCP) ,[object Object],[object Object],Internet Review

Real-time Transport Control Protocol (RTCP) ,[object Object],[object Object],[object Object],[object Object],Internet Review

Multicast Backbone (Mbone) ,[object Object],[object Object],Internet Review

Unicast Example Streaming media to multi-participants S1 D1 S2 D1 D2 S1 sends duplicate packets because there’s two participants: D1, D2.. D2 sees excess traffic on this subnet. Internet Review 1 1 1 1 2 2 2 1 1 1 1

Multicast Example Streaming media to multi-participants S1 D1 S2 D1 D2 S1 sends single set of packets to a multicast group. D2 doesn’t see any excess traffic on this subnet. Both D1 receivers subscribe to the same multicast group. Internet Review 1 1 1 2 2 2 1 1

Multicast Backbone (MBone) ,[object Object],[object Object],[object Object],[object Object],Internet Review

ReSerVation Protocol (RSVP) Internet Draft ,[object Object],[object Object],[object Object],[object Object],[object Object],Internet Review

Real-time Streaming Protocol (RTSP) Internet Draft ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Internet Review

Hyper-Text Transport Protocol (HTTP) ,[object Object],[object Object],[object Object],[object Object],Internet Review

Outline Section 1: Conferencing Video Section 2: Internet Review Section 3: Internet Video  

How do we stream video over the Internet? ,[object Object],[object Object],We’ll look at some solutions... Internet Video

HTTP Streaming ,[object Object],[object Object],[object Object],Internet Video

RTP Streaming ,[object Object],[object Object],[object Object],[object Object],Internet Video

H.263 Payload for RTP ,[object Object],[object Object],[object Object],[object Object],Internet Video RTP Header H.263 Payload Header H.263 Payload (bit stream)

H.263 Payload for RTP ,[object Object],[object Object],[object Object],Internet Video

Error Resiliency: Redundancy & Concealment Techniques Internet Video

Internet Packet Loss ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Internet Video

Packet Loss Burst Lengths Internet Video

First Order Loss Model 2-Stage Gilbert Model No Loss Loss 1 - p 1 - q p q Internet Video p = 0.083 q = 0.823

[object Object],[object Object],[object Object],Error Resiliency Internet Video + - REDUNDANCY compression resiliency

Error Resiliency Errors tend to propagate in video compression because of its predictive nature. I or P frame P frame One block is lost. Error propagates to two blocks in the next frame. Internet Video

[object Object],[object Object],[object Object],[object Object],Error Resiliency Internet Video

[object Object],[object Object],[object Object],Simple INTRA Coding & Skipped Blocks Internet Video

Intra Coding Resiliency Internet Video

Reference Picture Selection Mode of H.263+ I or P frame P frame P frame Last acknowledged error-free frame. In RPS Mode, a frame is not used for prediction in the encoder until it’s been acknowledged to be error free. No acknowledgment received yet - not used for prediction. Internet Video

[object Object],[object Object],Reference Picture Selection Internet Video

[object Object],[object Object],[object Object],[object Object],Multi-threaded Video 1 3 2 5 7 9 4 6 8 10 I P P P P P P P P I Internet Video

[object Object],[object Object],Conditional Replenishment ME/MC DCT, etc. decoder decoder Encoder Internet Video

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Conditional Replenishment Internet Video

[object Object],[object Object],[object Object],[object Object],[object Object],Error Tracking Appendix II, H.263 Internet Video

[object Object],[object Object],Prioritized Encoding AC Coefficients DC Coefficients MB Information Motion Vectors Picture Header Increasing Error Protection Internet Video

Prioritized Encoding Demo Internet Video Unprotected Encoding Prioritized Encoding (23% Overhead) Videos used with permission of ICSI, UC Berkeley

Error Concealment by Interpolation Lost block Take the weighted average of 4 neighboring pixels. Internet Video d 1 d 2

[object Object],[object Object],[object Object],[object Object],[object Object],Other Error Concealment Techniques Internet Video See references for more information...

Example: MRNF Filtering Internet Video

[object Object],[object Object],[object Object],[object Object],Network Congestion Internet Video

[object Object],[object Object],[object Object],[object Object],Receiver-driven Layered Multicast Internet Video

Receiver-driven Layered Multicast S D3 D2 D1 R R Internet Video Multicast groups are not transmitted on networks that have no subscribers. 1 2 3 1 2 3 1 2 1 2 1

H263.ppt

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (6)

Ähnlich wie H263.ppt

Ähnlich wie H263.ppt (20)

Mehr von Videoguy

Mehr von Videoguy (20)

H263.ppt