study Video stylization for digital ambient displays of home

STUDY OF Video Stylization for Digital Ambient Displays of Home MoviesNPAR 2010 TinghuaiWang and John Collomosse University of Surrey, UK David Slatter, Phil Cheatle and Darryl Greig Hewlett-Packard Labs, Bristol UK. Video clips are stylized into cartoons or paintings, and sequenced according to semantic and visual similarity

Abstract “Digital Ambient Display” Video  cartoon of home movie Video segmentation based on multi-label graph cut Video  temporal coherent region maps (tracking regions) enhance cartoon painting System Algorithm

Outline Introduction System Overview View Stylization Multi-label Graph cut, region propagation, refining region label, smoothing and filtering, stroke placement and shading Video Sequencing Stochastic composition, rendering transitions Results and Discussion Conclusion Video  temporal coherence region Video  Home movie

Digital Ambient Display A genre of content consumption experience which we call ambient experience Displaying still images in an ambient way Digital picture frame Displaying video content inan ambient way ?  Digital Ambient Displays(DAD)

Digital Ambient Displays (DADs) Video  “mid-level” scene abstract using Color region segmentation Video  temporal coherence region regionpropagation,multi-labelmraphiccut Video  Home Movie Video selection,compositionandtransition

Related work Stochastic selection of video clips Stochastic transitions between video frames [Schodl et al. 00] Single video and based on visual similarity Composition of photos for abstract [Collomosse and Hall 03] Video artwork [Slatter et al. 10; Bizzocchi 08] Little work of the use of artistic video stylizationin ambient displays

Related work Image segmentation Mid-level models of scene structure [Wang et al . 04; Collomosse 04] to render in artistic styles Mean-shift based stylization [Wang et al . 04]  small and short-live segments Spatio-temporal volumes from video [Collomosse et al 05] of 3Ddimension (x,y,t) Abstract video using a bilateral filer [Winnemoller et al. 06] ,[object Object],[object Object]

Video Segmentation Labeling regions and tracking regions in temporal 4 1 1 1 4 5 5 4 2 2 2 3 3 3 t

Video Stylization Multi-label Graph cut, region propagation, refining region label, smoothing and filtering, stroke placement and shading Video  temporal coherence region

Video Segmentation A novel coherent video segmentation Multi-label graph cut on successive video frames … color distribution built Gaussian Mixture Model (GMM) of each region past frames fn-3 fn-2 previous frame fn-1 Multi-label graph cut propagated by motion Current frame fn label

Video Segmentation Assign region labels existing in frame It-1 to each pixel p in frame It(p) Find the best mappingl : P L where L = { l(1), …, l(p), … l(|P|) } , P is an 8-connected lattice of pixels To minimize the global energy function to encourage Spatial homogeneity of contrast within each frame Temporal consistency of color distribution between frames labeling

Minimize global energy E U : temporal consistency of color distribution between frames V : spatial homogeneity of contrast within each frame where 1) L is label set of the previous frame 2) P is connected pixels in belong to labels 3) Θ is the colorhistory model

Minimize energy of V V : spatial homogeneity of contrast within each frame 2 1 ? 3 Punish pair points (8-connected neighbor) where they have different label but have high color homogeneity !

Minimize energy of U U : temporal consistency of color distribution between frames Color histogram at pixel p– label/color at each frame color distributions of different label assignment pixel p the color distribution at pixel pwith label L1 the color distribution at pixel pwith label L4 255 255 color color

Minimize energy of U U : temporal consistency of color distribution between frames color distributions of different pixel pixel n the color distribution at pixel n with label l(pn) the color distribution at pixel m with label l(pm) pixel m color color

Minimize energy of U U : temporal consistency of color distribution between frames N:Normal distribution (μ, Σ) Σ N:Mixture of Gaussians (GMM) Θ:parameters of all GMMs, Θ = {ωik, μik, Σi,k; i = 1, …, L; k = 1, …, Ki}

Minimize energy of U U : temporal consistency of color distribution between frames Motion propagation O : the prior labeling of pixels

Multi-label Graph Cut Minimize E is a NP-hard problem Multi-label graph cut α-expansion iterationfor each label until E can not decrease [Boykov and Kolmogorov 2004].

Graph cut ,[object Object],Maximum flow ≡ minimum cut Hong Chen, “Introduction to Min-Cut/Max-Flow Algorithms”

Multiple-Label Graphic Cut Hong Chen, “Introduction to Min-Cut/Max-Flow Algorithms”

Multi-label graph cut on Binary Label [PAMI04] Boykov and Kolmogorov, “An Experimental Comparison of Min-Cur/Max-Flow Algorithms for Energy Minimization in Vision”

Multi-label graph cut on Binary Label Mini-Cut problem Max-Flow problem of each pixel

Multi-label graph cut on Binary Label Mini-Cut problem on boundary Max-Flow problem

Region propagation Estimate the motion of It-1 using RANSAC search based on SIFT features [Lowe 04] rigid motion + deformation I’t-1 Propagation labeling per pixel from I’t-1 It Incorrect motion estimation ? Use thinned skeleton to mitigate imprecise motion estimation

Region/Skeleton ≡ regions skeleton pruning skeleton robust region region propagation with motion error motion estimation ?

Skeleton to robust motion estimation use only the skeletons whose distance to the boundary exceeds apre-set confidence

Region propagation It-1 Labeled It-1 skeleton I’t-1 It region label warped according to per-pixel motion estimation replace regions with skeleton to robust motion estimation

GMM Build a GMM color model for each region li Sampling historical colors of labelled pixels over recent frames How to sample historical colors? contribution weight More recent color contributes more importance

Refining region labels How about new objects appear in It?

D Refining region labels Keeptwo color models for each label l in frame It (1) Historical color model (2) an update color model If |Mh – Mu| > threshold, new objects are deemed present

Build color model for the new objects Extract the dominant colors Mean-shift to cluster the spatial-color modes (XY+RGB) CMM and skeleton on the new region Re-applying Graphic cut optimization locally within the region Using new labels

Smoothing and filtering Spatio-temporal smoothing Gaussian filter of 3x3x3 (x-y-t) Filtering Remove false segmentation and short-lived object smoothing filtering

D Filtering - remove short-lived object D l : duration of label l K disconnected objects (e.g. c1, c2, c3…, cK) with the same label dl,k : duration of kth object with label l τr : threshold. in this paper, 6 frames If the duration of any of these disconnected video object within this time window is shorter than threshold, this video object is removed

Stroke Placement and Shading β-spline stroke Face detection Painterly Rendering Painterly Rendering with Curved Brush Strokes of Multiple Sizes [Hertzmann 1998] Interpolate an orientation field from the shape of the region in this paper

Orientation field ,[object Object],[object Object]

Video Sequencing Stochastic composition, rendering transitions Video  Home movie

Stochastic composition Video sequencing depends on ds (Va, Vb) : semantic distance (tag) between tags of video A and B dv (i, j) : the similarity of videos V1 V2 V3 V4 Vi

Rendering Transitions Smoothing video transition similar similar Region Morphing similar similar

Rendering Transitions Smoothing video transition 0.5 0.4 0.1 C(:) indicates mean color similarity; A(:) indicates relative area; S(:) indicates shape similarity interms of region compactness

Results 23 videos, manually tagged collection

Comparison - BOY ,[object Object]

‘synergistic’ mean-shift + edge (Comaniciu, 2002)

spatio-temporal method (Paris, 2008).,[object Object]

Result - KITE background detail may (optionally) be abstracted by modifying the initial frame segmentation to merge unwanted detailed regions

Result - DRAMA correct handling of regions that disappear and appear within sequences

study Video stylization for digital ambient displays of home

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Mehr von Chiamin Hsu

Mehr von Chiamin Hsu (6)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

study Video stylization for digital ambient displays of home