SlideShare ist ein Scribd-Unternehmen logo
1 von 93
Downloaden Sie, um offline zu lesen
IEEE ICME-2019
8:30 – 12:00, July. 08
Shanghai, China
Intelligent
Image/Video Editing
Tutorial
STRUCT Group
Jiaying Liu
Wenhan Yang
Text-Centric Image
Style Transfer
Part 2
Outline
Background / 005
Statistic Based Text Stylization / 014
Context-Aware Unsupervised Text Stylization / 053
Text Effects Transfer via Stylization & Destylization / 077
Text-Centric Image Style Transfer
Outline
Background / 005
Statistic Based Text Stylization / 014
Context-Aware Unsupervised Text Stylization / 053
Text Effects Transfer via Stylization & Destylization / 077
Text-Centric Image Style Transfer
STRUCT Group05
Image Style Transfer
 Image stylization
 Enrich content
 Intensify representation
 Convey emotion
 Application
 Social media
 Graphic design
Background
STRUCT Group06
Image Style Transfer
Background
 Taxonomy
Neural Style Transfer1
CNNMRF2, Neural Doodle3
Pix2pix-cGAN, cycleGAN
42017 CVPR Image-to-Image Translation with Conditional Adversarial Networks
32016 Arxiv Semantic style transfer and turning two-bit doodles into fine artworks
12016 CVPR. Image style transfer using convolutional neural networks
22016 CVPR Combining Markov random fields and convolutional neural networks for image synthesis
52017 ICCV Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks
Method
Style
Transfer
Global
based
GAN
Local
based
STRUCT Group07
Image Style Transfer
Background
 Taxonomy
Neural Style Transfer1
Main idea:
Match the global feature distributions
between the style image and the target
image
• Gram Matrix (Coherence)
• Mean & Variance
• Whitening & Coloring
Style as global distribution
Method
Style
Transfer
Global
based
GAN
Local
based
STRUCT Group08
Image Style Transfer
Background
 Taxonomy
CNNMRF2, Neural Doodle3
Main idea:
match local patches / pixels
• Patch/pixel in image domain
• Image Analogy (SIGGRAPH01)
• Neural patches
• CNNMRF
• Hybrid: Deep image analogyStyle as local patches
Method
Style
Transfer
Global
based
GAN
Local
based
STRUCT Group09
Image Style Transfer
Background
 Taxonomy
Main idea:
Learn mapping between two domains
Instead of defining what style is,
Learn to classify the style
• Paired
• Pix2pix-cGAN
• Unpaired
• cycleGAN
Pix2pix-cGAN, cycleGAN
Style is learned by data
Method
Style
Transfer
Global
based
GAN
Local
based
STRUCT Group010 Background
Problem Analysis
 Motivation
 Manually design: Require skills & Take too much time
 Automatic text effects transfer
By Algorithms
(seconds)
By Manual work (hours)
Determine color styles, Warp textures to match text shapes
Adjust the transparency for visual plausibleness, etc.
STRUCT Group011
Guidance S Style S’ Target T Result T’
① Learn transformation
② Apply transformation
Image Style Transfer
 Supervised method
 Unsupervised method
Background
STRUCT Group012
Extract features to build mappings
Image Style Transfer
 Supervised method
 Unsupervised method
Style S’ Target T Result T’
Background
Outline
Background / 005
Statistic Based Text Stylization / 014
Context-Aware Unsupervised Text Stylization / 053
Text Effects Transfer via Stylization & Destylization / 077
Text-Centric Image Style Transfer
STRUCT Group014
 Supervised Text Stylization
Awesome Typography: Statistics-Based Text Effects Transfer
Shuai Yang, Jiaying Liu, Zhouhui Lian, Zongming Guo, CVPR 2017
Statistics-Based Text Effects Transfer
STRUCT Group015
 Supervised Text Stylization
Awesome Typography: Statistics-Based Text Effects Transfer
Shuai Yang, Jiaying Liu, Zhouhui Lian, Zongming Guo, CVPR 2017
Statistics-Based Text Effects Transfer
STRUCT Group016 Statistics-Based Text Effects Transfer
Problem: Text Effects Transfer
 Input: S, S', T
 Output: T '
Source text S Source effects S' Our result T'Target text T
Input Output
STRUCT Group017 Statistics-Based Text Effects Transfer
Problem Analysis
 Motivation
 Manually design: Require skills & Take too much time
 Automatic text effects transfer
By Algorithms
(seconds)
By Manual work (hours)
Determine color styles, Warp textures to match text shapes
Adjust the transparency for visual plausibleness, etc.
STRUCT Group018
Problem Analysis
 Challenges
 Extreme diversity of the text effects and characters
 Complicated composition of style elements
 Texture-by-number methods ✘
 Simpleness of guidance images
 Deep-based methods ✘
Complicated composition Plain guidance images texture-by-number
Image Analogies (2001)
deep-based
Neural Doodles (2016)
Our result
Statistics-Based Text Effects Transfer
STRUCT Group019
Problem Analysis
 Key Observation: Distance-based characteristics
 Textures with similar distances share similar patterns
 Distance: to text skeleton
 Pattern: Color and Scale
Different patch colors represent different distances to the text skeleton (white)
Statistics-Based Text Effects Transfer
STRUCT Group
 Texture Effects Statistics Calculation
 Distance
 Optimal patch scale
 Text Effects Generation
 Multiscale
 Texture distribution
 Naturalness
Relationship
020
Statistics-Based Texture Effects Transfer
Statistics-Based Text Effects Transfer
flame denim fabric lava neon
STRUCT Group
rank(q)
r(q)
021
Text Effects Statistics Estimation
 Distance Estimation
(a) Text image (b) Radius and distance (c) Text skeleton (d) Radius and its rank
(e) Normalized distance
Statistics-Based Text Effects Transfer
 Normalized distance calculation
STRUCT Group022
Text Effects Statistics Estimation
 Optimal Patch Scale Detection
 Fix patch size and resize image to realize multiple scales
 Determine from large scale to small scale
 Determination criterion:
Statistics-Based Text Effects Transfer
Text shape Texture
STRUCT Group023
Text Effects Statistics Estimation
 Relation Between Optimal Patch Scale and Distance
 Histogram
 Joint probability
 Posterior probability
(for l being the appropriate scale to depict the
patches with distances corresponding to bin(q))
Statistics-Based Text Effects Transfer
q q qp ~
STRUCT Group024
Text Effect Transfer
 Objective Function
Eapp Edist
Epsy
Appearance Term
Texture Style Transfer
Low-level
objective evaluation
Distribution Term
Spatial Style Transfer
Mid-level
objective evaluation
Psycho-Visual Term
Naturalness Preservation
subjective evaluation
Statistics-Based Text Effects Transfer
p: pixel in source images
q: pixel in target images
STRUCT Group025
 Appearance Term
Text Effect Transfer
Statistics-Based Text Effects Transfer
𝑃/𝑃′ : patch in 𝑆/S′ centered at p
𝑄/𝑄′: patch in 𝑇/𝑇′centered at q
Text effects term
Ensure texture transfer
Text shape term
Constrain text shape
STRUCT Group026
 Appearance Term
single-scale 5*5 single-scale 15*15 multi-scale 5*5
Weighted by posterior probability
Patches in different regions use different scales
Small patch:
 Lose structures
Large patch:
 Lose textures
Multi-scale:
 Preserve texture structures
 Preserve texture details
Text Effect Transfer
Statistics-Based Text Effects Transfer
STRUCT Group027
 Distribution Term
 Encourage the text effects of T' to share similar distribution of S'
 Used for initialization
no Edist with Edist with Edist + initialization
Text Effect Transfer
Statistics-Based Text Effects Transfer
STRUCT Group028
 Psycho-Visual Term
|Φ(q)|: Number of patches that find Q(q) as their matched patch
 Penalize texture over-repetitiveness
 Encourage new texture creation
Weights for Epsy
Text Effect Transfer
Statistics-Based Text Effects Transfer
STRUCT Group029
Text Effect Transfer
 Transfer for words
Stroke Term Estr
Control the texture similarity of similar strokes
Positive weight:
 stylize in a more consistent way
Negative weight:
 transfer more diverse textures
Positive
weight
Negative
weight
Statistics-Based Text Effects Transfer
STRUCT Group030 Statistics-Based Text Effects Transfer
Comparison with Other Methods
Target Text
STRUCT Group031 Statistics-Based Text Effects Transfer
Comparison with Other Methods
A. Hertzmann, C. E. Jacobs, N. Oliver, B. Curless, and D. H. Salesin.. “Image analogies”. SIGGRAPH. 2001.
Image Analogy
STRUCT Group032 Statistics-Based Text Effects Transfer
Comparison with Other Methods
O. Frigo, N. Sabater, J. Delon, and P. Hellier. Split and match: example-based adaptive patch sampling for unsupervised style transfer”. CVPR. 2016.
Split and Match
STRUCT Group033 Statistics-Based Text Effects Transfer
Comparison with Other Methods
A. J. Champandard. “Semantic style transfer and turning two-bit doodles into fine artworks”. Arxiv. 2016.
Neural Doodles
STRUCT Group034 Statistics-Based Text Effects Transfer
Comparison with Other Methods
Our method
STRUCT Group035
Comparison with Other Methods
(a) (S,S') (b) T (c) Image Analogies1 (d) Split & Match2 (e) Neural Doodle3 (f ) Baseline (g) Proposed
3A. J. Champandard. “Semantic style transfer and turning two-bit doodles into fine artworks”. Arxiv. 2016.
2O. Frigo, N. Sabater, J. Delon, and P. Hellier. Split and match: example-based adaptive patch sampling for unsupervised style transfer”. CVPR. 2016.
1A. Hertzmann, C. E. Jacobs, N. Oliver, B. Curless, and D. H. Salesin. “Image analogies”. SIGGRAPH. 2001.
Statistics-Based Text Effects Transfer
STRUCT Group036 Statistics-Based Text Effects Transfer
User Study
Score distribution
 91 participants, give 1-5 scores (5 means best)
 Averaged scores: 3.04, 1.54, 1.68, 3.95, 4.79
STRUCT Group037 Statistics-Based Text Effects Transfer
User Study
 91 participants, give 1-5 scores (5 means best)
 Highest score over all 8 test images
Score of each test image group
STRUCT Group038 Statistics-Based Text Effects Transfer
User Study
 91 participants, give 1-5 scores (5 means best)
 Clear advantages: highly regular texture effects
Score of each test image group
STRUCT Group039 Statistics-Based Text Effects Transfer
User Study
 91 participants, give 1-5 scores (5 means best)
 Less obvious advantages: simple effects, fail to cover all colors
Score of each test image group
STRUCT Group040 Statistics-Based Text Effects Transfer
Diversified Transfer
 Apply different text effects to representative characters
(Chinese, alphabetic, handwriting)
STRUCT Group046 Statistics-Based Text Effects Transfer
Typography Library Generation
STRUCT Group047 Statistics-Based Text Effects Transfer
Typography Library Generation
STRUCT Group048 Statistics-Based Text Effects Transfer
Typography Library Generation
STRUCT Group049 Statistics-Based Text Effects Transfer
Typography Library Generation
STRUCT Group050 Statistics-Based Text Effects Transfer
Stroke-Based Graphics Rendering
 Text  Icons
STRUCT Group051 Statistics-Based Text Effects Transfer
Stroke-Based Graphics Rendering
 Icons  Icons
 Icons  Text
Outline
Background / 005
Statistic Based Text Stylization / 014
Context-Aware Unsupervised Text Stylization / 053
Text Effects Transfer via Stylization & Destylization / 077
Text-Centric Image Style Transfer
STRUCT Group53
 Unsupervised Text Stylization
Context-Aware Unsupervised Text Stylization
Shuai Yang, Jiaying Liu, Wenhan Yang, Zongming Guo, ACM MM 2018 / TIP 2019
Context-Aware Unsupervised Text Stylization
STRUCT Group54 Context-Aware Unsupervised Text Stylization
Unsupervised Text Stylization
 Unsupervised Text Stylization
 Texture image S' without guidance S
 S' arbitrary texture images
 Broaden application scope
STRUCT Group55 Context-Aware Unsupervised Text Stylization
Unsupervised Text Stylization
 Context-Aware Layout Design
 Photo + text: poster, magazine cover, billboard
 Optimal text layout selection
 Seamlessly embedding
Application: posterBackground photo (T, S') photo + text
STRUCT Group56 Context-Aware Unsupervised Text Stylization
Context-Aware Unsupervised Text Stylization
 Unsupervised Text Stylization
 Structure transfer
 Texture transfer
 Context-Aware Layout Design
 Position estimation
 Text embedding
narrow the visual discrepancy
STRUCT Group57 Context-Aware Unsupervised Text Stylization
Unsupervised Text Stylization
 Guidance Map Extraction: Clustering
 Structure Transfer
 Texture Transfer
Guidance
Map
Extraction
𝑇
𝑆′ 𝑆
• Unify modality
• Establish mapping
STRUCT Group58 Context-Aware Unsupervised Text Stylization
Unsupervised Text Stylization
 Guidance Map Extraction
 Structure Transfer
 Texture Transfer
Output
𝑇
𝑆′ 𝑆
Guidance
Map
Extraction
Texture
Transfer
STRUCT Group59 Context-Aware Unsupervised Text Stylization
Unsupervised Text Stylization
 Guidance Map Extraction
 Structure Transfer: Minimize Structural Inconsistency
 Texture Transfer
Structure
Transfer Texture
Transfer
Output
෠𝑇𝑇
𝑆′ 𝑆
Guidance
Map
Extraction
STRUCT Group60 Context-Aware Unsupervised Text Stylization
Unsupervised Text Stylization
 Guidance Map Extraction
 Structure Transfer: Legibility-Preserving Structure Transfer
 Texture Transfer
1A. Rosenberger, D. Cohen-Or, and D. Lischinski. Layered shape synthesis: automatic generation of control maps for non-stationary textures. TOG. 2009.
LSS result
output
bicubic
Alg.
BW
T
STRUCT Group61 Context-Aware Unsupervised Text Stylization
Unsupervised Text Stylization
 Guidance Map Extraction
 Structure Transfer: Legibility-Preserving Structure Transfer
 Texture Transfer
1A. Rosenberger, D. Cohen-Or, and D. Lischinski. Layered shape synthesis: automatic generation of control maps for non-stationary textures. TOG. 2009.
T
LSS result
Our result
bicubic
Alg.
BW
output
STRUCT Group62 Context-Aware Unsupervised Text Stylization
Unsupervised Text Stylization
 Texture Transfer
 Objective Function
Low-level
Objective
Local texture
Mid-level
Objective
Text boundary
Naturalness
Subjective
Global distribution
1 2 3min ( , ) ( , ) ( , ) ( , )app dist psy sal
q
p
E p q E p q E p q E p q    
Eapp Edist Epsy Esal
Legibility
Subjective
Non-boundary part
Appearance
Term
Distribution
Term
Psycho-Visual
Term
Saliency
Term
STRUCT Group63 Context-Aware Unsupervised Text Stylization
Texture Transfer
外观项 Saliency Term
S’ Saliency map of S’
λ3=0.00 λ3=0.01 λ3=0.05
Internal area: salient textures
External area: indistinctive flat textures
Increase text legibility
STRUCT Group64 Context-Aware Unsupervised Text Stylization
Context-Aware Layout Design
 Position Estimation
 Cost function
Local variance
Seek flat region
Low saliency
Prevent occlusion
Texture consistency
Prevent seams
Uvar Usal Ucoh Uaes
Centrality of text
Prevent corners
ˆ arg min ( ) ( ) ( ) ( )var sal coh aes
R
p R
R U p U p U p U p

   
Variance
Cost
Saliency
Cost
Coherence
Cost
Aesthetics
Cost
STRUCT Group65 Context-Aware Unsupervised Text Stylization
Context-Aware Layout Design
 Position Estimation
 Cost function
Variance Cost
Local variance
Seek flat region
Saliency Cost
Low saliency
Prevent occlusion
Coherence Cost
Texture consistency
Prevent seams
Aesthetics Cost
Centrality of text
Prevent corners
Input T , S′ and I Total placement cost Position estimation result
STRUCT Group66 Context-Aware Unsupervised Text Stylization
Context-Aware Layout Design
 Text Embedding
 Image inpainting: fill Inpainting region with samples from S′
 Structure guidance of T + Contextual boundary constraint
Contextual region Inpainting region
Sampling region S′
STRUCT Group67 Context-Aware Unsupervised Text Stylization
Comparison with Other Methods
2A. J. Champandard. Semantic style transfer and turning two-bit doodles into fine artworks. Arxiv. 2016.
3S. Yang, J. Liu, Z. Lian, and Z. Guo. Awesome typography: statistics-based text effects transfer. CVPR. 2017.
1A. Hertzmann, C. E. Jacobs, N. Oliver, B. Curless, and D. H. Salesin. Image analogies. SIGGRAPH. 2001.
T and S′ Image Analogy1 Neural Doodle2 Text Effects Transfer3
Image Quilting4 Neural Style5 CNNMRF6 Our method
4A. Efros and W. T. Freeman. Image quilting for texture synthesis and transfer. TOG. 2001.
5L. A. Gatys, A. S. Ecker, and M. Bethge. Image style transfer using convolutional neural networks. CVPR. 2016
6C. Li and M. Wand. Combining Markov random fields and convolutional neural networks for image synthesis. CVPR. 2016
STRUCT Group68 Context-Aware Unsupervised Text Stylization
Comparison with Unsupervised Methods
 Structural Consistency
4A. Efros and W. T. Freeman. Image quilting for texture synthesis and transfer. TOG. 2001.
5L. A. Gatys, A. S. Ecker, and M. Bethge. Image style transfer using convolutional neural networks. CVPR. 2016
Image Quilting4 Neural Style5 Our method
S′ T Structure transfer result
STRUCT Group69 Context-Aware Unsupervised Text Stylization
Comparison with Supervised Methods
 Text Legibility
3S. Yang, J. Liu, Z. Lian, and Z. Guo. Awesome typography: statistics-based text effects transfer. CVPR. 2017.
1A. Hertzmann, C. E. Jacobs, N. Oliver, B. Curless, and D. H. Salesin. Image analogies. SIGGRAPH. 2001.
Our method
Image Analogy1
Text Effects Transfer3
T and S′
STRUCT Group70 Context-Aware Unsupervised Text Stylization
Different Fonts and Languages
Visual effects for text stylization on different languages.
Visual effects for text stylization on different fonts.
STRUCT Group71
Unsupervised Text Effects Synthesis
 Text effects synthesis results
Context-Aware Unsupervised Text Stylization
STRUCT Group72
Unsupervised Text Effects Synthesis
 Application: texture rendering for symbols
Context-Aware Unsupervised Text Stylization
STRUCT Group73
Graphical Arts: Text + Images
 Synthesize stylish text onto photos
 Follow image completion process
 Background images provide boundary information
 Texture synthesis under boundary constrains
Application:
Poster design
Background image (T, S') Text + Image
Context-Aware Unsupervised Text Stylization
STRUCT Group74
Graphical Arts: Text + Images
 Graphical arts design results (1/2)
 S' is sampled from background images
Context-Aware Unsupervised Text Stylization
STRUCT Group75
Graphical Arts: Text + Images
 Graphical arts design results (2/2)
 S' is not from background images: color transfer is used
Context-Aware Unsupervised Text Stylization
Outline
Background / 005
Statistic Based Text Stylization / 014
Context-Aware Unsupervised Text Stylization / 053
Text Effects Transfer via Stylization & Destylization / 077
Text-Centric Image Style Transfer
STRUCT Group77
 Deep Learning Based Text Stylization
Text Effects Transfer via Stylization and Destylization
Shuai Yang, Jiaying Liu, Wenjin Wang, Zongming Guo, AAAI 2019
Text Effects Transfer via Stylization & Destylization
STRUCT Group78
Problem: Text Stylization and Destylization
 Task1: Stylization for transferring the visual effects from
highly stylized text onto other glyphs
 Task2: Destylization for removing style features from text
Text Effects Transfer via Stylization & Destylization
STRUCT Group79
Problem Analysis
 Methods
 Text effects are highly structured along the glyph
Global-Based Local-Based GAN
Need no dataset
Preserve texture details
Need no dataset
Reconstruct vivid styles
Text effects cannot be simply
characterized as the mean,
variance, covariance and other
global statistics
Fail to characterize global
distribution
Need dataset for training
Deals only two domains or
Hard to extend to new style
pros
cons
e.g.
Text Effects Transfer via Stylization & Destylization
STRUCT Group80
Problem Analysis
• 64 different style
• 775 Chinesecharacters
• 52 English letters
• 10 Arabic numerals
• Featuredisentanglement
• Featurerecombination
• One-shot learning
METHODDATA APPLICATION
• Stylization
• Destylization
• Style creation
flame 字
combine
disentangledisentangle
Text Effects Transfer via Stylization & Destylization
STRUCT Group81
Data Overview
Text Effects Transfer via Stylization & Destylization
 Dataset
 64 different text effects
 775 Chinese characters, 52 English letters, 10 Arabic numerals
 64 * 837 ≈ 54K pairs
 Train : Test = 708 : 129
STRUCT Group
 Learn a two-way mapping between two visual domains X (text)
and Y (text effects)
82
Network Architecture
Text Effects Transfer via Stylization & Destylization
• Encoder:
Text content encoder
Style content encoder
Style encoder
• Decoder:
Content decoder
Style decoder
• Discriminator:
Content discriminator
Style discriminator
STRUCT Group83
Network Architecture
Text Effects Transfer via Stylization & Destylization
 Autoencoder
 Reconstruction loss
 Preserve the core information of
the glyph
STRUCT Group84
Network Architecture
Text Effects Transfer via Stylization & Destylization
 Destylization
 Feature loss
 Approach ground truth content feature
extracted by Ex
 Pixel loss
 Approach ground truth output
 Adversarial loss
 WGAN-GP
 Improve the quality
STRUCT Group85
Network Architecture
Text Effects Transfer via Stylization & Destylization
 Stylization
 Pixel loss
 Approach ground truth output
 Adversarial loss
 WGAN-GP
 Improve the quality
STRUCT Group86
One-shot fine-tuning (supervised)
Text Effects Transfer via Stylization & Destylization
 Extend to new style with only one example pair
 Random crop to generate multiple training pairs
 More precise texture details
(a) User-specified new text effects.
(b) Result on an unseen style
(c) Result after one-shot fine-tuning
STRUCT Group87
One-shot fine-tuning (unsupervised)
Text Effects Transfer via Stylization & Destylization
 Extend to new style with only one style image
 Use destylization network to generate auxiliary raw text
 Style autoencoder reconstruction loss
Input style Input glyph Auxiliary glyph
Destylization network
STRUCT Group88
Comparison
GAN-based Local-based Global-based
Text Effects Transfer via Stylization & Destylization
STRUCT Group89
Comparison
 One-shot supervised style transfer
Fine-tune Train from scratch
 Pretraining on our dataset successfully teaches our network the domain
knowledge of text effects synthesis
Text Effects Transfer via Stylization & Destylization
STRUCT Group90
Comparison
 One-shot unsupervised style transfer
Text Effects Transfer via Stylization & Destylization
STRUCT Group91
Other results
 Style interpolation
Text Effects Transfer via Stylization & Destylization
STRUCT Group92 Conclusion
Conclusion
Supervised
Patch-based Deep-based
Input
Method
Statistics-based Text Effect Transfer
• Distribution-based prior
• Uniform Objective function
Unsupervised Text Stylization
• Structure + texture transfer
• Combine text and images
TET-GAN
• Feature disentanglement
• Data collection
• One-shot supervised style transfer
• One-shot unsupervised style transfer
Unsupervised
style
glyph
style
glyph
STRUCT Group
liujiaying@pku.edu.cn
yangwenhan@pku.edu.cn

Weitere ähnliche Inhalte

Ähnlich wie Intelligent Image Enhancement and Restoration - From Prior Driven Model to Advanced Deep Learning Part 2: text centric image style transfer

Syntactic Mediation in Grid and Web Service Architectures
Syntactic Mediation in Grid and Web Service ArchitecturesSyntactic Mediation in Grid and Web Service Architectures
Syntactic Mediation in Grid and Web Service Architectures
Martin Szomszor
 
V for visualization: VIATRA finally goes graphical thanks to Sirius!
V for visualization: VIATRA finally goes graphical thanks to Sirius!V for visualization: VIATRA finally goes graphical thanks to Sirius!
V for visualization: VIATRA finally goes graphical thanks to Sirius!
Ákos Horváth
 
Automated Syntactic Mediation for Web Service Integration
Automated Syntactic Mediation for Web Service IntegrationAutomated Syntactic Mediation for Web Service Integration
Automated Syntactic Mediation for Web Service Integration
Martin Szomszor
 

Ähnlich wie Intelligent Image Enhancement and Restoration - From Prior Driven Model to Advanced Deep Learning Part 2: text centric image style transfer (20)

IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING
IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNINGIMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING
IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING
 
Zeroshot multimodal named entity disambiguation for noisy social media posts
Zeroshot multimodal named entity disambiguation for noisy social media postsZeroshot multimodal named entity disambiguation for noisy social media posts
Zeroshot multimodal named entity disambiguation for noisy social media posts
 
Syntactic Mediation in Grid and Web Service Architectures
Syntactic Mediation in Grid and Web Service ArchitecturesSyntactic Mediation in Grid and Web Service Architectures
Syntactic Mediation in Grid and Web Service Architectures
 
Validation of pervasive cloud task migration with colored petri net
Validation of pervasive cloud task migration with colored petri netValidation of pervasive cloud task migration with colored petri net
Validation of pervasive cloud task migration with colored petri net
 
V for visualization: VIATRA finally goes graphical thanks to Sirius!
V for visualization: VIATRA finally goes graphical thanks to Sirius!V for visualization: VIATRA finally goes graphical thanks to Sirius!
V for visualization: VIATRA finally goes graphical thanks to Sirius!
 
Lec10 matching
Lec10 matchingLec10 matching
Lec10 matching
 
Automated Syntactic Mediation for Web Service Integration
Automated Syntactic Mediation for Web Service IntegrationAutomated Syntactic Mediation for Web Service Integration
Automated Syntactic Mediation for Web Service Integration
 
Volumetric Medical Images Lossy Compression using Stationary Wavelet Transfor...
Volumetric Medical Images Lossy Compression using Stationary Wavelet Transfor...Volumetric Medical Images Lossy Compression using Stationary Wavelet Transfor...
Volumetric Medical Images Lossy Compression using Stationary Wavelet Transfor...
 
How to use transfer learning to bootstrap image classification and question a...
How to use transfer learning to bootstrap image classification and question a...How to use transfer learning to bootstrap image classification and question a...
How to use transfer learning to bootstrap image classification and question a...
 
Text Extraction of Colour Images using Mathematical Morphology & HAAR Transform
Text Extraction of Colour Images using Mathematical Morphology & HAAR TransformText Extraction of Colour Images using Mathematical Morphology & HAAR Transform
Text Extraction of Colour Images using Mathematical Morphology & HAAR Transform
 
IRJET- On-Screen Translator using NLP and Text Detection
IRJET- On-Screen Translator using NLP and Text DetectionIRJET- On-Screen Translator using NLP and Text Detection
IRJET- On-Screen Translator using NLP and Text Detection
 
geolocation twitter network text geotagging
geolocation twitter network text geotagginggeolocation twitter network text geotagging
geolocation twitter network text geotagging
 
Handwritten Text Recognition and Translation with Audio
Handwritten Text Recognition and Translation with AudioHandwritten Text Recognition and Translation with Audio
Handwritten Text Recognition and Translation with Audio
 
IRJET- Automatic Text Summarization using Text Rank
IRJET- Automatic Text Summarization using Text RankIRJET- Automatic Text Summarization using Text Rank
IRJET- Automatic Text Summarization using Text Rank
 
evonlp_slides.pdf
evonlp_slides.pdfevonlp_slides.pdf
evonlp_slides.pdf
 
A Visual Exploration of Distance, Documents, and Distributions
A Visual Exploration of Distance, Documents, and DistributionsA Visual Exploration of Distance, Documents, and Distributions
A Visual Exploration of Distance, Documents, and Distributions
 
Words in Space - Rebecca Bilbro
Words in Space - Rebecca BilbroWords in Space - Rebecca Bilbro
Words in Space - Rebecca Bilbro
 
A beginner's guide to Style Transfer and recent trends
A beginner's guide to Style Transfer and recent trendsA beginner's guide to Style Transfer and recent trends
A beginner's guide to Style Transfer and recent trends
 
One dimensional vector based pattern
One dimensional vector based patternOne dimensional vector based pattern
One dimensional vector based pattern
 
Estimating Species Divergence Times in RevBayes – iEvoBio 2014
Estimating Species Divergence Times in RevBayes – iEvoBio 2014Estimating Species Divergence Times in RevBayes – iEvoBio 2014
Estimating Species Divergence Times in RevBayes – iEvoBio 2014
 

Mehr von Wanjin Yu

Mehr von Wanjin Yu (16)

Architecture Design for Deep Neural Networks III
Architecture Design for Deep Neural Networks IIIArchitecture Design for Deep Neural Networks III
Architecture Design for Deep Neural Networks III
 
Intelligent Multimedia Recommendation
Intelligent Multimedia RecommendationIntelligent Multimedia Recommendation
Intelligent Multimedia Recommendation
 
Architecture Design for Deep Neural Networks II
Architecture Design for Deep Neural Networks IIArchitecture Design for Deep Neural Networks II
Architecture Design for Deep Neural Networks II
 
Architecture Design for Deep Neural Networks I
Architecture Design for Deep Neural Networks IArchitecture Design for Deep Neural Networks I
Architecture Design for Deep Neural Networks I
 
Causally regularized machine learning
Causally regularized machine learningCausally regularized machine learning
Causally regularized machine learning
 
Computer vision for transportation
Computer vision for transportationComputer vision for transportation
Computer vision for transportation
 
Object Detection Beyond Mask R-CNN and RetinaNet III
Object Detection Beyond Mask R-CNN and RetinaNet IIIObject Detection Beyond Mask R-CNN and RetinaNet III
Object Detection Beyond Mask R-CNN and RetinaNet III
 
Object Detection Beyond Mask R-CNN and RetinaNet II
Object Detection Beyond Mask R-CNN and RetinaNet IIObject Detection Beyond Mask R-CNN and RetinaNet II
Object Detection Beyond Mask R-CNN and RetinaNet II
 
Object Detection Beyond Mask R-CNN and RetinaNet I
Object Detection Beyond Mask R-CNN and RetinaNet IObject Detection Beyond Mask R-CNN and RetinaNet I
Object Detection Beyond Mask R-CNN and RetinaNet I
 
Visual Search and Question Answering II
Visual Search and Question Answering IIVisual Search and Question Answering II
Visual Search and Question Answering II
 
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
 
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
 
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
 
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
 
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
 
Big Data Intelligence: from Correlation Discovery to Causal Reasoning
Big Data Intelligence: from Correlation Discovery to Causal Reasoning Big Data Intelligence: from Correlation Discovery to Causal Reasoning
Big Data Intelligence: from Correlation Discovery to Causal Reasoning
 

Kürzlich hochgeladen

AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
ellan12
 
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
Diya Sharma
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
@Chandigarh #call #Girls 9053900678 @Call #Girls in @Punjab 9053900678
 

Kürzlich hochgeladen (20)

AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
 
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
 
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
 
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort ServiceBusty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
 
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark WebGDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
 
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
 
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
 
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
 
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
 
Russian Call Girls Pune (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
Russian Call Girls Pune  (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...Russian Call Girls Pune  (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
Russian Call Girls Pune (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
 
Call Now ☎ 8264348440 !! Call Girls in Rani Bagh Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Rani Bagh Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Rani Bagh Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Rani Bagh Escort Service Delhi N.C.R.
 
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
 
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
 
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
 
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
 
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
 

Intelligent Image Enhancement and Restoration - From Prior Driven Model to Advanced Deep Learning Part 2: text centric image style transfer

  • 1. IEEE ICME-2019 8:30 – 12:00, July. 08 Shanghai, China Intelligent Image/Video Editing Tutorial
  • 2. STRUCT Group Jiaying Liu Wenhan Yang Text-Centric Image Style Transfer Part 2
  • 3. Outline Background / 005 Statistic Based Text Stylization / 014 Context-Aware Unsupervised Text Stylization / 053 Text Effects Transfer via Stylization & Destylization / 077 Text-Centric Image Style Transfer
  • 4. Outline Background / 005 Statistic Based Text Stylization / 014 Context-Aware Unsupervised Text Stylization / 053 Text Effects Transfer via Stylization & Destylization / 077 Text-Centric Image Style Transfer
  • 5. STRUCT Group05 Image Style Transfer  Image stylization  Enrich content  Intensify representation  Convey emotion  Application  Social media  Graphic design Background
  • 6. STRUCT Group06 Image Style Transfer Background  Taxonomy Neural Style Transfer1 CNNMRF2, Neural Doodle3 Pix2pix-cGAN, cycleGAN 42017 CVPR Image-to-Image Translation with Conditional Adversarial Networks 32016 Arxiv Semantic style transfer and turning two-bit doodles into fine artworks 12016 CVPR. Image style transfer using convolutional neural networks 22016 CVPR Combining Markov random fields and convolutional neural networks for image synthesis 52017 ICCV Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks Method Style Transfer Global based GAN Local based
  • 7. STRUCT Group07 Image Style Transfer Background  Taxonomy Neural Style Transfer1 Main idea: Match the global feature distributions between the style image and the target image • Gram Matrix (Coherence) • Mean & Variance • Whitening & Coloring Style as global distribution Method Style Transfer Global based GAN Local based
  • 8. STRUCT Group08 Image Style Transfer Background  Taxonomy CNNMRF2, Neural Doodle3 Main idea: match local patches / pixels • Patch/pixel in image domain • Image Analogy (SIGGRAPH01) • Neural patches • CNNMRF • Hybrid: Deep image analogyStyle as local patches Method Style Transfer Global based GAN Local based
  • 9. STRUCT Group09 Image Style Transfer Background  Taxonomy Main idea: Learn mapping between two domains Instead of defining what style is, Learn to classify the style • Paired • Pix2pix-cGAN • Unpaired • cycleGAN Pix2pix-cGAN, cycleGAN Style is learned by data Method Style Transfer Global based GAN Local based
  • 10. STRUCT Group010 Background Problem Analysis  Motivation  Manually design: Require skills & Take too much time  Automatic text effects transfer By Algorithms (seconds) By Manual work (hours) Determine color styles, Warp textures to match text shapes Adjust the transparency for visual plausibleness, etc.
  • 11. STRUCT Group011 Guidance S Style S’ Target T Result T’ ① Learn transformation ② Apply transformation Image Style Transfer  Supervised method  Unsupervised method Background
  • 12. STRUCT Group012 Extract features to build mappings Image Style Transfer  Supervised method  Unsupervised method Style S’ Target T Result T’ Background
  • 13. Outline Background / 005 Statistic Based Text Stylization / 014 Context-Aware Unsupervised Text Stylization / 053 Text Effects Transfer via Stylization & Destylization / 077 Text-Centric Image Style Transfer
  • 14. STRUCT Group014  Supervised Text Stylization Awesome Typography: Statistics-Based Text Effects Transfer Shuai Yang, Jiaying Liu, Zhouhui Lian, Zongming Guo, CVPR 2017 Statistics-Based Text Effects Transfer
  • 15. STRUCT Group015  Supervised Text Stylization Awesome Typography: Statistics-Based Text Effects Transfer Shuai Yang, Jiaying Liu, Zhouhui Lian, Zongming Guo, CVPR 2017 Statistics-Based Text Effects Transfer
  • 16. STRUCT Group016 Statistics-Based Text Effects Transfer Problem: Text Effects Transfer  Input: S, S', T  Output: T ' Source text S Source effects S' Our result T'Target text T Input Output
  • 17. STRUCT Group017 Statistics-Based Text Effects Transfer Problem Analysis  Motivation  Manually design: Require skills & Take too much time  Automatic text effects transfer By Algorithms (seconds) By Manual work (hours) Determine color styles, Warp textures to match text shapes Adjust the transparency for visual plausibleness, etc.
  • 18. STRUCT Group018 Problem Analysis  Challenges  Extreme diversity of the text effects and characters  Complicated composition of style elements  Texture-by-number methods ✘  Simpleness of guidance images  Deep-based methods ✘ Complicated composition Plain guidance images texture-by-number Image Analogies (2001) deep-based Neural Doodles (2016) Our result Statistics-Based Text Effects Transfer
  • 19. STRUCT Group019 Problem Analysis  Key Observation: Distance-based characteristics  Textures with similar distances share similar patterns  Distance: to text skeleton  Pattern: Color and Scale Different patch colors represent different distances to the text skeleton (white) Statistics-Based Text Effects Transfer
  • 20. STRUCT Group  Texture Effects Statistics Calculation  Distance  Optimal patch scale  Text Effects Generation  Multiscale  Texture distribution  Naturalness Relationship 020 Statistics-Based Texture Effects Transfer Statistics-Based Text Effects Transfer flame denim fabric lava neon
  • 21. STRUCT Group rank(q) r(q) 021 Text Effects Statistics Estimation  Distance Estimation (a) Text image (b) Radius and distance (c) Text skeleton (d) Radius and its rank (e) Normalized distance Statistics-Based Text Effects Transfer  Normalized distance calculation
  • 22. STRUCT Group022 Text Effects Statistics Estimation  Optimal Patch Scale Detection  Fix patch size and resize image to realize multiple scales  Determine from large scale to small scale  Determination criterion: Statistics-Based Text Effects Transfer Text shape Texture
  • 23. STRUCT Group023 Text Effects Statistics Estimation  Relation Between Optimal Patch Scale and Distance  Histogram  Joint probability  Posterior probability (for l being the appropriate scale to depict the patches with distances corresponding to bin(q)) Statistics-Based Text Effects Transfer q q qp ~
  • 24. STRUCT Group024 Text Effect Transfer  Objective Function Eapp Edist Epsy Appearance Term Texture Style Transfer Low-level objective evaluation Distribution Term Spatial Style Transfer Mid-level objective evaluation Psycho-Visual Term Naturalness Preservation subjective evaluation Statistics-Based Text Effects Transfer p: pixel in source images q: pixel in target images
  • 25. STRUCT Group025  Appearance Term Text Effect Transfer Statistics-Based Text Effects Transfer 𝑃/𝑃′ : patch in 𝑆/S′ centered at p 𝑄/𝑄′: patch in 𝑇/𝑇′centered at q Text effects term Ensure texture transfer Text shape term Constrain text shape
  • 26. STRUCT Group026  Appearance Term single-scale 5*5 single-scale 15*15 multi-scale 5*5 Weighted by posterior probability Patches in different regions use different scales Small patch:  Lose structures Large patch:  Lose textures Multi-scale:  Preserve texture structures  Preserve texture details Text Effect Transfer Statistics-Based Text Effects Transfer
  • 27. STRUCT Group027  Distribution Term  Encourage the text effects of T' to share similar distribution of S'  Used for initialization no Edist with Edist with Edist + initialization Text Effect Transfer Statistics-Based Text Effects Transfer
  • 28. STRUCT Group028  Psycho-Visual Term |Φ(q)|: Number of patches that find Q(q) as their matched patch  Penalize texture over-repetitiveness  Encourage new texture creation Weights for Epsy Text Effect Transfer Statistics-Based Text Effects Transfer
  • 29. STRUCT Group029 Text Effect Transfer  Transfer for words Stroke Term Estr Control the texture similarity of similar strokes Positive weight:  stylize in a more consistent way Negative weight:  transfer more diverse textures Positive weight Negative weight Statistics-Based Text Effects Transfer
  • 30. STRUCT Group030 Statistics-Based Text Effects Transfer Comparison with Other Methods Target Text
  • 31. STRUCT Group031 Statistics-Based Text Effects Transfer Comparison with Other Methods A. Hertzmann, C. E. Jacobs, N. Oliver, B. Curless, and D. H. Salesin.. “Image analogies”. SIGGRAPH. 2001. Image Analogy
  • 32. STRUCT Group032 Statistics-Based Text Effects Transfer Comparison with Other Methods O. Frigo, N. Sabater, J. Delon, and P. Hellier. Split and match: example-based adaptive patch sampling for unsupervised style transfer”. CVPR. 2016. Split and Match
  • 33. STRUCT Group033 Statistics-Based Text Effects Transfer Comparison with Other Methods A. J. Champandard. “Semantic style transfer and turning two-bit doodles into fine artworks”. Arxiv. 2016. Neural Doodles
  • 34. STRUCT Group034 Statistics-Based Text Effects Transfer Comparison with Other Methods Our method
  • 35. STRUCT Group035 Comparison with Other Methods (a) (S,S') (b) T (c) Image Analogies1 (d) Split & Match2 (e) Neural Doodle3 (f ) Baseline (g) Proposed 3A. J. Champandard. “Semantic style transfer and turning two-bit doodles into fine artworks”. Arxiv. 2016. 2O. Frigo, N. Sabater, J. Delon, and P. Hellier. Split and match: example-based adaptive patch sampling for unsupervised style transfer”. CVPR. 2016. 1A. Hertzmann, C. E. Jacobs, N. Oliver, B. Curless, and D. H. Salesin. “Image analogies”. SIGGRAPH. 2001. Statistics-Based Text Effects Transfer
  • 36. STRUCT Group036 Statistics-Based Text Effects Transfer User Study Score distribution  91 participants, give 1-5 scores (5 means best)  Averaged scores: 3.04, 1.54, 1.68, 3.95, 4.79
  • 37. STRUCT Group037 Statistics-Based Text Effects Transfer User Study  91 participants, give 1-5 scores (5 means best)  Highest score over all 8 test images Score of each test image group
  • 38. STRUCT Group038 Statistics-Based Text Effects Transfer User Study  91 participants, give 1-5 scores (5 means best)  Clear advantages: highly regular texture effects Score of each test image group
  • 39. STRUCT Group039 Statistics-Based Text Effects Transfer User Study  91 participants, give 1-5 scores (5 means best)  Less obvious advantages: simple effects, fail to cover all colors Score of each test image group
  • 40. STRUCT Group040 Statistics-Based Text Effects Transfer Diversified Transfer  Apply different text effects to representative characters (Chinese, alphabetic, handwriting)
  • 41.
  • 42.
  • 43.
  • 44.
  • 45.
  • 46. STRUCT Group046 Statistics-Based Text Effects Transfer Typography Library Generation
  • 47. STRUCT Group047 Statistics-Based Text Effects Transfer Typography Library Generation
  • 48. STRUCT Group048 Statistics-Based Text Effects Transfer Typography Library Generation
  • 49. STRUCT Group049 Statistics-Based Text Effects Transfer Typography Library Generation
  • 50. STRUCT Group050 Statistics-Based Text Effects Transfer Stroke-Based Graphics Rendering  Text  Icons
  • 51. STRUCT Group051 Statistics-Based Text Effects Transfer Stroke-Based Graphics Rendering  Icons  Icons  Icons  Text
  • 52. Outline Background / 005 Statistic Based Text Stylization / 014 Context-Aware Unsupervised Text Stylization / 053 Text Effects Transfer via Stylization & Destylization / 077 Text-Centric Image Style Transfer
  • 53. STRUCT Group53  Unsupervised Text Stylization Context-Aware Unsupervised Text Stylization Shuai Yang, Jiaying Liu, Wenhan Yang, Zongming Guo, ACM MM 2018 / TIP 2019 Context-Aware Unsupervised Text Stylization
  • 54. STRUCT Group54 Context-Aware Unsupervised Text Stylization Unsupervised Text Stylization  Unsupervised Text Stylization  Texture image S' without guidance S  S' arbitrary texture images  Broaden application scope
  • 55. STRUCT Group55 Context-Aware Unsupervised Text Stylization Unsupervised Text Stylization  Context-Aware Layout Design  Photo + text: poster, magazine cover, billboard  Optimal text layout selection  Seamlessly embedding Application: posterBackground photo (T, S') photo + text
  • 56. STRUCT Group56 Context-Aware Unsupervised Text Stylization Context-Aware Unsupervised Text Stylization  Unsupervised Text Stylization  Structure transfer  Texture transfer  Context-Aware Layout Design  Position estimation  Text embedding narrow the visual discrepancy
  • 57. STRUCT Group57 Context-Aware Unsupervised Text Stylization Unsupervised Text Stylization  Guidance Map Extraction: Clustering  Structure Transfer  Texture Transfer Guidance Map Extraction 𝑇 𝑆′ 𝑆 • Unify modality • Establish mapping
  • 58. STRUCT Group58 Context-Aware Unsupervised Text Stylization Unsupervised Text Stylization  Guidance Map Extraction  Structure Transfer  Texture Transfer Output 𝑇 𝑆′ 𝑆 Guidance Map Extraction Texture Transfer
  • 59. STRUCT Group59 Context-Aware Unsupervised Text Stylization Unsupervised Text Stylization  Guidance Map Extraction  Structure Transfer: Minimize Structural Inconsistency  Texture Transfer Structure Transfer Texture Transfer Output ෠𝑇𝑇 𝑆′ 𝑆 Guidance Map Extraction
  • 60. STRUCT Group60 Context-Aware Unsupervised Text Stylization Unsupervised Text Stylization  Guidance Map Extraction  Structure Transfer: Legibility-Preserving Structure Transfer  Texture Transfer 1A. Rosenberger, D. Cohen-Or, and D. Lischinski. Layered shape synthesis: automatic generation of control maps for non-stationary textures. TOG. 2009. LSS result output bicubic Alg. BW T
  • 61. STRUCT Group61 Context-Aware Unsupervised Text Stylization Unsupervised Text Stylization  Guidance Map Extraction  Structure Transfer: Legibility-Preserving Structure Transfer  Texture Transfer 1A. Rosenberger, D. Cohen-Or, and D. Lischinski. Layered shape synthesis: automatic generation of control maps for non-stationary textures. TOG. 2009. T LSS result Our result bicubic Alg. BW output
  • 62. STRUCT Group62 Context-Aware Unsupervised Text Stylization Unsupervised Text Stylization  Texture Transfer  Objective Function Low-level Objective Local texture Mid-level Objective Text boundary Naturalness Subjective Global distribution 1 2 3min ( , ) ( , ) ( , ) ( , )app dist psy sal q p E p q E p q E p q E p q     Eapp Edist Epsy Esal Legibility Subjective Non-boundary part Appearance Term Distribution Term Psycho-Visual Term Saliency Term
  • 63. STRUCT Group63 Context-Aware Unsupervised Text Stylization Texture Transfer 外观项 Saliency Term S’ Saliency map of S’ λ3=0.00 λ3=0.01 λ3=0.05 Internal area: salient textures External area: indistinctive flat textures Increase text legibility
  • 64. STRUCT Group64 Context-Aware Unsupervised Text Stylization Context-Aware Layout Design  Position Estimation  Cost function Local variance Seek flat region Low saliency Prevent occlusion Texture consistency Prevent seams Uvar Usal Ucoh Uaes Centrality of text Prevent corners ˆ arg min ( ) ( ) ( ) ( )var sal coh aes R p R R U p U p U p U p      Variance Cost Saliency Cost Coherence Cost Aesthetics Cost
  • 65. STRUCT Group65 Context-Aware Unsupervised Text Stylization Context-Aware Layout Design  Position Estimation  Cost function Variance Cost Local variance Seek flat region Saliency Cost Low saliency Prevent occlusion Coherence Cost Texture consistency Prevent seams Aesthetics Cost Centrality of text Prevent corners Input T , S′ and I Total placement cost Position estimation result
  • 66. STRUCT Group66 Context-Aware Unsupervised Text Stylization Context-Aware Layout Design  Text Embedding  Image inpainting: fill Inpainting region with samples from S′  Structure guidance of T + Contextual boundary constraint Contextual region Inpainting region Sampling region S′
  • 67. STRUCT Group67 Context-Aware Unsupervised Text Stylization Comparison with Other Methods 2A. J. Champandard. Semantic style transfer and turning two-bit doodles into fine artworks. Arxiv. 2016. 3S. Yang, J. Liu, Z. Lian, and Z. Guo. Awesome typography: statistics-based text effects transfer. CVPR. 2017. 1A. Hertzmann, C. E. Jacobs, N. Oliver, B. Curless, and D. H. Salesin. Image analogies. SIGGRAPH. 2001. T and S′ Image Analogy1 Neural Doodle2 Text Effects Transfer3 Image Quilting4 Neural Style5 CNNMRF6 Our method 4A. Efros and W. T. Freeman. Image quilting for texture synthesis and transfer. TOG. 2001. 5L. A. Gatys, A. S. Ecker, and M. Bethge. Image style transfer using convolutional neural networks. CVPR. 2016 6C. Li and M. Wand. Combining Markov random fields and convolutional neural networks for image synthesis. CVPR. 2016
  • 68. STRUCT Group68 Context-Aware Unsupervised Text Stylization Comparison with Unsupervised Methods  Structural Consistency 4A. Efros and W. T. Freeman. Image quilting for texture synthesis and transfer. TOG. 2001. 5L. A. Gatys, A. S. Ecker, and M. Bethge. Image style transfer using convolutional neural networks. CVPR. 2016 Image Quilting4 Neural Style5 Our method S′ T Structure transfer result
  • 69. STRUCT Group69 Context-Aware Unsupervised Text Stylization Comparison with Supervised Methods  Text Legibility 3S. Yang, J. Liu, Z. Lian, and Z. Guo. Awesome typography: statistics-based text effects transfer. CVPR. 2017. 1A. Hertzmann, C. E. Jacobs, N. Oliver, B. Curless, and D. H. Salesin. Image analogies. SIGGRAPH. 2001. Our method Image Analogy1 Text Effects Transfer3 T and S′
  • 70. STRUCT Group70 Context-Aware Unsupervised Text Stylization Different Fonts and Languages Visual effects for text stylization on different languages. Visual effects for text stylization on different fonts.
  • 71. STRUCT Group71 Unsupervised Text Effects Synthesis  Text effects synthesis results Context-Aware Unsupervised Text Stylization
  • 72. STRUCT Group72 Unsupervised Text Effects Synthesis  Application: texture rendering for symbols Context-Aware Unsupervised Text Stylization
  • 73. STRUCT Group73 Graphical Arts: Text + Images  Synthesize stylish text onto photos  Follow image completion process  Background images provide boundary information  Texture synthesis under boundary constrains Application: Poster design Background image (T, S') Text + Image Context-Aware Unsupervised Text Stylization
  • 74. STRUCT Group74 Graphical Arts: Text + Images  Graphical arts design results (1/2)  S' is sampled from background images Context-Aware Unsupervised Text Stylization
  • 75. STRUCT Group75 Graphical Arts: Text + Images  Graphical arts design results (2/2)  S' is not from background images: color transfer is used Context-Aware Unsupervised Text Stylization
  • 76. Outline Background / 005 Statistic Based Text Stylization / 014 Context-Aware Unsupervised Text Stylization / 053 Text Effects Transfer via Stylization & Destylization / 077 Text-Centric Image Style Transfer
  • 77. STRUCT Group77  Deep Learning Based Text Stylization Text Effects Transfer via Stylization and Destylization Shuai Yang, Jiaying Liu, Wenjin Wang, Zongming Guo, AAAI 2019 Text Effects Transfer via Stylization & Destylization
  • 78. STRUCT Group78 Problem: Text Stylization and Destylization  Task1: Stylization for transferring the visual effects from highly stylized text onto other glyphs  Task2: Destylization for removing style features from text Text Effects Transfer via Stylization & Destylization
  • 79. STRUCT Group79 Problem Analysis  Methods  Text effects are highly structured along the glyph Global-Based Local-Based GAN Need no dataset Preserve texture details Need no dataset Reconstruct vivid styles Text effects cannot be simply characterized as the mean, variance, covariance and other global statistics Fail to characterize global distribution Need dataset for training Deals only two domains or Hard to extend to new style pros cons e.g. Text Effects Transfer via Stylization & Destylization
  • 80. STRUCT Group80 Problem Analysis • 64 different style • 775 Chinesecharacters • 52 English letters • 10 Arabic numerals • Featuredisentanglement • Featurerecombination • One-shot learning METHODDATA APPLICATION • Stylization • Destylization • Style creation flame 字 combine disentangledisentangle Text Effects Transfer via Stylization & Destylization
  • 81. STRUCT Group81 Data Overview Text Effects Transfer via Stylization & Destylization  Dataset  64 different text effects  775 Chinese characters, 52 English letters, 10 Arabic numerals  64 * 837 ≈ 54K pairs  Train : Test = 708 : 129
  • 82. STRUCT Group  Learn a two-way mapping between two visual domains X (text) and Y (text effects) 82 Network Architecture Text Effects Transfer via Stylization & Destylization • Encoder: Text content encoder Style content encoder Style encoder • Decoder: Content decoder Style decoder • Discriminator: Content discriminator Style discriminator
  • 83. STRUCT Group83 Network Architecture Text Effects Transfer via Stylization & Destylization  Autoencoder  Reconstruction loss  Preserve the core information of the glyph
  • 84. STRUCT Group84 Network Architecture Text Effects Transfer via Stylization & Destylization  Destylization  Feature loss  Approach ground truth content feature extracted by Ex  Pixel loss  Approach ground truth output  Adversarial loss  WGAN-GP  Improve the quality
  • 85. STRUCT Group85 Network Architecture Text Effects Transfer via Stylization & Destylization  Stylization  Pixel loss  Approach ground truth output  Adversarial loss  WGAN-GP  Improve the quality
  • 86. STRUCT Group86 One-shot fine-tuning (supervised) Text Effects Transfer via Stylization & Destylization  Extend to new style with only one example pair  Random crop to generate multiple training pairs  More precise texture details (a) User-specified new text effects. (b) Result on an unseen style (c) Result after one-shot fine-tuning
  • 87. STRUCT Group87 One-shot fine-tuning (unsupervised) Text Effects Transfer via Stylization & Destylization  Extend to new style with only one style image  Use destylization network to generate auxiliary raw text  Style autoencoder reconstruction loss Input style Input glyph Auxiliary glyph Destylization network
  • 88. STRUCT Group88 Comparison GAN-based Local-based Global-based Text Effects Transfer via Stylization & Destylization
  • 89. STRUCT Group89 Comparison  One-shot supervised style transfer Fine-tune Train from scratch  Pretraining on our dataset successfully teaches our network the domain knowledge of text effects synthesis Text Effects Transfer via Stylization & Destylization
  • 90. STRUCT Group90 Comparison  One-shot unsupervised style transfer Text Effects Transfer via Stylization & Destylization
  • 91. STRUCT Group91 Other results  Style interpolation Text Effects Transfer via Stylization & Destylization
  • 92. STRUCT Group92 Conclusion Conclusion Supervised Patch-based Deep-based Input Method Statistics-based Text Effect Transfer • Distribution-based prior • Uniform Objective function Unsupervised Text Stylization • Structure + texture transfer • Combine text and images TET-GAN • Feature disentanglement • Data collection • One-shot supervised style transfer • One-shot unsupervised style transfer Unsupervised style glyph style glyph