Slides by Víctor Garcia about the paper:
Reed, Scott, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, and Honglak Lee. "Generative adversarial text to image synthesis." ICML 2016.
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Generative adversarial text to image synthesis
1. Generative Adversarial Text to Image
Synthesis
Scott Reed, Zeynep Akata, Xinchen
Yan, Lajanugen Logeswaran
[GitHub] [Arxiv]
Slides by Víctor Garcia [GDoc]
Computer Vision Reading Group (30/09/2016)
2. Index
● Introduction
● State of the Art
● Method
○ Network Architecture
○ Losses
● Experiments
○ Qualitative Results
○ Sentence interpolation
○ Style Transfer
● Conclusions
4. Index
● Introduction
● State of the Art
● Method
○ Network Architecture
○ Losses
● Experiments
○ Qualitative Results
○ Sentence interpolation
○ Style Transfer
● Conclusions
16. Text Embeddding
In order to represent the text in a vector...
MIN
WHERE
This is the recurrent text encoder
17. Index
● Introduction
● State of the Art
● Method
○ Network Architecture
○ Losses
● Experiments
○ Qualitative Results
○ Sentence interpolation
○ Style Transfer
● Conclusions
19. Losses - CLS
log(D(x,t)) log(1-D(G(z,t)))
True Image
+
True Text
Fake Image
+
True Text
Real Images match
the text content?
20. Losses - CLS
log(D(x,t)) log(1-D(G(z,t))) log(1-D(G(zi,tk)))
True Image
+
True Text
Fake Image
+
True Text
True Image (i)
+
True Text (j)
Unmatched
21. Losses - INT
They train interpolating between different text embedding vector (t1~t2).
So the generator learns to fill GAPS on the data manifold.
22. Index
● Introduction
● State of the Art
● Method
○ Network Architecture
○ Losses
● Experiments
○ Qualitative Results
○ Sentence interpolation
○ Style Transfer
● Conclusions
25. Disentangling style and content
Generator.
z
+
Text
If ‘text’ is describing the content? What is ‘z’ describing?
26. Disentangling style and content
Generator.
z
+
Text
If ‘text’ is describing the content? What is ‘z’ describing?
Style → Pose, Background…, let’s extract ‘z’