Smart Reply: Learning a Model of Conversation from Data: Smart Reply is a text assistance feature that was recently introduced to Inbox by Gmail. Given an incoming email message, the Smartreply system analyzes its contents and suggests complete responses that the recipient can send with just one tap. This talk will cover how we built Smartreply using a combination of deep learning and semantic clustering, as well as what we learned along the way and why we think it shows promise for the future of dialogue models.
7. Why is this task hard?
● extracting meaning from previous message
● generating language
● grammatical transformations between call and response
● matching style/tone
8. Why is this solution interesting?
● Model is learned fully from data
48. Training
● Training data is a corpus of email-reply pairs
● Both encoder and decoder are trained together (end-to-end)
49. Training
● Training data is a corpus of email-reply pairs
● Both encoder and decoder are trained together (end-to-end)
50. Confidential + Proprietary
Key points about model
● Everything is learned from data, even features
● Neural network smooths across language variation
54. Quality
● How do we ensure that the response options are always high quality in content
and language?
○ Avoid incorrect grammar and mechanics, misspellings e.g., your the best
○ Avoid inappropriate, offensive responses. e.g., Leave me alone.
○ Deal with wide variability, informal language. e.g., got it thx
● Restricting model vocabulary is not sufficient!
Solution: Restrict to a fixed set of valid responses, derived automatically from data.
59. Confidential + Proprietary
What the model can't do
● Match every user's tone and style
● Ensure diverse options
● Access and update any kind of state or knowledge base
61. Conclusions
● Sequence-to-sequence produces plausible email replies in many common
scenarios, when trained on an email corpus
● Smart Reply is deployed in Inbox by Gmail and generates more than 10% of
mobile replies
62. Confidential + Proprietary
Conclusions
● A conversation model learned entirely from data is very powerful
● A data-driven approach can be complementary to hand-crafted rules and
scenarios
63. Confidential + Proprietary
Collaborators
- Greg Corrado, Oriol Vinyals (Google Brain)
- Balint Miklos, Tobias Kaufman, Laszlo Lukacs, and Karol Kurach (GMail)
- Sujith Ravi (Google Research)