1. Proprietary and confidential. Do not distribute.
Nervana’s Deep
Learning Platform
MAKING MACHINES SMARTER.™
Hanlin Tang, PhD
Algorithms Engineer
2. Facebook DeepMask
Silver et al, 2016
The Atlantic, March 2016
“The error rate has been cut by a factor of
two in all the languages, more than a factor of
two in many cases. That’s mostly due to deep
learning and the way we have optimized it …”
Alex Acero, Siri Senior Director, Apple
Article in Backhannel/WIRED, Aug 2016
Deep Learning
4. • Unprecedented computing power
• 10x speedup over current Maxwell
GPUs (~55 TeraOps)
• 32 GB High-Bandwidth Memory
• Six bi-directional high-bandwidth links
for 3D torus interconnect
• 8 chips in a box, seamlessly scale to
multiple chassis
11. “Training neural networks is a dark art.”
Hyperparameters:
•Number and type of units/layers
•Convolution filter size
•Weight Initialization
•Optimization method
•Learning Rate schedule