14. 14
The Fastest Training Performance at Scale
NVIDIA GPUs Won All Six Accelerated Workloads.
IMAGE CLASSIFICATION OBJECT DETECTION
(LIGHT WEIGHT)
TRANSLATION
(RECURRENT)
OBJECT DETECTION
(HEAVY WEIGHT)
TRANSLATION
(NON-RECURRENT)
RECOMMENDATION
MLPerf Results
NVIDIA GPU up to 3.6X Faster
Multiple NVIDIA GPU Systems Multiple Google TPU Systems
TPU V3
No Result
TPU V3
No Results
1.2X 3.2X 3.6X
TPU V3
No Result
15. 15
Chip-to-Chip Performance Comparison
NVIDIA GPUs scaled further, faster and, on a chip-per-chip basis
IMAGE CLASSIFICATION TRANSLATION
(RECURRENT)
OBJECT DETECTION
(LIGHT WEIGHT)
NCF OBJECT DETECTION
(HEAVY WEIGHT)
TRANSLATION
(NON-RECURRENT)
MLPerf Chip-To-Chip Performance
NVIDIA V100 Google TPUv3
TPU V3
No Results
TPU V3
No Results
TPU V3
No Results
1.1X 1.2X 1.6X
Normalized chip comparison using reported performance on configurations that have similar number of chips .
For TPU: Best 20 chip TPUv3 submission.