19-7960-01.pptx

•Als PPTX, PDF herunterladen•

0 gefällt mir•6 views

This document discusses neuromorphic architectures and hardware trends in machine learning. It provides an overview of topics to be covered in a course on neuromorphic architectures, including hardware trends driving the emphasis on accelerators, commercial hardware for machine learning, neuromorphic hardware goals, and taxonomy of neural network types and hardware implementations. Example hardware discussed includes Google's TPU chips, neuromorphic implementations like SpiNNaker and TrueNorth, and historical commercial neural network chips.

Gesundheit & Medizin

1
CS 7960 Neuromorphic Architectures
Rajeev Balasubramonian
School of Computing, University of Utah
http://www.cs.utah.edu/~rajeev/cs7960

2
Lecture: Introduction
• Topics: landscape, terminology, motivation
• Logistics: lectures, project, exams, grading, canvas

3
Hardware Trends
Why the emphasis on accelerators?

4
Hardware Trends
• Why the emphasis on accelerators?
– Stagnant single- and multi-thread performance with
general-purpose cores
• Dark silicon (emphasis on power-efficient throughput)
• End of scaling
• No low-hanging fruit
– Emergence of machine learning

5
Commercial Hardware for ML
Google TPU (inference and training)
Recent NVIDIA chips (Volta)
Microsoft Brainwave and Catapult
Intel Loihi and Nervana
Cambricon
Graphcore (training)
Cerebras (training)
Groq (inference)
ARM
Western Digital
Tesla (FSD)

6
Neuromorphic Hardware
• The holy grail: emulating the brain
– Low power – the brain consumes only 20 W
– Fault tolerant – the brain loses neurons all the time
– No programming required – the brain learns by itself
• Significant efforts in Europe, US, China (Human
Brain Project, DARPA SyNAPSE)
• Implementations: SpiNNaker, Spikey, TrueNorth

7
Taxonomy
• ANN vs. SNN: ML vs. Neuroscience, training via
back-prop or via biological processes
• Analog vs. Digital architectures
• Chips for inference vs. training
• Systolic vs. Tensors

8
ANN vs. SNN
x1
x2
x3
w1
w2
w3
f (x1w1 + x2w2 + x3w3 + b)
Source: Nere et al., HPCA’13

9
ANN vs. SNN
• Perceptron (ANN): well-understood, high accuracy
• Spiking neural network: infant technology
(relatively), potential for low-energy computations
and low-energy communication, potential for high
information content
• Potential convergence: brain-inspired ideas being
injected into machine learning (RNNs, few bits per
value), and machine learning techniques (back-prop)
being used to train brain models

10
Analog vs. Digital
• A single analog device can perform multiple multi-bit
operations
• But, analog has challenges, e.g. noise/precision
• Cognitive applications amenable to analog because
of their noise tolerance

12
NN Hype Curve
From O. Temam keynote, ISCA’10

13
Older Commercial Examples
Analog
Digital
Ref: “Artificial Neural Networks: A Review of Commercial Hardware”, Dias et al., Elsevier, 2004.

14
Early Bust
Commercial NN chips fell out of favor in the early 2000s
• SVMs overtook ANNs
• General-purpose processors were getting faster every
year and quickly overtaking ASICs
• Limited market for machine learning
Note that none of the above is applicable today

15
Google TPU
• Version 1: 15-month effort, basic design, only for
inference, 92 TOPs peak, 15x faster than GPU, 40 W
28nm 300 mm2 chip
• Version 2: designed for training, a pod is a collection
of v2 chips connected with a torus topology
• Version 3: 8x higher throughput, liquid cooled
Ref: Google

16
References
• “Neuromorphic Computing: The Machine of a New Soul”,
The Economist, August 2013
• “Artificial Neural Networks: A Review of Commercial Hardware”,
Dias et al., Elsevier, 2004.
• “The Rebirth of Neural Networks”, Olivier Temam, Keynote at ISCA’10
• Chapter 1 of Nielsen’s book:
http://neuralnetworksanddeeplearning.com/chap1.html

Empfohlen

Parallelism Processor DesignSri Prasanna

A Platform for Accelerating Machine Learning ApplicationsNVIDIA Taiwan

Mauricio breteernitiz hpc-exascale-isctembreternitz

Neurosynaptic chipsJeffrey Funk

Google TPUHao(Robin) Dong

Implementing AI: Hardware ChallengesKTN

Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...BigDataEverywhere

Distributed DNN training: Infrastructure, challenges, and lessons learnedWee Hyong Tok

Empfohlen

Parallelism Processor DesignSri Prasanna

A Platform for Accelerating Machine Learning ApplicationsNVIDIA Taiwan

Mauricio breteernitiz hpc-exascale-isctembreternitz

Neurosynaptic chipsJeffrey Funk

Google TPUHao(Robin) Dong

Implementing AI: Hardware ChallengesKTN

Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...BigDataEverywhere

Distributed DNN training: Infrastructure, challenges, and lessons learnedWee Hyong Tok

Deep Learning on Everyday DevicesBrodmann17

Nikravesh big datafeb2013btMasoud Nikravesh

CRNCH Rogues Gallery: A Community Core for Novel Computing PlatformsJason Riedy

LllsjjsjsjjshshjshjsjjsjjsjjzjsjjzjjzjjzjManhHoangVan

Current Trends in HPCPutchong Uthayopas

NNSA Explorations: ARM for Supercomputinginside-BigData.com

DARPA ERI Summit 2018: The End of Moore’s Law & Faster General Purpose Comput...zionsaint

Stories About Spark, HPC and Barcelona by Jordi TorresSpark Summit

ML At the Edge: Building Your Production Pipeline With Apache Spark and Tens...Stavros Kontopoulos

ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...Databricks

Manycores for the MassesIntel® Software

FPGA Hardware Accelerator for Machine Learning Dr. Swaminathan Kathirvel

State-Of-The Art Machine Learning Algorithms and How They Are Affected By Nea...inside-BigData.com

Deep learning with FPGAAyush Singh, MS

In datacenter performance analysis of a tensor processing unitJinwon Lee

ABCI: AI Bridging Cloud Infrastructure for Scalable AI/Big DataHitoshi Sato

SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...Chester Chen

Exascale CapablSagar Dolas

“A New Golden Age for Computer Architecture: Processor Innovation to Enable U...Edge AI and Vision Alliance

FPGAs in the cloud? (October 2017)Julien SIMON

NDCT Rules, 2019: An Overview | New Drugs and Clinical Trial Rules 2019Akash Agnihotri

Drug development life cycle indepth overview.pptxMohammadAbuzar19

Weitere ähnliche Inhalte

Ähnlich wie 19-7960-01.pptx

Deep Learning on Everyday DevicesBrodmann17

Nikravesh big datafeb2013btMasoud Nikravesh

CRNCH Rogues Gallery: A Community Core for Novel Computing PlatformsJason Riedy

LllsjjsjsjjshshjshjsjjsjjsjjzjsjjzjjzjjzjManhHoangVan

Current Trends in HPCPutchong Uthayopas

NNSA Explorations: ARM for Supercomputinginside-BigData.com

DARPA ERI Summit 2018: The End of Moore’s Law & Faster General Purpose Comput...zionsaint

Stories About Spark, HPC and Barcelona by Jordi TorresSpark Summit

ML At the Edge: Building Your Production Pipeline With Apache Spark and Tens...Stavros Kontopoulos

ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...Databricks

Manycores for the MassesIntel® Software

FPGA Hardware Accelerator for Machine Learning Dr. Swaminathan Kathirvel

State-Of-The Art Machine Learning Algorithms and How They Are Affected By Nea...inside-BigData.com

Deep learning with FPGAAyush Singh, MS

In datacenter performance analysis of a tensor processing unitJinwon Lee

ABCI: AI Bridging Cloud Infrastructure for Scalable AI/Big DataHitoshi Sato

SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...Chester Chen

Exascale CapablSagar Dolas

“A New Golden Age for Computer Architecture: Processor Innovation to Enable U...Edge AI and Vision Alliance

FPGAs in the cloud? (October 2017)Julien SIMON

Ähnlich wie 19-7960-01.pptx (20)

Deep Learning on Everyday Devices

Nikravesh big datafeb2013bt

CRNCH Rogues Gallery: A Community Core for Novel Computing Platforms

Lllsjjsjsjjshshjshjsjjsjjsjjzjsjjzjjzjjzj

Current Trends in HPC

NNSA Explorations: ARM for Supercomputing

DARPA ERI Summit 2018: The End of Moore’s Law & Faster General Purpose Comput...

Stories About Spark, HPC and Barcelona by Jordi Torres

ML At the Edge: Building Your Production Pipeline With Apache Spark and Tens...

ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...

Manycores for the Masses

FPGA Hardware Accelerator for Machine Learning

State-Of-The Art Machine Learning Algorithms and How They Are Affected By Nea...

Deep learning with FPGA

In datacenter performance analysis of a tensor processing unit

ABCI: AI Bridging Cloud Infrastructure for Scalable AI/Big Data

SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...

Exascale Capabl

“A New Golden Age for Computer Architecture: Processor Innovation to Enable U...

FPGAs in the cloud? (October 2017)

Kürzlich hochgeladen

NDCT Rules, 2019: An Overview | New Drugs and Clinical Trial Rules 2019Akash Agnihotri

Drug development life cycle indepth overview.pptxMohammadAbuzar19

DR. Neha Mehta Best Psychologist.in IndiaNehamehta128467

Renal Replacement Therapy in Acute Kidney Injury -time modality -Dr Ayman Se...Ayman Seddik

Gallbladder Double-Diverticular: A Case Report المرارة مزدوجة التج: تقرير حالةMohamad محمد Al-Gailani الكيلاني

CONGENITAL HYPERTROPHIC PYLORIC STENOSIS by Dr M.KARTHIK EMMANUELMKARTHIKEMMANUEL

Histology of Epithelium - Dr Muhammad Ali Rabbani - Medicose AcademicsMedicoseAcademics

Face and Muscles of facial expression.pptxDr. Rabia Inam Gandapore

Sell 5cladba adbb JWH-018 5FADB in stocktammysayles9

Best medicine 100% Effective&Safe Mifepristion ௵+918133066128௹Abortion pills ...Abortion pills in Kuwait Cytotec pills in Kuwait

How to buy 5cladba precursor raw 5cl-adb-a raw materialSherrylee83

Hemodialysis: Chapter 1, Physiological Principles of Hemodialysis - Dr.GawadNephroTube - Dr.Gawad

VIP ℂall Girls Kandivali west Mumbai 8250077686 WhatsApp: Me All Time Serviℂe...Model Neeha Mumbai

Gross Anatomy and Histology of Tongue by Dr. Rabia Inam Gandapore.pptxDr. Rabia Inam Gandapore

ROSE CASE SPINAL SBRT BY DR KANHU CHARAN PATROKanhu Charan

Video capsule endoscopy (VCE ) in childrenRaju678948

The Clean Living Project Episode 24 - SubconsciousThe Clean Living Project

^In Pietermaritzburg Hager Werken Embalming +27789155305 Compound Powder in ...pinkpowder997723

Vesu + ℂall Girls Serviℂe Surat (Adult Only) 8849756361 Esℂort Serviℂe 24x7 C...anushka vermaI11

Treatment Choices for Slip Disc at Gokuldas HospitalGokuldas Hospital

Kürzlich hochgeladen (20)

NDCT Rules, 2019: An Overview | New Drugs and Clinical Trial Rules 2019

Drug development life cycle indepth overview.pptx

DR. Neha Mehta Best Psychologist.in India

Renal Replacement Therapy in Acute Kidney Injury -time modality -Dr Ayman Se...

Gallbladder Double-Diverticular: A Case Report المرارة مزدوجة التج: تقرير حالة

CONGENITAL HYPERTROPHIC PYLORIC STENOSIS by Dr M.KARTHIK EMMANUEL

Histology of Epithelium - Dr Muhammad Ali Rabbani - Medicose Academics

Face and Muscles of facial expression.pptx

Sell 5cladba adbb JWH-018 5FADB in stock

Best medicine 100% Effective&Safe Mifepristion ௵+918133066128௹Abortion pills ...

How to buy 5cladba precursor raw 5cl-adb-a raw material

Hemodialysis: Chapter 1, Physiological Principles of Hemodialysis - Dr.Gawad

VIP ℂall Girls Kandivali west Mumbai 8250077686 WhatsApp: Me All Time Serviℂe...

Gross Anatomy and Histology of Tongue by Dr. Rabia Inam Gandapore.pptx

ROSE CASE SPINAL SBRT BY DR KANHU CHARAN PATRO

Video capsule endoscopy (VCE ) in children

The Clean Living Project Episode 24 - Subconscious

^In Pietermaritzburg Hager Werken Embalming +27789155305 Compound Powder in ...

Vesu + ℂall Girls Serviℂe Surat (Adult Only) 8849756361 Esℂort Serviℂe 24x7 C...

Treatment Choices for Slip Disc at Gokuldas Hospital

19-7960-01.pptx

1. 1 CS 7960 Neuromorphic Architectures Rajeev Balasubramonian School of Computing, University of Utah http://www.cs.utah.edu/~rajeev/cs7960

2. 2 Lecture: Introduction • Topics: landscape, terminology, motivation • Logistics: lectures, project, exams, grading, canvas

3. 3 Hardware Trends Why the emphasis on accelerators?

4. 4 Hardware Trends • Why the emphasis on accelerators? – Stagnant single- and multi-thread performance with general-purpose cores • Dark silicon (emphasis on power-efficient throughput) • End of scaling • No low-hanging fruit – Emergence of machine learning

5. 5 Commercial Hardware for ML Google TPU (inference and training) Recent NVIDIA chips (Volta) Microsoft Brainwave and Catapult Intel Loihi and Nervana Cambricon Graphcore (training) Cerebras (training) Groq (inference) ARM Western Digital Tesla (FSD)

6. 6 Neuromorphic Hardware • The holy grail: emulating the brain – Low power – the brain consumes only 20 W – Fault tolerant – the brain loses neurons all the time – No programming required – the brain learns by itself • Significant efforts in Europe, US, China (Human Brain Project, DARPA SyNAPSE) • Implementations: SpiNNaker, Spikey, TrueNorth

7. 7 Taxonomy • ANN vs. SNN: ML vs. Neuroscience, training via back-prop or via biological processes • Analog vs. Digital architectures • Chips for inference vs. training • Systolic vs. Tensors

8. 8 ANN vs. SNN x1 x2 x3 w1 w2 w3 f (x1w1 + x2w2 + x3w3 + b) Source: Nere et al., HPCA’13

9. 9 ANN vs. SNN • Perceptron (ANN): well-understood, high accuracy • Spiking neural network: infant technology (relatively), potential for low-energy computations and low-energy communication, potential for high information content • Potential convergence: brain-inspired ideas being injected into machine learning (RNNs, few bits per value), and machine learning techniques (back-prop) being used to train brain models

10. 10 Analog vs. Digital • A single analog device can perform multiple multi-bit operations • But, analog has challenges, e.g. noise/precision • Cognitive applications amenable to analog because of their noise tolerance

11. 11 Gartner Hype Curve

12. 12 NN Hype Curve From O. Temam keynote, ISCA’10

13. 13 Older Commercial Examples Analog Digital Ref: “Artificial Neural Networks: A Review of Commercial Hardware”, Dias et al., Elsevier, 2004.

14. 14 Early Bust Commercial NN chips fell out of favor in the early 2000s • SVMs overtook ANNs • General-purpose processors were getting faster every year and quickly overtaking ASICs • Limited market for machine learning Note that none of the above is applicable today

15. 15 Google TPU • Version 1: 15-month effort, basic design, only for inference, 92 TOPs peak, 15x faster than GPU, 40 W 28nm 300 mm2 chip • Version 2: designed for training, a pod is a collection of v2 chips connected with a torus topology • Version 3: 8x higher throughput, liquid cooled Ref: Google

16. 16 References • “Neuromorphic Computing: The Machine of a New Soul”, The Economist, August 2013 • “Artificial Neural Networks: A Review of Commercial Hardware”, Dias et al., Elsevier, 2004. • “The Rebirth of Neural Networks”, Olivier Temam, Keynote at ISCA’10 • Chapter 1 of Nielsen’s book: http://neuralnetworksanddeeplearning.com/chap1.html