SlideShare a Scribd company logo
1 of 20
UNET: Massive Scale DNN on
Spark
Deep Neural Net
Input Layer Hidden Layer 1 Hidden Layer 2 Hidden Layer 3
Convolutional Neural Net
Overview
 Components: Solver, Parameter Server, Model Splits.
 Massive Scale: Data Parallel & Model Parallel.
 Train Method: Async and Sync
 Algorithms: RBM, DA, SGD, CNN, LSTM, AdaGrad,
L1/L2, L-BFGS. CG, etc.
 Extensibility: Can be extended to any algorithm that
can be modeled as data flow.
 Highly optimized with lock free implementation, and
software pipeline maximizing the performance.
 Highly flexible and modulized to support arbitrary
network.
Architecture: Data / Model Parallel
Solver
Model1_3
Model1_2
Model1_1
Q
PS_2
Q
PS_3
Q
PS_1
One Solver RDD
(1 partition)
One Parameter Server RDD
(3 Partitions)
Three Replicated Model RDD
(3 Partitions Each)
Data Parallel
Component: Models & Parameter server
Multiple models trained independently
Each model fits one splits of training data, and
calculates the sub-gradient
Asynchronously, each model update/retrieve
parameters to/from parameter server
Data Parallel
(2 replicated Models with 1 Parameter Server)
Parameter Server
Q
ModelYModelX
Parameter Sync
Model Parallel
Model is huge, and cannot be hold in one
machine.
Training is computational heavy
Model partitioned into multiple splits.
Each split may located in different physical
machines.
Model Parallel
(3 Partitions)
Data Communication:
• node-level
• group-level
Control RPC traffic
Netty based Data Traffic
Master
Executor
Executor
Executor
Data / Model Parallel
Solver
Model1_3
Model1_2
Model1_1
Q
PS_2
Q
PS_3
Q
PS_1
One Solver RDD
(1 partition)
One Parameter Server RDD
(3 Partitions)
Three Replicated Model RDD
(3 Partitions Each)
A Simple Network
Convolutional Fully Mesh Softmax Facility Master
Parameter Management
 ParamMgr.Node for fully meshed layer
Managed by individual node.
 ParamMgr.Group for convolutional layer
Shared by all nodes in the group, and managed by
the group. The group gather/scatter the
parameters from its members, which may locate in
different executors.
 ParamMgr.Const for softmax master layer
The parameters are constant.
qi,1
qi,2
qi,3
qi,4
Node
Params
Parameter Type (Link vs. Node)
q1,I
l
q2,I
l
q3,I
l
Left-link
Params
qi,1
l+1
qi,2
l+1
qi,3
l+1
Right-link
Params
1. Each parameter is associated with either a link or a node.
2. Each node/link may have multiple parameters associated.
3. Link parameters are managed by upstream.
4. Each category of parameters may be managed by either the node or the group.
Network Partitioning
• The DNN network is organized by layers
• Each layer is defined as three-dimension cube by (x, y, z).
• Each dimension can be arbitrarily partitioned, defined as (sx, sy,
sz), s specifies the number of partitions of one dimension.
• One layer can be in multiple executors, and one partition is the
basic unit to be distributed in executors.
x(sx=3)
z(sz=3)
y (sy=2)
Software Components
 Layer: logical group in deep neuron net.
 Group: logical unit having similar input/output topology and
functionality. A group can further have subgroups.
 Node: the basic computation unit provide neuron functionality.
 Connection: define the network topology between layers, such as
fully meshed, convolutional, tiled convolutional, etc.
 Adaptors: mapping the remote upstream/down stream neuron to
local neuron in the topology defined by connections.
 Function: define the activation of each neuron.
 Master: provide central aggregation and scatter for softmax neuron.
 Solver: central place to drive the model training and monitoring.
 Parameter Server: the server used by neuron to update/retrieve
parameters.
Memory Overhead
 Neuron does not need to keep the inputs from upstream,
but only keeps the aggregation record.
 The calculation is associative in both forward/backward path
(through function split trick).
 The link gradient is calculated and updated in the upstream
 Memory overhead is O(N + M), N is the neuron size and M
is the parameter size.
Network Overhead
 Neuron forwards same output to its upstream/downstream
neurons.
 Receiving neurons compute the input or update the gradient.
 Neuron forwards its output to the executors only if it hosts
neurons requesting it.
 Neuron forwards its output to an executor only once
regardless of the number of neurons requesting it.
Complexity
Memory: O(M+N) independent of network
partition mechanism.
M: the number of parameters
N: The number of nodes.
Communication: O(N)
Realized by
 Each node managing its outgoing link parameter
instead of incoming link parameter
 The trick to split the function across the layers
Distributed Pipeline
 MicroBatch: The number of training examples in one pipeline stage
 max_buf: the length of the pipleline.
 Batch algorithms: Significantly improve the performance when the
training data set is big enough to fully populate the pipeline.
 SGD: the improvement is limited, because the pipeline cannot be fully
populated if the miniBatch size is not big enough.
Executor 4
Executor 3
Executor 2
Executor 1 Micro Batch i +4
Micro Batch i +3
Micro Batch i +2
Micro Batch i +1
Micro Batch i +1
Micro Batch i +1
Micro Batch i +1
Micro Batch i +2
Micro Batch i +2 Micro Batch i +3
T1 T2 T3 T4
Connections
 Easy extensible through Adaptors.
 Adaptor is used to mapping global status to its local status.
 Fully Meshed
 (Tiled) Convolutional
 NonShared Convolutional

More Related Content

What's hot

Lecture 11 neural network principles
Lecture 11 neural network principlesLecture 11 neural network principles
Lecture 11 neural network principlesVajira Thambawita
 
All-Reduce and Prefix-Sum Operations
All-Reduce and Prefix-Sum Operations All-Reduce and Prefix-Sum Operations
All-Reduce and Prefix-Sum Operations Syed Zaid Irshad
 
The Use of Wireless networks for Control Applications
The Use of Wireless networks for Control ApplicationsThe Use of Wireless networks for Control Applications
The Use of Wireless networks for Control ApplicationsNativ Levy
 
Analysis and design of a half hypercube interconnection network topology
Analysis and design of a half hypercube interconnection network topologyAnalysis and design of a half hypercube interconnection network topology
Analysis and design of a half hypercube interconnection network topologyAmir Masoud Sefidian
 
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
PointNet: Deep Learning on Point Sets for 3D Classification and SegmentationPointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
PointNet: Deep Learning on Point Sets for 3D Classification and SegmentationSEMINARGROOT
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Lecture 3 parallel programming platforms
Lecture 3   parallel programming platformsLecture 3   parallel programming platforms
Lecture 3 parallel programming platformsVajira Thambawita
 
Convolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetConvolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetSungminYou
 
Multilayer & Back propagation algorithm
Multilayer & Back propagation algorithmMultilayer & Back propagation algorithm
Multilayer & Back propagation algorithmswapnac12
 
Physical organization of parallel platforms
Physical organization of parallel platformsPhysical organization of parallel platforms
Physical organization of parallel platformsSyed Zaid Irshad
 
Radial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and DhanashriRadial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and Dhanashrisheetal katkar
 
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...IOSR Journals
 
Deep Belief Networks for Spam Filtering
Deep Belief Networks for Spam FilteringDeep Belief Networks for Spam Filtering
Deep Belief Networks for Spam Filteringbutest
 
A Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in ParallelA Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in ParallelJenny Liu
 

What's hot (20)

Lecture 11 neural network principles
Lecture 11 neural network principlesLecture 11 neural network principles
Lecture 11 neural network principles
 
All-Reduce and Prefix-Sum Operations
All-Reduce and Prefix-Sum Operations All-Reduce and Prefix-Sum Operations
All-Reduce and Prefix-Sum Operations
 
The Use of Wireless networks for Control Applications
The Use of Wireless networks for Control ApplicationsThe Use of Wireless networks for Control Applications
The Use of Wireless networks for Control Applications
 
Lec 6-bp
Lec 6-bpLec 6-bp
Lec 6-bp
 
Analysis and design of a half hypercube interconnection network topology
Analysis and design of a half hypercube interconnection network topologyAnalysis and design of a half hypercube interconnection network topology
Analysis and design of a half hypercube interconnection network topology
 
Data Applied: Similarity
Data Applied: SimilarityData Applied: Similarity
Data Applied: Similarity
 
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
PointNet: Deep Learning on Point Sets for 3D Classification and SegmentationPointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Lecture 3 parallel programming platforms
Lecture 3   parallel programming platformsLecture 3   parallel programming platforms
Lecture 3 parallel programming platforms
 
Convolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetConvolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNet
 
Multilayer & Back propagation algorithm
Multilayer & Back propagation algorithmMultilayer & Back propagation algorithm
Multilayer & Back propagation algorithm
 
Physical organization of parallel platforms
Physical organization of parallel platformsPhysical organization of parallel platforms
Physical organization of parallel platforms
 
Solution(1)
Solution(1)Solution(1)
Solution(1)
 
Radial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and DhanashriRadial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and Dhanashri
 
Lecture3
Lecture3Lecture3
Lecture3
 
Chapter 4 pc
Chapter 4 pcChapter 4 pc
Chapter 4 pc
 
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
 
Deep Belief Networks for Spam Filtering
Deep Belief Networks for Spam FilteringDeep Belief Networks for Spam Filtering
Deep Belief Networks for Spam Filtering
 
A Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in ParallelA Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in Parallel
 
Chap4 slides
Chap4 slidesChap4 slides
Chap4 slides
 

Similar to UNET: Massive Scale DNN on Spark

Trackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity CalorimeterTrackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity CalorimeterYousef Fadila
 
Enhancement of Improved Balanced LEACH for Heterogeneous Wireless Sensor Netw...
Enhancement of Improved Balanced LEACH for Heterogeneous Wireless Sensor Netw...Enhancement of Improved Balanced LEACH for Heterogeneous Wireless Sensor Netw...
Enhancement of Improved Balanced LEACH for Heterogeneous Wireless Sensor Netw...acijjournal
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural networkIshaneeSharma
 
Scalable Deep Learning Using Apache MXNet
Scalable Deep Learning Using Apache MXNetScalable Deep Learning Using Apache MXNet
Scalable Deep Learning Using Apache MXNetAmazon Web Services
 
Web Spam Classification Using Supervised Artificial Neural Network Algorithms
Web Spam Classification Using Supervised Artificial Neural Network AlgorithmsWeb Spam Classification Using Supervised Artificial Neural Network Algorithms
Web Spam Classification Using Supervised Artificial Neural Network Algorithmsaciijournal
 
Web spam classification using supervised artificial neural network algorithms
Web spam classification using supervised artificial neural network algorithmsWeb spam classification using supervised artificial neural network algorithms
Web spam classification using supervised artificial neural network algorithmsaciijournal
 
Deep learning notes.pptx
Deep learning notes.pptxDeep learning notes.pptx
Deep learning notes.pptxPandi Gingee
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learningJunaid Bhat
 
A survey research summary on neural networks
A survey research summary on neural networksA survey research summary on neural networks
A survey research summary on neural networkseSAT Publishing House
 
Neural Networks in Data Mining - “An Overview”
Neural Networks  in Data Mining -   “An Overview”Neural Networks  in Data Mining -   “An Overview”
Neural Networks in Data Mining - “An Overview”Dr.(Mrs).Gethsiyal Augasta
 
Lab Seminar 2009 12 01 Message Drop Reduction And Movement
Lab Seminar 2009 12 01  Message Drop Reduction And MovementLab Seminar 2009 12 01  Message Drop Reduction And Movement
Lab Seminar 2009 12 01 Message Drop Reduction And Movementtharindanv
 
Convolutional Neural Networks
Convolutional Neural NetworksConvolutional Neural Networks
Convolutional Neural Networksmilad abbasi
 
Protocols for wireless sensor networks
Protocols for wireless sensor networks Protocols for wireless sensor networks
Protocols for wireless sensor networks DEBABRATASINGH3
 

Similar to UNET: Massive Scale DNN on Spark (20)

Trackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity CalorimeterTrackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity Calorimeter
 
Final PPT.pptx
Final PPT.pptxFinal PPT.pptx
Final PPT.pptx
 
Enhancement of Improved Balanced LEACH for Heterogeneous Wireless Sensor Netw...
Enhancement of Improved Balanced LEACH for Heterogeneous Wireless Sensor Netw...Enhancement of Improved Balanced LEACH for Heterogeneous Wireless Sensor Netw...
Enhancement of Improved Balanced LEACH for Heterogeneous Wireless Sensor Netw...
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
 
Scalable Deep Learning Using Apache MXNet
Scalable Deep Learning Using Apache MXNetScalable Deep Learning Using Apache MXNet
Scalable Deep Learning Using Apache MXNet
 
Web Spam Classification Using Supervised Artificial Neural Network Algorithms
Web Spam Classification Using Supervised Artificial Neural Network AlgorithmsWeb Spam Classification Using Supervised Artificial Neural Network Algorithms
Web Spam Classification Using Supervised Artificial Neural Network Algorithms
 
D031202018023
D031202018023D031202018023
D031202018023
 
Web spam classification using supervised artificial neural network algorithms
Web spam classification using supervised artificial neural network algorithmsWeb spam classification using supervised artificial neural network algorithms
Web spam classification using supervised artificial neural network algorithms
 
Scene understanding
Scene understandingScene understanding
Scene understanding
 
Deep learning notes.pptx
Deep learning notes.pptxDeep learning notes.pptx
Deep learning notes.pptx
 
2nd fast.pptx
2nd fast.pptx2nd fast.pptx
2nd fast.pptx
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
A survey research summary on neural networks
A survey research summary on neural networksA survey research summary on neural networks
A survey research summary on neural networks
 
Neural Networks in Data Mining - “An Overview”
Neural Networks  in Data Mining -   “An Overview”Neural Networks  in Data Mining -   “An Overview”
Neural Networks in Data Mining - “An Overview”
 
F017533540
F017533540F017533540
F017533540
 
RNN-LSTM.pptx
RNN-LSTM.pptxRNN-LSTM.pptx
RNN-LSTM.pptx
 
Lab Seminar 2009 12 01 Message Drop Reduction And Movement
Lab Seminar 2009 12 01  Message Drop Reduction And MovementLab Seminar 2009 12 01  Message Drop Reduction And Movement
Lab Seminar 2009 12 01 Message Drop Reduction And Movement
 
Cnn
CnnCnn
Cnn
 
Convolutional Neural Networks
Convolutional Neural NetworksConvolutional Neural Networks
Convolutional Neural Networks
 
Protocols for wireless sensor networks
Protocols for wireless sensor networks Protocols for wireless sensor networks
Protocols for wireless sensor networks
 

Recently uploaded

MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based projectAnoyGreter
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)jennyeacort
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....kzayra69
 
How to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfHow to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfLivetecs LLC
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odishasmiwainfosol
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...Technogeeks
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 

Recently uploaded (20)

MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based project
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
 
How to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfHow to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdf
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 

UNET: Massive Scale DNN on Spark

  • 1. UNET: Massive Scale DNN on Spark
  • 2. Deep Neural Net Input Layer Hidden Layer 1 Hidden Layer 2 Hidden Layer 3
  • 4. Overview  Components: Solver, Parameter Server, Model Splits.  Massive Scale: Data Parallel & Model Parallel.  Train Method: Async and Sync  Algorithms: RBM, DA, SGD, CNN, LSTM, AdaGrad, L1/L2, L-BFGS. CG, etc.  Extensibility: Can be extended to any algorithm that can be modeled as data flow.  Highly optimized with lock free implementation, and software pipeline maximizing the performance.  Highly flexible and modulized to support arbitrary network.
  • 5. Architecture: Data / Model Parallel Solver Model1_3 Model1_2 Model1_1 Q PS_2 Q PS_3 Q PS_1 One Solver RDD (1 partition) One Parameter Server RDD (3 Partitions) Three Replicated Model RDD (3 Partitions Each)
  • 6. Data Parallel Component: Models & Parameter server Multiple models trained independently Each model fits one splits of training data, and calculates the sub-gradient Asynchronously, each model update/retrieve parameters to/from parameter server
  • 7. Data Parallel (2 replicated Models with 1 Parameter Server) Parameter Server Q ModelYModelX Parameter Sync
  • 8. Model Parallel Model is huge, and cannot be hold in one machine. Training is computational heavy Model partitioned into multiple splits. Each split may located in different physical machines.
  • 9. Model Parallel (3 Partitions) Data Communication: • node-level • group-level Control RPC traffic Netty based Data Traffic Master Executor Executor Executor
  • 10. Data / Model Parallel Solver Model1_3 Model1_2 Model1_1 Q PS_2 Q PS_3 Q PS_1 One Solver RDD (1 partition) One Parameter Server RDD (3 Partitions) Three Replicated Model RDD (3 Partitions Each)
  • 11. A Simple Network Convolutional Fully Mesh Softmax Facility Master
  • 12. Parameter Management  ParamMgr.Node for fully meshed layer Managed by individual node.  ParamMgr.Group for convolutional layer Shared by all nodes in the group, and managed by the group. The group gather/scatter the parameters from its members, which may locate in different executors.  ParamMgr.Const for softmax master layer The parameters are constant.
  • 13. qi,1 qi,2 qi,3 qi,4 Node Params Parameter Type (Link vs. Node) q1,I l q2,I l q3,I l Left-link Params qi,1 l+1 qi,2 l+1 qi,3 l+1 Right-link Params 1. Each parameter is associated with either a link or a node. 2. Each node/link may have multiple parameters associated. 3. Link parameters are managed by upstream. 4. Each category of parameters may be managed by either the node or the group.
  • 14. Network Partitioning • The DNN network is organized by layers • Each layer is defined as three-dimension cube by (x, y, z). • Each dimension can be arbitrarily partitioned, defined as (sx, sy, sz), s specifies the number of partitions of one dimension. • One layer can be in multiple executors, and one partition is the basic unit to be distributed in executors. x(sx=3) z(sz=3) y (sy=2)
  • 15. Software Components  Layer: logical group in deep neuron net.  Group: logical unit having similar input/output topology and functionality. A group can further have subgroups.  Node: the basic computation unit provide neuron functionality.  Connection: define the network topology between layers, such as fully meshed, convolutional, tiled convolutional, etc.  Adaptors: mapping the remote upstream/down stream neuron to local neuron in the topology defined by connections.  Function: define the activation of each neuron.  Master: provide central aggregation and scatter for softmax neuron.  Solver: central place to drive the model training and monitoring.  Parameter Server: the server used by neuron to update/retrieve parameters.
  • 16. Memory Overhead  Neuron does not need to keep the inputs from upstream, but only keeps the aggregation record.  The calculation is associative in both forward/backward path (through function split trick).  The link gradient is calculated and updated in the upstream  Memory overhead is O(N + M), N is the neuron size and M is the parameter size.
  • 17. Network Overhead  Neuron forwards same output to its upstream/downstream neurons.  Receiving neurons compute the input or update the gradient.  Neuron forwards its output to the executors only if it hosts neurons requesting it.  Neuron forwards its output to an executor only once regardless of the number of neurons requesting it.
  • 18. Complexity Memory: O(M+N) independent of network partition mechanism. M: the number of parameters N: The number of nodes. Communication: O(N) Realized by  Each node managing its outgoing link parameter instead of incoming link parameter  The trick to split the function across the layers
  • 19. Distributed Pipeline  MicroBatch: The number of training examples in one pipeline stage  max_buf: the length of the pipleline.  Batch algorithms: Significantly improve the performance when the training data set is big enough to fully populate the pipeline.  SGD: the improvement is limited, because the pipeline cannot be fully populated if the miniBatch size is not big enough. Executor 4 Executor 3 Executor 2 Executor 1 Micro Batch i +4 Micro Batch i +3 Micro Batch i +2 Micro Batch i +1 Micro Batch i +1 Micro Batch i +1 Micro Batch i +1 Micro Batch i +2 Micro Batch i +2 Micro Batch i +3 T1 T2 T3 T4
  • 20. Connections  Easy extensible through Adaptors.  Adaptor is used to mapping global status to its local status.  Fully Meshed  (Tiled) Convolutional  NonShared Convolutional