Diese Präsentation wurde erfolgreich gemeldet.
Die SlideShare-Präsentation wird heruntergeladen. ×

HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to enable the future of Embedded Machine Learning

Anzeige
Anzeige
Anzeige
Anzeige
Anzeige
Anzeige
Anzeige
Anzeige
Anzeige
Anzeige
Anzeige
Anzeige

Hier ansehen

1 von 15 Anzeige

HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to enable the future of Embedded Machine Learning

Herunterladen, um offline zu lesen

Session ID: HKG18-300K2
Session Name: HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to enable the future of Embedded Machine Learning
Speaker: Tomas Evensen
Track: Ecosystem Day


★ Session Summary ★
As Moore's law is slowing down, heterogeneous architectures are needed to keep up with the increasing compute requirements emerging from industry trends such as the use machine learning across a diverse range of markets and applications. These compute requirements require custom system architectures to suit the rapidly evolving demands of emerging algorithms, standards and trends.
Field Programmable hardware offers a unique capability to provide flexibility alongside advanced processor architectures to address this ever increasing multitude of applications. Development flows, programmability and flexibility are crucial to the enablement of these advancing algorithms and to enable the next generation of implementations in a world of advancing Artificial intelligence.
In this session we will introduce you to an all Programmable paradigm and low cost development platform to enable an ecosystem of flexibility and unparalleled programmability.
The future is now…...

---------------------------------------------------
★ Resources ★
Event Page: http://connect.linaro.org/resource/hkg18/hkg18-300k2/
Presentation: http://connect.linaro.org.s3.amazonaws.com/hkg18/presentations/hkg18-300k2.pdf
Video: http://connect.linaro.org.s3.amazonaws.com/hkg18/videos/hkg18-300k2.mp4
---------------------------------------------------
★ Event Details ★
Linaro Connect Hong Kong 2018 (HKG18)
19-23 March 2018
Regal Airport Hotel Hong Kong

---------------------------------------------------
Keyword: Ecosystem Day
'http://www.linaro.org'
'http://connect.linaro.org'
---------------------------------------------------
Follow us on Social Media
https://www.facebook.com/LinaroOrg
https://www.youtube.com/user/linaroorg?sub_confirmation=1
https://www.linkedin.com/company/1026961

Session ID: HKG18-300K2
Session Name: HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to enable the future of Embedded Machine Learning
Speaker: Tomas Evensen
Track: Ecosystem Day


★ Session Summary ★
As Moore's law is slowing down, heterogeneous architectures are needed to keep up with the increasing compute requirements emerging from industry trends such as the use machine learning across a diverse range of markets and applications. These compute requirements require custom system architectures to suit the rapidly evolving demands of emerging algorithms, standards and trends.
Field Programmable hardware offers a unique capability to provide flexibility alongside advanced processor architectures to address this ever increasing multitude of applications. Development flows, programmability and flexibility are crucial to the enablement of these advancing algorithms and to enable the next generation of implementations in a world of advancing Artificial intelligence.
In this session we will introduce you to an all Programmable paradigm and low cost development platform to enable an ecosystem of flexibility and unparalleled programmability.
The future is now…...

---------------------------------------------------
★ Resources ★
Event Page: http://connect.linaro.org/resource/hkg18/hkg18-300k2/
Presentation: http://connect.linaro.org.s3.amazonaws.com/hkg18/presentations/hkg18-300k2.pdf
Video: http://connect.linaro.org.s3.amazonaws.com/hkg18/videos/hkg18-300k2.mp4
---------------------------------------------------
★ Event Details ★
Linaro Connect Hong Kong 2018 (HKG18)
19-23 March 2018
Regal Airport Hotel Hong Kong

---------------------------------------------------
Keyword: Ecosystem Day
'http://www.linaro.org'
'http://connect.linaro.org'
---------------------------------------------------
Follow us on Social Media
https://www.facebook.com/LinaroOrg
https://www.youtube.com/user/linaroorg?sub_confirmation=1
https://www.linkedin.com/company/1026961

Anzeige
Anzeige

Weitere Verwandte Inhalte

Diashows für Sie (20)

Ähnlich wie HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to enable the future of Embedded Machine Learning (20)

Anzeige

Weitere von Linaro (20)

Aktuellste (20)

Anzeige

HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to enable the future of Embedded Machine Learning

  1. 1. All Programmable SoCs? Platforms to enable the future of Embedded Machine Learning Linaro Connect – Hong Kong March 2018 Tomas Evensen CTO Embedded Software, Xilinx
  2. 2. © Copyright 2018 Xilinx . Processor frequency scaling ended in 2007 Multicore architecture scaling has flattened CPU Architectures not Scaling with Workloads Workloads require higher performance, lower latency – Cloud: video, big data, AI… – Edge: auto, surveillance, AI…Andrew Danowitz, Kyle Kelley, James Mao, John P. Stevenson, Mark Horowitz Communications of the ACM, Vol. 55 No. 4 Page 2
  3. 3. © Copyright 2018 Xilinx . Application Processor 64-bit Dual/Quad-Core Zynq UltraScale+ MPSoC Real-Time Processors 32-bit Dual-Core Platform & Power Management Granular Power Control Functional Safety Configuration & Security Unit Anti-Tamper & Trust Industry Standards Fabric Acceleration Customizable Engines High Speed Connectivity Video Codec 8K4K (15fps) 4K2K (60fps) High Speed Peripherals Key Interfaces Graphics Processor ARM Mali-400MP2 Memory Subsystem High Bandwidth Low Latency Page 3
  4. 4. © Copyright 2018 Xilinx . Page 4 FPGA: The “Chameleon” Chip What is FPGA/Fabric/Programmable Logic: Is it glue logic? Is it a powerful parallel DSP engine? Is it an RTL simulator? Yes!!! And more… FPGA includes: Programmable logic (LUTs) Hardened DSP blocks Hardened memory (BRAM, URAM) FPGAs are great to implement: Parallel compute (e.g. MAC) – With variable precision Parallel, flexible dataflows – Build your own buses Flexible, multiport memory hierarchies
  5. 5. © Copyright 2018 Xilinx . Breakout in Programming Model Traditional HW design HDMI video proc. video enc. Development Productivity 15x productivity with HLS, IPI Ethernet IP Video decode C++ Video process C++ Video encode C++ HDMI IP SW Programmability Page 5
  6. 6. © Copyright 2018 Xilinx . SDSoC Example: Matrix Multiply + Add main(){ malloc(A,B,C); mmult(A,B,D); madd(C,D,E); printf(E); } madd(inA,inB,out){ } HLS C/C++ mmult(inA,inB,out){ } HLS C/C++ A,B datamovers AXI Bus Platform Application Driver mmult madd Generated D A B C E PS PL Page 6
  7. 7. © Copyright 2018 Xilinx . Supporting the Whole Stack Accelerated Open Frameworks Accelerated Libraries Development Environment Boards w/ HLx-based platform Machine learning Database Analytics Platform Development Stack VCU1525 Acceleration card Page 7
  8. 8. © Copyright 2018 Xilinx . Deep learning-based multi-object recognition for smart-city Live video object detection SSD @ 480x360 on Zynq MPSoC End-customer obtained: – 5x NVIDIA TX2 performance – Better accuracy Example of Embedded Vision Application at the Edge Object detection 5X Perf/watt for SSD vs GPU Page 8
  9. 9. © Copyright 2018 Xilinx . Xilinx and Avnet is partnering and is announcing the Ultra96 board – Equipped with Zync Ultrascale+ MPSoC (ZU3EG) – https://www.96boards.org/product/ultra96/ Ultra96 board makes ARM® based Xilinx SoCs available to developers at a low price point – Built to 96Boards standard, suitable for software prototyping with standardized expansion kits – Targeting a range of applications including Machine Learning, IoT, and compute Leverages an open-source software development platform – 96boards community: 12K actively contributing software engineers – Supports both self-hosted and cross development • Self-hosted: Compile on the board itself • Cross: Develop on your workstation/laptop • C to fabric/FPGA: SDSoC tools available later this year • Unboxing to coding in less than 2 minutes Page 9 $249 Ultra96 Board Targeted for Software Designers Available from Avnet in April
  10. 10. © Copyright 2018 Xilinx . Submit your most creative, most out-of-the box AI or ML application at the Xilinx or Avnet table during Demo Friday (12:00 – 14:00). The best 30 get a FREE Ultra96 board plus software to help you realize your vision. The 1st twenty to submit a working design by MAY 25th, 2018 get a $25 Amazon Gift Card. ONE Winner announced through Xilinx social media channels. If it’s you, you’re invited to present your design to your peers in industry at Xilinx Developer Forum 2018. Page 10 The Future is Ultra96 Xilinx Contest
  11. 11. © Copyright 2018 Xilinx . Page 11 Demo
  12. 12. © Copyright 2018 Xilinx . German Road Sign Database – 50,000+ 32x32 bit images for training – 44 classes (43 road signs, 1 background) – Training via Amazon Web Services • AWS: p2.xlarge Instance – 8 hours à $7.78 6,5e Binary Neural Network Characteristics – 6 convolutional layers – 2 max pool layers – 3 fully connected layers Page 12 Neural Network Example
  13. 13. © Copyright 2018 Xilinx . Page 13 Neural Network Performance Results Up to 8,600 times faster when accelerated with programmable logic Performance Metric Software Only Programmable Logic Accelerated Tiles per second 2.2 19,000 Scene rate (fps) 0.011 (92 sec per frame) 94 Overall Acceleration - 8,600X
  14. 14. © Copyright 2018 Xilinx . Dramatically Accelerate 96Board Software via an FPGA with Integrated Processors – Wednesday 16:00-16:55 Accelerating Neural Networks for Vision Systems via FPGAs – Thursday 11:00-11:25 Page 14 Learn More About FPGA’s and Software Acceleration
  15. 15. © Copyright 2018 Xilinx . Page 15 Questions?/ Thank You

×