SlideShare a Scribd company logo
1 of 12
Download to read offline
Whitepaper
NVIDIA® Tegra™ Multi-processor Architecture




V1.0
Page |2


Table of Contents
The Mobile Challenge - High Performance and Low Power ......................................................................... 3
NVIDIA Tegra Multi-Processor Architecture ................................................................................................. 4
   Use Case Examples.................................................................................................................................... 5
   Eight Purpose Optimized Processors ........................................................................................................ 6
NVIDIA Tegra Global Power Management System ....................................................................................... 9
Ground-Breaking Performance and Battery Life .......................................................................................... 9
Conclusions ................................................................................................................................................. 10
Document Revision History......................................................................................................................... 11




NVIDIA Tegra Multi-Processor Architecture                                                                                                    February 2010
Page |3




The Mobile Challenge - High Performance and Low Power
Processing capabilities of computers and the internet connection bandwidth available to users have
increased by several orders of magnitude over the last two decades. Consequently, there has been
explosive growth in the amount and variety of content that is available on the internet. Web pages now
include complex Java elements, high definition video content and Flash based games. There are over
hundred million websites1 on the internet and over six billion videos2 are being viewed on Youtube™
every month. Farmville, a flash based game has alone attracted over 78 million3 new players on
Facebook. Users today want to access and enjoy this rich content not only on their PCs but also on their
mobile devices such as tablets and smartphones. Users today expect their mobile device to not only
deliver long battery life but also provide the same full high definition web experience that they get from
a PC. In addition to web browsing capabilities, users expect their mobile devices to handle a variety of
use cases such as gaming, audio playback, navigation and photo/video capture. Current mobile devices
are unable to offer both the performance and the battery life required for users to enjoy a high
definition web experience

Most mobile device architectures are designed around a general purpose CPU. The CPU, primarily
designed to handle general purpose operations, is very inefficient in handling specialized audio-visual
tasks such as decoding HD video, 3D gaming and flash video playback. The CPU not only underperforms
but also consumes excessive power for today’s mobile use cases. Relying on a power hungry CPU either
compromises battery life to offer good performance or delivers substandard experiences. This is starkly
evident in the number of complaints seen in mobile gadget magazines, websites, and forums regarding
current mobile devices providing only a few hours of battery life. In fact, numerous articles have been
written to help consumers extend battery life, and these articles basically recommend users to turn off
many of the ‘smart’ features of their smartphones to get longer battery life. We call this the power-
performance paradox.

The NVIDIA Tegra processor resolves this paradox with a new purpose-optimized processor architecture
that delivers a responsive experience for full-resolution Web pages, full HD 1080p video playback, and
days of battery life for typical use.




1
  http://www.domaintools.com/internet-statistics/
2
  http://techcrunch.com/2009/06/09/youtube-video-streams-top-1-billionday/
3
  http://www.facebook.com/apps/application.php?id=102452128776


NVIDIA Tegra Multi-Processor Architecture                                                     February 2010
Page |4




         Smartphones
                                                                         Tablets




   Portable Media Players                                     Mobile Internet Devices




NVIDIA Tegra Multi-Processor Architecture
To overcome the power-performance paradox and the limitations associated with single-processor
architectures, a revolutionary ground-up approach was taken in the design of the NVIDIA Tegra
processor. Extensive architectural simulations and modeling of actual mobile and Web uses cases
showed that a heterogeneous multi-processor architecture was required to meet both performance and
battery life needs of current and future mobile use cases. A heterogeneous multi-processor by definition
consists of several processors that are different in structure and function with each processor optimized
for a specific purpose

The NVIDIA Tegra architecture is a heterogeneous multi-processor architecture that consists of eight
independent processors for graphics, video encode and decode, image processing, audio processing,
power management, and general-purpose functions. These processors are power managed
independently with local hardware control and system level control built into each processor. A system-
level power monitor allows the Tegra processor to turn on only those processors required for a specific
use case, while keeping all other processors turned off.



NVIDIA Tegra Multi-Processor Architecture                                                    February 2010
Page |5


Tegra’s purpose-optimized processors are analogous to a carpenter’s tool kit that has various tools such
as hammer, saw, screwdriver, and drill. A carpenter does not use a hammer to drive screws or a drill to
cut wood. Each tool is optimized for a specific task or function. Likewise, the Tegra processors are all
specialized for particular tasks

-     Graphics Processor (GPU): Delivers outstanding mobile 3D game playability and is also used for
      visually engaging, highly-responsive 3D touch user interfaces.

-     Video Decode Processor: runs video macro-block oriented algorithms including inverse discrete
      cosine transforms (IDCT), variable length decode (VLD), color space conversion (CSC), and bit stream
      processing to deliver smooth, full frame rate 1080p HD video playback and streaming HD Flash video
      playback without compromising battery life.

-     Video Encode Processor: runs video encode algorithms to deliver full 1080p HD video streams for
      video recording and conferencing capabilities.

-     Image Signal Processor (ISP): handles light balance, edge enhancement, and noise reduction
      algorithms to deliver real-time photo enhancement capabilities.

-     Audio Processor: handles analog signal audio processing to deliver over 140 hours of continuous
      128kbps mp3 audio playback on a single battery charge.4

-     Dual-Core ARM Cortex A9 CPU: for general-purpose computing delivers faster web browsing and
      snappier response times on Java enabled websites

-     ARM7 Processor: handles system management functions and several proprietary battery life
      extending features on NVIDIA Tegra.

Each processor adds instructions, caches, clocks, and circuits optimized for each specific task, with
performance monitors to track system activity. Each processor is designed to provide the performance
needed for demanding and heavy workload tasks such as browsing java enabled Web pages, HD video
playback, responsive 3D user interfaces, and gaming. The proprietary power management features in
the NVIDIA Tegra processor track system workload levels and continuously manage the frequency and
voltage of each processor to provide the demanded performance at the lowest possible power. Using a
set of dedicated processors, the system can easily handle multi-tasking loads that include a combination
of 3D, video, communications, and audio tasks. When loads do not require multi-tasking, only the
processor required for the task is used while all other processors in the system are turned off, thus
achieving the lowest power consumption possible.




4
    Using a standard 2000mAh battery


NVIDIA Tegra Multi-Processor Architecture                                                     February 2010
Page |6


Following are three example use cases to illustrate how the NVIDIA Tegra power management system
optimizes processor use:

Listening to Music: During audio playback, the dedicated audio processor and the low-power ARM7
processor are turned on and all other processors are turned off. Turning off all the other processors
allows the ARM7 processor to fetch a playlist and efficiently decode the audio using only 30 mW of
power on average. If the device is in camera mode, the audio processor can be turned off and the ISP is
turned on to grab camera sensor data at the click of a button. The ISP can then clean up the image and
hand it off to the core processor to save on storage.

Watching Videos: When watching movies, the video decode processor handles all of the H.264 decode
of 1080p content and can stream the video to the built-in HDMI port for viewing of HD content on a big
screen. In this case, the display processor and actual display screen can be turned off to save significant
power. For example, the Tegra-powered Zune HD player from Microsoft can provide eight hours of HD
playback with a cell phone-sized battery in this mode.

Playing Games: During game play only the CPUs and the highly optimized NVIDIA GPU are turned on,
and the video processors and ISP are turned off. This ensures that the high-performance processors
consume power only when demanded and consume zero power when not in use.



The diagram below shows the NVIDIA Tegra purpose-optimized processors. Each of these processors is
power managed at a global level through software and locally through hardware mechanisms.




NVIDIA Tegra Multi-Processor Architecture                                                      February 2010
Page |7




         Dual-Core ARM Cortex A9 CPU: NVIDIA Tegra features the world’s first dual core CPU for mobile
         applications in addition to support for Symmetric Multi-Processing (SMP). SMP enables tasks to
         be parallelized between the two ARM Cortex A9 processors, and delivers faster load times of
         Web pages, quicker UI responsiveness, and faster rendering of complex Web pages.

         NVIDIA has implemented aggressive dynamic voltage and frequency scaling to minimize the
         power used by the dual-core Cortex A9 CPU. A typical Web browsing activity consists of a short
         initial period of Web page load followed by a longer period for reading the content of the page.
         When a new Web page is opened the CPU springs to life at full speed to quickly load the page
         and process java elements in the page. The Global Power Management System monitors the
         workload presented to the CPU and tunes the CPU voltage and frequency to match the needs of
         the workload. When the CPU is not in use (for example after a Web page has loaded and the
         user is reading the content of the page), power to the CPU is completely turned off with full
         power gating. Even when Web pages have Flash visual elements, the CPU can be completely
         turned off because the Flash visual features are offloaded to the GPU and video processor in the
         NVIDIA Tegra architecture.

         ARM7 Processor: To avoid the use of the higher power Cortex A9 CPU for tasks that have lower
         performance requirements, an ARM7 processor is included in the Tegra architecture. This


NVIDIA Tegra Multi-Processor Architecture                                                    February 2010
Page |8


         processor is primarily used in concert with the audio and video processors to deliver ultra low-
         power audio and low-power video playback capabilities.

         Ultra Low-Power Graphics Processor (GPU): The NVIDIA Tegra processor includes a market-
         leading NVIDIA GPU with programmable vertex shading and pixel shading processors. The
         hyper-efficient 2D/ 3D processor renders user interfaces, pictures, fonts, and other graphical
         elements instantly using only battery-sipping amounts of power. The 3D processor can render
         Quake 3 at 1024x600 resolution at over 60 frames per second using only a few hundred
         milliwatts. The GPU in NVIDIA Tegra is also capable of handling the demanding workloads
         presented by modern game engines. The Unreal® game engine that is used by several hundred
         PC games runs effortlessly on the NVIDIA Tegra processor. This capability will now enable game
         developers to easily port PC games to NVIDIA Tegra based mobile platform with minimal effort.
         The GPU can also render full-screen Flash animations using only 150 mW. This is critical for long
         battery life when browsing the Web or playing Flash-based games.

         Unlike ARM CPUs, the GPU in the Tegra processor has dedicated hardware to handle video
         workloads. ARM CPUs are generic processors and the CPU hardware is not optimized for specific
         tasks. Thus, they struggle to handle Flash content and render Flash video and graphics elements
         at very low frame rates while consuming relatively large amounts of power. The GPU is
         specifically optimized to decode Flash video and graphics elements and is able to decode Flash
         content at full frame rates and render Flash visual elements at much lower power levels.

          HD Video Decode Processor: The video decode processor is used to decode video streams from
         files that are either played from the local disk or streamed off the network. Over 80% of today’s
         top Web sites include video, most of which is Flash-based video content. The dedicated video
         processor handles all three Flash video formats: H.264, Sorenson and VP6-E. As a result, Flash
         video runs on NVIDIA Tegra at full frame rates and consumes very low power. Other mobile
         solutions that use general-purpose CPUs for Flash video deliver stuttering images and drain
         battery life. The decode processor is based on several generations of NVIDIA hardware decode
         expertise and is able to deliver flawless 1080p video playback while consuming less than 400
         mW of power.

         HD Video Encode Processor: The video encode processor can be used to encode 1080p content
         coming in from an HD video image sensor at 30 fps. The encoded video stream may then be
         saved locally or transmitted live over a broadband network. The video encode and decode
         processors can be used together to deliver HD video conferencing.

         Audio Processor: The dedicated audio processor is used to deliver ultra low-power audio
         playback that can last for days. The audio processing pipeline is designed to consume less than
         30mW of power while playing back a 128 Kbps MP3 file.

         Image Signal Processor (ISP): Many mobile devices include high resolution sensors for taking
         pictures and lower resolution cameras for Web video. The ISP in the NVIDIA Tegra processor is
         capable of taking raw camera sensor input at up to 12 megapixel resolution and 30 frames per


NVIDIA Tegra Multi-Processor Architecture                                                      February 2010
Page |9


         second. The image processor can then apply real-time image enhancement algorithms like
         automatic white balance, edge enhancement, and noise reduction, and can even adjust for poor
         lighting conditions. The output of the ISP is then ideal to save as a picture or as a stream that
         can be compressed for live video conferencing or sent with HD quality across a broadband
         network.

The eight independent processors of the NVIDIA Tegra multi-processor architecture deliver
unprecedented performance and multimedia capabilities to mobile devices. NVIDIA Tegra implements a
Global Power Management System that utilizes hardware feedback and software algorithms to deliver
industry-leading battery life for today’s mobile devices.




NVIDIA Tegra Global Power Management System
Distributed across the NVIDIA Tegra multi-processor architecture are numerous hardware monitors that
collect data on activity levels, frequency, temperature, and incoming request patterns. The Global
Power Management System uses the data fed back from these monitors to determine the optimal
operating frequency and voltage for the active processors. This type of feedback control ensures that
the processors are always running at the lowest possible power level.

Additionally, a feed-forward control mechanism is used to instantaneously boost operating frequency
and voltage for specific requests that indicate upcoming high performance requirements. This ensures
that users do not experience any lag or performance degradation for activities such as 3D UI navigation
and gaming.

Ground-Breaking Performance and Battery Life
The table below illustrates battery life the user can expect from an NVIDIA Tegra-based tablet (with
2000 mAh battery and 400 mW display).




NVIDIA Tegra Multi-Processor Architecture                                                     February 2010
P a g e | 10


Conclusions
The purpose-oriented multi-processor architecture of NVIDIA Tegra is defined by the philosophy of “the
right tool for the job”. Dedicated, high-performance processors deliver snappy Web browsing, flawless
streaming HD Flash video, and an exceptional mobile gaming experience. Additional, processors deliver
ultra low-power audio and video capabilities that provide extended audio and video playback times.

The revolutionary multi-processor architecture and power management features of NVIDIA Tegra
deliver an exceptional mobile visual computing experience with uncompromised battery life. NVIDIA
Tegra processors will be used in a wide range of very capable and enjoyable mobile platforms including
smartphones, tablets, e-readers, media players, and mobile Internet devices (MIDs).

With a system-level design built upon more than 10 generations of proven NVIDIA GPU technology,
Tegra enables intuitive 3D user interfaces, advanced HD multimedia features, and access to full Web
browsing, all while delivering battery performance that leaves the competition far behind.




NVIDIA Tegra Multi-Processor Architecture                                                   February 2010
P a g e | 11


Document Revision History




NVIDIA Tegra Multi-Processor Architecture   February 2010
P a g e | 12

More Related Content

What's hot

Snapdragon SoC and ARMv7 Architecture
Snapdragon SoC and ARMv7 ArchitectureSnapdragon SoC and ARMv7 Architecture
Snapdragon SoC and ARMv7 ArchitectureSantosh Verma
 
Qualcomm Snapdragon Processor
Qualcomm Snapdragon ProcessorQualcomm Snapdragon Processor
Qualcomm Snapdragon ProcessorKrishna Gehlot
 
What is a smartphone-R1
What is a smartphone-R1What is a smartphone-R1
What is a smartphone-R1Yong Heui Cho
 
Apple A10 Series Application Processor
Apple A10 Series Application ProcessorApple A10 Series Application Processor
Apple A10 Series Application ProcessorJJ Wu
 
Introducing SAMSUNG Galaxy S9|S9+
Introducing SAMSUNG Galaxy S9|S9+Introducing SAMSUNG Galaxy S9|S9+
Introducing SAMSUNG Galaxy S9|S9+JJ Wu
 
mobile processors
mobile processorsmobile processors
mobile processorsAreticharan
 
Derek Aberle Presentation for Qualcomm Snapdragon 820
Derek Aberle Presentation for Qualcomm Snapdragon 820Derek Aberle Presentation for Qualcomm Snapdragon 820
Derek Aberle Presentation for Qualcomm Snapdragon 820Low Hong Chuan
 
mobile processors introduction..
mobile processors introduction..mobile processors introduction..
mobile processors introduction..Muhammad Sayam
 
Smartphone processors
Smartphone processorsSmartphone processors
Smartphone processorsMahzad Zahedi
 
Apple MacBook 2016
Apple MacBook 2016Apple MacBook 2016
Apple MacBook 2016JJ Wu
 
Qualcomm Snapdragon S4 Pro-based Smart Phone(Simple)
Qualcomm Snapdragon S4 Pro-based Smart Phone(Simple)Qualcomm Snapdragon S4 Pro-based Smart Phone(Simple)
Qualcomm Snapdragon S4 Pro-based Smart Phone(Simple)JJ Wu
 
Qualcomm Snapdragon Processors: A Super Gaming Platform
Qualcomm Snapdragon Processors: A Super Gaming Platform Qualcomm Snapdragon Processors: A Super Gaming Platform
Qualcomm Snapdragon Processors: A Super Gaming Platform Qualcomm Developer Network
 
Survey on Mobile Processors
Survey on Mobile ProcessorsSurvey on Mobile Processors
Survey on Mobile ProcessorsAshutosh Singh
 
Brochure (2016-01-30)
Brochure (2016-01-30)Brochure (2016-01-30)
Brochure (2016-01-30)Jonah McLeod
 
Smartphone Hardware Architecture
Smartphone Hardware ArchitectureSmartphone Hardware Architecture
Smartphone Hardware ArchitectureYong Heui Cho
 

What's hot (20)

Snapdragon Processor
Snapdragon ProcessorSnapdragon Processor
Snapdragon Processor
 
Snapdragon SoC and ARMv7 Architecture
Snapdragon SoC and ARMv7 ArchitectureSnapdragon SoC and ARMv7 Architecture
Snapdragon SoC and ARMv7 Architecture
 
Imaging on embedded GPUs
Imaging on embedded GPUsImaging on embedded GPUs
Imaging on embedded GPUs
 
Qualcomm Snapdragon Processor
Qualcomm Snapdragon ProcessorQualcomm Snapdragon Processor
Qualcomm Snapdragon Processor
 
What is a smartphone-R1
What is a smartphone-R1What is a smartphone-R1
What is a smartphone-R1
 
Apple A10 Series Application Processor
Apple A10 Series Application ProcessorApple A10 Series Application Processor
Apple A10 Series Application Processor
 
Introducing SAMSUNG Galaxy S9|S9+
Introducing SAMSUNG Galaxy S9|S9+Introducing SAMSUNG Galaxy S9|S9+
Introducing SAMSUNG Galaxy S9|S9+
 
mobile processors
mobile processorsmobile processors
mobile processors
 
Derek Aberle Presentation for Qualcomm Snapdragon 820
Derek Aberle Presentation for Qualcomm Snapdragon 820Derek Aberle Presentation for Qualcomm Snapdragon 820
Derek Aberle Presentation for Qualcomm Snapdragon 820
 
mobile processors introduction..
mobile processors introduction..mobile processors introduction..
mobile processors introduction..
 
Smartphone processors
Smartphone processorsSmartphone processors
Smartphone processors
 
Apple MacBook 2016
Apple MacBook 2016Apple MacBook 2016
Apple MacBook 2016
 
AM437x Product on Module
AM437x Product on ModuleAM437x Product on Module
AM437x Product on Module
 
Product Brief- Nano SOM
Product Brief- Nano SOMProduct Brief- Nano SOM
Product Brief- Nano SOM
 
Qualcomm Snapdragon S4 Pro-based Smart Phone(Simple)
Qualcomm Snapdragon S4 Pro-based Smart Phone(Simple)Qualcomm Snapdragon S4 Pro-based Smart Phone(Simple)
Qualcomm Snapdragon S4 Pro-based Smart Phone(Simple)
 
Qualcomm Snapdragon Processors: A Super Gaming Platform
Qualcomm Snapdragon Processors: A Super Gaming Platform Qualcomm Snapdragon Processors: A Super Gaming Platform
Qualcomm Snapdragon Processors: A Super Gaming Platform
 
Synrgic S8i
Synrgic   S8iSynrgic   S8i
Synrgic S8i
 
Survey on Mobile Processors
Survey on Mobile ProcessorsSurvey on Mobile Processors
Survey on Mobile Processors
 
Brochure (2016-01-30)
Brochure (2016-01-30)Brochure (2016-01-30)
Brochure (2016-01-30)
 
Smartphone Hardware Architecture
Smartphone Hardware ArchitectureSmartphone Hardware Architecture
Smartphone Hardware Architecture
 

Viewers also liked

Apple A7 Processor
Apple A7 ProcessorApple A7 Processor
Apple A7 ProcessorJJ Wu
 
The ARM Architecture: ARM : ARM Architecture
The ARM Architecture: ARM : ARM ArchitectureThe ARM Architecture: ARM : ARM Architecture
The ARM Architecture: ARM : ARM Architecturesreea4
 
Apple iOS Device 2013
Apple iOS Device 2013Apple iOS Device 2013
Apple iOS Device 2013JJ Wu
 
I2C Bus (Inter-Integrated Circuit)
I2C Bus (Inter-Integrated Circuit)I2C Bus (Inter-Integrated Circuit)
I2C Bus (Inter-Integrated Circuit)Varun Mahajan
 
Apple - Compare iPhone Models
Apple - Compare iPhone ModelsApple - Compare iPhone Models
Apple - Compare iPhone ModelsJJ Wu
 

Viewers also liked (6)

Apple A7 Processor
Apple A7 ProcessorApple A7 Processor
Apple A7 Processor
 
The ARM Architecture: ARM : ARM Architecture
The ARM Architecture: ARM : ARM ArchitectureThe ARM Architecture: ARM : ARM Architecture
The ARM Architecture: ARM : ARM Architecture
 
Apple iOS Device 2013
Apple iOS Device 2013Apple iOS Device 2013
Apple iOS Device 2013
 
SOC Design Challenges and Practices
SOC Design Challenges and PracticesSOC Design Challenges and Practices
SOC Design Challenges and Practices
 
I2C Bus (Inter-Integrated Circuit)
I2C Bus (Inter-Integrated Circuit)I2C Bus (Inter-Integrated Circuit)
I2C Bus (Inter-Integrated Circuit)
 
Apple - Compare iPhone Models
Apple - Compare iPhone ModelsApple - Compare iPhone Models
Apple - Compare iPhone Models
 

Similar to Tegra multiprocessor architecture_white_paper

TECHNICAL PAPER PRESENTATION copy
TECHNICAL PAPER PRESENTATION copyTECHNICAL PAPER PRESENTATION copy
TECHNICAL PAPER PRESENTATION copyBhargav Ramesh
 
Tegra3 comparision on Android and Windows presentation
Tegra3 comparision on Android and Windows  presentationTegra3 comparision on Android and Windows  presentation
Tegra3 comparision on Android and Windows presentationGagan019
 
VISION Technology from AMD Powered by AMD E-Series & C-Series APUs
VISION Technology from AMD Powered by AMD E-Series & C-Series APUsVISION Technology from AMD Powered by AMD E-Series & C-Series APUs
VISION Technology from AMD Powered by AMD E-Series & C-Series APUsAdditionalResources
 
High End Modeling & Imaging with Intel Iris Pro Graphics
High End Modeling & Imaging with Intel Iris Pro GraphicsHigh End Modeling & Imaging with Intel Iris Pro Graphics
High End Modeling & Imaging with Intel Iris Pro GraphicsIntel® Software
 
Yashi dealer meeting settembre 2016 tecnologie xeon intel italia
Yashi dealer meeting settembre 2016 tecnologie xeon intel italiaYashi dealer meeting settembre 2016 tecnologie xeon intel italia
Yashi dealer meeting settembre 2016 tecnologie xeon intel italiaYashi Italia
 
Live Data: For When Data is Greater than Memory
Live Data: For When Data is Greater than MemoryLive Data: For When Data is Greater than Memory
Live Data: For When Data is Greater than MemoryMemVerge
 
Intel 8th Core G Series with Radeon Vega M
Intel 8th Core G Series with Radeon Vega M Intel 8th Core G Series with Radeon Vega M
Intel 8th Core G Series with Radeon Vega M Low Hong Chuan
 
Cuda meetup presentation 5
Cuda meetup presentation 5Cuda meetup presentation 5
Cuda meetup presentation 5Rihards Gailums
 
Re-architecting the Datacenter to Deliver Better Experiences (Intel)
Re-architecting the Datacenter to Deliver Better Experiences (Intel)Re-architecting the Datacenter to Deliver Better Experiences (Intel)
Re-architecting the Datacenter to Deliver Better Experiences (Intel)COMPUTEX TAIPEI
 
Graphics Processing Unit by Saurabh
Graphics Processing Unit by SaurabhGraphics Processing Unit by Saurabh
Graphics Processing Unit by SaurabhSaurabh Kumar
 
Virtual Private Server Documentation
Virtual Private Server DocumentationVirtual Private Server Documentation
Virtual Private Server Documentationwebhostingguy
 
Virtual Private Server Documentation
Virtual Private Server DocumentationVirtual Private Server Documentation
Virtual Private Server Documentationwebhostingguy
 
Innovative Solutions for Cloud Gaming, Media, Transcoding, & AI Inferencing
Innovative Solutions for Cloud Gaming, Media, Transcoding, & AI InferencingInnovative Solutions for Cloud Gaming, Media, Transcoding, & AI Inferencing
Innovative Solutions for Cloud Gaming, Media, Transcoding, & AI InferencingRebekah Rodriguez
 
i.MX51 Multimedia Applications Processors
i.MX51 Multimedia Applications Processorsi.MX51 Multimedia Applications Processors
i.MX51 Multimedia Applications ProcessorsPremier Farnell
 
Noile soluţii Intel pentru afaceri eficiente-23apr2010
Noile soluţii Intel pentru afaceri eficiente-23apr2010Noile soluţii Intel pentru afaceri eficiente-23apr2010
Noile soluţii Intel pentru afaceri eficiente-23apr2010Agora Group
 
Noile soluţii Intel pentru afaceri eficiente-20apr2010
Noile soluţii Intel pentru afaceri eficiente-20apr2010Noile soluţii Intel pentru afaceri eficiente-20apr2010
Noile soluţii Intel pentru afaceri eficiente-20apr2010Agora Group
 
Noile tehnologii INTEL pentru infrastructuri IT eficiente-19mar2010
Noile tehnologii INTEL pentru infrastructuri IT eficiente-19mar2010Noile tehnologii INTEL pentru infrastructuri IT eficiente-19mar2010
Noile tehnologii INTEL pentru infrastructuri IT eficiente-19mar2010Agora Group
 
Noile solutii Intel pentru afaceri eficiente-tm-20mai2010
Noile solutii Intel pentru afaceri eficiente-tm-20mai2010Noile solutii Intel pentru afaceri eficiente-tm-20mai2010
Noile solutii Intel pentru afaceri eficiente-tm-20mai2010Agora Group
 

Similar to Tegra multiprocessor architecture_white_paper (20)

TECHNICAL PAPER PRESENTATION copy
TECHNICAL PAPER PRESENTATION copyTECHNICAL PAPER PRESENTATION copy
TECHNICAL PAPER PRESENTATION copy
 
Tegra3 comparision on Android and Windows presentation
Tegra3 comparision on Android and Windows  presentationTegra3 comparision on Android and Windows  presentation
Tegra3 comparision on Android and Windows presentation
 
VISION Technology from AMD Powered by AMD E-Series & C-Series APUs
VISION Technology from AMD Powered by AMD E-Series & C-Series APUsVISION Technology from AMD Powered by AMD E-Series & C-Series APUs
VISION Technology from AMD Powered by AMD E-Series & C-Series APUs
 
High End Modeling & Imaging with Intel Iris Pro Graphics
High End Modeling & Imaging with Intel Iris Pro GraphicsHigh End Modeling & Imaging with Intel Iris Pro Graphics
High End Modeling & Imaging with Intel Iris Pro Graphics
 
Yashi dealer meeting settembre 2016 tecnologie xeon intel italia
Yashi dealer meeting settembre 2016 tecnologie xeon intel italiaYashi dealer meeting settembre 2016 tecnologie xeon intel italia
Yashi dealer meeting settembre 2016 tecnologie xeon intel italia
 
Live Data: For When Data is Greater than Memory
Live Data: For When Data is Greater than MemoryLive Data: For When Data is Greater than Memory
Live Data: For When Data is Greater than Memory
 
Intel 8th Core G Series with Radeon Vega M
Intel 8th Core G Series with Radeon Vega M Intel 8th Core G Series with Radeon Vega M
Intel 8th Core G Series with Radeon Vega M
 
Lenovo t550
Lenovo t550Lenovo t550
Lenovo t550
 
Cuda meetup presentation 5
Cuda meetup presentation 5Cuda meetup presentation 5
Cuda meetup presentation 5
 
Re-architecting the Datacenter to Deliver Better Experiences (Intel)
Re-architecting the Datacenter to Deliver Better Experiences (Intel)Re-architecting the Datacenter to Deliver Better Experiences (Intel)
Re-architecting the Datacenter to Deliver Better Experiences (Intel)
 
Graphics Processing Unit by Saurabh
Graphics Processing Unit by SaurabhGraphics Processing Unit by Saurabh
Graphics Processing Unit by Saurabh
 
Virtual Private Server Documentation
Virtual Private Server DocumentationVirtual Private Server Documentation
Virtual Private Server Documentation
 
Virtual Private Server Documentation
Virtual Private Server DocumentationVirtual Private Server Documentation
Virtual Private Server Documentation
 
APU in nepal 2
APU in nepal 2APU in nepal 2
APU in nepal 2
 
Innovative Solutions for Cloud Gaming, Media, Transcoding, & AI Inferencing
Innovative Solutions for Cloud Gaming, Media, Transcoding, & AI InferencingInnovative Solutions for Cloud Gaming, Media, Transcoding, & AI Inferencing
Innovative Solutions for Cloud Gaming, Media, Transcoding, & AI Inferencing
 
i.MX51 Multimedia Applications Processors
i.MX51 Multimedia Applications Processorsi.MX51 Multimedia Applications Processors
i.MX51 Multimedia Applications Processors
 
Noile soluţii Intel pentru afaceri eficiente-23apr2010
Noile soluţii Intel pentru afaceri eficiente-23apr2010Noile soluţii Intel pentru afaceri eficiente-23apr2010
Noile soluţii Intel pentru afaceri eficiente-23apr2010
 
Noile soluţii Intel pentru afaceri eficiente-20apr2010
Noile soluţii Intel pentru afaceri eficiente-20apr2010Noile soluţii Intel pentru afaceri eficiente-20apr2010
Noile soluţii Intel pentru afaceri eficiente-20apr2010
 
Noile tehnologii INTEL pentru infrastructuri IT eficiente-19mar2010
Noile tehnologii INTEL pentru infrastructuri IT eficiente-19mar2010Noile tehnologii INTEL pentru infrastructuri IT eficiente-19mar2010
Noile tehnologii INTEL pentru infrastructuri IT eficiente-19mar2010
 
Noile solutii Intel pentru afaceri eficiente-tm-20mai2010
Noile solutii Intel pentru afaceri eficiente-tm-20mai2010Noile solutii Intel pentru afaceri eficiente-tm-20mai2010
Noile solutii Intel pentru afaceri eficiente-tm-20mai2010
 

Recently uploaded

Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 

Recently uploaded (20)

Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 

Tegra multiprocessor architecture_white_paper

  • 2. Page |2 Table of Contents The Mobile Challenge - High Performance and Low Power ......................................................................... 3 NVIDIA Tegra Multi-Processor Architecture ................................................................................................. 4 Use Case Examples.................................................................................................................................... 5 Eight Purpose Optimized Processors ........................................................................................................ 6 NVIDIA Tegra Global Power Management System ....................................................................................... 9 Ground-Breaking Performance and Battery Life .......................................................................................... 9 Conclusions ................................................................................................................................................. 10 Document Revision History......................................................................................................................... 11 NVIDIA Tegra Multi-Processor Architecture February 2010
  • 3. Page |3 The Mobile Challenge - High Performance and Low Power Processing capabilities of computers and the internet connection bandwidth available to users have increased by several orders of magnitude over the last two decades. Consequently, there has been explosive growth in the amount and variety of content that is available on the internet. Web pages now include complex Java elements, high definition video content and Flash based games. There are over hundred million websites1 on the internet and over six billion videos2 are being viewed on Youtube™ every month. Farmville, a flash based game has alone attracted over 78 million3 new players on Facebook. Users today want to access and enjoy this rich content not only on their PCs but also on their mobile devices such as tablets and smartphones. Users today expect their mobile device to not only deliver long battery life but also provide the same full high definition web experience that they get from a PC. In addition to web browsing capabilities, users expect their mobile devices to handle a variety of use cases such as gaming, audio playback, navigation and photo/video capture. Current mobile devices are unable to offer both the performance and the battery life required for users to enjoy a high definition web experience Most mobile device architectures are designed around a general purpose CPU. The CPU, primarily designed to handle general purpose operations, is very inefficient in handling specialized audio-visual tasks such as decoding HD video, 3D gaming and flash video playback. The CPU not only underperforms but also consumes excessive power for today’s mobile use cases. Relying on a power hungry CPU either compromises battery life to offer good performance or delivers substandard experiences. This is starkly evident in the number of complaints seen in mobile gadget magazines, websites, and forums regarding current mobile devices providing only a few hours of battery life. In fact, numerous articles have been written to help consumers extend battery life, and these articles basically recommend users to turn off many of the ‘smart’ features of their smartphones to get longer battery life. We call this the power- performance paradox. The NVIDIA Tegra processor resolves this paradox with a new purpose-optimized processor architecture that delivers a responsive experience for full-resolution Web pages, full HD 1080p video playback, and days of battery life for typical use. 1 http://www.domaintools.com/internet-statistics/ 2 http://techcrunch.com/2009/06/09/youtube-video-streams-top-1-billionday/ 3 http://www.facebook.com/apps/application.php?id=102452128776 NVIDIA Tegra Multi-Processor Architecture February 2010
  • 4. Page |4 Smartphones Tablets Portable Media Players Mobile Internet Devices NVIDIA Tegra Multi-Processor Architecture To overcome the power-performance paradox and the limitations associated with single-processor architectures, a revolutionary ground-up approach was taken in the design of the NVIDIA Tegra processor. Extensive architectural simulations and modeling of actual mobile and Web uses cases showed that a heterogeneous multi-processor architecture was required to meet both performance and battery life needs of current and future mobile use cases. A heterogeneous multi-processor by definition consists of several processors that are different in structure and function with each processor optimized for a specific purpose The NVIDIA Tegra architecture is a heterogeneous multi-processor architecture that consists of eight independent processors for graphics, video encode and decode, image processing, audio processing, power management, and general-purpose functions. These processors are power managed independently with local hardware control and system level control built into each processor. A system- level power monitor allows the Tegra processor to turn on only those processors required for a specific use case, while keeping all other processors turned off. NVIDIA Tegra Multi-Processor Architecture February 2010
  • 5. Page |5 Tegra’s purpose-optimized processors are analogous to a carpenter’s tool kit that has various tools such as hammer, saw, screwdriver, and drill. A carpenter does not use a hammer to drive screws or a drill to cut wood. Each tool is optimized for a specific task or function. Likewise, the Tegra processors are all specialized for particular tasks - Graphics Processor (GPU): Delivers outstanding mobile 3D game playability and is also used for visually engaging, highly-responsive 3D touch user interfaces. - Video Decode Processor: runs video macro-block oriented algorithms including inverse discrete cosine transforms (IDCT), variable length decode (VLD), color space conversion (CSC), and bit stream processing to deliver smooth, full frame rate 1080p HD video playback and streaming HD Flash video playback without compromising battery life. - Video Encode Processor: runs video encode algorithms to deliver full 1080p HD video streams for video recording and conferencing capabilities. - Image Signal Processor (ISP): handles light balance, edge enhancement, and noise reduction algorithms to deliver real-time photo enhancement capabilities. - Audio Processor: handles analog signal audio processing to deliver over 140 hours of continuous 128kbps mp3 audio playback on a single battery charge.4 - Dual-Core ARM Cortex A9 CPU: for general-purpose computing delivers faster web browsing and snappier response times on Java enabled websites - ARM7 Processor: handles system management functions and several proprietary battery life extending features on NVIDIA Tegra. Each processor adds instructions, caches, clocks, and circuits optimized for each specific task, with performance monitors to track system activity. Each processor is designed to provide the performance needed for demanding and heavy workload tasks such as browsing java enabled Web pages, HD video playback, responsive 3D user interfaces, and gaming. The proprietary power management features in the NVIDIA Tegra processor track system workload levels and continuously manage the frequency and voltage of each processor to provide the demanded performance at the lowest possible power. Using a set of dedicated processors, the system can easily handle multi-tasking loads that include a combination of 3D, video, communications, and audio tasks. When loads do not require multi-tasking, only the processor required for the task is used while all other processors in the system are turned off, thus achieving the lowest power consumption possible. 4 Using a standard 2000mAh battery NVIDIA Tegra Multi-Processor Architecture February 2010
  • 6. Page |6 Following are three example use cases to illustrate how the NVIDIA Tegra power management system optimizes processor use: Listening to Music: During audio playback, the dedicated audio processor and the low-power ARM7 processor are turned on and all other processors are turned off. Turning off all the other processors allows the ARM7 processor to fetch a playlist and efficiently decode the audio using only 30 mW of power on average. If the device is in camera mode, the audio processor can be turned off and the ISP is turned on to grab camera sensor data at the click of a button. The ISP can then clean up the image and hand it off to the core processor to save on storage. Watching Videos: When watching movies, the video decode processor handles all of the H.264 decode of 1080p content and can stream the video to the built-in HDMI port for viewing of HD content on a big screen. In this case, the display processor and actual display screen can be turned off to save significant power. For example, the Tegra-powered Zune HD player from Microsoft can provide eight hours of HD playback with a cell phone-sized battery in this mode. Playing Games: During game play only the CPUs and the highly optimized NVIDIA GPU are turned on, and the video processors and ISP are turned off. This ensures that the high-performance processors consume power only when demanded and consume zero power when not in use. The diagram below shows the NVIDIA Tegra purpose-optimized processors. Each of these processors is power managed at a global level through software and locally through hardware mechanisms. NVIDIA Tegra Multi-Processor Architecture February 2010
  • 7. Page |7 Dual-Core ARM Cortex A9 CPU: NVIDIA Tegra features the world’s first dual core CPU for mobile applications in addition to support for Symmetric Multi-Processing (SMP). SMP enables tasks to be parallelized between the two ARM Cortex A9 processors, and delivers faster load times of Web pages, quicker UI responsiveness, and faster rendering of complex Web pages. NVIDIA has implemented aggressive dynamic voltage and frequency scaling to minimize the power used by the dual-core Cortex A9 CPU. A typical Web browsing activity consists of a short initial period of Web page load followed by a longer period for reading the content of the page. When a new Web page is opened the CPU springs to life at full speed to quickly load the page and process java elements in the page. The Global Power Management System monitors the workload presented to the CPU and tunes the CPU voltage and frequency to match the needs of the workload. When the CPU is not in use (for example after a Web page has loaded and the user is reading the content of the page), power to the CPU is completely turned off with full power gating. Even when Web pages have Flash visual elements, the CPU can be completely turned off because the Flash visual features are offloaded to the GPU and video processor in the NVIDIA Tegra architecture. ARM7 Processor: To avoid the use of the higher power Cortex A9 CPU for tasks that have lower performance requirements, an ARM7 processor is included in the Tegra architecture. This NVIDIA Tegra Multi-Processor Architecture February 2010
  • 8. Page |8 processor is primarily used in concert with the audio and video processors to deliver ultra low- power audio and low-power video playback capabilities. Ultra Low-Power Graphics Processor (GPU): The NVIDIA Tegra processor includes a market- leading NVIDIA GPU with programmable vertex shading and pixel shading processors. The hyper-efficient 2D/ 3D processor renders user interfaces, pictures, fonts, and other graphical elements instantly using only battery-sipping amounts of power. The 3D processor can render Quake 3 at 1024x600 resolution at over 60 frames per second using only a few hundred milliwatts. The GPU in NVIDIA Tegra is also capable of handling the demanding workloads presented by modern game engines. The Unreal® game engine that is used by several hundred PC games runs effortlessly on the NVIDIA Tegra processor. This capability will now enable game developers to easily port PC games to NVIDIA Tegra based mobile platform with minimal effort. The GPU can also render full-screen Flash animations using only 150 mW. This is critical for long battery life when browsing the Web or playing Flash-based games. Unlike ARM CPUs, the GPU in the Tegra processor has dedicated hardware to handle video workloads. ARM CPUs are generic processors and the CPU hardware is not optimized for specific tasks. Thus, they struggle to handle Flash content and render Flash video and graphics elements at very low frame rates while consuming relatively large amounts of power. The GPU is specifically optimized to decode Flash video and graphics elements and is able to decode Flash content at full frame rates and render Flash visual elements at much lower power levels. HD Video Decode Processor: The video decode processor is used to decode video streams from files that are either played from the local disk or streamed off the network. Over 80% of today’s top Web sites include video, most of which is Flash-based video content. The dedicated video processor handles all three Flash video formats: H.264, Sorenson and VP6-E. As a result, Flash video runs on NVIDIA Tegra at full frame rates and consumes very low power. Other mobile solutions that use general-purpose CPUs for Flash video deliver stuttering images and drain battery life. The decode processor is based on several generations of NVIDIA hardware decode expertise and is able to deliver flawless 1080p video playback while consuming less than 400 mW of power. HD Video Encode Processor: The video encode processor can be used to encode 1080p content coming in from an HD video image sensor at 30 fps. The encoded video stream may then be saved locally or transmitted live over a broadband network. The video encode and decode processors can be used together to deliver HD video conferencing. Audio Processor: The dedicated audio processor is used to deliver ultra low-power audio playback that can last for days. The audio processing pipeline is designed to consume less than 30mW of power while playing back a 128 Kbps MP3 file. Image Signal Processor (ISP): Many mobile devices include high resolution sensors for taking pictures and lower resolution cameras for Web video. The ISP in the NVIDIA Tegra processor is capable of taking raw camera sensor input at up to 12 megapixel resolution and 30 frames per NVIDIA Tegra Multi-Processor Architecture February 2010
  • 9. Page |9 second. The image processor can then apply real-time image enhancement algorithms like automatic white balance, edge enhancement, and noise reduction, and can even adjust for poor lighting conditions. The output of the ISP is then ideal to save as a picture or as a stream that can be compressed for live video conferencing or sent with HD quality across a broadband network. The eight independent processors of the NVIDIA Tegra multi-processor architecture deliver unprecedented performance and multimedia capabilities to mobile devices. NVIDIA Tegra implements a Global Power Management System that utilizes hardware feedback and software algorithms to deliver industry-leading battery life for today’s mobile devices. NVIDIA Tegra Global Power Management System Distributed across the NVIDIA Tegra multi-processor architecture are numerous hardware monitors that collect data on activity levels, frequency, temperature, and incoming request patterns. The Global Power Management System uses the data fed back from these monitors to determine the optimal operating frequency and voltage for the active processors. This type of feedback control ensures that the processors are always running at the lowest possible power level. Additionally, a feed-forward control mechanism is used to instantaneously boost operating frequency and voltage for specific requests that indicate upcoming high performance requirements. This ensures that users do not experience any lag or performance degradation for activities such as 3D UI navigation and gaming. Ground-Breaking Performance and Battery Life The table below illustrates battery life the user can expect from an NVIDIA Tegra-based tablet (with 2000 mAh battery and 400 mW display). NVIDIA Tegra Multi-Processor Architecture February 2010
  • 10. P a g e | 10 Conclusions The purpose-oriented multi-processor architecture of NVIDIA Tegra is defined by the philosophy of “the right tool for the job”. Dedicated, high-performance processors deliver snappy Web browsing, flawless streaming HD Flash video, and an exceptional mobile gaming experience. Additional, processors deliver ultra low-power audio and video capabilities that provide extended audio and video playback times. The revolutionary multi-processor architecture and power management features of NVIDIA Tegra deliver an exceptional mobile visual computing experience with uncompromised battery life. NVIDIA Tegra processors will be used in a wide range of very capable and enjoyable mobile platforms including smartphones, tablets, e-readers, media players, and mobile Internet devices (MIDs). With a system-level design built upon more than 10 generations of proven NVIDIA GPU technology, Tegra enables intuitive 3D user interfaces, advanced HD multimedia features, and access to full Web browsing, all while delivering battery performance that leaves the competition far behind. NVIDIA Tegra Multi-Processor Architecture February 2010
  • 11. P a g e | 11 Document Revision History NVIDIA Tegra Multi-Processor Architecture February 2010
  • 12. P a g e | 12