GPU Acceleration for Containers on Intel Processor Graphics

•

3 gefällt mir•2,994 views

Presentation delivered at the 2017 LinuxCon China. Container is a popular technology in cloud with the merits of fast provisioning, high density and near-native performance. New requirements have been rising to further use GPU to accelerate various applications in containers, e.g. media trans-coding, machine learning, etc. However there is still gap of managing GPU in such container usages. This presentation will first review the status of GPU support on both native container and Intel clear container, then provide an anatomy of technical gaps and enabling plans in major areas (orchestration layer, resource isolation and QoS performance isolation), then review our work for GPU virtualization in clear container and prototype work through extensions in docker plugin, cgroup and GPU scheduler to fix those gaps.

Technologie

1
GPU Acceleration for Container on Intel
Processor Graphics
Zhenyu Wang
zhenyuw@linux.intel.com

2
Agenda
GPU Containers
GPU Namespace
GPU Control Group
Integration with Container Runtime
VM Containers
Status

3
GPU Containers
Video Delivery
(Encode/Decode/Transcode)
Cloud Graphics
(Remote desktop/gaming)
HPC/Machine Learning
(AI, Analytic, …)
E3-G Discrete
GPU
Fast provisioning
High Density
Little Overhead

4
What Makes a Container?
Platform
Host OS
Namespaces
Control
groups
App
Libs
App
Libs
App
Libs
Platform
Hypervisor
App
Libs
App
Libs
Guest OS
App
Libs
Guest OS
Bare Metal Containers
Docker DockerDocker
VM Containers
(e.g. Clear Container)

5
Path to GPU Containers
Bare Metal Containers VM Containers
GPU namespaces
GPU control groups
GPU virtualization technology
Integrate with container runtime

6
GPU Namespace:
DRM Render Node
DRM/i915
Card# RenderD#
Xserver ComputeMedia
Xcode
Unprivileged
No mode-setting
Offscreen
3D
Privileged
Mode setting
Global gem flink

7
GPU Namespace:
DRM Render Node
DRM/i915
RenderD#
Media
Xcode
Libs
Offscreen
3D
Libs
Compute
Libs
GPU
Contexts
Sharing happen with dmabuf only (no
global gem object on card0)
No need of creating another namespace
Expose DRM render nodes to provide
isolated views through device cgroup
 e.g “docker run –device=/dev/dri/renderD128
–it debian”
Per-process hardware context in GPU
 Hardware managed render context switch

8
GPU CGroup
CPU Mem BlkIO Net GPU
container
Container
resource
manager
Cgroup
controller
container container container
…
Container-granule GPU resource control

9
GPU CGroup
New subsystem: gpu_cgrp_subsys
Control knobs
 gpu.shares (% share of GPU cycles, work in progress)
 gpu.priority (workload priority in the system)
 gpu.memory (maximum GPU memory size)
DRM/i915
 Favor ‘shares/priority’ in workload scheduler
 Enforce memory limitation (optionally for UMA graphics)

$10 Container Runtime Integration { “linux”: { “devices”: [ { “path”: “/dev/dri/renderD128”, “type”: “c”, “major”: 226, “minor”: 128, “fileMode”: 432, “uid”: 0, “gid”: 0 } ], “resources”: { … “gpu”: { “memory”: <max_mem_in_bytes>, “prio”: <workload_priority> } } } RunC: an universal container runtime  Understand new gpu cgroup  Add new Linux resources config.json options for gpu  Runtime-tools helper to generate config stubs New Docker GPU control parameters e.g docker –gpu-mem=xxx –gpu-priority=xxx –gpu-share=xxx …$

11
Image Portability
Main challenge - library compatibility
 User level libraries MUST match underlying kernel/HW
Introduce Docker helper for image inspection
 Compare image libraries with host environment
 Reject to launch container upon any mismatch

12
VM GPU Containers
DRM/i915
vGPU
Media
Xcode
Libs
GPU
vGPU vGPU
Intel GVT-g
Guest OS
Media
Xcode
Libs
Guest OS
Media
Xcode
Libs
Guest OS
CC VM CC VM CC VM Clear Container (CC)
Intel graphics virtualization technology
(Intel GVT-g)
Assign vGPU to VM container
In upstream Linux since v4.10
https://01.org/igvt-g

13
VM GPU Containers
Nested containers in VM also have GPU access
Need Improvement:
Scalability
 Only support 8vGPUs today due to resource partitioning
 Need some enlightened way to further scale in the short term
Boot time
 Full guest graphics driver load takes >1.5s
 Fast optimization to <0.5s in progress

14
Status
 GPU cgroup PoC
https://github.com/zhenyw/linux
https://github.com/zhenyw/runc
https://github.com/zhenyw/moby
https://github.com/zhenyw/cli
 GPU virtualization for Clear container
https://clearlinux.org/features/intel%C2%AE-clear-containers
https://github.com/01org/gvt-linux/wiki/Clear-Container-with-GVTg-Setup-Guide

Weitere ähnliche Inhalte

Was ist angesagt?

The complete Dell Technologies, Dell EMC Infrastructure Solutions Group (ISG) storage, converged, hyper converged and data protection portfolio one single page. Dell Technologies is a unique family of businesses that provides the essential infrastructure for organizations to build their digital future, transform IT and protect their most important asset, information. ISO A0 poster edition - v2 June 2020

Dell Technologies Dell EMC ISG Storage, CI, HCI and Data Protection Portfolio...

Dell Technologies

Cloud Management

Andreas Chatzakis

Project ACRN expose and pass through platform hidden PCIe devices to SOS

Project ACRN

Accessing Hardware on Android

Gary Bisson

Containers are incredibly convenient to package applications and deploy them quickly across the data center. This talk will introduce RunX, a new project under LF Edge that aims at bringing containers to the edge with extra benefits. At the core, RunX is an OCI-compatible container runtime to run software packaged as containers as Xen micro-VMs. RunX allows traditional containers to be executed with a minimal overhead as virtual machines, providing additional isolation and real-time support. It also introduces new types of containers designed with edge and embedded deployments in mind. RunX enables RTOSes, and baremetal apps to be packaged as containers, delivered to the target using the powerful containers infrastructure, and deployed at runtime as Xen micro-VMs. Physical resources can be dynamically assigned to them, such as accelerators and FPGA blocks. This presentation will go through the architecture of RunX and the new deployment scenarios it enables. It will provide an overview of the integration with Yocto Project via the meta-virtualization layer and describe how to build a complete system with Xen and RunX. The presentation will come with a live demo on embedded hardware.

RunX: deploy real-time OSes as containers at the edge

Stefano Stabellini

Featurization is one of the most difficult problems in machine learning, just behind data wrangling in terms of the time it consumes. For many problems, featurization plays the largest role in determining model performance, greater even than choice of machine learning method. We’ll walk through how graph features engineered in Neo4j can be used in a supervised learning model trained with Amazon SageMaker. These novel graph features can improve model performance beyond what is possible with more traditional approaches.

Better Together: Delivering Graph Value with AWS & Neo4j - Antony Prasad The...

Neo4j

Kvm virtualization platform

Ahmad Hafeezi

** Edureka Certification Training: https://www.edureka.co ** This Edureka "VMware Tutorial for Beginners” video will give you a thorough and insightful overview of Virtualization and help you understand other related terms that revolve around VMware and Virtualization. Following are the offering of this video: 1. What is VMware? 2. What is Virtualization? 3. Types Of Virtualization 4. What Is Hypervisor? 5. Hypervisor Types 6. Demo- Creating a VM using VMware Workstation Player

VMware Tutorial For Beginners | VMware Workstation | VMware Virtualization | ...

Edureka!

Xen Hypervisor

Susheel Thakur

VMware Presentation

Emirates Computers

Storage Virtualization

Mehul Jariwala

Virtualization.ppt

vishal choudhary

Shared Virtual Memory (SVM) is a VT-d feature that allows sharing application address space with the I/O device. The feature works with the PCI sig Process Address Space ID (PASID). With SVM, programmer gets a consistent view of memory across host application and device, avoids pining or copying overheads. We have been working on supporting SVM in Xen to enable SVM usage in guest if a SVM capable device is assigned. e.g. assign IGD to a guest, applications like OpenCL would benefit if SVM is supported in guest. SVM virtualization requires exposing a virtual VT-d to guest. In this discussion, Yi would update the latest SVM virtualization implementation and foresee the future work about supporting SVM and IOVA a single virtual VT-d.

XPDDS17: Shared Virtual Memory Virtualization Implementation on Xen - Yi Liu,...

The Linux Foundation

Virtualization

Utkarsh Soni

Confidential Computing overview

Mark Argent

Aws seminar report

Rahul Kumar

Monitoring MySQL Replication lag with Prometheus & pt-heartbeat

Julien Pivotto

Embedded Hypervisor for ARM

National Cheng Kung University

Platform as a Service (PaaS) - A cloud service for Developers

Ravindra Dastikop

Open ebs 101

LibbySchulze

Was ist angesagt? (20)

Dell Technologies Dell EMC ISG Storage, CI, HCI and Data Protection Portfolio...

Cloud Management

Project ACRN expose and pass through platform hidden PCIe devices to SOS

Accessing Hardware on Android

RunX: deploy real-time OSes as containers at the edge

Better Together: Delivering Graph Value with AWS & Neo4j - Antony Prasad The...

Kvm virtualization platform

VMware Tutorial For Beginners | VMware Workstation | VMware Virtualization | ...

Xen Hypervisor

VMware Presentation

Storage Virtualization

Virtualization.ppt

XPDDS17: Shared Virtual Memory Virtualization Implementation on Xen - Yi Liu,...

Virtualization

Confidential Computing overview

Aws seminar report

Monitoring MySQL Replication lag with Prometheus & pt-heartbeat

Embedded Hypervisor for ARM

Platform as a Service (PaaS) - A cloud service for Developers

Open ebs 101

Andere mochten auch

Make Accelerator Pluggable for Container Engine

LinuxCon ContainerCon CloudOpen China

Resource placement is a policy-rich problem, particularly across multi-cluster, multi-geography and multi-cloud environments. Placement may be based on company conventions, external regulation, pricing, performance requirements, or complex combinations of those. Furthermore, placement policies evolve over time and vary across organizations. As a result, it is very difficult to anticipate the policy requirements of all users. In this presentation, Torin Sandal (Lead Engineer of Open Policy Agent) will present, along with Irfan Ur Rehman, and demonstrate the work they've done integrating OPA into the Kubernetes Cluster Federation Control Plane. This enables high level policies to be expressed in a easy to understand policy language, and automatically enforced across federations of Kubernetes clusters.

Policy-based Resource Placement

LinuxCon ContainerCon CloudOpen China

Presentation delivered at LinuxCon China 2017. Zephyr is an upstream open source project for places where Linux is too big to fit. This talk will overview the progress we've made in the first year towards the projects goals around incorporating best of breed technologies into the code base, and building up the community to support multiple architectures and development environments. We will share our roadmap, plans and the challenges ahead of the us and give an overview of the major technical challenges we want to tackle in 2017.

Zephyr: Creating a Best-of-Breed, Secure RTOS for IoT

LinuxCon ContainerCon CloudOpen China

Besides huge success in mobile, ARM is also ambitious in server field. Software ecosystem is now a barrier for wide deployment of ARM servers in data center. ARM Shanghai Workloads team is working on clouding and big data software enablement and optimization on ARM64 platform. In this presentation, Yibo Cai will introduce the status and challenges of running OpenStack on ARM servers, with emphasis on OpenStack compute, storage and networking.

OpenStack on AArch64

LinuxCon ContainerCon CloudOpen China

Hyperledger Technical Community in China.

LinuxCon ContainerCon CloudOpen China

OpenDaylight OpenStack Integration

LinuxCon ContainerCon CloudOpen China

Presentation given at the 2017 LinuxCon China With the booming of Container technology, it brings obvious advantages for cloud: simple and faster deployment, portability and lightweight cost. But the networking challenges are significant. Users need to restructure their network and support container deployment with current cloud framework, like container and VMs. In this presentation, we will introduce new container networking solution, which provides one management framework to work with different network componenets through Open/friendly modelling mechnism. iCAN can simplify network deployment and management with most orchestration systems and a variety of data plane components, and design extendsible architect to define and validate Service Level Agreement(SLA) for cloud native applications, which is important factor for enterprise to deliver successful and stable service via containers.

Simplify Networking for Containers

LinuxCon ContainerCon CloudOpen China

Obstacles & Solutions for Livepatch Support on ARM64 Architecture

LinuxCon ContainerCon CloudOpen China

Presentation delivered at LinuxCon China 2017 by Greg Kroah-Hartman. The Linux kernel is the largest collaborative software development projects ever. This talk will discuss exactly how Linux is developed, how fast it is happening, who is doing the work, and how we all stay sane keeping up with it. It will discuss the development model used, and how it differs from almost all "traditional" models of software development.

Linux Kernel Development

LinuxCon ContainerCon CloudOpen China

Linuxcon secureefficientcontainerimagemanagementharbor

LinuxCon ContainerCon CloudOpen China

LiteOS

LinuxCon ContainerCon CloudOpen China

Presentation delivered at LinuxCon China 2017. The Libvirt API is cloud industry standard API to manage virtualization hosts on cross platforms.It is widely implemented in renowned cloud system such as openstack,opencloud. The compatibility and fragmentation avoiding of Libvirt APIs will eventually play great impact on what Libvirt can achieve as a whole. For this reason,Libvirt API certification is introduced. Libvirt API certification focuses on testing a technology implementation to make sure that it operates consistently with all other implementations of the same Libvirt technology specification. In this paper, the author will review current state of Libvirt API certification, discuss the challenges it faces, and look forward to how Libvirt community may address those challenges.

Libvirt API Certification

LinuxCon ContainerCon CloudOpen China

In this talk, Tim Bird will discuss the recent status of the Linux with regard to embedded systems. This will include a review of the last year's worth of mainline kernel releases, as well as topic areas specifically related to embedded, such as boot-up time, security, system size, etc. Tim will also present recent and planned work by the Core Embedded Linux Project of the Linux Foundation, and discuss the current status of Linux in various markets and fields. Tim will go over current areas of work, and discuss remaining challenges faced by Linux in embedded projects.

Status of Embedded Linux

LinuxCon ContainerCon CloudOpen China

Presentation delivered at LinuxCon China 2017 The practices of Blockchain as a service in Dianrong (Shiyuan Xiao, Dianrong.com) - Blockchain as a Service (BaaS) provides a easy, low-cost and flexible platform for companies to enable their businesses based on blockchain backed by a cloud platform. Shiyuan will introduce the experiences to build such a BaaS platform, what is the architecture, what problems we have met and solved and the best practices we summarized.

点融网区块链即服务实践 - The Practice of Blockchain as a Service in Dianrong

LinuxCon ContainerCon CloudOpen China

Presentation delivered at LinuxCon China 2017. Rethinking the Operating System. A new wave of Operating Systems optimized for containers appeared on the horizon making us excited and puzzeled at the same time. "Why do we need anything different for containers when traditional OSs served us well in the last 25+ years?" "Isn't Kubernetes just another package to install on top of my favorite distro?"" Will this obsolete my whole infrastructure?" are some of the questions this talk will shed some light on. Explore the journey SUSE made in rethinking the OS: From a conservative linux distribution to a platform that goes hand in hand with the needs of Microservices. You will get an insight at what lessons were learned during the intense development effort that lead to SUSE Containers as a Service Platform, how the obstacles along the way were lifted and why "Upstream first" is - and should always be - the rule.

Rethinking the OS

LinuxCon ContainerCon CloudOpen China

After returning from a recent trip that occurred during the middle of a heat wave. I arrived home to find my apartment quite hot, at least 45C inside. Needless to say it wasn’t the most comfortable way to come home after 15 days out of town, I decided it was time for me to do something about it to address this so I didn't come home to that unpleasant surprise again. Normally, this problem is solved by having a thermostat which controls the air conditioning. However, my apartment did not have a thermostat. So I decided to build one using open source software. This talk will cover how I went about solving my problem using existing software and protocols like home-assistant, MQTT, and also some new software that was created for this. It'll also discuss how using open software and home automation I was able to solve my issue but also make cooling my apartment smarter.

Building a Better Thermostat

LinuxCon ContainerCon CloudOpen China

OCI Support in Mesos

LinuxCon ContainerCon CloudOpen China

Presentation delivered at LinuxCon China 2017. Operating systems need to move faster without sacrificing stability. New hardware, new software features, and bugfixes are making it into distribution components every day. To maintain stability, packagers and distribution developers are looking toward lessons learned in the DevOps movement to implement Continuous Integration/Continuous Delivery (CI/CD) workflows that provide quicker test feedback to developers. This talk highlights some of the coming trends in Fedora such as: streamlined base package sets, userspace applications delivered as containers, continuous validation of individual distro components and the distro as a whole, and collaboration with the CentOS Project.

Releasing a Distribution in the Age of DevOps.

LinuxCon ContainerCon CloudOpen China

Can we leverage the resource of public cloud for gaming, streaming, transcoding, machine learning and visualized CAD application on demand? Yes if it provides the capability and infrastructure to utilize GPUs. Can we get high performance networking in the cloud as what I have in the bare metal environment? Yes with SR-IOV. How to achieve them? In this presentation we describe Discrete Device Assignment (also known as PCI Pass-through) support for GPU and network adapter in Linux guest and SR-IOV architectures of Linux guest with near-native performance profile running on Hyper-V. We also will share how to integrate accelerated graphics and networking capabilities in Microsoft Azure infrastructure.

High Performance Linux Virtual Machine on Microsoft Azure: SR-IOV Networking ...

LinuxCon ContainerCon CloudOpen China

Kdump is a long existing method for acquiring dump of crashed kernel, however very few literatures are available to understand it's usage and internals. We receive a lot of queries on kexec mailing list about different issues related to the kexec/kdump environment. In this presentation, we talk about basics of kdump usage and some internals about kdump/kexec kernel implementation. It includes end to end flow from kdump kernel configuration to crash analysis. We discuss some of the problem which is frequently faced by kdump users. It also includes related information about ELF structure, so that one can debug if vmcore itself gets corrupted because of any architecture related issue.

kdump: usage and_internals

LinuxCon ContainerCon CloudOpen China

Andere mochten auch (20)

Make Accelerator Pluggable for Container Engine

Policy-based Resource Placement

Zephyr: Creating a Best-of-Breed, Secure RTOS for IoT

OpenStack on AArch64

Hyperledger Technical Community in China.

OpenDaylight OpenStack Integration

Simplify Networking for Containers

Obstacles & Solutions for Livepatch Support on ARM64 Architecture

Linux Kernel Development

Linuxcon secureefficientcontainerimagemanagementharbor

LiteOS

Libvirt API Certification

Status of Embedded Linux

点融网区块链即服务实践 - The Practice of Blockchain as a Service in Dianrong

Rethinking the OS

Building a Better Thermostat

OCI Support in Mesos

Releasing a Distribution in the Age of DevOps.

High Performance Linux Virtual Machine on Microsoft Azure: SR-IOV Networking ...

kdump: usage and_internals

Ähnlich wie GPU Acceleration for Containers on Intel Processor Graphics

This presentation is from NVIDIA GTC DC on Oct 23, 2018: https://youtu.be/z5gEUL6dJRI Corresponding Press Release: https://www.redhat.com/en/about/press-releases/red-hat-nvidia-align-open-source-solutions-fuel-emerging-workloads Blog: https://www.redhat.com/en/blog/red-hat-and-nvidia-positioning-red-hat-enterprise-linux-and-openshift-primary-platforms-artificial-intelligence-and-other-gpu-accelerated-workloads Demo Video: https://www.youtube.com/watch?v=9iVYjA_WJgU

Best practices for optimizing Red Hat platforms for large scale datacenter de...

Jeremy Eder

When Docker, IoT and AI Meet Together.... Highlights: - Opendatacam is 100% Open Source solution - Quantifies and tracks moving objects with Live Video Analysis - Runs on Linux and CUDA GPU enabled hardware - Runs completely on Docker Containers as well as support K3s. Read the Full Story - https://collabnix.com/object-detection-with-yolo-using-docker-19-03-on-nvidia-jetson-nano/ Hear me LIVE on 10th October while I talk about Running Docker Container which uses GPU on Jetson Nano for Implementing Object Detection and Analytics.

Quantifying Your World with AI & Docker on the Edge | OSCONF 2020 Jaipur

Ajeet Singh Raina

App container rkt

Xiaofeng Guo

Как быстро и легко настраивать/обновлять окружения для кросскомпиляции проектов под различные платформы(на основе docker), как быстро переключаться между ними, как используя эти кирпичики организовать CI и тестирование(на основе GitLab и Docker).

Настройка окружения для кросскомпиляции проектов на основе docker'a

corehard_by

The Linux graphics stack is constantly evolving to add support for new hardware. This evolution and new software specifications have forced the X graphical server to be split into several components including a now rotates in the Linux kernel, the Direct Rendering Manager (DRM). A quick presentation of these components and their role will be carried out before looking at new major change in the common code, the NVIDIA Optimus technology. One equipped with Optimus technology laptop has two graphics processing units (GPUs), one from Intel and one from NVIDIA. This technology combines the low power Intel GPU when the machine is not used to the performance of NVIDIA GPUs when the user plays. This technology, however, is a nightmare to manage kernel-side although the final building blocks necessary for its complete management are being finalized. Further explanation of this issue will be made and we’ll see how this new software architecture has added graphics acceleration on embedded processor SoCs like Tegra. The case of open source NVIDIA driver, called “New” will then be studied. This is the graphics driver community as it is developed without the help of NVIDIA and attracted several regular contributors, including myself! We’ll take a quick history of the project before talking about the current developments and issues related to the lack of documentation. The end of this presentation will then be left to the participants so they can ask more general questions about the graphics stack, if they wish. Martin Peres, Laboratoire Bordelais de Recherche en Informatique

Kernel Recipes 2014 - The Linux graphics stack and Nouveau driver

Anne Nicolas

Tensorflow in Docker

Eric Ahn

I Just Want to Run My Code: Waypoint, Nomad, and Other Things

Michael Lange

In this deck from 2018 Swiss HPC Conference, Christian Kniep from Docker Inc. presents: State of Containers and the Convergence of HPC and BigData. "This talk will recap the history of and what constitutes Linux Containers, before laying out how the technology is employed by various engines and what problems these engines have to solve. Afterward Christian will elaborate on why the advent of standards for images and runtimes moved the discussion from building and distributing containers to orchestrating containerized applications at scale. In conclusion attendees will get an update on how containers foster the convergence of Big Data and HPC workloads and the state of native HPC containers." Learn more: http://docker.com and http://www.hpcadvisorycouncil.com/events/2018/swiss-workshop/agenda.php Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter

State of Containers and the Convergence of HPC and BigData

inside-BigData.com

ELC-E Linux Awareness

Peter Griffin

S0333 gtc2012-gmac-programming-cuda

mistercteam

About 94% of AI Adopters are planning to use containers in the next 1 year. What’s driving this exponential growth? Faster time to deployment and Faster AI workload processing are the two major reasons. You can use GPUs in big data applications such as machine learning, data analytics, and genome sequencing. Docker containerization makes it easier for you to package and distribute applications. You can enable GPU support when using YARN on Docker containers. In this talk, I will demonstrate how Docker accelerates the AI workload development and deployment over the IoT Edge devices in efficient manner

Delivering Docker & K3s worloads to IoT Edge devices

Ajeet Singh Raina

Kubernetes - training micro-dragons without getting burnt

Amir Moghimi

Accelerate your development with Docker

Andrey Hristov

Accelerate your software development with Docker

Andrey Hristov

Using the NVIDIA Container Runtime, many developers and enterprises have been developing, benchmarking and deploying deep learning (DL) frameworks, HPC and other GPU accelerated containers at scale for the last two years. In this talk, we will go over the architecture of the NVIDIA Container Runtime and discuss our recent close collaboration with Docker. The result of our collaboration with Docker is a seamless native integration of the runtime enabling Docker Engine 19.03 CE and the forthcoming Docker Enterprise release to run GPU accelerated containers. We will also highlight containerized NVIDIA drivers. This new feature eliminates the overhead of provisioning GPU machines and brings GPU support on container optimized operating systems, which either lack package managers for installing software or require all applications to run in containers. In this session, you will learn how GPU accelerated containers can be easily built and deployed through the use of driver containers and native support for GPUs in Docker 19.03. The session will include a demo of running a GPU accelerated deep learning container using the new CLI options in Docker 19.03 and containerized drivers. Running NVIDIA GPU accelerated containers with Docker has never been this easy!

DCSF 19 Accelerating Docker Containers with NVIDIA GPUs

Docker, Inc.

Graphics processing unit or GPU (also occasionally called visual processing unit or VPU) is a specialized microprocessor that offloads and accelerates graphics rendering from the central (micro) processor. Modern GPUs are very efficient at manipulating computer graphics, and their highly parallel structure makes them more effective than general-purpose CPUs for a range of complex algorithms. In CPU, only a fraction of the chip does computations where as the GPU devotes more transistors to data processing. GPGPU is a programming methodology based on modifying algorithms to run on existing GPU hardware for increased performance. Unfortunately, GPGPU programming is significantly more complex than traditional programming for several reasons.

GPU Programming with Java

Kelum Senanayake

Cruise has been working on self-driving cars for six years and growing exponentially for most of that time. Two years ago they started using Kubernetes, betting on namespace-level multitenancy to provide isolation between teams and projects. Today they have over 40 internal tenants, 100,000 pods, 4,000 nodes, and… an embarrassing number of KubeDNS replicas. This session will take you through the motivations, story, and results of migrating to multitenant Kubernetes, along with some hard-earned Pro Tips from the trenches. You’ll also learn about the open source tooling they built around Spinnaker, Vault, Google Cloud, and Istio in order to integrate with our multitenant Kubernetes. Come see how they went from barely isolated to very isolated and saved a few million dollars doing it!

Kubernetes Multitenancy Karl Isenberg - KubeCon NA 2019

Karl Isenberg

CS 626 - March : Capsicum: Practical Capabilities for UNIX

ruchith

Containers are a set of various lightweight methods to isolate filesystems, CPU resources, memory resources, system permissions, etc. Containers are similar to virtual machines in many senses, but they are more efficient and often less secure. This talk roughly consists of the following three parts: 1. Introduction to containers and how they spread in the last decade 2. Internals of container runtimes: namespaces, cgroups, capabilities, seccomp, etc. 3. Latest trends: Non-Docker containers, User Namespaces, Rootless Containers, Kata Containers, gVisor, WebAssembly, etc. http://www.cce.i.kyoto-u.ac.jp/danwa23.html

The internals and the latest trends of container runtimes

Akihiro Suda

Introduction to Docker and all things containers, Docker Meetup at RelateIQ

dotCloud

Ähnlich wie GPU Acceleration for Containers on Intel Processor Graphics (20)

Best practices for optimizing Red Hat platforms for large scale datacenter de...

Quantifying Your World with AI & Docker on the Edge | OSCONF 2020 Jaipur

App container rkt

Настройка окружения для кросскомпиляции проектов на основе docker'a

Kernel Recipes 2014 - The Linux graphics stack and Nouveau driver

Tensorflow in Docker

I Just Want to Run My Code: Waypoint, Nomad, and Other Things

State of Containers and the Convergence of HPC and BigData

ELC-E Linux Awareness

S0333 gtc2012-gmac-programming-cuda

Delivering Docker & K3s worloads to IoT Edge devices

Kubernetes - training micro-dragons without getting burnt

Accelerate your development with Docker

Accelerate your software development with Docker

DCSF 19 Accelerating Docker Containers with NVIDIA GPUs

GPU Programming with Java

Kubernetes Multitenancy Karl Isenberg - KubeCon NA 2019

CS 626 - March : Capsicum: Practical Capabilities for UNIX

The internals and the latest trends of container runtimes

Introduction to Docker and all things containers, Docker Meetup at RelateIQ

Mehr von LinuxCon ContainerCon CloudOpen China

A lot of Internet of things devices use linux as its core. More so with the advent of DIY projects and Internet of things projects. A lot of Raspberry PI's, Beaglebone, Tessel boards are out there with default settings, and all connected to the internet, ready to be taken over. With the recent dyn DNS attack its of prime importance to know how we can keep these end point devices secure and out of the hands of botnet hoarders, attackers. In this presentation Rabimba Karanjai will show how to harden the security on these endpint devices taking a RaspBerry PI as an example. He will explain different techniques with code examples along with a toolkit made specifically for this demo which will make devices considerable harder to compromise. And even when they are, will allow to locate and detect the breach. After all, proetcting the device fially protects us all (prevents another DDOS)

SecurityPI - Hardening your IoT endpoints in Home.

LinuxCon ContainerCon CloudOpen China

Kubernetes currently has two load balancing mode: userspace and IPTables. They both have limitation on scalability and performance. We introduced IPVS as third kube-proxy mode which scales kubernetes load balancer to support 50,000 services. Beyond that, control plane needs to be optimized in order to deploy 50,000 services. We will introduce alternative solutions and our prototypes with detailed performance data.

Scale Kubernetes to support 50000 services

LinuxCon ContainerCon CloudOpen China

The Blockchain for the Internet of Things (IoT) has considered to "change the future." Despite a myriad of studies on the blockchain IoT, few studies have investigated how an IoT blockchain system develops with open source technologies, open standards, web technologies, and a p2p network. In this presentation, Jollen will share the Flowchain case study, an open source IoT blockchain project in Node.js; he will discuss the practice, the technical challenges, and the engineering experiences. Furthermore, to provide the real-time data transaction capabilities for current IoT requirements, he will utilize the "virtual block" idea to facilitate such technical challenges.

Flowchain: A case study on building a Blockchain for the IoT

LinuxCon ContainerCon CloudOpen China

Secure Container solution is to enhance container security by isolating memory between Docker containers inside one VM with Intel VT-x EPT HW, which is highly effective to protect container’s memory and at the meantime defends ret2user privilege escalation attack that exploits kernel vulnerabilities (eg. CVE-2017-6074 UAF (use-after-free) vulnerability). It extends KVM interfaces which the guest OS can leverage to isolate container memory from other containers, and the interfaces rely on Intel VT-x EPT hardware extension and provide memory access protection for the container which sits in an isolated memory region. Each secure container has a dedicated EPT table rather than sharing one EPT table with guest OS, which enforces the cross-EPT memory access protection. The whole solution is user-friendly to fit in the existing cloud server infrastructure with very limited changes.

Secure Containers with EPT Isolation

LinuxCon ContainerCon CloudOpen China

There are best practices to understand when building products from open source software, but there are a number of anti-patterns that crop up along the way. Product teams (from engineering to marketing) need to understand these patterns and practices to participate best in open source project communities and deliver products and services to their customers at the same time. These patterns hold regardless of whether the vendor created and owns the project or participates in projects outside their control.

Open Source Software Business Models Redux

LinuxCon ContainerCon CloudOpen China

Enterprise data centers have to support a diverse of set of workloads: cloud native, big data, high performance computing, and legacy applications. While cloud native applications are ideal to run in Docker clusters, bare metal and virtualization infrastructures must still be supported in the data center. The result is a proliferation of clusters and technologies running in individual silos, resulting in high management costs and low utilization. This talk describes the challenges and experiences in implementing a shared cluster infrastructure based on Kubernetes to support big data, high performance computing, and VM-based workloads. The talk will show the deployment and scaling of a high performance computing workload manager, Spark, and OpenStack, and how the VM and Docker management can be integrated together.

Running Legacy Applications with Containers

LinuxCon ContainerCon CloudOpen China

Introduction to OCI Image Technologies Serving Container

LinuxCon ContainerCon CloudOpen China

Rebuild - Simplifying Embedded and IoT Development Using Linux Containers

LinuxCon ContainerCon CloudOpen China

Failure injection is somewhat analogous to a vaccine. We want to inject these bad behaviours so our developers can build immunities to them. Can we inject failure scenarios into deployed systems to reduce platform risk Demonstrations of the Simian Army, Chaos Lemur and Locust.io tools will be presented. Even when all of the individual services in a distributed system are functioning properly, the interactions between those services can cause unpredictable outcomes. Unpredictable outcomes, compounded by rare but disruptive real-world events that affect production environments, make these distributed systems inherently chaotic.

From Resilient to Antifragile Chaos Engineering Primer

LinuxCon ContainerCon CloudOpen China

Presentation delivered at LinuxCon China 2017 Real-Time is used for deadline-oriented applications and time-sensitive workloads. Real-Time KVM is the extension of KVM(Linux Kernel-based Virtual Machine) to allow the virtual machines(VM) to be a truly Real-Time operating system.Users sometimes need to run low-latency applications(such as audio/video streaming, highly interactive systems, etc) to meet their requirements in clouds. NFV is a new network concept which uses virtualization and software instead of dedicated network appliances. For some use cases of telecommunications, network latency must be within a certain range of values. Real-Time KVM can help NFV meet this requirements. In this presentation, Pei Zhang will talk about: (1)Real-Time KVM introduction (2)Real-Time cloud building (3)Real-Time KVM in NFV: VM with openvswitch, dpdk and qemu’s vhostuser (4)Performance testing results show

See what happened with real time kvm when building real time cloud pezhang@re...

LinuxCon ContainerCon CloudOpen China

Presentation delivered at LinuxCon China 2016 UEFI HTTP/HTTPS Boot is a new feature of UEFI 2.5+. In the meantime, this feature is not yet implemented in any Linux bootloader. This Birds of a Feather session will give an introduction to UEFI HTTP/HTTPS Boot, and share a proof-of-concept implementation based on grub2 that works on both the emulator (QEMU/OVMF) and HPE ProLiant Gen10 servers. For HTTPS, the experience and comparison will be shared between the purely software-based and UEFI-based implementations in the aspects of ease of implementation, security strength, and limitation.

UEFI HTTP/HTTPS Boot

LinuxCon ContainerCon CloudOpen China

How Open Source Communities do Standardization

LinuxCon ContainerCon CloudOpen China

Fully Automated Kubernetes Deployment and Management (Peng Jiang, Rancher Labs) - Kubernetes is rapidly gaining popularity as a powerful container orchestration and scheduling platform. But deploying and managing Kubernetes clusters is still a challenge for many organizations.How to ensure Kubernetes clusters in different clouds and data centers can communicate with each other? How to automate the deployment of multiple Kubernetes clusters? How to incorporate the new Kubernetes Federation into multi cloud and multi datacenter deployments? How to manage the health of Kubernetes cluster itself? etc. In this talk, Peng will share his experience on how to automate and simplify Kubernetes deployments, and discuss how some of the latest community projects (such as kubeadm and self-hosting Kubernetes) will help address the problems in the future.

Fully automated kubernetes deployment and management

LinuxCon ContainerCon CloudOpen China

In the container ecosystem, there is perhaps no technology that has received more focus and attention than orchestration and scheduling. Mesos, Kubernetes, and Swarm have established themselves as the leading technology choices in this space. In this talk, Sheng will discuss what he learned from working directly with hundreds of users who have deployed one of these frameworks. He will look at how these frameworks will continue to evolve and if there’re any gaps and opportunities in container orchestration and scheduling. Sheng will make a case that there are still room for innovation and new orchestration and scheduling frameworks will be created in the future. He will discuss what new frameworks might look like--the features, functionalities, and attributes that differentiate them from the mainstream frameworks today.

Is there still room for innovation in container orchestration and scheduling

LinuxCon ContainerCon CloudOpen China

Container Security

LinuxCon ContainerCon CloudOpen China

Quickly Debug VM Failures in OpenStack

LinuxCon ContainerCon CloudOpen China

Mehr von LinuxCon ContainerCon CloudOpen China (16)

SecurityPI - Hardening your IoT endpoints in Home.

Scale Kubernetes to support 50000 services

Flowchain: A case study on building a Blockchain for the IoT

Secure Containers with EPT Isolation

Open Source Software Business Models Redux

Running Legacy Applications with Containers

Introduction to OCI Image Technologies Serving Container

Rebuild - Simplifying Embedded and IoT Development Using Linux Containers

From Resilient to Antifragile Chaos Engineering Primer

See what happened with real time kvm when building real time cloud pezhang@re...

UEFI HTTP/HTTPS Boot

How Open Source Communities do Standardization

Fully automated kubernetes deployment and management

Is there still room for innovation in container orchestration and scheduling

Container Security

Quickly Debug VM Failures in OpenStack

Kürzlich hochgeladen

[BuildWithAI] Introduction to Gemini.pdf

Sandro Moreira

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Martijn de Jong

Dubai, often portrayed as a shimmering oasis in the desert, faces its own set of challenges, including the occasional threat of flooding. Despite its reputation for opulence and modernity, the emirate is not immune to the forces of nature. In recent years, Dubai has experienced sporadic but significant floods, testing the resilience of its infrastructure and communities. Among the critical lifelines in this bustling metropolis is the Dubai International Airport, a bustling hub that connects the city to the world. This article explores the intersection of Dubai flood events and the resilience demonstrated by the Dubai International Airport in the face of such challenges.

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...

Orbitshub

Passkeys: Developing APIs to enable passwordless authentication Cody Salas, Sr Developer Advocate | Solutions Architect - Yubico Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...

apidays

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Zilliz

DBX First Quarter 2024 Investor Presentation

Dropbox

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Zilliz

ICT role in 21st century education and its challenges

rafiqahmad00786416

Manulife - Insurer Transformation Award 2024

The Digital Insurer

Tracing the root cause of a performance issue requires a lot of patience, experience, and focus. It’s so hard that we sometimes attempt to guess by trying out tentative fixes, but that usually results in frustration, messy code, and a considerable waste of time and money. This talk explains how to correctly zoom in on a performance bottleneck using three levels of profiling: distributed tracing, metrics, and method profiling. After we learn to read the JVM profiler output as a flame graph, we explore a series of bottlenecks typical for backend systems, like connection/thread pool starvation, invisible aspects, blocking code, hot CPU methods, lock contention, and Virtual Thread pinning, and we learn to trace them even if they occur in library code you are not familiar with. Attend this talk and prepare for the performance issues that will eventually hit any successful system. About authorWith two decades of experience, Victor is a Java Champion working as a trainer for top companies in Europe. Five thousands developers in 120 companies attended his workshops, so he gets to debate every week the challenges that various projects struggle with. In return, Victor summarizes key points from these workshops in conference talks and online meetups for the European Software Crafters, the world’s largest developer community around architecture, refactoring, and testing. Discover how Victor can help you on victorrentea.ro : company training catalog, consultancy and YouTube playlists.

Finding Java's Hidden Performance Traps @ DevoxxUK 2024

Victor Rentea

💉💊+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI}}+971581248768 +971581248768 Mtp-Kit (500MG) Prices » Dubai [(+971581248768**)] Abortion Pills For Sale In Dubai, UAE, Mifepristone and Misoprostol Tablets Available In Dubai, UAE CONTACT DR.Maya Whatsapp +971581248768 We Have Abortion Pills / Cytotec Tablets /Mifegest Kit Available in Dubai, Sharjah, Abudhabi, Ajman, Alain, Fujairah, Ras Al Khaimah, Umm Al Quwain, UAE, Buy cytotec in Dubai +971581248768''''Abortion Pills near me DUBAI | ABU DHABI|UAE. Price of Misoprostol, Cytotec” +971581248768' Dr.DEEM ''BUY ABORTION PILLS MIFEGEST KIT, MISOPROTONE, CYTOTEC PILLS IN DUBAI, ABU DHABI,UAE'' Contact me now via What's App…… abortion Pills Cytotec also available Oman Qatar Doha Saudi Arabia Bahrain Above all, Cytotec Abortion Pills are Available In Dubai / UAE, you will be very happy to do abortion in Dubai we are providing cytotec 200mg abortion pill in Dubai, UAE. Medication abortion offers an alternative to Surgical Abortion for women in the early weeks of pregnancy. We only offer abortion pills from 1 week-6 Months. We then advise you to use surgery if its beyond 6 months. Our Abu Dhabi, Ajman, Al Ain, Dubai, Fujairah, Ras Al Khaimah (RAK), Sharjah, Umm Al Quwain (UAQ) United Arab Emirates Abortion Clinic provides the safest and most advanced techniques for providing non-surgical, medical and surgical abortion methods for early through late second trimester, including the Abortion By Pill Procedure (RU 486, Mifeprex, Mifepristone, early options French Abortion Pill), Tamoxifen, Methotrexate and Cytotec (Misoprostol). The Abu Dhabi, United Arab Emirates Abortion Clinic performs Same Day Abortion Procedure using medications that are taken on the first day of the office visit and will cause the abortion to occur generally within 4 to 6 hours (as early as 30 minutes) for patients who are 3 to 12 weeks pregnant. When Mifepristone and Misoprostol are used, 50% of patients complete in 4 to 6 hours; 75% to 80% in 12 hours; and 90% in 24 hours. We use a regimen that allows for completion without the need for surgery 99% of the time. All advanced second trimester and late term pregnancies at our Tampa clinic (17 to 24 weeks or greater) can be completed within 24 hours or less 99% of the time without the need surgery. The procedure is completed with minimal to no complications. Our Women's Health Center located in Abu Dhabi, United Arab Emirates, uses the latest medications for medical abortions (RU-486, Mifeprex, Mifegyne, Mifepristone, early options French abortion pill), Methotrexate and Cytotec (Misoprostol). The safety standards of our Abu Dhabi, United Arab Emirates Abortion Doctors remain unparalleled. They consistently maintain the lowest complication rates throughout the nation. Our Physicians and staff are always available to answer questions and care for women in one of the most difficult times in their lives. The decision to have an abortion at the Abortion Cl

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Architecting Cloud Native Applications

WSO2

Following the popularity of "Cloud Revolution: Exploring the New Wave of Serverless Spatial Data," we're thrilled to announce this much-anticipated encore webinar. In this sequel, we'll dive deeper into the Cloud-Native realm by uncovering practical applications and FME support for these new formats, including COGs, COPC, FlatGeoBuf, GeoParquet, STAC, and ZARR. Building on the foundation laid by industry leaders Michelle Roby of Radiant Earth and Chris Holmes of Planet in the first webinar, this second part offers an in-depth look at the real-world application and behind-the-scenes dynamics of these cutting-edge formats. We will spotlight specific use-cases and workflows, showcasing their efficiency and relevance in practical scenarios. Discover the vast possibilities each format holds, highlighted through detailed discussions and demonstrations. Our expert speakers will dissect the key aspects and provide critical takeaways for effective use, ensuring attendees leave with a thorough understanding of how to apply these formats in their own projects. Elevate your understanding of how FME supports these cutting-edge technologies, enhancing your ability to manage, share, and analyze spatial data. Whether you're building on knowledge from our initial session or are new to the serverless spatial data landscape, this webinar is your gateway to mastering cloud-native formats in your workflows.

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Safe Software

The microservices honeymoon is over. When starting a new project or revamping a legacy monolith, teams started looking for alternatives to microservices. The Modular Monolith, or 'Modulith', is an architecture that reaps the benefits of (vertical) functional decoupling without the high costs associated with separate deployments. This talk will delve into the advantages and challenges of this progressive architecture, beginning with exploring the concept of a 'module', its internal structure, public API, and inter-module communication patterns. Supported by spring-modulith, the talk provides practical guidance on addressing the main challenges of a Modultith Architecture: finding and guarding module boundaries, data decoupling, and integration module-testing. You should not miss this talk if you are a software architect or tech lead seeking practical, scalable solutions. About the author With two decades of experience, Victor is a Java Champion working as a trainer for top companies in Europe. Five thousands developers in 120 companies attended his workshops, so he gets to debate every week the challenges that various projects struggle with. In return, Victor summarizes key points from these workshops in conference talks and online meetups for the European Software Crafters, the world’s largest developer community around architecture, refactoring, and testing. Discover how Victor can help you on victorrentea.ro : company training catalog, consultancy and YouTube playlists.

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024

Victor Rentea

Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows. We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases. This video focuses on the deployment of external web forms using Jotform for Bonterra Impact Management. This solution can be customized to your organization’s needs and deployed to support the common use cases below: - Intake and consent - Assessments - Surveys - Applications - Program registration Interested in deploying web form automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Jeffrey Haguewood

Effective data discovery is crucial for maintaining compliance and mitigating risks in today's rapidly evolving privacy landscape. However, traditional manual approaches often struggle to keep pace with the growing volume and complexity of data. Join us for an insightful webinar where industry leaders from TrustArc and Privya will share their expertise on leveraging AI-powered solutions to revolutionize data discovery. You'll learn how to: - Effortlessly maintain a comprehensive, up-to-date data inventory - Harness code scanning insights to gain complete visibility into data flows leveraging the advantages of code scanning over DB scanning - Simplify compliance by leveraging Privya's integration with TrustArc - Implement proven strategies to mitigate third-party risks Our panel of experts will discuss real-world case studies and share practical strategies for overcoming common data discovery challenges. They'll also explore the latest trends and innovations in AI-driven data management, and how these technologies can help organizations stay ahead of the curve in an ever-changing privacy landscape.

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

TrustArc

Three things you will take away from the session: • How to run an effective tenant-to-tenant migration • Best practices for before, during, and after migration • Tips for using migration as a springboard to prepare for Copilot in Microsoft 365 Main ideas: Migration Overview: The presentation covers the current reality of cross-tenant migrations, the triggers, phases, best practices, and benefits of a successful tenant migration Considerations: When considering a migration, it is important to consider the migration scope, performance, customization, flexibility, user-friendly interface, automation, monitoring, support, training, scalability, data integrity, data security, cost, and licensing structure Next Wave: The next wave of change includes the launch of Copilot, which requires businesses to be prepared for upcoming changes related to Copilot and the cloud, and to consolidate data and tighten governance ShareGate: ShareGate can help with pre-migration analysis, configurable migration tool, and automated, end-user driven collaborative governance

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

sammart93

CNIC Information System with Pakdata Cf In Pakistan

danishmna97

Boost Fertility New Invention Ups Success Rates.pdf

sudhanshuwaghmare1

When you’re building (micro)services, you have lots of framework options. Spring Boot is no doubt a popular choice. But there’s more! Take Quarkus, a framework that’s considered the rising star for Kubernetes-native Java. It always depends on what's best for your situation, but how to choose the best solution if you're comparing 2 frameworks? Both Spring Boot and Quarkus have their positives and negatives. Let us compare the two by live coding a couple of common use cases in Spring Boot and Quarkus. After this talk, you’ll be ready to get started with Quarkus yourself, and know when to select Quarkus or Spring Boot.

Spring Boot vs Quarkus the ultimate battle - DevoxxUK

Jago de Vreede

Kürzlich hochgeladen (20)

[BuildWithAI] Introduction to Gemini.pdf

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

DBX First Quarter 2024 Investor Presentation

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

ICT role in 21st century education and its challenges

Manulife - Insurer Transformation Award 2024

Finding Java's Hidden Performance Traps @ DevoxxUK 2024

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

Architecting Cloud Native Applications

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

CNIC Information System with Pakdata Cf In Pakistan

Boost Fertility New Invention Ups Success Rates.pdf

Spring Boot vs Quarkus the ultimate battle - DevoxxUK

GPU Acceleration for Containers on Intel Processor Graphics

1. 1 GPU Acceleration for Container on Intel Processor Graphics Zhenyu Wang zhenyuw@linux.intel.com

2. 2 Agenda GPU Containers GPU Namespace GPU Control Group Integration with Container Runtime VM Containers Status

3. 3 GPU Containers Video Delivery (Encode/Decode/Transcode) Cloud Graphics (Remote desktop/gaming) HPC/Machine Learning (AI, Analytic, …) E3-G Discrete GPU Fast provisioning High Density Little Overhead

4. 4 What Makes a Container? Platform Host OS Namespaces Control groups App Libs App Libs App Libs Platform Hypervisor App Libs App Libs Guest OS App Libs Guest OS Bare Metal Containers Docker DockerDocker VM Containers (e.g. Clear Container)

5. 5 Path to GPU Containers Bare Metal Containers VM Containers GPU namespaces GPU control groups GPU virtualization technology Integrate with container runtime

6. 6 GPU Namespace: DRM Render Node DRM/i915 Card# RenderD# Xserver ComputeMedia Xcode Unprivileged No mode-setting Offscreen 3D Privileged Mode setting Global gem flink

7. 7 GPU Namespace: DRM Render Node DRM/i915 RenderD# Media Xcode Libs Offscreen 3D Libs Compute Libs GPU Contexts Sharing happen with dmabuf only (no global gem object on card0) No need of creating another namespace Expose DRM render nodes to provide isolated views through device cgroup  e.g “docker run –device=/dev/dri/renderD128 –it debian” Per-process hardware context in GPU  Hardware managed render context switch

8. 8 GPU CGroup CPU Mem BlkIO Net GPU container Container resource manager Cgroup controller container container container … Container-granule GPU resource control

9. 9 GPU CGroup New subsystem: gpu_cgrp_subsys Control knobs  gpu.shares (% share of GPU cycles, work in progress)  gpu.priority (workload priority in the system)  gpu.memory (maximum GPU memory size) DRM/i915  Favor ‘shares/priority’ in workload scheduler  Enforce memory limitation (optionally for UMA graphics)

10. 10 Container Runtime Integration { “linux”: { “devices”: [ { “path”: “/dev/dri/renderD128”, “type”: “c”, “major”: 226, “minor”: 128, “fileMode”: 432, “uid”: 0, “gid”: 0 } ], “resources”: { … “gpu”: { “memory”: <max_mem_in_bytes>, “prio”: <workload_priority> } } } RunC: an universal container runtime  Understand new gpu cgroup  Add new Linux resources config.json options for gpu  Runtime-tools helper to generate config stubs New Docker GPU control parameters e.g docker –gpu-mem=xxx –gpu-priority=xxx –gpu-share=xxx …

11. 11 Image Portability Main challenge - library compatibility  User level libraries MUST match underlying kernel/HW Introduce Docker helper for image inspection  Compare image libraries with host environment  Reject to launch container upon any mismatch

12. 12 VM GPU Containers DRM/i915 vGPU Media Xcode Libs GPU vGPU vGPU Intel GVT-g Guest OS Media Xcode Libs Guest OS Media Xcode Libs Guest OS CC VM CC VM CC VM Clear Container (CC) Intel graphics virtualization technology (Intel GVT-g) Assign vGPU to VM container In upstream Linux since v4.10 https://01.org/igvt-g

13. 13 VM GPU Containers Nested containers in VM also have GPU access Need Improvement: Scalability  Only support 8vGPUs today due to resource partitioning  Need some enlightened way to further scale in the short term Boot time  Full guest graphics driver load takes >1.5s  Fast optimization to <0.5s in progress

14. 14 Status  GPU cgroup PoC https://github.com/zhenyw/linux https://github.com/zhenyw/runc https://github.com/zhenyw/moby https://github.com/zhenyw/cli  GPU virtualization for Clear container https://clearlinux.org/features/intel%C2%AE-clear-containers https://github.com/01org/gvt-linux/wiki/Clear-Container-with-GVTg-Setup-Guide

15. Q & A

GPU Acceleration for Containers on Intel Processor Graphics

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (20)

Ähnlich wie GPU Acceleration for Containers on Intel Processor Graphics

Ähnlich wie GPU Acceleration for Containers on Intel Processor Graphics (20)

Mehr von LinuxCon ContainerCon CloudOpen China

Mehr von LinuxCon ContainerCon CloudOpen China (16)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

GPU Acceleration for Containers on Intel Processor Graphics