SlideShare ist ein Scribd-Unternehmen logo
1 von 22
SDK Overview Dr David Brown Microsoft Technology Centre
Agenda Installing and Using the Kinect Sensor Setting up your Development Environment Camera Fundamentals Working with Depth Data Skeletal Tracking Fundamentals Audio Fundamentals
Hardware Computer with a dual-core 2.66-GHz or faster processor 2GB RAM Windows 7-compatible graphics card that supports DirectX 9.0c
Kinect Sensor  3D DEPTH SENSORS RGB CAMERA MOTORIZED TILT MULTI-ARRAY MIC
Development Environment Microsoft Visual Studio 2010 Express or other Visual Studio 2010 edition  .NET Framework 4.0 SDK, http://research.microsoft.com/kinectsdk DirectX Samples Microsoft DirectX® SDK - June 2010 or later version Current runtime for Microsoft DirectX® 9 Speech Samples Microsoft Speech Platform Runtime, version 10.2 (x86 edition) Microsoft Kinect Speech Platform (US-English version) Microsoft Speech Platform - Software Development Kit, version 10.2 (x86 edition)
Image API
Demo
Depth Image Array of bytes (ImageFrame.Image.Bits) Left to right, top to bottom Represents distance for pixel in mm (850 to 4,000mm) 0 means unknown Shadows, low reflectivity, and high reflectivity among the few reasons Player Index 0, No player 1, Skeleton 0 2, Skeleton 1
Depth Data 2 bytes per pixel (16 bits) Depth (Distance per pixel) Bitshiftsecond byte by 8 Distance (0,0) = (int)(Bits[0] | Bits[1] << 8); DepthAndPlayer Index (Includes Player index) Bitshift by 3 first byte (player index), 5 second byte Distance (0,0) =(int)(Bits[0] >> 3 | Bits[1] << 5);
Demo
Skeleton API
Skeleton Data
Joint Data Maximum two players tracked at once Six player proposals Each player with set of joints <x, y, z> in meters Tracking state Tracked Inferred Occluded, clipped, or low confidence joints Not tracked Rare, but your code must check for this state
Demo
Audio Processing Four microphone arraywith hardware-basedaudio processing Multichannel echo cancellation (MEC) Sound position tracking Other digital signal processing (noise suppression and reduction)
Audio API
Speech Recognition Grammar – What we are listening for Code – GrammarBuilder, Choices Speech Recognition Grammar Specification (SRGS) C:rogram Files (x86)icrosoft Speech Platform SDKamplesample Grammarsbr />Set AutomaticGainControl = false
Demo
Samples NUI Skeletal viewer, C++, C# Shape Game Demo, C# Audio Raw capture, C++ Audio filtering, C++ Echo cancellation, C++ Recording, C# Speech, C#
Resources SDK, http://research.microsoft.com/kinectsdk Channel 9 quick-starts, http://channel9.msdn.com/series/KinectSDKQuickstarts/ Coding4Fun gallery & blog, http://channel9.msdn.com/coding4fun/kinect
Architecture

Weitere ähnliche Inhalte

Ähnlich wie Kinect for Windows SDK Dr David Brown

PyKinect: Body Iteration Application Development Using Python
PyKinect: Body Iteration Application Development Using PythonPyKinect: Body Iteration Application Development Using Python
PyKinect: Body Iteration Application Development Using Python
pycontw
 
March.2012.KinectForWindows
March.2012.KinectForWindowsMarch.2012.KinectForWindows
March.2012.KinectForWindows
Reuben Ahmed
 
Accelerating Real Time Video Analytics on a Heterogenous CPU + FPGA Platform
Accelerating Real Time Video Analytics on a Heterogenous CPU + FPGA PlatformAccelerating Real Time Video Analytics on a Heterogenous CPU + FPGA Platform
Accelerating Real Time Video Analytics on a Heterogenous CPU + FPGA Platform
Databricks
 
PRESENTATION ON Game Engine
PRESENTATION ON Game EnginePRESENTATION ON Game Engine
PRESENTATION ON Game Engine
Diksha Bhargava
 

Ähnlich wie Kinect for Windows SDK Dr David Brown (20)

XLcloud 3-d remote rendering
XLcloud 3-d remote renderingXLcloud 3-d remote rendering
XLcloud 3-d remote rendering
 
Implementation of artificial environment using direct x
Implementation of artificial environment using direct xImplementation of artificial environment using direct x
Implementation of artificial environment using direct x
 
Implementation of artificial environment using direct x
Implementation of artificial environment using direct xImplementation of artificial environment using direct x
Implementation of artificial environment using direct x
 
Implementation of Artificial Environment using DirectX
Implementation of Artificial Environment using DirectXImplementation of Artificial Environment using DirectX
Implementation of Artificial Environment using DirectX
 
Xbox360
Xbox360Xbox360
Xbox360
 
Cse191 01
Cse191 01Cse191 01
Cse191 01
 
Getting started with microsoft band
Getting started with microsoft bandGetting started with microsoft band
Getting started with microsoft band
 
OW2con'14 - XLcoud, 3D rendering in the cloud, Marius Preda, Institut Mines T...
OW2con'14 - XLcoud, 3D rendering in the cloud, Marius Preda, Institut Mines T...OW2con'14 - XLcoud, 3D rendering in the cloud, Marius Preda, Institut Mines T...
OW2con'14 - XLcoud, 3D rendering in the cloud, Marius Preda, Institut Mines T...
 
What is direct X ?
What is direct X ?What is direct X ?
What is direct X ?
 
PyKinect: Body Iteration Application Development Using Python
PyKinect: Body Iteration Application Development Using PythonPyKinect: Body Iteration Application Development Using Python
PyKinect: Body Iteration Application Development Using Python
 
March.2012.KinectForWindows
March.2012.KinectForWindowsMarch.2012.KinectForWindows
March.2012.KinectForWindows
 
"Image Sensor Formats and Interfaces for IoT Applications," a Presentation fr...
"Image Sensor Formats and Interfaces for IoT Applications," a Presentation fr..."Image Sensor Formats and Interfaces for IoT Applications," a Presentation fr...
"Image Sensor Formats and Interfaces for IoT Applications," a Presentation fr...
 
Developing For Kinect For Windows
Developing For Kinect For WindowsDeveloping For Kinect For Windows
Developing For Kinect For Windows
 
Accelerating Real Time Video Analytics on a Heterogenous CPU + FPGA Platform
Accelerating Real Time Video Analytics on a Heterogenous CPU + FPGA PlatformAccelerating Real Time Video Analytics on a Heterogenous CPU + FPGA Platform
Accelerating Real Time Video Analytics on a Heterogenous CPU + FPGA Platform
 
第38回 名古屋CV・PRML勉強会 「Kinect v2本の紹介とPCLの概要」
第38回 名古屋CV・PRML勉強会 「Kinect v2本の紹介とPCLの概要」第38回 名古屋CV・PRML勉強会 「Kinect v2本の紹介とPCLの概要」
第38回 名古屋CV・PRML勉強会 「Kinect v2本の紹介とPCLの概要」
 
Presentation for IoT workshop at Sinhagad University (Feb 4, 2016) - 2/2
Presentation for IoT workshop at Sinhagad University (Feb 4, 2016) - 2/2Presentation for IoT workshop at Sinhagad University (Feb 4, 2016) - 2/2
Presentation for IoT workshop at Sinhagad University (Feb 4, 2016) - 2/2
 
Prasentation Managed DirectX
Prasentation Managed DirectXPrasentation Managed DirectX
Prasentation Managed DirectX
 
Introduction to Kinect v2
Introduction to Kinect v2Introduction to Kinect v2
Introduction to Kinect v2
 
PRESENTATION ON Game Engine
PRESENTATION ON Game EnginePRESENTATION ON Game Engine
PRESENTATION ON Game Engine
 
Windows 10 IoT Core
Windows 10 IoT CoreWindows 10 IoT Core
Windows 10 IoT Core
 

Mehr von Lee Stott

Visual Studio Tools for Unity Unity User Group 23rd Feb
Visual Studio Tools for Unity  Unity User Group 23rd FebVisual Studio Tools for Unity  Unity User Group 23rd Feb
Visual Studio Tools for Unity Unity User Group 23rd Feb
Lee Stott
 

Mehr von Lee Stott (20)

Cortana intelligence suite for projects &amp; hacks
Cortana intelligence suite for projects &amp; hacksCortana intelligence suite for projects &amp; hacks
Cortana intelligence suite for projects &amp; hacks
 
Project Oxford - Introduction to advanced Manchine Learning API
Project Oxford - Introduction to advanced Manchine Learning APIProject Oxford - Introduction to advanced Manchine Learning API
Project Oxford - Introduction to advanced Manchine Learning API
 
Visual studio professional 2015 overview
Visual studio professional 2015 overviewVisual studio professional 2015 overview
Visual studio professional 2015 overview
 
Azure cloud for students and educators
Azure cloud   for students and educatorsAzure cloud   for students and educators
Azure cloud for students and educators
 
Getting coding in under a hour with Imagine Microsoft
Getting coding in under a hour with Imagine MicrosoftGetting coding in under a hour with Imagine Microsoft
Getting coding in under a hour with Imagine Microsoft
 
Create and manage a web application on Azure (step to step tutorial)
Create and manage a web application on Azure (step to step tutorial)Create and manage a web application on Azure (step to step tutorial)
Create and manage a web application on Azure (step to step tutorial)
 
Setting up a WordPress Site on Microsoft DreamSpark Azure Cloud Subscription
Setting up a WordPress Site on Microsoft DreamSpark Azure Cloud SubscriptionSetting up a WordPress Site on Microsoft DreamSpark Azure Cloud Subscription
Setting up a WordPress Site on Microsoft DreamSpark Azure Cloud Subscription
 
Imagine at Microsoft - Resources for Students and Educators
Imagine at Microsoft - Resources for Students and EducatorsImagine at Microsoft - Resources for Students and Educators
Imagine at Microsoft - Resources for Students and Educators
 
Porting unity games to windows - London Unity User Group
Porting unity games to windows - London Unity User GroupPorting unity games to windows - London Unity User Group
Porting unity games to windows - London Unity User Group
 
Visual Studio Tools for Unity Unity User Group 23rd Feb
Visual Studio Tools for Unity  Unity User Group 23rd FebVisual Studio Tools for Unity  Unity User Group 23rd Feb
Visual Studio Tools for Unity Unity User Group 23rd Feb
 
Unity camp london feb 2015
Unity camp london feb 2015Unity camp london feb 2015
Unity camp london feb 2015
 
Marmalade @include2014 Dev leestott Microsoft
Marmalade @include2014 Dev leestott MicrosoftMarmalade @include2014 Dev leestott Microsoft
Marmalade @include2014 Dev leestott Microsoft
 
E book Mobile App Marketing_101
E book Mobile App Marketing_101E book Mobile App Marketing_101
E book Mobile App Marketing_101
 
Game Republic 24th April 2014 - Maximising your app revenue
Game Republic 24th April 2014  - Maximising your app revenueGame Republic 24th April 2014  - Maximising your app revenue
Game Republic 24th April 2014 - Maximising your app revenue
 
Updateshow Manchester April 2014
Updateshow Manchester April 2014Updateshow Manchester April 2014
Updateshow Manchester April 2014
 
Microsoft Office for Education
Microsoft Office for EducationMicrosoft Office for Education
Microsoft Office for Education
 
Microsoft Learning Experiences Skills and Employability
Microsoft Learning Experiences Skills and Employability Microsoft Learning Experiences Skills and Employability
Microsoft Learning Experiences Skills and Employability
 
Game Kettle Feb 2014 Gateshead
Game Kettle Feb 2014 GatesheadGame Kettle Feb 2014 Gateshead
Game Kettle Feb 2014 Gateshead
 
GamesWest 2013 December
GamesWest 2013 December GamesWest 2013 December
GamesWest 2013 December
 
Microsoft Graduate Recuirtment postcard
 Microsoft Graduate Recuirtment postcard Microsoft Graduate Recuirtment postcard
Microsoft Graduate Recuirtment postcard
 

Kürzlich hochgeladen

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Kürzlich hochgeladen (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 

Kinect for Windows SDK Dr David Brown

  • 1. SDK Overview Dr David Brown Microsoft Technology Centre
  • 2. Agenda Installing and Using the Kinect Sensor Setting up your Development Environment Camera Fundamentals Working with Depth Data Skeletal Tracking Fundamentals Audio Fundamentals
  • 3. Hardware Computer with a dual-core 2.66-GHz or faster processor 2GB RAM Windows 7-compatible graphics card that supports DirectX 9.0c
  • 4. Kinect Sensor 3D DEPTH SENSORS RGB CAMERA MOTORIZED TILT MULTI-ARRAY MIC
  • 5. Development Environment Microsoft Visual Studio 2010 Express or other Visual Studio 2010 edition .NET Framework 4.0 SDK, http://research.microsoft.com/kinectsdk DirectX Samples Microsoft DirectX® SDK - June 2010 or later version Current runtime for Microsoft DirectX® 9 Speech Samples Microsoft Speech Platform Runtime, version 10.2 (x86 edition) Microsoft Kinect Speech Platform (US-English version) Microsoft Speech Platform - Software Development Kit, version 10.2 (x86 edition)
  • 8. Depth Image Array of bytes (ImageFrame.Image.Bits) Left to right, top to bottom Represents distance for pixel in mm (850 to 4,000mm) 0 means unknown Shadows, low reflectivity, and high reflectivity among the few reasons Player Index 0, No player 1, Skeleton 0 2, Skeleton 1
  • 9. Depth Data 2 bytes per pixel (16 bits) Depth (Distance per pixel) Bitshiftsecond byte by 8 Distance (0,0) = (int)(Bits[0] | Bits[1] << 8); DepthAndPlayer Index (Includes Player index) Bitshift by 3 first byte (player index), 5 second byte Distance (0,0) =(int)(Bits[0] >> 3 | Bits[1] << 5);
  • 10. Demo
  • 13. Joint Data Maximum two players tracked at once Six player proposals Each player with set of joints <x, y, z> in meters Tracking state Tracked Inferred Occluded, clipped, or low confidence joints Not tracked Rare, but your code must check for this state
  • 14. Demo
  • 15. Audio Processing Four microphone arraywith hardware-basedaudio processing Multichannel echo cancellation (MEC) Sound position tracking Other digital signal processing (noise suppression and reduction)
  • 17. Speech Recognition Grammar – What we are listening for Code – GrammarBuilder, Choices Speech Recognition Grammar Specification (SRGS) C:rogram Files (x86)icrosoft Speech Platform SDKamplesample Grammarsbr />Set AutomaticGainControl = false
  • 18. Demo
  • 19. Samples NUI Skeletal viewer, C++, C# Shape Game Demo, C# Audio Raw capture, C++ Audio filtering, C++ Echo cancellation, C++ Recording, C# Speech, C#
  • 20. Resources SDK, http://research.microsoft.com/kinectsdk Channel 9 quick-starts, http://channel9.msdn.com/series/KinectSDKQuickstarts/ Coding4Fun gallery & blog, http://channel9.msdn.com/coding4fun/kinect
  • 21.

Hinweis der Redaktion

  1. Speech Platform Runtime &amp; SDK must use x86 editionMicrosoft Kinect Speech Platform is the same speech recognition for XBOX.NET 4.0 Windows.Speech namespace can be used but not as up-to-date
  2. Colour and depth stream4 to 11.5 feet (1.2 to 3.5 meters) Skeletal tracking4 to 11.5 feet (1.2 to 3.5 meters) Viewing angle43° vertical by 57° horizontal field of viewMechanized tilt range (vertical)±28° Frame rate (depth and colour stream)30 frames per second (FPS)Resolution, depth streamQVGA (320 × 240) Resolution, colour streamVGA (640 × 480) Audio format16-kHz, 16-bit mono pulse code modulation (PCM)Audio input characteristicsA four-microphone array with 24-bit analogue-to-digital converter (ADC) and Kinect-resident signal processing such as echo cancellation and noise suppression
  3. Speech Platform Runtime &amp; SDK must use x86 editionMicrosoft Kinect Speech Platform is the same speech recognition (acoustic model) for XBOX.NET 4.0 Windows.Speech namespace can be used but not as up-to-date
  4. WPF event-driven RGB &amp; Depth framesCamera tilt
  5. WPF event-driven RGB &amp; Depth framesCamera tilt
  6. WPF event-driven RGB &amp; Depth framesCamera tilt
  7. WPF event-driven RGB &amp; Depth framesCamera tilt
  8. Skeletal Viewer (C++ and C#) The Kinect sensor includes two cameras: one delivers depth information and the other delivers color data. The NUI API enables applications to access and manipulate this data. The SkeletalViewer sample uses the NUI API to render data from the Kinect sensor’s cameras as images on the screen. The managed sample uses WPF to render captured images, and the native application uses DirectX.ShapeGame—Creating a Game with Audio and Skeletal Tracking Displays the tracked skeletons of two players together with shapes falling from the sky. Players can control the shapes by moving and speaking commands.Audio Capture Raw (C++) The Kinect sensor’s audio component is a four-element microphone array. The AudioCaptureRaw sample uses the Windows Audio Session API (WASAPI) to capture the raw audio stream from the Kinect sensor’s microphone array and write it to a .wav file.MicArrayEchoCancellation—Acoustic Echo Cancellation, Beam Forming, and Source Localization (C++)The primary way for C++ applications to access the Kinect sensor’s microphone array is through the MSRKinectAudio DirectX Media Object (DMO). The MSRKinectAudio DMO supports all standard microphone array functionality, and adds support for beamforming and source localization. The MicArrayEchoCancellation sample shows how to use the KinectAudio DMO in a DirectShow graph. It uses acoustic echo cancellation to record a high-quality audio stream and beamforming and source localization to determine the selected beam and the direction to the sound source. MFAudioFilter—Media Foundation Audio Filter (C++) Shows how to capture an audio stream from the Kinect sensor’s microphone array by using the MSRKinectAudio DMO in filter mode in a Windows Media Foundation topology.RecordAudio—Recording an Audio Stream and Monitoring Direction (C#) Demonstrates how to capture an audio stream from the Kinect sensor’s microphone array and monitor the currently selected beam and sound source direction.Speech—Recognizing Voice Commands (C#) Demonstrates how to use the Kinect sensor’s microphone array with the Microsoft.Speech API to recognize voice commands