Additional information on #datatuesday: http://data-tuesday.com/
Additional information on Hadoop on Azure: http://www.hadooponazure.com, http://aka.ms/benjguinhadoop
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
2012-01-10-data tuesday
1. Data Tuesday – 10 janvier 2012
Pierre Lagarde (DPE) – pierlag@microsoft.com
Benjamin Guinebertière (DPE) – www.benjguin.com
2. Microsoft Distribution of Hadoop [MDH]
• Code name : Isotope
• Leveraging the Hadoop data-driven
community
– OnPremise – Cloud
– Windows Server integration [AD – Secure HDFS]
– Connection with SQL Server / Excel
– Developer Framework [JavaScript, .NET, F#, …]
– Hadoop as a Service through Azure [eMDH]
3. Structural Overview
ISOTOPE
[Azure and Enterprise]
Java - JavaScript Streaming OM HiveQL PigLatin .NET/C#/F# (T)SQL
NOSQL OCEAN OF DATA ETL
[unstructured, semi-structured, structured]
HDFS
A SEAMLESS OCEAN OF INFORMATION PROCESSING AND ANALYTICS
EIS / ERP RDBMS File System OData [RSS] Azure Storage
Isotope is the all-up effort around Microsoft and Hadoop. It includes several components:A full distribution of Apache Hadoop that runs on standard windows hardware.A full version of Apache Hadoop that runs on the Azure cloudConnectors from Hadoop (any Hadoop, not just Microsoft’s) to Microsoft’s key products – SQL, Excel, PDW, etc.Jscript shell for live scripting of Hadoop from the browserAdmin, monitoring, and authoring tools to make Microsoft Hadoop best-in-class