SlideShare ist ein Scribd-Unternehmen logo
1 von 41
Impetus Technologies Inc. 
1 © 2014 Impetus Technologies 
Big Data 
Presented By: 
Sanjay Sharma
Outline 
• About 
• Big Data: Recap 
• Big Data Technologies Landscape 
• Hadoop Overview 
• Other Big Data Tools Overview 
• Impact on IT and us 
2 © 2014 Impetus Technologies
About 
• Big Data Solution Architect 
• Work for Impetus Technologies 
• Based out of San Jose, Atlanta & 
• India(1300+ Engineers) 
• Thought Leaders in Big Data consulting 
• Started Big Data Labs 4 years ago 
3 © 2014 Impetus Technologies
Big Data 
4 © 2014 Impetus Technologies 
Velocity 
Volume Variety 
Big Data
Big Data Opportunity 
Visualization 
Personalization 
5 © 2014 Impetus Technologies 
Optimization 
Advanced 
Predictive 
Analytics 
Time to Market 
Business Opportunity
Big Data Users- 2009 
6 © 2014 Impetus Technologies
Big Data Users – 2010 
7 © 2014 Impetus Technologies
Big Data Job Trends 
8 © 2014 Impetus Technologies 
Top Job Trends 
(Indeed.com July 2012) 
HTML5 
MongoDB 
iOS Android 
Mobile app 
Puppet 
Hadoop 
jQuery PaaS 
Social Media
Big Data Every Where 
Search : Atlanta, GA 
Date : 10/16/2012 
“Big Data” = 82 
“Hadoop” =78 
“NoSQL” = 117 
MPP DBs = 192 
9 © 2014 Impetus Technologies 
Search : USA 
Date : 10/16/2012 
“Big Data” = 5169 
“Hadoop” = 5174 
“NoSQL” = 3820 
MPP DBs= 4581
Big Data Future 
Source:: McKinsey-http:// 
www.mckinsey.com/Insights/MGI/Research/Technology_and_Innovation/Big_data_The_next_frontier_for_ 
innovation 
10 © 2014 Impetus Technologies
Big Data Landscapes 
11 © 2014 Impetus Technologies 
Hadoop MPP 
NewSQL NoSQL 
• Scalability Limits 
• Online vs. Batch 
• Open Source 
• Writes/Reads 
• Petabyte Scale 
• Commodity 
• DW Vendors 
• Appliances
Big Data Vendor Galore 
12 © 2014 Impetus Technologies
Hadoop: Glory to the Elephant 
13 © 2014 Impetus Technologies
Hadoop 
Distributed File System Distributed Processing System 
• Petabyte Scale 
• Thousands of 
Commodity Servers 
• High Availability 
• High Fault Tolerant 
14 © 2014 Impetus Technologies 
• Simple easy to code 
Algorithm 
• Code once Run on PBs 
• High Fault Tolerance 
• Data Locality
Map Concept: RDBMS 
BIG COMBINED TABLE id Name 
© 2012 Impetus 
Technologies 
15 © 2014 Impetus Technologies 
Other 
Columns. 
1 Scott .. 
2 Bob .. 
3 Lisa .. 
4 Sanjay .. 
…. .. 
256 million Bob 
Select count(*), ‘Bob’ from table 
Select count(*), ‘Scott’ from table 
nwahmeree=“Bob”; 
where name=“Scott”; 
->12,’Bob’ 
->4,’Scott’ 
TABLE 2- on m/c 2 Id Name 
64 
million and 1 Scott 
64 
million and 2 Bob 
64 
million and 3 Lisa 
64 
… 
million and 4 Sanjay 
… 
128 million 
Select 
Bob 
count(*), ‘Bob’ from table wh eSreelenactmceo=u“nBt(o*b),”;‘Bob’ from table where 
”Bob”,3 
”Scott”,1 
“Sanjay”,0 
TABLE 3- on m/c 3 Id Name 
128 million and 1 Scott 
128 million and 2 Bob 
128 million and 3 Lisa 
128 million and 4 
Sanjay 
…… 
192 million Bob 
name=“Bob”; 
”Bob”,3 
”Scott”,1 
“Sanjay”,1 
TABLE 4- on m/c 4 Id Name 
192 million and 1 Scott 
192 million and 2 Bob 
192 million and 3 Lisa 
192 million and 4 Lisa 
…… 
256 million Bob 
Select count(*), ‘Bob’ from table where 
name=“Bob”; 
”Bob”,3 
”Scott”,1 
“Sanjay”,1 
TABLE 1- on m/c 1 Id Name 
1 Scott 
2 Bob 
3 Lisa 
4 Sanjay 
…… 
64 million Bob 
Select ‘Bob’,count(*), from table where 
name=“Bob”; <- same queries for ‘Scott’ & 
‘Sanjay” 
”Bob”,3 
”Scott”,1 
“Sanjay”,2
Reduce Concept: RDBMS 
“Scott”,list([1,1,1,1]) 
List[1,1,1,1].iterate-> 
Sum(EACH) 
16 © 2014 Impetus Technologies 
“Bob”,list([3,3,3,3]) 
List[3,3,3,3].iterate-> 
Sum(EACH) 
“Sanjay” ,list([2,0,1,1]) 
List[2,0,1,1].iterate-> 
Sum(EACH)
Hadoop DFS 
17 © 2014 Impetus Technologies
Hadoop Map Reduce 
18 © 2014 Impetus Technologies 
• map (k1,v1) list(k2,v2) 
reduce (k2,list(v2)) list(v2)
Hadoop Map Reduce 
19 © 2014 Impetus Technologies
Hadoop Ecosystem 
20 © 2014 Impetus Technologies
NoSQL: “No to SQL” OR “Not Only 
21 © 2014 Impetus Technologies 
SQL”
NoSQL Overview 
22 © 2014 Impetus Technologies 
NoSQL Models 
Volatile Storage Memcached, Ehcache 
Persistence Storage 
Key / Value Databases Voldemort, Redis, Scalaris 
Columnar Databases Hbase, 
Cassandra, 
Hypertable 
Document Databases MongoDB, CouchDB 
Graph Databases InfoGrid, Neo4j 
Other Databases Kyotocabinet, Berkley DB
NoSQL Characterstics 
23 © 2014 Impetus Technologies 
TYPICAL BENEFITS 
 Scalability 
 Availability 
 Near-Real time Performance 
 Modeling flexibility 
 Deployment flexibility 
Auto- Sharding 
Failover 
Schema- less 
Intelligent client 
In-memory flush to disk Dynamic clustering
MPP: Massively Parallel Processing 
24 © 2014 Impetus Technologies 
DW
MPP/ Columnar Stores 
• Oracle Exadata 
• IBM Netezza 
• Teradata 
• EMC Greenplum 
• HP Vertica 
• ParAccel 
• Microsoft SQL Server PDW 
25 © 2014 Impetus Technologies
26 © 2014 Impetus Technologies
Big Data: Microsoft 
27 © 2014 Impetus Technologies
NewSQL: New Generation DB 
28 © 2014 Impetus Technologies
New SQL / Cloud DB 
29 © 2014 Impetus Technologies 
• SimpleDB 
• DynamoDB 
• NuoDB 
• Totutek 
• VoltDB 
• NimbusDB 
• Clustrix 
• Xeround
ETL, BI & Reporting 
30 © 2014 Impetus Technologies
ETL, BI & Reporting 
• Hadoop/ MPP/ NoSQL support in- 
• Informatica Datastage 
• Talend, Pentaho 
• Microstrategy, SAS 
• Tableau, Qlikview, Intellicus 
31 © 2014 Impetus Technologies
32 © 2014 Impetus Technologies 
Big Data & Cloud
Big Data & Cloud 
• Marriage made in heaven 
• Big data demands met by Cloud scalability 
• IAAS, PAAS and DAAS offerings 
• AWS EMR, SimpleDB, RDS 
• Azure SQL Server, Hadoop 
• Google 
33 © 2014 Impetus Technologies
Real Time Analytics 
34 © 2014 Impetus Technologies
Real Time Analytics 
• Storm 
• HStreaming, StreamBase 
• Microsoft StreamInsight 
• IBM Streams 
• Oracle SQLstream 
• Complex Event Processing engines- Esper etc. 
35 © 2014 Impetus Technologies
Big Data Impact on us 
36 © 2014 Impetus Technologies
Big Data Careers 
• ETL Developers 
• Database Administrators 
• Database SQL Developers 
• Solution/ 
Technical 
Architects 
• Data Scientists 
37 © 2014 Impetus Technologies 
Enhance 
OR 
Extend 
- 
NOT 
Replace
Hadoop/Hive 
Developers 
- Java, Hive 
38 © 2014 Impetus Technologies 
Hadoop Architects 
- Java, DW, ETL 
Hadoop Administrators 
- Linux, Java 
NoSQL 
Developers 
- Java/ 
Python/ 
Ruby 
MPP DW 
Developers 
- SQL, Data 
Modeling 
MPP DW Admin 
- Linux, SQL 
Data 
Scientist 
-Machine 
Learning 
Big Data 
Architect 
- Solution/ 
Technical 
Architecture 
Some Big Data Careers
Typical Big Data Architecture 
39 © 2014 Impetus Technologies
Credits & Acknowledgements 
• Company Logos – Creative Commons/ Company 
Copyrighted/ Trademarked 
• Hadoop Elephant Images– Apache Trademarked 
• Cloudera.com, hadoop.apache.org, Oracle big data 
web site, Indeed.com, McKinsey report, dzone.org. 
microsoft.com 
• The Awesome at Impetus- Team of Big Data 
architects and practitioners 
40 © 2014 Impetus Technologies
41 © 2014 Impetus Technologies 
Thank You 
Write to us at inquiry@impetus.com 
Follow us on Twitter @impetustech

Weitere ähnliche Inhalte

Was ist angesagt?

Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...Databricks
 
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...Databricks
 
Spark and the Enterprise by Tony Baer
Spark and the Enterprise by Tony BaerSpark and the Enterprise by Tony Baer
Spark and the Enterprise by Tony BaerSpark Summit
 
How Apache Spark Changed the Way We Hire People with Tomasz Magdanski
How Apache Spark Changed the Way We Hire People with Tomasz MagdanskiHow Apache Spark Changed the Way We Hire People with Tomasz Magdanski
How Apache Spark Changed the Way We Hire People with Tomasz MagdanskiDatabricks
 
Oracle Stream Analytics - Developer Introduction
Oracle Stream Analytics - Developer IntroductionOracle Stream Analytics - Developer Introduction
Oracle Stream Analytics - Developer IntroductionJeffrey T. Pollock
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)DataWorks Summit/Hadoop Summit
 
Intro to Delta Lake
Intro to Delta LakeIntro to Delta Lake
Intro to Delta LakeDatabricks
 
Real-time Microservices and In-Memory Data Grids
Real-time Microservices and In-Memory Data GridsReal-time Microservices and In-Memory Data Grids
Real-time Microservices and In-Memory Data GridsAli Hodroj
 
Build a Big Data Warehouse on the Cloud in 30 Minutes
Build a Big Data Warehouse on the Cloud in 30 MinutesBuild a Big Data Warehouse on the Cloud in 30 Minutes
Build a Big Data Warehouse on the Cloud in 30 MinutesCaserta
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIAmazon Web Services
 
Accion Labs - Big Data Services
Accion Labs - Big Data ServicesAccion Labs - Big Data Services
Accion Labs - Big Data ServicesAccion Labs, Inc.
 
Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016StampedeCon
 
Spark Summit Keynote by Seshu Adunuthula
Spark Summit Keynote by Seshu AdunuthulaSpark Summit Keynote by Seshu Adunuthula
Spark Summit Keynote by Seshu AdunuthulaSpark Summit
 
Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...
Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...
Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...StampedeCon
 
Office 360 and Spark
Office 360 and Spark Office 360 and Spark
Office 360 and Spark Spark Summit
 
Building a Modern FinTech Big Data Infrastructure
Building a Modern FinTech Big Data InfrastructureBuilding a Modern FinTech Big Data Infrastructure
Building a Modern FinTech Big Data InfrastructureDatabricks
 
Spark Summit Keynote by Shaun Connolly
Spark Summit Keynote by Shaun ConnollySpark Summit Keynote by Shaun Connolly
Spark Summit Keynote by Shaun ConnollySpark Summit
 

Was ist angesagt? (20)

Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...
 
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
 
Spark and the Enterprise by Tony Baer
Spark and the Enterprise by Tony BaerSpark and the Enterprise by Tony Baer
Spark and the Enterprise by Tony Baer
 
Spark meets Smart Meters
Spark meets Smart MetersSpark meets Smart Meters
Spark meets Smart Meters
 
Ford
FordFord
Ford
 
How Apache Spark Changed the Way We Hire People with Tomasz Magdanski
How Apache Spark Changed the Way We Hire People with Tomasz MagdanskiHow Apache Spark Changed the Way We Hire People with Tomasz Magdanski
How Apache Spark Changed the Way We Hire People with Tomasz Magdanski
 
Oracle Stream Analytics - Developer Introduction
Oracle Stream Analytics - Developer IntroductionOracle Stream Analytics - Developer Introduction
Oracle Stream Analytics - Developer Introduction
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
 
Intro to Delta Lake
Intro to Delta LakeIntro to Delta Lake
Intro to Delta Lake
 
Real-time Microservices and In-Memory Data Grids
Real-time Microservices and In-Memory Data GridsReal-time Microservices and In-Memory Data Grids
Real-time Microservices and In-Memory Data Grids
 
Build a Big Data Warehouse on the Cloud in 30 Minutes
Build a Big Data Warehouse on the Cloud in 30 MinutesBuild a Big Data Warehouse on the Cloud in 30 Minutes
Build a Big Data Warehouse on the Cloud in 30 Minutes
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AI
 
Accion Labs - Big Data Services
Accion Labs - Big Data ServicesAccion Labs - Big Data Services
Accion Labs - Big Data Services
 
Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016
 
Hadoop for the Masses
Hadoop for the MassesHadoop for the Masses
Hadoop for the Masses
 
Spark Summit Keynote by Seshu Adunuthula
Spark Summit Keynote by Seshu AdunuthulaSpark Summit Keynote by Seshu Adunuthula
Spark Summit Keynote by Seshu Adunuthula
 
Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...
Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...
Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...
 
Office 360 and Spark
Office 360 and Spark Office 360 and Spark
Office 360 and Spark
 
Building a Modern FinTech Big Data Infrastructure
Building a Modern FinTech Big Data InfrastructureBuilding a Modern FinTech Big Data Infrastructure
Building a Modern FinTech Big Data Infrastructure
 
Spark Summit Keynote by Shaun Connolly
Spark Summit Keynote by Shaun ConnollySpark Summit Keynote by Shaun Connolly
Spark Summit Keynote by Shaun Connolly
 

Ähnlich wie Big Data Landscape and Technologies Overview

Hadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big DataHadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big DataSenturus
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubCloudera, Inc.
 
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Dataconomy Media
 
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Mats Uddenfeldt
 
Intel and Cloudera: Accelerating Enterprise Big Data Success
Intel and Cloudera: Accelerating Enterprise Big Data SuccessIntel and Cloudera: Accelerating Enterprise Big Data Success
Intel and Cloudera: Accelerating Enterprise Big Data SuccessCloudera, Inc.
 
IoT and Big Data - Iot Asia 2014
IoT and Big Data - Iot Asia 2014IoT and Big Data - Iot Asia 2014
IoT and Big Data - Iot Asia 2014John Berns
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixNicolas Morales
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWKent Graziano
 
Idera live 2021: Managing Databases in the Cloud - the First Step, a Succes...
Idera live 2021:   Managing Databases in the Cloud - the First Step, a Succes...Idera live 2021:   Managing Databases in the Cloud - the First Step, a Succes...
Idera live 2021: Managing Databases in the Cloud - the First Step, a Succes...IDERA Software
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data AnalyticsAmazon Web Services
 
Hot Technologies of 2013: Hadoop 2.0
Hot Technologies of 2013: Hadoop 2.0Hot Technologies of 2013: Hadoop 2.0
Hot Technologies of 2013: Hadoop 2.0Inside Analysis
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...jdijcks
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubCloudera, Inc.
 
The Future of Hadoop: MapR VP of Product Management, Tomer Shiran
The Future of Hadoop: MapR VP of Product Management, Tomer ShiranThe Future of Hadoop: MapR VP of Product Management, Tomer Shiran
The Future of Hadoop: MapR VP of Product Management, Tomer ShiranMapR Technologies
 
Postgres Takes Charge Around the World
Postgres Takes Charge Around the WorldPostgres Takes Charge Around the World
Postgres Takes Charge Around the WorldEDB
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database RoundtableEric Kavanagh
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopInside Analysis
 
Data lake – On Premise VS Cloud
Data lake – On Premise VS CloudData lake – On Premise VS Cloud
Data lake – On Premise VS CloudIdan Tohami
 
IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter AnalyticsAdrian Turcu
 

Ähnlich wie Big Data Landscape and Technologies Overview (20)

Hadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big DataHadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big Data
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
 
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
 
Intel and Cloudera: Accelerating Enterprise Big Data Success
Intel and Cloudera: Accelerating Enterprise Big Data SuccessIntel and Cloudera: Accelerating Enterprise Big Data Success
Intel and Cloudera: Accelerating Enterprise Big Data Success
 
IoT and Big Data - Iot Asia 2014
IoT and Big Data - Iot Asia 2014IoT and Big Data - Iot Asia 2014
IoT and Big Data - Iot Asia 2014
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with Bluemix
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFW
 
Idera live 2021: Managing Databases in the Cloud - the First Step, a Succes...
Idera live 2021:   Managing Databases in the Cloud - the First Step, a Succes...Idera live 2021:   Managing Databases in the Cloud - the First Step, a Succes...
Idera live 2021: Managing Databases in the Cloud - the First Step, a Succes...
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data Analytics
 
Hot Technologies of 2013: Hadoop 2.0
Hot Technologies of 2013: Hadoop 2.0Hot Technologies of 2013: Hadoop 2.0
Hot Technologies of 2013: Hadoop 2.0
 
Big Data: Myths and Realities
Big Data: Myths and RealitiesBig Data: Myths and Realities
Big Data: Myths and Realities
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
The Future of Hadoop: MapR VP of Product Management, Tomer Shiran
The Future of Hadoop: MapR VP of Product Management, Tomer ShiranThe Future of Hadoop: MapR VP of Product Management, Tomer Shiran
The Future of Hadoop: MapR VP of Product Management, Tomer Shiran
 
Postgres Takes Charge Around the World
Postgres Takes Charge Around the WorldPostgres Takes Charge Around the World
Postgres Takes Charge Around the World
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
 
Data lake – On Premise VS Cloud
Data lake – On Premise VS CloudData lake – On Premise VS Cloud
Data lake – On Premise VS Cloud
 
IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter Analytics
 

Mehr von Impetus Technologies

Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...Impetus Technologies
 
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix WebinarFuture-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix WebinarImpetus Technologies
 
Building Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus WebinarBuilding Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus WebinarImpetus Technologies
 
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...Impetus Technologies
 
Impetus White Paper- Handling Data Corruption in Elasticsearch
Impetus White Paper- Handling  Data Corruption  in ElasticsearchImpetus White Paper- Handling  Data Corruption  in Elasticsearch
Impetus White Paper- Handling Data Corruption in ElasticsearchImpetus Technologies
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarImpetus Technologies
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarImpetus Technologies
 
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...Impetus Technologies
 
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...Impetus Technologies
 
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Impetus Technologies
 
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...Impetus Technologies
 
Enterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus WebcastEnterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus WebcastImpetus Technologies
 
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...Impetus Technologies
 
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...Impetus Technologies
 
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...Impetus Technologies
 
Big Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabBig Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabImpetus Technologies
 
Webinar maturity of mobile test automation- approaches and future trends
Webinar  maturity of mobile test automation- approaches and future trendsWebinar  maturity of mobile test automation- approaches and future trends
Webinar maturity of mobile test automation- approaches and future trendsImpetus Technologies
 
Next generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labNext generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labImpetus Technologies
 
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...Impetus Technologies
 
Performance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus WebcastPerformance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus WebcastImpetus Technologies
 

Mehr von Impetus Technologies (20)

Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
 
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix WebinarFuture-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
 
Building Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus WebinarBuilding Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus Webinar
 
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
 
Impetus White Paper- Handling Data Corruption in Elasticsearch
Impetus White Paper- Handling  Data Corruption  in ElasticsearchImpetus White Paper- Handling  Data Corruption  in Elasticsearch
Impetus White Paper- Handling Data Corruption in Elasticsearch
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
 
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
 
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
 
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
 
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
 
Enterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus WebcastEnterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus Webcast
 
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
 
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
 
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
 
Big Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabBig Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLab
 
Webinar maturity of mobile test automation- approaches and future trends
Webinar  maturity of mobile test automation- approaches and future trendsWebinar  maturity of mobile test automation- approaches and future trends
Webinar maturity of mobile test automation- approaches and future trends
 
Next generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labNext generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph lab
 
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
 
Performance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus WebcastPerformance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus Webcast
 

Kürzlich hochgeladen

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Kürzlich hochgeladen (20)

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Big Data Landscape and Technologies Overview

  • 1. Impetus Technologies Inc. 1 © 2014 Impetus Technologies Big Data Presented By: Sanjay Sharma
  • 2. Outline • About • Big Data: Recap • Big Data Technologies Landscape • Hadoop Overview • Other Big Data Tools Overview • Impact on IT and us 2 © 2014 Impetus Technologies
  • 3. About • Big Data Solution Architect • Work for Impetus Technologies • Based out of San Jose, Atlanta & • India(1300+ Engineers) • Thought Leaders in Big Data consulting • Started Big Data Labs 4 years ago 3 © 2014 Impetus Technologies
  • 4. Big Data 4 © 2014 Impetus Technologies Velocity Volume Variety Big Data
  • 5. Big Data Opportunity Visualization Personalization 5 © 2014 Impetus Technologies Optimization Advanced Predictive Analytics Time to Market Business Opportunity
  • 6. Big Data Users- 2009 6 © 2014 Impetus Technologies
  • 7. Big Data Users – 2010 7 © 2014 Impetus Technologies
  • 8. Big Data Job Trends 8 © 2014 Impetus Technologies Top Job Trends (Indeed.com July 2012) HTML5 MongoDB iOS Android Mobile app Puppet Hadoop jQuery PaaS Social Media
  • 9. Big Data Every Where Search : Atlanta, GA Date : 10/16/2012 “Big Data” = 82 “Hadoop” =78 “NoSQL” = 117 MPP DBs = 192 9 © 2014 Impetus Technologies Search : USA Date : 10/16/2012 “Big Data” = 5169 “Hadoop” = 5174 “NoSQL” = 3820 MPP DBs= 4581
  • 10. Big Data Future Source:: McKinsey-http:// www.mckinsey.com/Insights/MGI/Research/Technology_and_Innovation/Big_data_The_next_frontier_for_ innovation 10 © 2014 Impetus Technologies
  • 11. Big Data Landscapes 11 © 2014 Impetus Technologies Hadoop MPP NewSQL NoSQL • Scalability Limits • Online vs. Batch • Open Source • Writes/Reads • Petabyte Scale • Commodity • DW Vendors • Appliances
  • 12. Big Data Vendor Galore 12 © 2014 Impetus Technologies
  • 13. Hadoop: Glory to the Elephant 13 © 2014 Impetus Technologies
  • 14. Hadoop Distributed File System Distributed Processing System • Petabyte Scale • Thousands of Commodity Servers • High Availability • High Fault Tolerant 14 © 2014 Impetus Technologies • Simple easy to code Algorithm • Code once Run on PBs • High Fault Tolerance • Data Locality
  • 15. Map Concept: RDBMS BIG COMBINED TABLE id Name © 2012 Impetus Technologies 15 © 2014 Impetus Technologies Other Columns. 1 Scott .. 2 Bob .. 3 Lisa .. 4 Sanjay .. …. .. 256 million Bob Select count(*), ‘Bob’ from table Select count(*), ‘Scott’ from table nwahmeree=“Bob”; where name=“Scott”; ->12,’Bob’ ->4,’Scott’ TABLE 2- on m/c 2 Id Name 64 million and 1 Scott 64 million and 2 Bob 64 million and 3 Lisa 64 … million and 4 Sanjay … 128 million Select Bob count(*), ‘Bob’ from table wh eSreelenactmceo=u“nBt(o*b),”;‘Bob’ from table where ”Bob”,3 ”Scott”,1 “Sanjay”,0 TABLE 3- on m/c 3 Id Name 128 million and 1 Scott 128 million and 2 Bob 128 million and 3 Lisa 128 million and 4 Sanjay …… 192 million Bob name=“Bob”; ”Bob”,3 ”Scott”,1 “Sanjay”,1 TABLE 4- on m/c 4 Id Name 192 million and 1 Scott 192 million and 2 Bob 192 million and 3 Lisa 192 million and 4 Lisa …… 256 million Bob Select count(*), ‘Bob’ from table where name=“Bob”; ”Bob”,3 ”Scott”,1 “Sanjay”,1 TABLE 1- on m/c 1 Id Name 1 Scott 2 Bob 3 Lisa 4 Sanjay …… 64 million Bob Select ‘Bob’,count(*), from table where name=“Bob”; <- same queries for ‘Scott’ & ‘Sanjay” ”Bob”,3 ”Scott”,1 “Sanjay”,2
  • 16. Reduce Concept: RDBMS “Scott”,list([1,1,1,1]) List[1,1,1,1].iterate-> Sum(EACH) 16 © 2014 Impetus Technologies “Bob”,list([3,3,3,3]) List[3,3,3,3].iterate-> Sum(EACH) “Sanjay” ,list([2,0,1,1]) List[2,0,1,1].iterate-> Sum(EACH)
  • 17. Hadoop DFS 17 © 2014 Impetus Technologies
  • 18. Hadoop Map Reduce 18 © 2014 Impetus Technologies • map (k1,v1) list(k2,v2) reduce (k2,list(v2)) list(v2)
  • 19. Hadoop Map Reduce 19 © 2014 Impetus Technologies
  • 20. Hadoop Ecosystem 20 © 2014 Impetus Technologies
  • 21. NoSQL: “No to SQL” OR “Not Only 21 © 2014 Impetus Technologies SQL”
  • 22. NoSQL Overview 22 © 2014 Impetus Technologies NoSQL Models Volatile Storage Memcached, Ehcache Persistence Storage Key / Value Databases Voldemort, Redis, Scalaris Columnar Databases Hbase, Cassandra, Hypertable Document Databases MongoDB, CouchDB Graph Databases InfoGrid, Neo4j Other Databases Kyotocabinet, Berkley DB
  • 23. NoSQL Characterstics 23 © 2014 Impetus Technologies TYPICAL BENEFITS  Scalability  Availability  Near-Real time Performance  Modeling flexibility  Deployment flexibility Auto- Sharding Failover Schema- less Intelligent client In-memory flush to disk Dynamic clustering
  • 24. MPP: Massively Parallel Processing 24 © 2014 Impetus Technologies DW
  • 25. MPP/ Columnar Stores • Oracle Exadata • IBM Netezza • Teradata • EMC Greenplum • HP Vertica • ParAccel • Microsoft SQL Server PDW 25 © 2014 Impetus Technologies
  • 26. 26 © 2014 Impetus Technologies
  • 27. Big Data: Microsoft 27 © 2014 Impetus Technologies
  • 28. NewSQL: New Generation DB 28 © 2014 Impetus Technologies
  • 29. New SQL / Cloud DB 29 © 2014 Impetus Technologies • SimpleDB • DynamoDB • NuoDB • Totutek • VoltDB • NimbusDB • Clustrix • Xeround
  • 30. ETL, BI & Reporting 30 © 2014 Impetus Technologies
  • 31. ETL, BI & Reporting • Hadoop/ MPP/ NoSQL support in- • Informatica Datastage • Talend, Pentaho • Microstrategy, SAS • Tableau, Qlikview, Intellicus 31 © 2014 Impetus Technologies
  • 32. 32 © 2014 Impetus Technologies Big Data & Cloud
  • 33. Big Data & Cloud • Marriage made in heaven • Big data demands met by Cloud scalability • IAAS, PAAS and DAAS offerings • AWS EMR, SimpleDB, RDS • Azure SQL Server, Hadoop • Google 33 © 2014 Impetus Technologies
  • 34. Real Time Analytics 34 © 2014 Impetus Technologies
  • 35. Real Time Analytics • Storm • HStreaming, StreamBase • Microsoft StreamInsight • IBM Streams • Oracle SQLstream • Complex Event Processing engines- Esper etc. 35 © 2014 Impetus Technologies
  • 36. Big Data Impact on us 36 © 2014 Impetus Technologies
  • 37. Big Data Careers • ETL Developers • Database Administrators • Database SQL Developers • Solution/ Technical Architects • Data Scientists 37 © 2014 Impetus Technologies Enhance OR Extend - NOT Replace
  • 38. Hadoop/Hive Developers - Java, Hive 38 © 2014 Impetus Technologies Hadoop Architects - Java, DW, ETL Hadoop Administrators - Linux, Java NoSQL Developers - Java/ Python/ Ruby MPP DW Developers - SQL, Data Modeling MPP DW Admin - Linux, SQL Data Scientist -Machine Learning Big Data Architect - Solution/ Technical Architecture Some Big Data Careers
  • 39. Typical Big Data Architecture 39 © 2014 Impetus Technologies
  • 40. Credits & Acknowledgements • Company Logos – Creative Commons/ Company Copyrighted/ Trademarked • Hadoop Elephant Images– Apache Trademarked • Cloudera.com, hadoop.apache.org, Oracle big data web site, Indeed.com, McKinsey report, dzone.org. microsoft.com • The Awesome at Impetus- Team of Big Data architects and practitioners 40 © 2014 Impetus Technologies
  • 41. 41 © 2014 Impetus Technologies Thank You Write to us at inquiry@impetus.com Follow us on Twitter @impetustech