SlideShare ist ein Scribd-Unternehmen logo
1 von 44
Photo Credit: http://www.crosseyedlife.com/teaching-resources/
Menu
Who am I?
Early adopters of Hadoop
Next generation use cases
Changing big data architectures
Art of the possible
My request
Questions
Appetiz
er
Main
Dessert
Who am I?
Google, Software Engineer
Personalized Search
Personalized Recommendations
WibiData, CTO
Real-time Personalization Platform
Customer Use Cases
EARLY ADOPTERS OF HADOOP
Early AdopterEarly Majority
Collect Everything
Keep Everything
Ask Anything
Collect Everything
Collect Everything
Collect Everything
Collect Everything
Collect Everything
Maybe I
should, too?
Keep Everything
Blind Spots
1. New, high-value use cases
1. Architectural changes to
support broader use cases
1. The ultimate strategic
goals of early adopters
NEXT GENERATION USE CASES
Blind Spot Number 1
Recommendations
Recommendations
Search
Prediction and Prevention
Targeted Offers
Customer Experience Optimization
Clearly, early adopters have
moved beyond ETL.
Life After ETL
Understanding
360-degree customer views
Visualization
Graphs
Exploration
Trends
Customer segmentation
ROI
Prediction
Action
Recommendations
Prevention
Mobile
Offers
Recommendations
Localization
Search
Personalization
Evolution of Enterprise Data
Collect Organize Understand ActUnderstandUnderstand
CHANGING ARCHITECTURE
Blind Spot Number 2
Sometimes, supporting a new use case
requires a different architecture.
Evolution of Enterprise Data
Collect Organize Understand Act
Collect Organize Understand
Key Ingredients
Data
Consolidation
Organization
Experimentation
Try something!
Rapid iteration
Tuning
Deployment
Evaluation
Real time
Required to Understand
Required to Act
Web Web Web
HDFS
Logs
Txns
POS
Third
Party
Data
1. Collect
MapReduce
Web Web Web
HDFS
Logs
Txns
POS
Third
Party
Data
1. Collect
2. Organize
Data Warehouse
Web Web Web
HDFS
POS
Third
Party
1. Collect
2. Organize
3. Understand
MapReduce
Data Warehouse
Web Web Web
HDFS
POS
Third
Party
1. Collect
2. Organize
3. Understand
4. Act
MapReduceHBase
Data Warehouse
Key Ingredients
Data
Consolidation
Organization
Experimentation
Try something!
Rapid iteration
Tuning
Deployment
Evaluation
Real time
Required to Understand
Required to Act
Did we get any
of these?
Early Adopter Migration Strategies
Add serving capability
Key-value store
Indexing
Add stream processing
Storm
Samza
Lambda architecture
Add both
Web Web Web
HDFS
POS
Third
Party
1. Collect
2. Organize
3. Understand
4. Act
MapReduceHBase
Data Warehouse
HBaseStorm
Query
BatchServingSpeed
Key Ingredients
Data
Consolidation
Organization
Experimentation
Try something!
Rapid iteration
Tuning
Deployment
Evaluation
Real time
Required to Understand
Required to Act
Did we get any
of these?
Web Web Web
HDFS
POS
Third
Party
1. Collect
2. Organize
3. Understand
4. Act
MapReduce
Data Warehouse
HBaseStorm
Query
BatchServingSpeed
ART OF THE POSSIBLE
Blind Spot Number 3
Photo credit: http://mediahub.olive.co.uk/blog/the-art-of-the-possible
You can’t build a data platform to solve
a problem you haven’t identified yet.
What’s Next?
Collect Organize Understand Act ?
What’s Next?
Collect
OrganizeUnderstand
Act
Where is the Value?
Collect Organize Understand Act
0%
20%
40%
60%
80%
100%
Collect Organize Understand Act
“As the amount of data goes up,
the importance of human judgment
should go down”
- Andrew McAfee
HBR Blog
Question
Hypothesis
PredictionTesting
Analysis
Hire smarter people
Faster EDW
Hire smarter peopleFaster Deployment
Faster EDW
Testing
What does this all mean?
The real value is in next generation “action”
use cases
The architecture for “action” is different
Design for your problem, since you don’t know
the art of the possible.
Requirements first, then technology
My Request
Stop building faster data warehouses.
You already understand your data.
Turn your understanding into action.
Questions?
Garrett Wu
http://www.wibidata.com
gwu@wibidata.com

Weitere ähnliche Inhalte

Was ist angesagt?

Activating Governance for End Users in Office 365 and SharePoint - SPS Vanc...
Activating Governance for End Users in Office 365 and SharePoint -   SPS Vanc...Activating Governance for End Users in Office 365 and SharePoint -   SPS Vanc...
Activating Governance for End Users in Office 365 and SharePoint - SPS Vanc...Heather Newman
 
Google Survivor Tips at 2011 SMX Advanced
Google Survivor Tips at 2011 SMX AdvancedGoogle Survivor Tips at 2011 SMX Advanced
Google Survivor Tips at 2011 SMX AdvancedMicah Fisher-Kirshner
 
SPS Boston 2016 - Drive on the FastTrack to SharePoint and Office 365 End Use...
SPS Boston 2016 - Drive on the FastTrack to SharePoint and Office 365 End Use...SPS Boston 2016 - Drive on the FastTrack to SharePoint and Office 365 End Use...
SPS Boston 2016 - Drive on the FastTrack to SharePoint and Office 365 End Use...Heather Newman
 
How to Build a Successful Data Team - Florian Douetteau @ PAPIs Connect
How to Build a Successful Data Team - Florian Douetteau @ PAPIs ConnectHow to Build a Successful Data Team - Florian Douetteau @ PAPIs Connect
How to Build a Successful Data Team - Florian Douetteau @ PAPIs ConnectPAPIs.io
 
The Road to Awesome SharePoint Adoption - SPTechCon June 2016
The Road to Awesome SharePoint Adoption - SPTechCon June 2016The Road to Awesome SharePoint Adoption - SPTechCon June 2016
The Road to Awesome SharePoint Adoption - SPTechCon June 2016Heather Newman
 
SPTechCon San Francsico 2016 When SharePoint Needs a Spark - Building an En...
SPTechCon San Francsico 2016   When SharePoint Needs a Spark - Building an En...SPTechCon San Francsico 2016   When SharePoint Needs a Spark - Building an En...
SPTechCon San Francsico 2016 When SharePoint Needs a Spark - Building an En...Heather Newman
 
Staffing your analytics team: 6 skill sets
Staffing your analytics team:  6 skill setsStaffing your analytics team:  6 skill sets
Staffing your analytics team: 6 skill setsDavid Stephenson, Ph.D.
 
SPTechCon San Francsico 2016 When SharePoint Needs a Spark - Building an En...
SPTechCon San Francsico 2016   When SharePoint Needs a Spark - Building an En...SPTechCon San Francsico 2016   When SharePoint Needs a Spark - Building an En...
SPTechCon San Francsico 2016 When SharePoint Needs a Spark - Building an En...Heather Newman
 
Introduction to Mahout with HDInsight
Introduction to Mahout with HDInsightIntroduction to Mahout with HDInsight
Introduction to Mahout with HDInsightChris Price
 
Drive on the FastTrack to SharePoint End User Adoption in Your Organization -...
Drive on the FastTrack to SharePoint End User Adoption in Your Organization -...Drive on the FastTrack to SharePoint End User Adoption in Your Organization -...
Drive on the FastTrack to SharePoint End User Adoption in Your Organization -...Heather Newman
 
San Francisco SharePoint User Group - September 2016 - Drive on the FastTrack...
San Francisco SharePoint User Group - September 2016 - Drive on the FastTrack...San Francisco SharePoint User Group - September 2016 - Drive on the FastTrack...
San Francisco SharePoint User Group - September 2016 - Drive on the FastTrack...Heather Newman
 
The Limitations of Web Scraping Tools
The Limitations of Web Scraping ToolsThe Limitations of Web Scraping Tools
The Limitations of Web Scraping ToolsPromptCloud
 
Drive On the Fast Track to SharePoint End User Adoption - SPS Toronto 2016
Drive On the Fast Track to SharePoint End User Adoption - SPS Toronto 2016Drive On the Fast Track to SharePoint End User Adoption - SPS Toronto 2016
Drive On the Fast Track to SharePoint End User Adoption - SPS Toronto 2016Heather Newman
 
BigData Meets the Federal Data Center
BigData Meets the Federal Data CenterBigData Meets the Federal Data Center
BigData Meets the Federal Data CenterAbe Usher
 
Christoph Luetke Schelhowe - Data for Everyone
Christoph Luetke Schelhowe  - Data for EveryoneChristoph Luetke Schelhowe  - Data for Everyone
Christoph Luetke Schelhowe - Data for EveryoneCXL
 
O'Reilly Strata: Distilling Data Exhaust
O'Reilly Strata: Distilling Data ExhaustO'Reilly Strata: Distilling Data Exhaust
O'Reilly Strata: Distilling Data ExhaustPeter Skomoroch
 
RightScale Webinar: Introducing Cloud Analytics
RightScale Webinar: Introducing Cloud AnalyticsRightScale Webinar: Introducing Cloud Analytics
RightScale Webinar: Introducing Cloud AnalyticsRightScale
 
Data Mashups -Data Science Summit
Data Mashups -Data Science SummitData Mashups -Data Science Summit
Data Mashups -Data Science SummitPeter Skomoroch
 
Publishers presentation ucl
Publishers presentation uclPublishers presentation ucl
Publishers presentation uclStephen Morgan
 

Was ist angesagt? (20)

Activating Governance for End Users in Office 365 and SharePoint - SPS Vanc...
Activating Governance for End Users in Office 365 and SharePoint -   SPS Vanc...Activating Governance for End Users in Office 365 and SharePoint -   SPS Vanc...
Activating Governance for End Users in Office 365 and SharePoint - SPS Vanc...
 
Google Survivor Tips at 2011 SMX Advanced
Google Survivor Tips at 2011 SMX AdvancedGoogle Survivor Tips at 2011 SMX Advanced
Google Survivor Tips at 2011 SMX Advanced
 
Belvilla
BelvillaBelvilla
Belvilla
 
SPS Boston 2016 - Drive on the FastTrack to SharePoint and Office 365 End Use...
SPS Boston 2016 - Drive on the FastTrack to SharePoint and Office 365 End Use...SPS Boston 2016 - Drive on the FastTrack to SharePoint and Office 365 End Use...
SPS Boston 2016 - Drive on the FastTrack to SharePoint and Office 365 End Use...
 
How to Build a Successful Data Team - Florian Douetteau @ PAPIs Connect
How to Build a Successful Data Team - Florian Douetteau @ PAPIs ConnectHow to Build a Successful Data Team - Florian Douetteau @ PAPIs Connect
How to Build a Successful Data Team - Florian Douetteau @ PAPIs Connect
 
The Road to Awesome SharePoint Adoption - SPTechCon June 2016
The Road to Awesome SharePoint Adoption - SPTechCon June 2016The Road to Awesome SharePoint Adoption - SPTechCon June 2016
The Road to Awesome SharePoint Adoption - SPTechCon June 2016
 
SPTechCon San Francsico 2016 When SharePoint Needs a Spark - Building an En...
SPTechCon San Francsico 2016   When SharePoint Needs a Spark - Building an En...SPTechCon San Francsico 2016   When SharePoint Needs a Spark - Building an En...
SPTechCon San Francsico 2016 When SharePoint Needs a Spark - Building an En...
 
Staffing your analytics team: 6 skill sets
Staffing your analytics team:  6 skill setsStaffing your analytics team:  6 skill sets
Staffing your analytics team: 6 skill sets
 
SPTechCon San Francsico 2016 When SharePoint Needs a Spark - Building an En...
SPTechCon San Francsico 2016   When SharePoint Needs a Spark - Building an En...SPTechCon San Francsico 2016   When SharePoint Needs a Spark - Building an En...
SPTechCon San Francsico 2016 When SharePoint Needs a Spark - Building an En...
 
Introduction to Mahout with HDInsight
Introduction to Mahout with HDInsightIntroduction to Mahout with HDInsight
Introduction to Mahout with HDInsight
 
Drive on the FastTrack to SharePoint End User Adoption in Your Organization -...
Drive on the FastTrack to SharePoint End User Adoption in Your Organization -...Drive on the FastTrack to SharePoint End User Adoption in Your Organization -...
Drive on the FastTrack to SharePoint End User Adoption in Your Organization -...
 
San Francisco SharePoint User Group - September 2016 - Drive on the FastTrack...
San Francisco SharePoint User Group - September 2016 - Drive on the FastTrack...San Francisco SharePoint User Group - September 2016 - Drive on the FastTrack...
San Francisco SharePoint User Group - September 2016 - Drive on the FastTrack...
 
The Limitations of Web Scraping Tools
The Limitations of Web Scraping ToolsThe Limitations of Web Scraping Tools
The Limitations of Web Scraping Tools
 
Drive On the Fast Track to SharePoint End User Adoption - SPS Toronto 2016
Drive On the Fast Track to SharePoint End User Adoption - SPS Toronto 2016Drive On the Fast Track to SharePoint End User Adoption - SPS Toronto 2016
Drive On the Fast Track to SharePoint End User Adoption - SPS Toronto 2016
 
BigData Meets the Federal Data Center
BigData Meets the Federal Data CenterBigData Meets the Federal Data Center
BigData Meets the Federal Data Center
 
Christoph Luetke Schelhowe - Data for Everyone
Christoph Luetke Schelhowe  - Data for EveryoneChristoph Luetke Schelhowe  - Data for Everyone
Christoph Luetke Schelhowe - Data for Everyone
 
O'Reilly Strata: Distilling Data Exhaust
O'Reilly Strata: Distilling Data ExhaustO'Reilly Strata: Distilling Data Exhaust
O'Reilly Strata: Distilling Data Exhaust
 
RightScale Webinar: Introducing Cloud Analytics
RightScale Webinar: Introducing Cloud AnalyticsRightScale Webinar: Introducing Cloud Analytics
RightScale Webinar: Introducing Cloud Analytics
 
Data Mashups -Data Science Summit
Data Mashups -Data Science SummitData Mashups -Data Science Summit
Data Mashups -Data Science Summit
 
Publishers presentation ucl
Publishers presentation uclPublishers presentation ucl
Publishers presentation ucl
 

Andere mochten auch

Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...DataWorks Summit
 
Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...
Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...
Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...Innovative Management Services
 
Getting Ahead of the Game: How a gaming studio gets customer insights from bi...
Getting Ahead of the Game: How a gaming studio gets customer insights from bi...Getting Ahead of the Game: How a gaming studio gets customer insights from bi...
Getting Ahead of the Game: How a gaming studio gets customer insights from bi...DataWorks Summit
 
Тематическое планирование 8 класс
Тематическое планирование 8 классТематическое планирование 8 класс
Тематическое планирование 8 классkoneqq
 
Doctor Faustus by Micaela & Rocío
Doctor Faustus by Micaela & RocíoDoctor Faustus by Micaela & Rocío
Doctor Faustus by Micaela & RocíoAndrea Izzo
 
Aggietarium slideshow final
Aggietarium slideshow finalAggietarium slideshow final
Aggietarium slideshow finalAna Monzon
 
Pure Storage Customer Business and IT Transformation
Pure Storage Customer Business and IT TransformationPure Storage Customer Business and IT Transformation
Pure Storage Customer Business and IT TransformationPure Storage
 
План самообразования
План самообразованияПлан самообразования
План самообразованияkoneqq
 
Enterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageEnterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageHortonworks
 
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersMonitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersDataWorks Summit
 
Dataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin BuzzwordsDataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin BuzzwordsDataiku
 
Кроссворд "Грибы"
Кроссворд "Грибы"Кроссворд "Грибы"
Кроссворд "Грибы"koneqq
 
Earth science pptx
Earth science pptxEarth science pptx
Earth science pptxsihellyay
 

Andere mochten auch (20)

Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...
 
Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...
Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...
Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...
 
Getting Ahead of the Game: How a gaming studio gets customer insights from bi...
Getting Ahead of the Game: How a gaming studio gets customer insights from bi...Getting Ahead of the Game: How a gaming studio gets customer insights from bi...
Getting Ahead of the Game: How a gaming studio gets customer insights from bi...
 
Practical Placement
Practical PlacementPractical Placement
Practical Placement
 
Oxleas_Review_2012_website_version_1
Oxleas_Review_2012_website_version_1Oxleas_Review_2012_website_version_1
Oxleas_Review_2012_website_version_1
 
Тематическое планирование 8 класс
Тематическое планирование 8 классТематическое планирование 8 класс
Тематическое планирование 8 класс
 
Doctor Faustus by Micaela & Rocío
Doctor Faustus by Micaela & RocíoDoctor Faustus by Micaela & Rocío
Doctor Faustus by Micaela & Rocío
 
Aggietarium slideshow final
Aggietarium slideshow finalAggietarium slideshow final
Aggietarium slideshow final
 
Pay periods use show
Pay periods use showPay periods use show
Pay periods use show
 
Pure Storage Customer Business and IT Transformation
Pure Storage Customer Business and IT TransformationPure Storage Customer Business and IT Transformation
Pure Storage Customer Business and IT Transformation
 
План самообразования
План самообразованияПлан самообразования
План самообразования
 
Active and passive voice
Active and passive voice Active and passive voice
Active and passive voice
 
Step by step essay
Step by step essayStep by step essay
Step by step essay
 
Zachatie Bulgaria
Zachatie BulgariaZachatie Bulgaria
Zachatie Bulgaria
 
Enterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageEnterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble Storage
 
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersMonitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service Providers
 
Dataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin BuzzwordsDataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin Buzzwords
 
Auto loans
Auto loansAuto loans
Auto loans
 
Кроссворд "Грибы"
Кроссворд "Грибы"Кроссворд "Грибы"
Кроссворд "Грибы"
 
Earth science pptx
Earth science pptxEarth science pptx
Earth science pptx
 

Ähnlich wie Move Beyond ETL: Tapping the True Business Value of Hadoop

Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011Jonathan Seidman
 
Gartner peer forum sept 2011 orbitz
Gartner peer forum sept 2011   orbitzGartner peer forum sept 2011   orbitz
Gartner peer forum sept 2011 orbitzRaghu Kashyap
 
Search Analytics at Enterprise Search Summit Fall 2011
Search Analytics at Enterprise Search Summit Fall 2011Search Analytics at Enterprise Search Summit Fall 2011
Search Analytics at Enterprise Search Summit Fall 2011Sematext Group, Inc.
 
Web analyticsandbigdata techweek2011
Web analyticsandbigdata techweek2011Web analyticsandbigdata techweek2011
Web analyticsandbigdata techweek2011Raghu Kashyap
 
Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...
Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...
Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...Rackspace
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.Jennifer Walker
 
Hortonworks.HadoopPatternsOfUse.201304
Hortonworks.HadoopPatternsOfUse.201304Hortonworks.HadoopPatternsOfUse.201304
Hortonworks.HadoopPatternsOfUse.201304James Kenney
 
Embracing Hadoop with a musical touch!
Embracing Hadoop with a musical touch!Embracing Hadoop with a musical touch!
Embracing Hadoop with a musical touch!DataWorks Summit
 
5 Tips to Bulletproof Your Analytics Implementation
5 Tips to Bulletproof Your Analytics Implementation5 Tips to Bulletproof Your Analytics Implementation
5 Tips to Bulletproof Your Analytics ImplementationObservePoint
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunitiesBigdata Meetup Kochi
 
Cloudera - Mike Olson - Hadoop World 2010
Cloudera - Mike Olson - Hadoop World 2010Cloudera - Mike Olson - Hadoop World 2010
Cloudera - Mike Olson - Hadoop World 2010Cloudera, Inc.
 
Keynote - Cloudera - Mike Olson - Hadoop World 2010
Keynote - Cloudera - Mike Olson - Hadoop World 2010Keynote - Cloudera - Mike Olson - Hadoop World 2010
Keynote - Cloudera - Mike Olson - Hadoop World 2010Cloudera, Inc.
 
Operationalizing Data Analytics
Operationalizing Data AnalyticsOperationalizing Data Analytics
Operationalizing Data AnalyticsVMware Tanzu
 
Hadoop World 2011: Extending Enterprise Data Warehouse with Hadoop - Jonathan...
Hadoop World 2011: Extending Enterprise Data Warehouse with Hadoop - Jonathan...Hadoop World 2011: Extending Enterprise Data Warehouse with Hadoop - Jonathan...
Hadoop World 2011: Extending Enterprise Data Warehouse with Hadoop - Jonathan...Cloudera, Inc.
 
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big DataInfochimps, a CSC Big Data Business
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...Hortonworks
 
The Right Data Warehouse: Automation Now, Business Value Thereafter
The Right Data Warehouse: Automation Now, Business Value ThereafterThe Right Data Warehouse: Automation Now, Business Value Thereafter
The Right Data Warehouse: Automation Now, Business Value ThereafterInside Analysis
 
Finding business value in Big Data
Finding business value in Big DataFinding business value in Big Data
Finding business value in Big DataJames Serra
 

Ähnlich wie Move Beyond ETL: Tapping the True Business Value of Hadoop (20)

Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
 
Gartner peer forum sept 2011 orbitz
Gartner peer forum sept 2011   orbitzGartner peer forum sept 2011   orbitz
Gartner peer forum sept 2011 orbitz
 
Search Analytics at Enterprise Search Summit Fall 2011
Search Analytics at Enterprise Search Summit Fall 2011Search Analytics at Enterprise Search Summit Fall 2011
Search Analytics at Enterprise Search Summit Fall 2011
 
Web analyticsandbigdata techweek2011
Web analyticsandbigdata techweek2011Web analyticsandbigdata techweek2011
Web analyticsandbigdata techweek2011
 
Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...
Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...
Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
 
Hortonworks.HadoopPatternsOfUse.201304
Hortonworks.HadoopPatternsOfUse.201304Hortonworks.HadoopPatternsOfUse.201304
Hortonworks.HadoopPatternsOfUse.201304
 
Embracing Hadoop with a musical touch!
Embracing Hadoop with a musical touch!Embracing Hadoop with a musical touch!
Embracing Hadoop with a musical touch!
 
5 Tips to Bulletproof Your Analytics Implementation
5 Tips to Bulletproof Your Analytics Implementation5 Tips to Bulletproof Your Analytics Implementation
5 Tips to Bulletproof Your Analytics Implementation
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
 
Big data and Hadoop Training Brochure
Big data and Hadoop Training Brochure Big data and Hadoop Training Brochure
Big data and Hadoop Training Brochure
 
Cloudera - Mike Olson - Hadoop World 2010
Cloudera - Mike Olson - Hadoop World 2010Cloudera - Mike Olson - Hadoop World 2010
Cloudera - Mike Olson - Hadoop World 2010
 
Keynote - Cloudera - Mike Olson - Hadoop World 2010
Keynote - Cloudera - Mike Olson - Hadoop World 2010Keynote - Cloudera - Mike Olson - Hadoop World 2010
Keynote - Cloudera - Mike Olson - Hadoop World 2010
 
Operationalizing Data Analytics
Operationalizing Data AnalyticsOperationalizing Data Analytics
Operationalizing Data Analytics
 
Hadoop World 2011: Extending Enterprise Data Warehouse with Hadoop - Jonathan...
Hadoop World 2011: Extending Enterprise Data Warehouse with Hadoop - Jonathan...Hadoop World 2011: Extending Enterprise Data Warehouse with Hadoop - Jonathan...
Hadoop World 2011: Extending Enterprise Data Warehouse with Hadoop - Jonathan...
 
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
 
The Right Data Warehouse: Automation Now, Business Value Thereafter
The Right Data Warehouse: Automation Now, Business Value ThereafterThe Right Data Warehouse: Automation Now, Business Value Thereafter
The Right Data Warehouse: Automation Now, Business Value Thereafter
 
Finding business value in Big Data
Finding business value in Big DataFinding business value in Big Data
Finding business value in Big Data
 

Mehr von DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

Mehr von DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Kürzlich hochgeladen

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 

Kürzlich hochgeladen (20)

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 

Move Beyond ETL: Tapping the True Business Value of Hadoop

Hinweis der Redaktion

  1. This talk is really about blind spots. I believe there are three that are ultimately keeping many of you from “tapping the true value of Hadoop.”
  2. How are we going to store all the information on the internet? Google File System (GFS) How are we going to analyze is? MapReduce (MR) How are we going to do something with it? BigTable (BT)
  3. “I, too, need to store large amounts of data!” These are technology companies The followers on this wave are in other businesses, but need to use technology to move forward They waited to see if these technologies would really work
  4. Three things the follower does not see: New use cases from a few early adopters What changes about the architectures to support new use cases Where the early adopters are ultimately going
  5. I don’t actually mean the use cases that are way out there. I mean the very next ones that you early adopters are doing now, and you should be doing next (this year or next year)
  6. We all know product recommendations
  7. Recommendations are not just for products. Recommend content Recommend people Recommend actions
  8. Auto-complete Recommendations within search Personalized search results Search within the enterprise
  9. Predict energy usage (Opower) Predict weather (Climate Corp) Predict device returns (Motorola)
  10. Deals to tablet and mobile devices
  11. Optimizing experiences on each channel The key ingredients here are: Data consolidation (get everything in one place so it is accessible) Experimentation (try different things on live traffic) Rapid iteration (optimize by making changes quickly)
  12. You should, too. At the very least, you should start doing “traditional BI” on big data.
  13. Next generation use cases are in two categories: Analysis: Now that we have data, and it is consolidated, let’s ask more questions. Action: Now that we have data, and it is consolidated, let’s put it to work.
  14. Followers (early majority) are at the Understand phase. Early adopters are going deep into Understand, or moving on to Act. I really want to talk about the last phase. What are the key ingredients?
  15. Early adopters are changing their system architectures: They are adding new-age tools They are removing and replacing outdated systems They are restructuring and shuffling components
  16. Review the difference between building upon understanding versus moving into action.
  17. You got data delivered back into the application, but did you include any of the key ingredients?
  18. Let’s focus on the early adopters who migrated into action. What have they done? We have already added the KVStore, HBase, to connect data back to the frontends. We can add a stream processing engine to get real-time. We can use the Lambda architecture to get all sorts of nice properties like immutable data sources, and make only incremental additions.
  19. What does it look like to go through this process of “going deep” into action? Add room for a stream processing system (Storm, Samza) Add a query layer on top to join the results from the batch layer from the speed layer
  20. You got data delivered back into the application, but did you include any of the key ingredients?
  21. To make a change to something you need to edit the batch layer, the speed layer, and potentially the query that joins the two.
  22. You don’t have enough data to see the future of where people are going.
  23. What’s next?
  24. What’s next?
  25. I don’t know how to quantify the business value. I’ll leave that to Gartner. But I hope that I can convince you that: The intrinsic value of each phase is greater than the previous. What good is collecting data if you don’t do anything with it? What good is it if you don’t understand it? The realized value to the business at each phase is even more extreme that what I’ve shown here. What good is understanding unless you do something with it? You can do something with it as a human being, but many more decisions now are made by machines, not humans.
  26. How long does this take? The testing, aka experiment design, development, and deployment is the bottleneck. Why are you spending so much money working on increasing the speed of these other phases?
  27. What you would design to solve the first three phases (up to understanding) is different from what you would build to solve “action.” We don’t know what’s coming next. Design for your problem. And do so without just blindly following the early adopters. Instead, start with your requirements, and design with purpose.