SlideShare ist ein Scribd-Unternehmen logo
1 von 30
1© Cloudera, Inc. All rights reserved.
Your customer’s journey to success AND
Your journey to quicker and larger expansions
Making Data Real
2© Cloudera, Inc. All rights reserved.
What we are going to discuss here
1. How advanced analytics is better than data
warehousing
2. Being data-driven is a journey, not a project
3. Our most successful customers do five key things
3© Cloudera, Inc. All rights reserved.
Innovation around the world driven by data
Using data about bets per second
and machine learning to promote
responsible gambling by
customising offers to minimise the
customer's vulnerability.
A "smart business" application for
small businesses that enables them
to see patterns in an anonymised
data generated by the bank's other
customers.
CONNECT PRODUCT & SERVICES (IoT) DRIVE CUSTOMER INSIGHTSPROTECT LIVES
Analyzing acoustic data coming
from turbines in real-time to
monitor the health of and predict
failures in turbines for hydro power
stations.
4© Cloudera, Inc. All rights reserved.
Advanced analytics is better than data warehousing
Build your data asset economically and at scale
1. Collect data in native format – enables agility
2. Build history by collecting data prior to its use
Securely share on-prem, in cloud, anywhere
3. Security at the data layer increases flexibility and ability to protect privacy
4. Create community data and drive innovation by sharing across your business
Innovate with analytics and operationalize the insights
5. Analyze data in near real-time
6. Build and deploy machine learning models and other advanced analytics
7. Deliver insights via enterprise, mobile and web applications
5© Cloudera, Inc. All rights reserved.
Think Big.
Start small.
Iterate to success.
Being data-driven is a journey, not a project.
6© Cloudera, Inc. All rights reserved.
Customers ask: what does ‘think big’ mean?
Determine your strategic initiatives.
Read your annual report.
Define a reasonable timeframe and goals.
Typically 3-5 strategic initiatives in parallel
Often segmented by business unit
At maturity initiatives cross business units.
7© Cloudera, Inc. All rights reserved.
Customers ask: what does ‘start small’ mean?
Embrace the familiar.
Enhance something you know.
Make a report better with more data.
Then go get a shiny object (new data).
Bring in and integrate.
Showcase your results with a visualization.
8© Cloudera, Inc. All rights reserved.
Customers ask: what does ‘iterate often’ mean?
Break strategic initiatives into quarterly objectives.
Break your quarterly objectives into sprints.
Deliver and visualize outcomes at every sprint exit.
Continuously learning and adapting.
Outcomes can be both positive and negative.
Outcomes can be about the business and data.
9© Cloudera, Inc. All rights reserved.
Our Most Successful Customers iterate these Five Things
1. Build a Big Data Culture
Led by an enabled executive sponsor(s). Communication methodologies. Advocating change.
2. Assemble the right team
Tightly aligned team. Mix of seasoned experts and innovators
3. Adopt an agile approach for data engineering, data science, analysis
Successful projects start small, are hypothesis driven and iterate to success approach.
Roadmaps: Document expected direction, yet expect insights to create change
4. Efficiently operationalize insights
Analytics -> Reports, Big Data -> Actions. Create a bridge between Dev and Ops
5. Rightsize your data governance
Rightsize and iteratively building towards maturity.
10© Cloudera, Inc. All rights reserved.
Description
Executive
Sponsorship
 Executive Sponsor for the overall Big Data mission including advocacy for
creating/collecting data and business stakeholders for individual use cases. Align to
strategic initiatives.
Community
 Build community through communications about vision, insights, data and platform
and technology.
 Make communications more programmatic across the entire organization with
meet ups, big data days and hackathons.
Foster a culture that iteratively and continuous builds a strong
sharing community. Enable many in the organization – over time –
to become evangelists
1. Build a data-driven culture
Visualizations  Visualize EVERYTHING! Use visualizations to tell stories about the data asset itself
(how big is it, how fast is it growing) as well as insights found for the business.
11© Cloudera, Inc. All rights reserved.
Logical Information Architecture
An environment that supports new ways of working
Ingestion Zone Discovery Zone Integrated Zone Production
Trusted power users have broad
access & new tools.
The new data engineering
team partners with
data stewards to ingest
full fidelity raw data
Continuous deployment concepts move
data, models, etc from exploratory
environment to production
Business users have
narrowed access with
traditional BI tools
and applications
12© Cloudera, Inc. All rights reserved.
Ingestion Zone Discovery Zone Integrated Zone Production
Raw Trusted
Ingest
Validation &
Verification
Enrichment Transform Routing
Logical Information Architecture
An environment that supports new ways of working
13© Cloudera, Inc. All rights reserved.
2. Assemble the right team 1
Executive
Architecture &
Operations
Data
Engineering
Data Science,
SQL & app
development
Vision and Goals
14© Cloudera, Inc. All rights reserved.
Description
An essential key to success is having a strong executive sponsor for the
overall Big Data mission including advocacy for creating/collecting data
and business stakeholders for individual use cases.
Profile
 An executive focused on change, and willing to take risk to ensure the
success of the business via the Big Data initiatives.
Education
 Use every opportunity to bring the topic in front of potential
sponsors and stakeholders. Share industry and business potential
ROI models (heeding the warning not to overstate).
Advocacy
 Build big data success stories from within the business. Advocate for
the use of data in new ways. Support the proactive collection of data
and lead the charge to assign value to data.
The Important Role of the Executive Sponsor
15© Cloudera, Inc. All rights reserved.
Hadoop and the Big Data technology ecosystems change rapidly
– infrastructure architecture is a critical component of your
team. Architects need to balance tactical and strategic needs.
Communication
 The software and hardware infrastructure is often physically operated
by an different group. The Architect needs close collaboration.
Education
 Continually explore new technologies, including 3rd party tools –
architects need to stay ahead of the curve. Training is essential:
admin, developer.
Leadership
 Be the infrastructure expert and advise on new projects and new
requirements from the data management team and the business.
Know when to call in the experts on Hadoop and Big Data.
Description
Your Infrastructure Team & Architect
16© Cloudera, Inc. All rights reserved.
Data is only useful if users can employ it in a meaningful way.
Data engineers have to be committed to making your company’s
data the utmost strategic asset, from acquisition to advocacy.
Communication
 Document, secure, audit the data. Create simple schemas and search
indexes for each data set. Create common profiles, and continually
advocate for new data and for improved data.
Education
 Get trained and certified with Cloudera Administrator, Developer,
Data Analyst courses and become an expert with Navigator, Sentry
Leadership
 Promote and evangelize to educate on the value of Big Data, take the
lead on data governance – love the data
Description
Your Data Engineering Team
17© Cloudera, Inc. All rights reserved.
Curiosity
Math &
Statistical
Knowledge
Hacking
skills
Subject
Matter
Expertise
The hybrid data scientist
• Subject Matter Expertise lies
in the business
• Hacking skills can come from
existing IT staff or new hires
• Staff at least one true Ph.D
statistician for model
oversight across all teams
Important character trait
Data Science
A luxury is finding one or more
data scientists that cross these
disciplines
Your Data Scientist Team(s)
18© Cloudera, Inc. All rights reserved.
Often a centralized Data Science team can partner with the
business to identify data that differentiates, explore use cases to
solve, and help to jumpstart business teams. Be mindful not to
overbuild centrally.
Agility  The team must be able to learn quickly and adapt
Skills
 Hybrid skills of computer science (hacking), domain expertise and
at least one true statistician. Data Science training.
Teams
 Often businesses find the domain expertise in-house, add in MS/Ph.D.
candidates from local universities and hire that one true statistician
Experts
 This team must be the “data experts” for the entire company in order
to fulfil the vision of sharing data for maximum innovation
Description
Staff for Success: Data Science-as-a-Service
19© Cloudera, Inc. All rights reserved.
Lower risk
 Risk of funding long-running projects with limited business value is
small. Use daily results to improve the process or change course.
Lower costs
 Can run infrastructure, data and insights workstreams in parallel.
Avoids large build-out of infrastructure and data before insights.
Communication
 With clear short-term results, enables a continuous communications
stream showcasing results or failures
Team
 Can start with small team, and add additional scrum teams as value is
determined and investment is available
Agile methodology provides actionable results more rapidly and
measures the value gained at each step, in small iterations. Agile
should be applied to data and insights project workstreams.
Description
3. Adopt an agile approach to data engineering & science
20© Cloudera, Inc. All rights reserved.
Use Case Development
EDH Buildout
Data Governance & Common Profile Development
Data Engineering
Agile Methodology Enables Iterative Workstreams
Use Case Development/App development/Data science
21© Cloudera, Inc. All rights reserved.
Agile Use Case Development
Scrum Team Release 1 Release 2 Release 3
Production
Ready
Scrum Team Release 1 Release 2
Production
Ready
Release 4
Release 3
Agile Data Ingestion/Management
Scrum Team Release 1 Release 2 Release 3
Production
Ready
Scrum Team Release 1 Release 2
Production
Ready
Release 4
Release 3
Agile Methodology Enables Iterative Workstreams
Agile Data Governance & Common Profile Development
Scrum Team Release 1 Release 2 Release 3
Production
Ready
Scrum Team Release 1 Release 2
Production
Ready
Release 4
Release 3
EDH Buildout
22© Cloudera, Inc. All rights reserved.
Description
DevOps  Operationalizing data and insights from analytics is more like digital and mobile
development cycles, than traditional ERP or RDBMS applications
Communication
 Start by finding those people who understood both sides of the agile development
and IT deployment cycle – and bridge the communication gap between both sides
 Initially most DevOps processes were manual – making sure web code was unit and
functionally tested, ensuring source code was under a control system, etc.
Apply DevOps concepts from the world of application / web
development to the world of data and analytics – from
managing data that needs to move to production, as well as the
models used to create insights from that data.
4. Efficiently operationalize your insights
Continuous
Delivery
 Move towards automation of the processes needed to move new analytical
code/models/data from development into production. Eventually get to complete
automation of those processes, allowing for continuous deployment of new
analytical models and data into production
23© Cloudera, Inc. All rights reserved.
Data Stewards
Owners and/or creators
of the data
Responsibilities
 Providing knowledge
about the data (e.g.
privacy, use case
concerns)
 Documenting and
improving the raw data,
with focus on link-ability
Data Engineers
Implement the data
governance policies
Responsibilities
 Defining and driving the
governance
 Organizing and hosting
the Governance Council
 Delivering and utilizing
tools (e.g. Navigator) to
enforce governance
Data Governance
Council
Business owners of the
Data Governance
Responsibilities
 Communication about
and enforcement of
data governance
 Assigning data steward
roles
 Improving the link-
ability of data
5. Rightsize Your Big Data Governance
24© Cloudera, Inc. All rights reserved.
Rightsize your data governance: Iterate to maturity
Chaos: “We don’t
know what’s in our
data hub”
CYA: Basic
governance artifact
capture
Self-service: Data
curation automation
Automation: Data
stewardship and
lifecycle automation
Continuous
improvement:
ongoing
optimization
1
2
3
4
5Initial
Managed
Standardized
Measured
Optimized
25© Cloudera, Inc. All rights reserved.
Data can make what is
impossible today,
possible tomorrow
26© Cloudera, Inc. All rights reserved.
Changing our relationship with the
products and services we consume
27© Cloudera, Inc. All rights reserved.
Improving reliability, quality,
& sustainability
28© Cloudera, Inc. All rights reserved.
Reaching for the stars
29© Cloudera, Inc. All rights reserved.
Protecting the most vulnerable
30© Cloudera, Inc. All rights reserved.
Thank you

Weitere ähnliche Inhalte

Was ist angesagt?

Strategies for Enterprise Grade Azure-based Analytics
Strategies for Enterprise Grade Azure-based AnalyticsStrategies for Enterprise Grade Azure-based Analytics
Strategies for Enterprise Grade Azure-based Analytics
Cloudera, Inc.
 

Was ist angesagt? (20)

Turning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data PlatformTurning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data Platform
 
Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1
Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1

 
IoT-Enabled Predictive Maintenance
IoT-Enabled Predictive MaintenanceIoT-Enabled Predictive Maintenance
IoT-Enabled Predictive Maintenance
 
Advanced Analytics for Investment Firms and Machine Learning
Advanced Analytics for Investment Firms and Machine LearningAdvanced Analytics for Investment Firms and Machine Learning
Advanced Analytics for Investment Firms and Machine Learning
 
Driving Better Products with Customer Intelligence

Driving Better Products with Customer Intelligence
Driving Better Products with Customer Intelligence

Driving Better Products with Customer Intelligence

 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
Becoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural ChangeBecoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural Change
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
 
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
 
How to Lower TCO and Avoid Cloud Lock-in

How to Lower TCO and Avoid Cloud Lock-in
How to Lower TCO and Avoid Cloud Lock-in

How to Lower TCO and Avoid Cloud Lock-in

 
From Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your OrganizationFrom Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your Organization
 
Customer Best Practices: Optimizing Cloudera on AWS
Customer Best Practices: Optimizing Cloudera on AWSCustomer Best Practices: Optimizing Cloudera on AWS
Customer Best Practices: Optimizing Cloudera on AWS
 
Deep Learning with Cloudera
Deep Learning with ClouderaDeep Learning with Cloudera
Deep Learning with Cloudera
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Preparing for the Cybersecurity Renaissance
Preparing for the Cybersecurity RenaissancePreparing for the Cybersecurity Renaissance
Preparing for the Cybersecurity Renaissance
 
Strategies for Enterprise Grade Azure-based Analytics
Strategies for Enterprise Grade Azure-based AnalyticsStrategies for Enterprise Grade Azure-based Analytics
Strategies for Enterprise Grade Azure-based Analytics
 
Cloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learningCloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learning
 
High-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache ImpalaHigh-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache Impala
 
Optimize your cloud strategy for machine learning and analytics
Optimize your cloud strategy for machine learning and analyticsOptimize your cloud strategy for machine learning and analytics
Optimize your cloud strategy for machine learning and analytics
 
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
 

Ähnlich wie The Five Markers on Your Big Data Journey

Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
Julian Tong
 
Hu Yoshida's Point of View: Competing In An Always On World
Hu Yoshida's Point of View: Competing In An Always On WorldHu Yoshida's Point of View: Competing In An Always On World
Hu Yoshida's Point of View: Competing In An Always On World
Hitachi Vantara
 
151116 Sedania Cloudera BDA Profile
151116 Sedania Cloudera BDA Profile151116 Sedania Cloudera BDA Profile
151116 Sedania Cloudera BDA Profile
Zarul Zaabah
 

Ähnlich wie The Five Markers on Your Big Data Journey (20)

Creating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use casesCreating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use cases
 
Standing Up an Effective Enterprise Data Hub -- Technology and Beyond
Standing Up an Effective Enterprise Data Hub -- Technology and BeyondStanding Up an Effective Enterprise Data Hub -- Technology and Beyond
Standing Up an Effective Enterprise Data Hub -- Technology and Beyond
 
The Journey to Success with Big Data
The Journey to Success with Big DataThe Journey to Success with Big Data
The Journey to Success with Big Data
 
Capgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with Cloudera
 
Data Analytics.pptx
Data Analytics.pptxData Analytics.pptx
Data Analytics.pptx
 
Accelerating Innovation in Energy
Accelerating Innovation in EnergyAccelerating Innovation in Energy
Accelerating Innovation in Energy
 
MongoDB World 2019: Data Digital Decoupling
MongoDB World 2019: Data Digital DecouplingMongoDB World 2019: Data Digital Decoupling
MongoDB World 2019: Data Digital Decoupling
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Where the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information AccessWhere the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information Access
 
Hu Yoshida's Point of View: Competing In An Always On World
Hu Yoshida's Point of View: Competing In An Always On WorldHu Yoshida's Point of View: Competing In An Always On World
Hu Yoshida's Point of View: Competing In An Always On World
 
151116 Sedania Cloudera BDA Profile
151116 Sedania Cloudera BDA Profile151116 Sedania Cloudera BDA Profile
151116 Sedania Cloudera BDA Profile
 
CIS 499 Final
CIS 499 FinalCIS 499 Final
CIS 499 Final
 
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision MakingAnalyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
 
Transform Banking with Big Data and Automated Machine Learning 9.12.17
Transform Banking with Big Data and Automated Machine Learning 9.12.17Transform Banking with Big Data and Automated Machine Learning 9.12.17
Transform Banking with Big Data and Automated Machine Learning 9.12.17
 
CSC Conversations
CSC ConversationsCSC Conversations
CSC Conversations
 
Tdwi austin simplifying big data delivery to drive new insights final
Tdwi austin   simplifying big data delivery to drive new insights finalTdwi austin   simplifying big data delivery to drive new insights final
Tdwi austin simplifying big data delivery to drive new insights final
 
The 5 Biggest Data Myths in Telco: Exposed
The 5 Biggest Data Myths in Telco: ExposedThe 5 Biggest Data Myths in Telco: Exposed
The 5 Biggest Data Myths in Telco: Exposed
 
Leverage Data Strategy as a Catalyst for Innovation
Leverage Data Strategy as a Catalyst for InnovationLeverage Data Strategy as a Catalyst for Innovation
Leverage Data Strategy as a Catalyst for Innovation
 
Modernizing IT in the Platform Era
Modernizing IT in the Platform EraModernizing IT in the Platform Era
Modernizing IT in the Platform Era
 

Mehr von Cloudera, Inc.

Mehr von Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 
Cloudera SDX
Cloudera SDXCloudera SDX
Cloudera SDX
 

Kürzlich hochgeladen

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Kürzlich hochgeladen (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 

The Five Markers on Your Big Data Journey

  • 1. 1© Cloudera, Inc. All rights reserved. Your customer’s journey to success AND Your journey to quicker and larger expansions Making Data Real
  • 2. 2© Cloudera, Inc. All rights reserved. What we are going to discuss here 1. How advanced analytics is better than data warehousing 2. Being data-driven is a journey, not a project 3. Our most successful customers do five key things
  • 3. 3© Cloudera, Inc. All rights reserved. Innovation around the world driven by data Using data about bets per second and machine learning to promote responsible gambling by customising offers to minimise the customer's vulnerability. A "smart business" application for small businesses that enables them to see patterns in an anonymised data generated by the bank's other customers. CONNECT PRODUCT & SERVICES (IoT) DRIVE CUSTOMER INSIGHTSPROTECT LIVES Analyzing acoustic data coming from turbines in real-time to monitor the health of and predict failures in turbines for hydro power stations.
  • 4. 4© Cloudera, Inc. All rights reserved. Advanced analytics is better than data warehousing Build your data asset economically and at scale 1. Collect data in native format – enables agility 2. Build history by collecting data prior to its use Securely share on-prem, in cloud, anywhere 3. Security at the data layer increases flexibility and ability to protect privacy 4. Create community data and drive innovation by sharing across your business Innovate with analytics and operationalize the insights 5. Analyze data in near real-time 6. Build and deploy machine learning models and other advanced analytics 7. Deliver insights via enterprise, mobile and web applications
  • 5. 5© Cloudera, Inc. All rights reserved. Think Big. Start small. Iterate to success. Being data-driven is a journey, not a project.
  • 6. 6© Cloudera, Inc. All rights reserved. Customers ask: what does ‘think big’ mean? Determine your strategic initiatives. Read your annual report. Define a reasonable timeframe and goals. Typically 3-5 strategic initiatives in parallel Often segmented by business unit At maturity initiatives cross business units.
  • 7. 7© Cloudera, Inc. All rights reserved. Customers ask: what does ‘start small’ mean? Embrace the familiar. Enhance something you know. Make a report better with more data. Then go get a shiny object (new data). Bring in and integrate. Showcase your results with a visualization.
  • 8. 8© Cloudera, Inc. All rights reserved. Customers ask: what does ‘iterate often’ mean? Break strategic initiatives into quarterly objectives. Break your quarterly objectives into sprints. Deliver and visualize outcomes at every sprint exit. Continuously learning and adapting. Outcomes can be both positive and negative. Outcomes can be about the business and data.
  • 9. 9© Cloudera, Inc. All rights reserved. Our Most Successful Customers iterate these Five Things 1. Build a Big Data Culture Led by an enabled executive sponsor(s). Communication methodologies. Advocating change. 2. Assemble the right team Tightly aligned team. Mix of seasoned experts and innovators 3. Adopt an agile approach for data engineering, data science, analysis Successful projects start small, are hypothesis driven and iterate to success approach. Roadmaps: Document expected direction, yet expect insights to create change 4. Efficiently operationalize insights Analytics -> Reports, Big Data -> Actions. Create a bridge between Dev and Ops 5. Rightsize your data governance Rightsize and iteratively building towards maturity.
  • 10. 10© Cloudera, Inc. All rights reserved. Description Executive Sponsorship  Executive Sponsor for the overall Big Data mission including advocacy for creating/collecting data and business stakeholders for individual use cases. Align to strategic initiatives. Community  Build community through communications about vision, insights, data and platform and technology.  Make communications more programmatic across the entire organization with meet ups, big data days and hackathons. Foster a culture that iteratively and continuous builds a strong sharing community. Enable many in the organization – over time – to become evangelists 1. Build a data-driven culture Visualizations  Visualize EVERYTHING! Use visualizations to tell stories about the data asset itself (how big is it, how fast is it growing) as well as insights found for the business.
  • 11. 11© Cloudera, Inc. All rights reserved. Logical Information Architecture An environment that supports new ways of working Ingestion Zone Discovery Zone Integrated Zone Production Trusted power users have broad access & new tools. The new data engineering team partners with data stewards to ingest full fidelity raw data Continuous deployment concepts move data, models, etc from exploratory environment to production Business users have narrowed access with traditional BI tools and applications
  • 12. 12© Cloudera, Inc. All rights reserved. Ingestion Zone Discovery Zone Integrated Zone Production Raw Trusted Ingest Validation & Verification Enrichment Transform Routing Logical Information Architecture An environment that supports new ways of working
  • 13. 13© Cloudera, Inc. All rights reserved. 2. Assemble the right team 1 Executive Architecture & Operations Data Engineering Data Science, SQL & app development Vision and Goals
  • 14. 14© Cloudera, Inc. All rights reserved. Description An essential key to success is having a strong executive sponsor for the overall Big Data mission including advocacy for creating/collecting data and business stakeholders for individual use cases. Profile  An executive focused on change, and willing to take risk to ensure the success of the business via the Big Data initiatives. Education  Use every opportunity to bring the topic in front of potential sponsors and stakeholders. Share industry and business potential ROI models (heeding the warning not to overstate). Advocacy  Build big data success stories from within the business. Advocate for the use of data in new ways. Support the proactive collection of data and lead the charge to assign value to data. The Important Role of the Executive Sponsor
  • 15. 15© Cloudera, Inc. All rights reserved. Hadoop and the Big Data technology ecosystems change rapidly – infrastructure architecture is a critical component of your team. Architects need to balance tactical and strategic needs. Communication  The software and hardware infrastructure is often physically operated by an different group. The Architect needs close collaboration. Education  Continually explore new technologies, including 3rd party tools – architects need to stay ahead of the curve. Training is essential: admin, developer. Leadership  Be the infrastructure expert and advise on new projects and new requirements from the data management team and the business. Know when to call in the experts on Hadoop and Big Data. Description Your Infrastructure Team & Architect
  • 16. 16© Cloudera, Inc. All rights reserved. Data is only useful if users can employ it in a meaningful way. Data engineers have to be committed to making your company’s data the utmost strategic asset, from acquisition to advocacy. Communication  Document, secure, audit the data. Create simple schemas and search indexes for each data set. Create common profiles, and continually advocate for new data and for improved data. Education  Get trained and certified with Cloudera Administrator, Developer, Data Analyst courses and become an expert with Navigator, Sentry Leadership  Promote and evangelize to educate on the value of Big Data, take the lead on data governance – love the data Description Your Data Engineering Team
  • 17. 17© Cloudera, Inc. All rights reserved. Curiosity Math & Statistical Knowledge Hacking skills Subject Matter Expertise The hybrid data scientist • Subject Matter Expertise lies in the business • Hacking skills can come from existing IT staff or new hires • Staff at least one true Ph.D statistician for model oversight across all teams Important character trait Data Science A luxury is finding one or more data scientists that cross these disciplines Your Data Scientist Team(s)
  • 18. 18© Cloudera, Inc. All rights reserved. Often a centralized Data Science team can partner with the business to identify data that differentiates, explore use cases to solve, and help to jumpstart business teams. Be mindful not to overbuild centrally. Agility  The team must be able to learn quickly and adapt Skills  Hybrid skills of computer science (hacking), domain expertise and at least one true statistician. Data Science training. Teams  Often businesses find the domain expertise in-house, add in MS/Ph.D. candidates from local universities and hire that one true statistician Experts  This team must be the “data experts” for the entire company in order to fulfil the vision of sharing data for maximum innovation Description Staff for Success: Data Science-as-a-Service
  • 19. 19© Cloudera, Inc. All rights reserved. Lower risk  Risk of funding long-running projects with limited business value is small. Use daily results to improve the process or change course. Lower costs  Can run infrastructure, data and insights workstreams in parallel. Avoids large build-out of infrastructure and data before insights. Communication  With clear short-term results, enables a continuous communications stream showcasing results or failures Team  Can start with small team, and add additional scrum teams as value is determined and investment is available Agile methodology provides actionable results more rapidly and measures the value gained at each step, in small iterations. Agile should be applied to data and insights project workstreams. Description 3. Adopt an agile approach to data engineering & science
  • 20. 20© Cloudera, Inc. All rights reserved. Use Case Development EDH Buildout Data Governance & Common Profile Development Data Engineering Agile Methodology Enables Iterative Workstreams Use Case Development/App development/Data science
  • 21. 21© Cloudera, Inc. All rights reserved. Agile Use Case Development Scrum Team Release 1 Release 2 Release 3 Production Ready Scrum Team Release 1 Release 2 Production Ready Release 4 Release 3 Agile Data Ingestion/Management Scrum Team Release 1 Release 2 Release 3 Production Ready Scrum Team Release 1 Release 2 Production Ready Release 4 Release 3 Agile Methodology Enables Iterative Workstreams Agile Data Governance & Common Profile Development Scrum Team Release 1 Release 2 Release 3 Production Ready Scrum Team Release 1 Release 2 Production Ready Release 4 Release 3 EDH Buildout
  • 22. 22© Cloudera, Inc. All rights reserved. Description DevOps  Operationalizing data and insights from analytics is more like digital and mobile development cycles, than traditional ERP or RDBMS applications Communication  Start by finding those people who understood both sides of the agile development and IT deployment cycle – and bridge the communication gap between both sides  Initially most DevOps processes were manual – making sure web code was unit and functionally tested, ensuring source code was under a control system, etc. Apply DevOps concepts from the world of application / web development to the world of data and analytics – from managing data that needs to move to production, as well as the models used to create insights from that data. 4. Efficiently operationalize your insights Continuous Delivery  Move towards automation of the processes needed to move new analytical code/models/data from development into production. Eventually get to complete automation of those processes, allowing for continuous deployment of new analytical models and data into production
  • 23. 23© Cloudera, Inc. All rights reserved. Data Stewards Owners and/or creators of the data Responsibilities  Providing knowledge about the data (e.g. privacy, use case concerns)  Documenting and improving the raw data, with focus on link-ability Data Engineers Implement the data governance policies Responsibilities  Defining and driving the governance  Organizing and hosting the Governance Council  Delivering and utilizing tools (e.g. Navigator) to enforce governance Data Governance Council Business owners of the Data Governance Responsibilities  Communication about and enforcement of data governance  Assigning data steward roles  Improving the link- ability of data 5. Rightsize Your Big Data Governance
  • 24. 24© Cloudera, Inc. All rights reserved. Rightsize your data governance: Iterate to maturity Chaos: “We don’t know what’s in our data hub” CYA: Basic governance artifact capture Self-service: Data curation automation Automation: Data stewardship and lifecycle automation Continuous improvement: ongoing optimization 1 2 3 4 5Initial Managed Standardized Measured Optimized
  • 25. 25© Cloudera, Inc. All rights reserved. Data can make what is impossible today, possible tomorrow
  • 26. 26© Cloudera, Inc. All rights reserved. Changing our relationship with the products and services we consume
  • 27. 27© Cloudera, Inc. All rights reserved. Improving reliability, quality, & sustainability
  • 28. 28© Cloudera, Inc. All rights reserved. Reaching for the stars
  • 29. 29© Cloudera, Inc. All rights reserved. Protecting the most vulnerable
  • 30. 30© Cloudera, Inc. All rights reserved. Thank you