SlideShare a Scribd company logo
1 of 20
Confidential Think Big AnalyticsConfidential Think Big Analytics
Big Analytics Best Practices
An Executive Guide
September 26, 2012
Ron Bodkin
Founder and CEO
ron.bodkin@thinkbiganalytics.com
@ronbodkin
Confidential Think Big Analytics
Introduction
• One of Silicon Valley’s Fastest Growing Big Data start ups
• 100% Focus on Big Data consulting & Data Science solution services
• Management Background:
 Cambridge Technology, C-bridge, Oracle, Sun Microsystems, Quantcast, Accenture
 C-bridge Internet Solutions (CBIS) founder 1996 & executives, IPO 1999
• Clients: 40+
– Focuses: Technology, Financial Services, Retail, Advertising
• North America Locations
• US East: Boston, New York, Miami
• US Central: Chicago, Austin
• US West: HQ Mountain View, San Diego, Salt Lake City
Think Big is the leading professional services firm that’s purpose built for Big Data.
28/17/2013
Confidential Think Big Analytics
Big Analytics, enabled by Big Data
Big Data invented to solve
web scale data challenges.
Opportunity and mandate
for enterprises to compete
with advanced analytics.
Now enabling new
businesses and products.
38/17/2013
Confidential Think Big Analytics
1. It’s just a new name for Business Intelligence.
2. The packaged applications are about to emerge.
3. The enterprise can wait.
4. Low cost, low skill staffing will work.
5. It’s simple to get results.
6. You can automate all the intelligence.
7. You can buy it all from a single vendor “stack.”
The 7 Myths of Big Data
48/17/2013
Confidential Think Big Analytics
Incremental Adoption
58/17/2013
Confidential Think Big Analytics
Real World Results
68/17/2013
Confidential Think Big Analytics
360 Customer View Analytics
Trends
• Compute model scores faster
• Analyze full data sets
• Incorporate new data
• Build new services from data
Basic Reporting
Data Ingestion
Batch
Processing
Fast Analytics
Data
Enrichment
Data Science
78/17/2013
Confidential Think Big Analytics
Social Media
“The digital transformation occurring at American Express cuts across many business units,
and it has to because of the breadth and depth of our business,” Leslie Berland SVP of
Digital Partnerships and Development explains. “From customer service to merchant
services to our entertainment and travel business units, to corporate affairs, as well as our
newly formed digital partnerships and development team, social media is a company-wide
initiative.”
Source: http://mashable.com/2012/03/28/american-express-social-media/
March 28, 2012
88/17/2013
Confidential Think Big Analytics
Think Big
98/17/2013
Confidential Think Big Analytics
Envision
Current State
Future State
Prioritized
Initiatives
Key
Decisions &
Impact
Analysis
Reference
Architecture
Design
Patterns
Technology
Rankings
Organization
& Training
Optimized
Projects
Selection
Big Analytics
Roadmap &
RecommendationsGap Analysis
Big Data Strategy Readiness Analysis
Technology
Recommendations
Big Data Roadmap
Big Analytics Roadmap Methodology
Analytic
Platform
Decision Tree
Data
Strategy
108/17/2013
Confidential Think Big Analytics
Data Strategy: Value from Integration
Ad Server
Mobile
Social
Web Site
Devices & Enterprise Applications
Outside Data (new)
118/17/2013
Confidential Think Big Analytics
Start Smart
128/17/2013
Confidential Think Big Analytics
Organizing for Success
• Driven by collaboration between
data scientists, engineers and
business
• Leverages the manifest and latent
signal of multi-structured data
• Emphasizes exploratory analysis to
uncover novel topologies in the
data
• Boosts power with diverse
multivariate models and holistic
data sets
• Triangulates truth with multiple
approaches when problems are
intractable
138/17/2013
Confidential Think Big Analytics
Need for New Skills
Database
Administrator
Big Data
Administrator
Business
Analyst
Data Science
Math Modeler
Data
Architect
Data Architect
Big Data Modeling
Developers
Big Data
Engineer
Invest and scale complementary skills to move to a data-centric organizational model.
• Include expert training, mentoring and joint solution development
148/17/2013
Confidential Think Big Analytics
Scale Fast
158/17/2013
Confidential Think Big Analytics
An Integrated Approach
Creating value with nimble, incremental innovation
Brainstorm
POC
Pilot
Deploy
Training
GTM
Partners
Clients
Industry
Analysts
Strategic
Technology
Business&TechnologyRequirements
Data Science & Analytics
Center of Excellence
InternalSolutions
ExternalSolutions
QA TestEngineer
Risk Management
Big Data
Lab
Technology
Experts
Best in Class Analytics Sand Box
Monitoring
Open Source
Innovation
Business SMEs
Envision Education Engineering
Strategy Management, Development & Operations
Support & Performance Measurement
BUSINESS VELOCITY
Administration & Optimization
Big Data Strategy
Readiness Analysis
Technology
Recommendations
Big Data
Roadmap
168/17/2013
Confidential Think Big Analytics
• Develop data and analytics
platforms that bridge the old
and new.
• Understand integration
patterns and use cases to
effectively guide new
initiatives.
• Partner with business on
opportunities for innovation.
• Build organizational maturity
along a number of dimensions
(platform, architecture, data
engineering, data science).
17
New IT Platforms
Data Mining
(R, Mahout)
Query
(Hive/Pig)
MapReduce
Parallel
Export
Parallel
Export
Messaging
Replication
Hadoop Cluster
Management, Monitoring, and Security
Landing Zone
External Data Sources
Event
Ingest
Realtime to Seconds Minutes and Up
Interactions
Analysis
Source: Think Big Analytics
MPP EDW:
structured
summary data
Fast Unstruct-
ured DB
Prod
Cycle
(Min's)
Science
Cycle
(Days)
Scheduler &
Dependency
Engine
DFS
Data Science
Tools
Tradtional
BI Tools
Scale out
DB
Scale out
DB
Relational
DBMS
Serving
Engine
Secondary
Index
low vol
ACID
Read /
Write
Distributed
SearchDistributed
Search
DB Sync
8/17/2013
Confidential Think Big Analytics
Data Science
A New Role Exists – the Data Scientist
Focused on data not models
Works with analysts to create business value
• One Part Scientist/Statistician
• One Part Sleuth
• One Part Artist
• One Part Programmer
188/17/2013
Confidential Think Big Analytics
1. Big Analytics is a critical capability.
2. Your organization can create value now.
3. Get help to get off on the right foot.
4. Adopt incrementally.
Conclusions
Think Big Start Smart Scale Fast
198/17/2013
Confidential Think Big Analytics
Rick
Ron.Bodkin@thinkbiganalytics.com
@ronbodkin

More Related Content

What's hot

Deutsche Telekom on Big Data
Deutsche Telekom on Big DataDeutsche Telekom on Big Data
Deutsche Telekom on Big DataDataWorks Summit
 
News UK Transformed Its Data Center To Become More Agile
News UK Transformed Its Data Center To Become More AgileNews UK Transformed Its Data Center To Become More Agile
News UK Transformed Its Data Center To Become More AgileHCL Technologies
 
3 джозеп курто превращаем вашу организацию в big data компанию
3 джозеп курто превращаем вашу организацию в big data компанию3 джозеп курто превращаем вашу организацию в big data компанию
3 джозеп курто превращаем вашу организацию в big data компаниюantishmanti
 
Big Data World Forum
Big Data World ForumBig Data World Forum
Big Data World Forumbigdatawf
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data PlatformVikas Manoria
 
Fast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationFast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationDenodo
 
Making Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High ExpectationsMaking Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High ExpectationsRackspace
 
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersMonitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersDataWorks Summit
 
Integrating Hadoop Into the Enterprise
Integrating Hadoop Into the EnterpriseIntegrating Hadoop Into the Enterprise
Integrating Hadoop Into the EnterpriseDataWorks Summit
 
Hitachi data systems and tsys success story
Hitachi data systems and tsys success storyHitachi data systems and tsys success story
Hitachi data systems and tsys success storyHitachi Vantara
 
Top 10 ways BigInsights BigIntegrate and BigQuality will improve your life
Top 10 ways BigInsights BigIntegrate and BigQuality will improve your lifeTop 10 ways BigInsights BigIntegrate and BigQuality will improve your life
Top 10 ways BigInsights BigIntegrate and BigQuality will improve your lifeIBM Analytics
 
DCD Big Discussion Guide
DCD Big Discussion GuideDCD Big Discussion Guide
DCD Big Discussion GuideJames Laker
 
THE FUTURE OF DATA: PROVISIONING ANALYTICS-READY DATA AT SPEED
THE FUTURE OF DATA: PROVISIONING ANALYTICS-READY DATA AT SPEEDTHE FUTURE OF DATA: PROVISIONING ANALYTICS-READY DATA AT SPEED
THE FUTURE OF DATA: PROVISIONING ANALYTICS-READY DATA AT SPEEDwebwinkelvakdag
 
Customer Keynote: Data Service and Security at an Enterprise Scale with Logic...
Customer Keynote: Data Service and Security at an Enterprise Scale with Logic...Customer Keynote: Data Service and Security at an Enterprise Scale with Logic...
Customer Keynote: Data Service and Security at an Enterprise Scale with Logic...Denodo
 
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data VirtualizationKASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data VirtualizationDenodo
 
Data Virtualization for Data Architects (New Zealand)
Data Virtualization for Data Architects (New Zealand)Data Virtualization for Data Architects (New Zealand)
Data Virtualization for Data Architects (New Zealand)Denodo
 
Data Virtualization for Data Architects (Australia)
Data Virtualization for Data Architects (Australia)Data Virtualization for Data Architects (Australia)
Data Virtualization for Data Architects (Australia)Denodo
 

What's hot (20)

10g db grid
10g db grid10g db grid
10g db grid
 
Ibm big data
Ibm big dataIbm big data
Ibm big data
 
Deutsche Telekom on Big Data
Deutsche Telekom on Big DataDeutsche Telekom on Big Data
Deutsche Telekom on Big Data
 
News UK Transformed Its Data Center To Become More Agile
News UK Transformed Its Data Center To Become More AgileNews UK Transformed Its Data Center To Become More Agile
News UK Transformed Its Data Center To Become More Agile
 
3 джозеп курто превращаем вашу организацию в big data компанию
3 джозеп курто превращаем вашу организацию в big data компанию3 джозеп курто превращаем вашу организацию в big data компанию
3 джозеп курто превращаем вашу организацию в big data компанию
 
Big Data World Forum
Big Data World ForumBig Data World Forum
Big Data World Forum
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data Platform
 
Fast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationFast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow Presentation
 
Making Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High ExpectationsMaking Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High Expectations
 
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersMonitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service Providers
 
Integrating Hadoop Into the Enterprise
Integrating Hadoop Into the EnterpriseIntegrating Hadoop Into the Enterprise
Integrating Hadoop Into the Enterprise
 
Hitachi data systems and tsys success story
Hitachi data systems and tsys success storyHitachi data systems and tsys success story
Hitachi data systems and tsys success story
 
Top 10 ways BigInsights BigIntegrate and BigQuality will improve your life
Top 10 ways BigInsights BigIntegrate and BigQuality will improve your lifeTop 10 ways BigInsights BigIntegrate and BigQuality will improve your life
Top 10 ways BigInsights BigIntegrate and BigQuality will improve your life
 
DCD Big Discussion Guide
DCD Big Discussion GuideDCD Big Discussion Guide
DCD Big Discussion Guide
 
THE FUTURE OF DATA: PROVISIONING ANALYTICS-READY DATA AT SPEED
THE FUTURE OF DATA: PROVISIONING ANALYTICS-READY DATA AT SPEEDTHE FUTURE OF DATA: PROVISIONING ANALYTICS-READY DATA AT SPEED
THE FUTURE OF DATA: PROVISIONING ANALYTICS-READY DATA AT SPEED
 
Customer Keynote: Data Service and Security at an Enterprise Scale with Logic...
Customer Keynote: Data Service and Security at an Enterprise Scale with Logic...Customer Keynote: Data Service and Security at an Enterprise Scale with Logic...
Customer Keynote: Data Service and Security at an Enterprise Scale with Logic...
 
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data VirtualizationKASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
 
Data Virtualization for Data Architects (New Zealand)
Data Virtualization for Data Architects (New Zealand)Data Virtualization for Data Architects (New Zealand)
Data Virtualization for Data Architects (New Zealand)
 
[XConf Brasil 2020] Data mesh
[XConf Brasil 2020] Data mesh[XConf Brasil 2020] Data mesh
[XConf Brasil 2020] Data mesh
 
Data Virtualization for Data Architects (Australia)
Data Virtualization for Data Architects (Australia)Data Virtualization for Data Architects (Australia)
Data Virtualization for Data Architects (Australia)
 

Similar to Big analytics best practices @ PARC

How to get started in extracting business value from big data 1 of 2 oct 2013
How to get started in extracting business value from big data 1 of 2 oct 2013How to get started in extracting business value from big data 1 of 2 oct 2013
How to get started in extracting business value from big data 1 of 2 oct 2013Jaime Nistal
 
Are you getting the most out of your data?
Are you getting the most out of your data?Are you getting the most out of your data?
Are you getting the most out of your data?SAS Canada
 
Value of data in digital transformation
Value of data in digital transformationValue of data in digital transformation
Value of data in digital transformationLoihde Advisory
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieSunil Ranka
 
Rady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program OverviewRady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program OverviewUC San Diego Rady School of Management
 
Four Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyFour Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyArcadia Data
 
2013 ALPFA Leadership Submit, Data Analytics in Practice
2013 ALPFA Leadership Submit, Data Analytics in Practice2013 ALPFA Leadership Submit, Data Analytics in Practice
2013 ALPFA Leadership Submit, Data Analytics in PracticeAlejandro Jaramillo
 
Self-service Analytic for Business Users-19july2017-final
Self-service Analytic for Business Users-19july2017-finalSelf-service Analytic for Business Users-19july2017-final
Self-service Analytic for Business Users-19july2017-finalstelligence
 
SpigitEngage - The latest release of our Enterprise Innovation Platform
SpigitEngage - The latest release of our Enterprise Innovation PlatformSpigitEngage - The latest release of our Enterprise Innovation Platform
SpigitEngage - The latest release of our Enterprise Innovation PlatformMilind Pansare
 
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...Grid Dynamics
 
BigInsights BigData Study 2013 - Exec Summary
BigInsights BigData Study 2013  - Exec SummaryBigInsights BigData Study 2013  - Exec Summary
BigInsights BigData Study 2013 - Exec SummaryBigInsights
 
It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...
 It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201... It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...
It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...Edgar Alejandro Villegas
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesDATAVERSITY
 
How to make your data scientists happy
How to make your data scientists happy How to make your data scientists happy
How to make your data scientists happy Hussain Sultan
 
Building the Analytics Capability
Building the Analytics CapabilityBuilding the Analytics Capability
Building the Analytics CapabilityBala Iyer
 
Big Data Meetup by Chad Richeson
Big Data Meetup by Chad RichesonBig Data Meetup by Chad Richeson
Big Data Meetup by Chad RichesonSocietyConsulting
 
Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry Capgemini
 
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...DATAVERSITY
 
Big agendas for big data analytics projects
Big agendas for big data analytics projectsBig agendas for big data analytics projects
Big agendas for big data analytics projectsThe Marketing Distillery
 
The Softer Skills Analysts need to make an impact
The Softer Skills Analysts need to make an impactThe Softer Skills Analysts need to make an impact
The Softer Skills Analysts need to make an impactPaul Laughlin
 

Similar to Big analytics best practices @ PARC (20)

How to get started in extracting business value from big data 1 of 2 oct 2013
How to get started in extracting business value from big data 1 of 2 oct 2013How to get started in extracting business value from big data 1 of 2 oct 2013
How to get started in extracting business value from big data 1 of 2 oct 2013
 
Are you getting the most out of your data?
Are you getting the most out of your data?Are you getting the most out of your data?
Are you getting the most out of your data?
 
Value of data in digital transformation
Value of data in digital transformationValue of data in digital transformation
Value of data in digital transformation
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
Rady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program OverviewRady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program Overview
 
Four Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyFour Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics Strategy
 
2013 ALPFA Leadership Submit, Data Analytics in Practice
2013 ALPFA Leadership Submit, Data Analytics in Practice2013 ALPFA Leadership Submit, Data Analytics in Practice
2013 ALPFA Leadership Submit, Data Analytics in Practice
 
Self-service Analytic for Business Users-19july2017-final
Self-service Analytic for Business Users-19july2017-finalSelf-service Analytic for Business Users-19july2017-final
Self-service Analytic for Business Users-19july2017-final
 
SpigitEngage - The latest release of our Enterprise Innovation Platform
SpigitEngage - The latest release of our Enterprise Innovation PlatformSpigitEngage - The latest release of our Enterprise Innovation Platform
SpigitEngage - The latest release of our Enterprise Innovation Platform
 
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...
 
BigInsights BigData Study 2013 - Exec Summary
BigInsights BigData Study 2013  - Exec SummaryBigInsights BigData Study 2013  - Exec Summary
BigInsights BigData Study 2013 - Exec Summary
 
It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...
 It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201... It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...
It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
 
How to make your data scientists happy
How to make your data scientists happy How to make your data scientists happy
How to make your data scientists happy
 
Building the Analytics Capability
Building the Analytics CapabilityBuilding the Analytics Capability
Building the Analytics Capability
 
Big Data Meetup by Chad Richeson
Big Data Meetup by Chad RichesonBig Data Meetup by Chad Richeson
Big Data Meetup by Chad Richeson
 
Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry
 
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
 
Big agendas for big data analytics projects
Big agendas for big data analytics projectsBig agendas for big data analytics projects
Big agendas for big data analytics projects
 
The Softer Skills Analysts need to make an impact
The Softer Skills Analysts need to make an impactThe Softer Skills Analysts need to make an impact
The Softer Skills Analysts need to make an impact
 

More from Jim Kaskade

Jim kaskade biography (updated)
Jim kaskade biography (updated)Jim kaskade biography (updated)
Jim kaskade biography (updated)Jim Kaskade
 
Woodside Residential Design Guidelines
Woodside Residential Design GuidelinesWoodside Residential Design Guidelines
Woodside Residential Design GuidelinesJim Kaskade
 
Woodside Glens Neighborhood Plan - Amended 1999
Woodside Glens Neighborhood Plan - Amended 1999Woodside Glens Neighborhood Plan - Amended 1999
Woodside Glens Neighborhood Plan - Amended 1999Jim Kaskade
 
Infochimps Hadoop Summit 2013
Infochimps Hadoop Summit 2013Infochimps Hadoop Summit 2013
Infochimps Hadoop Summit 2013Jim Kaskade
 
Infochimps TieCon 2013
Infochimps TieCon 2013Infochimps TieCon 2013
Infochimps TieCon 2013Jim Kaskade
 
Infochimps Cloudcon 2012
Infochimps Cloudcon 2012Infochimps Cloudcon 2012
Infochimps Cloudcon 2012Jim Kaskade
 
Vmware Serengeti - Based on Infochimps Ironfan
Vmware Serengeti - Based on Infochimps IronfanVmware Serengeti - Based on Infochimps Ironfan
Vmware Serengeti - Based on Infochimps IronfanJim Kaskade
 
Infochimps CxO Seminar @ PARC
Infochimps CxO Seminar @ PARCInfochimps CxO Seminar @ PARC
Infochimps CxO Seminar @ PARCJim Kaskade
 
Big Data & Cloud - Infinite Monkey Theorem
Big Data & Cloud - Infinite Monkey TheoremBig Data & Cloud - Infinite Monkey Theorem
Big Data & Cloud - Infinite Monkey TheoremJim Kaskade
 
Marketing & Sales
Marketing & SalesMarketing & Sales
Marketing & SalesJim Kaskade
 
Outsourcing Class
Outsourcing ClassOutsourcing Class
Outsourcing ClassJim Kaskade
 
Online Video and Next-gen Storage
Online Video and Next-gen StorageOnline Video and Next-gen Storage
Online Video and Next-gen StorageJim Kaskade
 
Rapid Social Game Development & Deployment
Rapid Social Game Development & DeploymentRapid Social Game Development & Deployment
Rapid Social Game Development & DeploymentJim Kaskade
 
Application Model for Cloud Deployment
Application Model for Cloud DeploymentApplication Model for Cloud Deployment
Application Model for Cloud DeploymentJim Kaskade
 
Next-Gen Security (using Cloud)
Next-Gen Security (using Cloud)Next-Gen Security (using Cloud)
Next-Gen Security (using Cloud)Jim Kaskade
 
CISCO Visual Networking Index Forecast and Methodology, 2009-14
CISCO Visual Networking Index Forecast and Methodology, 2009-14CISCO Visual Networking Index Forecast and Methodology, 2009-14
CISCO Visual Networking Index Forecast and Methodology, 2009-14Jim Kaskade
 
Jim Kaskade Biography
Jim Kaskade BiographyJim Kaskade Biography
Jim Kaskade BiographyJim Kaskade
 
CISCO\'s Take On Internet Video
CISCO\'s Take On Internet VideoCISCO\'s Take On Internet Video
CISCO\'s Take On Internet VideoJim Kaskade
 
Private Cloud Platform as a Service
Private Cloud Platform as a ServicePrivate Cloud Platform as a Service
Private Cloud Platform as a ServiceJim Kaskade
 
Advertising Exchange Whitepaper
Advertising Exchange WhitepaperAdvertising Exchange Whitepaper
Advertising Exchange WhitepaperJim Kaskade
 

More from Jim Kaskade (20)

Jim kaskade biography (updated)
Jim kaskade biography (updated)Jim kaskade biography (updated)
Jim kaskade biography (updated)
 
Woodside Residential Design Guidelines
Woodside Residential Design GuidelinesWoodside Residential Design Guidelines
Woodside Residential Design Guidelines
 
Woodside Glens Neighborhood Plan - Amended 1999
Woodside Glens Neighborhood Plan - Amended 1999Woodside Glens Neighborhood Plan - Amended 1999
Woodside Glens Neighborhood Plan - Amended 1999
 
Infochimps Hadoop Summit 2013
Infochimps Hadoop Summit 2013Infochimps Hadoop Summit 2013
Infochimps Hadoop Summit 2013
 
Infochimps TieCon 2013
Infochimps TieCon 2013Infochimps TieCon 2013
Infochimps TieCon 2013
 
Infochimps Cloudcon 2012
Infochimps Cloudcon 2012Infochimps Cloudcon 2012
Infochimps Cloudcon 2012
 
Vmware Serengeti - Based on Infochimps Ironfan
Vmware Serengeti - Based on Infochimps IronfanVmware Serengeti - Based on Infochimps Ironfan
Vmware Serengeti - Based on Infochimps Ironfan
 
Infochimps CxO Seminar @ PARC
Infochimps CxO Seminar @ PARCInfochimps CxO Seminar @ PARC
Infochimps CxO Seminar @ PARC
 
Big Data & Cloud - Infinite Monkey Theorem
Big Data & Cloud - Infinite Monkey TheoremBig Data & Cloud - Infinite Monkey Theorem
Big Data & Cloud - Infinite Monkey Theorem
 
Marketing & Sales
Marketing & SalesMarketing & Sales
Marketing & Sales
 
Outsourcing Class
Outsourcing ClassOutsourcing Class
Outsourcing Class
 
Online Video and Next-gen Storage
Online Video and Next-gen StorageOnline Video and Next-gen Storage
Online Video and Next-gen Storage
 
Rapid Social Game Development & Deployment
Rapid Social Game Development & DeploymentRapid Social Game Development & Deployment
Rapid Social Game Development & Deployment
 
Application Model for Cloud Deployment
Application Model for Cloud DeploymentApplication Model for Cloud Deployment
Application Model for Cloud Deployment
 
Next-Gen Security (using Cloud)
Next-Gen Security (using Cloud)Next-Gen Security (using Cloud)
Next-Gen Security (using Cloud)
 
CISCO Visual Networking Index Forecast and Methodology, 2009-14
CISCO Visual Networking Index Forecast and Methodology, 2009-14CISCO Visual Networking Index Forecast and Methodology, 2009-14
CISCO Visual Networking Index Forecast and Methodology, 2009-14
 
Jim Kaskade Biography
Jim Kaskade BiographyJim Kaskade Biography
Jim Kaskade Biography
 
CISCO\'s Take On Internet Video
CISCO\'s Take On Internet VideoCISCO\'s Take On Internet Video
CISCO\'s Take On Internet Video
 
Private Cloud Platform as a Service
Private Cloud Platform as a ServicePrivate Cloud Platform as a Service
Private Cloud Platform as a Service
 
Advertising Exchange Whitepaper
Advertising Exchange WhitepaperAdvertising Exchange Whitepaper
Advertising Exchange Whitepaper
 

Recently uploaded

SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 

Big analytics best practices @ PARC

  • 1. Confidential Think Big AnalyticsConfidential Think Big Analytics Big Analytics Best Practices An Executive Guide September 26, 2012 Ron Bodkin Founder and CEO ron.bodkin@thinkbiganalytics.com @ronbodkin
  • 2. Confidential Think Big Analytics Introduction • One of Silicon Valley’s Fastest Growing Big Data start ups • 100% Focus on Big Data consulting & Data Science solution services • Management Background:  Cambridge Technology, C-bridge, Oracle, Sun Microsystems, Quantcast, Accenture  C-bridge Internet Solutions (CBIS) founder 1996 & executives, IPO 1999 • Clients: 40+ – Focuses: Technology, Financial Services, Retail, Advertising • North America Locations • US East: Boston, New York, Miami • US Central: Chicago, Austin • US West: HQ Mountain View, San Diego, Salt Lake City Think Big is the leading professional services firm that’s purpose built for Big Data. 28/17/2013
  • 3. Confidential Think Big Analytics Big Analytics, enabled by Big Data Big Data invented to solve web scale data challenges. Opportunity and mandate for enterprises to compete with advanced analytics. Now enabling new businesses and products. 38/17/2013
  • 4. Confidential Think Big Analytics 1. It’s just a new name for Business Intelligence. 2. The packaged applications are about to emerge. 3. The enterprise can wait. 4. Low cost, low skill staffing will work. 5. It’s simple to get results. 6. You can automate all the intelligence. 7. You can buy it all from a single vendor “stack.” The 7 Myths of Big Data 48/17/2013
  • 5. Confidential Think Big Analytics Incremental Adoption 58/17/2013
  • 6. Confidential Think Big Analytics Real World Results 68/17/2013
  • 7. Confidential Think Big Analytics 360 Customer View Analytics Trends • Compute model scores faster • Analyze full data sets • Incorporate new data • Build new services from data Basic Reporting Data Ingestion Batch Processing Fast Analytics Data Enrichment Data Science 78/17/2013
  • 8. Confidential Think Big Analytics Social Media “The digital transformation occurring at American Express cuts across many business units, and it has to because of the breadth and depth of our business,” Leslie Berland SVP of Digital Partnerships and Development explains. “From customer service to merchant services to our entertainment and travel business units, to corporate affairs, as well as our newly formed digital partnerships and development team, social media is a company-wide initiative.” Source: http://mashable.com/2012/03/28/american-express-social-media/ March 28, 2012 88/17/2013
  • 9. Confidential Think Big Analytics Think Big 98/17/2013
  • 10. Confidential Think Big Analytics Envision Current State Future State Prioritized Initiatives Key Decisions & Impact Analysis Reference Architecture Design Patterns Technology Rankings Organization & Training Optimized Projects Selection Big Analytics Roadmap & RecommendationsGap Analysis Big Data Strategy Readiness Analysis Technology Recommendations Big Data Roadmap Big Analytics Roadmap Methodology Analytic Platform Decision Tree Data Strategy 108/17/2013
  • 11. Confidential Think Big Analytics Data Strategy: Value from Integration Ad Server Mobile Social Web Site Devices & Enterprise Applications Outside Data (new) 118/17/2013
  • 12. Confidential Think Big Analytics Start Smart 128/17/2013
  • 13. Confidential Think Big Analytics Organizing for Success • Driven by collaboration between data scientists, engineers and business • Leverages the manifest and latent signal of multi-structured data • Emphasizes exploratory analysis to uncover novel topologies in the data • Boosts power with diverse multivariate models and holistic data sets • Triangulates truth with multiple approaches when problems are intractable 138/17/2013
  • 14. Confidential Think Big Analytics Need for New Skills Database Administrator Big Data Administrator Business Analyst Data Science Math Modeler Data Architect Data Architect Big Data Modeling Developers Big Data Engineer Invest and scale complementary skills to move to a data-centric organizational model. • Include expert training, mentoring and joint solution development 148/17/2013
  • 15. Confidential Think Big Analytics Scale Fast 158/17/2013
  • 16. Confidential Think Big Analytics An Integrated Approach Creating value with nimble, incremental innovation Brainstorm POC Pilot Deploy Training GTM Partners Clients Industry Analysts Strategic Technology Business&TechnologyRequirements Data Science & Analytics Center of Excellence InternalSolutions ExternalSolutions QA TestEngineer Risk Management Big Data Lab Technology Experts Best in Class Analytics Sand Box Monitoring Open Source Innovation Business SMEs Envision Education Engineering Strategy Management, Development & Operations Support & Performance Measurement BUSINESS VELOCITY Administration & Optimization Big Data Strategy Readiness Analysis Technology Recommendations Big Data Roadmap 168/17/2013
  • 17. Confidential Think Big Analytics • Develop data and analytics platforms that bridge the old and new. • Understand integration patterns and use cases to effectively guide new initiatives. • Partner with business on opportunities for innovation. • Build organizational maturity along a number of dimensions (platform, architecture, data engineering, data science). 17 New IT Platforms Data Mining (R, Mahout) Query (Hive/Pig) MapReduce Parallel Export Parallel Export Messaging Replication Hadoop Cluster Management, Monitoring, and Security Landing Zone External Data Sources Event Ingest Realtime to Seconds Minutes and Up Interactions Analysis Source: Think Big Analytics MPP EDW: structured summary data Fast Unstruct- ured DB Prod Cycle (Min's) Science Cycle (Days) Scheduler & Dependency Engine DFS Data Science Tools Tradtional BI Tools Scale out DB Scale out DB Relational DBMS Serving Engine Secondary Index low vol ACID Read / Write Distributed SearchDistributed Search DB Sync 8/17/2013
  • 18. Confidential Think Big Analytics Data Science A New Role Exists – the Data Scientist Focused on data not models Works with analysts to create business value • One Part Scientist/Statistician • One Part Sleuth • One Part Artist • One Part Programmer 188/17/2013
  • 19. Confidential Think Big Analytics 1. Big Analytics is a critical capability. 2. Your organization can create value now. 3. Get help to get off on the right foot. 4. Adopt incrementally. Conclusions Think Big Start Smart Scale Fast 198/17/2013
  • 20. Confidential Think Big Analytics Rick Ron.Bodkin@thinkbiganalytics.com @ronbodkin

Editor's Notes

  1. New Data Sources, Innovative Use Cases, Data Science & Predictive AnalyticsA new class of big data technologies were invented to address data management challenges at Web scale. These technologies enabled new approaches to solve analytic questions that were too complex or did not fit into traditional systems:Reduce cycle time developing new analytic modelsRun analyses that were previously impossible Simpler modeling approaches by utilizing larger datasetsAnalysis conducted at a far lower costFlexibility for future unknowns +Compute Processing $ & Time ex. 26 Days 2 minex. 42 Hours  40 minex. 18 Hours  16 min=Business Innovation VelocityBig Data is Changing the Game. Organizations need to get smarter, leveraging substantial untapped data assets for sustained competitive advantage. reduce cycle time -> 1. much lower effort to work with new datasets; 2. parallel distributed infrastructure processes data much faster 3. compute approximate answers before investing in projects to automateMore detailed example on reduced cycle time - Hive allows you to define the underlying structure of the raw data just enough to let you run SQL-esque queries against it. run new types of analysis->1) model across complex datasets that did not match relational database model2) work with larger datasets and compute intensive algorithms simpler modeling->1) google whitepaper2) fewer assumptions, simpler models required when looking at entire customer dbvs 3% and extrapolating lower cost->- shorter cycle times- lower infrastructure costs for storage and processing utilizing commodity hw and open source sw- reduced processing time Flexibility -> promise, that by storing everything, you have source data to continue to generate and model new hypothesis, reduced cycle times for experiments to increase value, now have the ability to store 10 years of full data for self and suppliersInnovations in commodity hardware, elastic, distributed, open source software platforms, such as Hadoop, and NoSQL database technologies are changing the game for advanced analytics at the core:
  2. leverages the manifest and latent signal of multistructured data        we say "multistructured" not "unstructured" now        most of the data in the world has latent signal - it's hidden as a messy tangle of other crap.  bi tools are really designed to work with data where the relevant signal is overt (manifest variables) and this is true of the corresponding models* emphasizes exploratory analysis to uncover novel topologies in the data        so this is stuff like narrow strata and behavioral cohorts. just said all fancy and whatnot* boosts power with diverse multivariate models and wholistic data sets        the world is multivariate        integrating models designed for structured data with those designed for unstructured data gives new power        it not just more data, it's data from new sources, providing a new lenses, new behaviors, etc. all in concert                e.g. integrating online and offline, adding offline brand exposures to online ad efficacy assessments and attribution analysis* triangulates truth with multiple approaches when problems are intractable        stop trying to "prove" things, let validity and predictability testing guide you, focus on avoiding spurious relationships through theoryPlaybook as talking pointsRich data setsURLs, social graphs, text feedback…New, rich visualizationExplorationAutomated detectionHighlights, trends, anomaliesCollaboration with data scientists…
  3. DBA -> Big DBAPrior experience:Diverse system environmentsApplication performance mgtSystems appreciationMetrics-focusedNew skills:Management & monitoring toolsMetricsAutomation for scaleLower-level workload tuning DA --> DA BDMPrior experience:Data-focused: digging into detailsDiverse database environmentsDeep domain knowledgeFamiliarity with unstructured data (XML)Hybrid dbs and non-db systemsNew skills:Data modeling for unstructured dataAlternative tools and documentationLanguages and APIs (Hive, Pig, M/R)Process Models (M/R, Key/Value)Lower-level optimizations BA -> DS MMNew Skills:Introduction to HadoopNew tools for data manipulationVariety of new modelsChallenging top-down approachesWorking with unstructured dataBottoms-up pattern discoveryEfficient programming at scaleLarge scale Machine Learning Dev -> Big Data EngineerNew Skills:Processing models(MapReduce, Key/Value)Data modelingSchemas for unstructuredLanguages/APIs (Hive, Pig, M/R)Work process from small to full-scaleInvestigating approachesManual optimization ExplorationLearning1st Internal DataTest WorkloadsProcess LimitedProductionPilot AppsAgile DataFeedback LoopProcess LimitedPortfolioBroad App RangeIntense AnalyticsNew Feeds, Derived DataSpace LimitedData-CentricOrgImpacts Core BizNew ProductsAnalytic FocusSpace Limited
  4. Big Data solution FactoryBig Data Labs Asset partnering with Think Big Gather best practicesQtlry review of brainstormsVendor briefingsGartner and ind analysts and researchWhat is the criteria for techSelection on POC vs PilotRamp Adoption and share assetsCollaboration tools