SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
Organizing for Data Science 
Dan Mallinger 
Data Science Practice Manager 
September 2014
CONFIDENTIAL | Dan Mallinger 
• Data Science Practice Manager 
− Think Big Analytics 
• Working with clients across 
− Financial Services 
− Advertising 
− Manufacturing 
− Social 
− Network Providers 
CONFIDENTIAL 2
CONFIDENTIAL | Today 
• Define Data Science in the Organization 
• Look at Current Perspectives on Organization 
• Discuss Shortcomings 
• Review a Real World Solution 
CONFIDENTIAL 3
Ÿ Use Data to Improve Our 
Business 
Ÿ Better Understand Customers 
Ÿ Act Proactively, Not Reactively 
CONFIDENTIAL | What Do We Hope to Do? 
CONFIDENTIAL 4
CONFIDENTIAL | Ÿ Scale 
Ÿ Robustness 
Ÿ Repeatability 
Why Organize? 
CONFIDENTIAL 5
Ÿ Revolutionizing Ad Targeting 
Ÿ Automating Deals and 
Recommendations 
Ÿ Alerting Admins to New Network 
Attacks 
CONFIDENTIAL | Perception: What Does Data Science Do? 
CONFIDENTIAL 6
CONFIDENTIAL | Ÿ Specific Data Expertise 
Ÿ Exploratory Analysis 
Ÿ Modeling 
Ÿ Creativity 
Ÿ Programming 
Ÿ Big Data 
Ÿ Communication 
Ÿ Ability to Target Impact 
Ÿ Unstructured Analysis 
Ÿ Organizational Politics 
Ÿ Visualization 
Ÿ … 
What Does It Take? 
CONFIDENTIAL 7
CONFIDENTIAL | The New Toy: A Center of Excellence 
Ÿ Centralized 
- Brings data, analysis, and 
processing together 
- Data scientists support one 
another 
Ÿ Distributed 
- Data scientists close to 
business 
- Multiple models for rotating 
data scientists into lines of 
business 
CONFIDENTIAL 8 
Line of 
Business A 
CoE 
Line of 
Business B 
Line of 
Business C
CONFIDENTIAL | Ÿ Specific Data Expertise 
Ÿ Exploratory Analysis 
Ÿ Modeling 
Ÿ Creativity 
Ÿ Programming 
Ÿ Big Data 
Ÿ Communication 
Ÿ Ability to Target Impact 
Ÿ Unstructured Analysis 
Ÿ Organizational Politics 
Ÿ Visualization 
Ÿ … 
What Does It Still Take? 
CONFIDENTIAL 9
CONFIDENTIAL | Ÿ Designed a great home for unicorns 
Ÿ But they are still unicorns 
CONFIDENTIAL 10 
If You Build It, They Will Come?
Ÿ Unravel Capability 
Ÿ Map Activities to Functional Roles 
Ÿ Align Functions with Process, 
Not Individuals 
Ÿ Don’t Forget to Scale 
CONFIDENTIAL | Working with Horses, Not Unicorns 
CONFIDENTIAL 11
Ÿ Identify Fraudulent Sessions 
Ÿ Cross Channel Analysis 
Ÿ Next Best Action 
Ÿ Optimize Pathways 
Ÿ Determine Session Interest 
Ÿ Customizing Experience 
Ÿ Proactive Outreach 
Ÿ Search Analysis 
Ÿ Content Optimization 
CONFIDENTIAL | CLIENT EXAMPLE 
Clickstream Data in Action 
CONFIDENTIAL 12
Ÿ Billions of clicks 
Ÿ Unstructured data 
Ÿ How do we model it?! 
CONFIDENTIAL | Ÿ Model the SIGNAL 
Ÿ Not the data 
CLIENT EXAMPLE 
Scaling Data Science 
CONFIDENTIAL 13
MPP Web 
CONFIDENTIAL | CLIENT EXAMPLE 
Clickstream Data Science in Action 
CONFIDENTIAL 14 
Hadoop 1.0 
Feature Selection & 
Dimensionality Reduction
CONFIDENTIAL | Ÿ Feature Selection 
- Forests 
- Clustering 
Ÿ Dimensionality Reduction 
- SVM 
Ÿ Challenges 
- Job Latency 
- Limited Iterations 
CLIENT EXAMPLE 
Extracting Signal: Hadoop 1.0 
CONFIDENTIAL 15
CONFIDENTIAL | CLIENT EXAMPLE 
Extracting Signal: Hadoop 2.0 
• Spark 
− Faster response in exploration 
− Better Support for Iterative Models 
• Genetic Algorithms 
• Neural Networks 
• Challenges 
− In memory: costly and limiting 
− MapReduce does not go away 
CONFIDENTIAL 16
Ÿ Focus on Technical Skills 
- EDA 
- Modeling 
- Programming / Big Data 
Ÿ Communication Skills 
- Capturing signal needs 
- Iterating with stakeholders 
CONFIDENTIAL | CLIENT EXAMPLE 
Horses, Not Unicorns 
CONFIDENTIAL 17 
Hadoop 1.0
CONFIDENTIAL | CLIENT EXAMPLE 
CoE Next Steps 
• Continue to make signal available to analysts 
− Next up: Extracting signal from text 
• Act as a capability search party 
− Sprints of new insights and tools 
• Finalize operating model 
− Funding structure 
− Engagement model with lines of business 
CONFIDENTIAL 18
CONFIDENTIAL | Discussion Over Drinks 
CONFIDENTIAL 19

Weitere ähnliche Inhalte

Was ist angesagt?

The Five Data Questions
The Five Data QuestionsThe Five Data Questions
The Five Data Questions
crystalpullen
 
"Simplify Your Analytics Strategy" by Narendra Mulani
 "Simplify Your Analytics Strategy" by Narendra Mulani "Simplify Your Analytics Strategy" by Narendra Mulani
"Simplify Your Analytics Strategy" by Narendra Mulani
Sai Sandeep MN
 
Sisense Introduction PPT
Sisense Introduction PPTSisense Introduction PPT
Sisense Introduction PPT
Khirod Sahu
 

Was ist angesagt? (20)

"Making Data Actionable" by Budiman Rusly (KMK Online)
"Making Data Actionable" by Budiman Rusly (KMK Online)"Making Data Actionable" by Budiman Rusly (KMK Online)
"Making Data Actionable" by Budiman Rusly (KMK Online)
 
The Five Data Questions
The Five Data QuestionsThe Five Data Questions
The Five Data Questions
 
"Simplify Your Analytics Strategy" by Narendra Mulani
 "Simplify Your Analytics Strategy" by Narendra Mulani "Simplify Your Analytics Strategy" by Narendra Mulani
"Simplify Your Analytics Strategy" by Narendra Mulani
 
Big data sharing at fintech academy oct19 (1)
Big data sharing at fintech academy oct19 (1)Big data sharing at fintech academy oct19 (1)
Big data sharing at fintech academy oct19 (1)
 
"Planning Your Analytics Implementation" by Bachtiar Rifai (Kofera Technology)
"Planning Your Analytics Implementation" by Bachtiar Rifai (Kofera Technology)"Planning Your Analytics Implementation" by Bachtiar Rifai (Kofera Technology)
"Planning Your Analytics Implementation" by Bachtiar Rifai (Kofera Technology)
 
Data Science Salon: Building smart AI: How Deep Learning Can Get You Into Dee...
Data Science Salon: Building smart AI: How Deep Learning Can Get You Into Dee...Data Science Salon: Building smart AI: How Deep Learning Can Get You Into Dee...
Data Science Salon: Building smart AI: How Deep Learning Can Get You Into Dee...
 
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav MisraFrom Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
 
1415 gold sanford
1415 gold sanford1415 gold sanford
1415 gold sanford
 
Data Science Salon: Digital Transformation: The Data Science Catalyst
Data Science Salon: Digital Transformation: The Data Science CatalystData Science Salon: Digital Transformation: The Data Science Catalyst
Data Science Salon: Digital Transformation: The Data Science Catalyst
 
1530 track2 reid
1530 track2 reid1530 track2 reid
1530 track2 reid
 
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
 
Using text analytics to manage mobile qual to manage mobile Qual Data - Civicom
Using text analytics to manage mobile qual to manage mobile Qual Data - CivicomUsing text analytics to manage mobile qual to manage mobile Qual Data - Civicom
Using text analytics to manage mobile qual to manage mobile Qual Data - Civicom
 
Simplify your analytics strategy
Simplify your analytics strategySimplify your analytics strategy
Simplify your analytics strategy
 
Dynamics Day 2016: digital transformation - getting personal
Dynamics Day 2016: digital transformation - getting personalDynamics Day 2016: digital transformation - getting personal
Dynamics Day 2016: digital transformation - getting personal
 
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
 
H2O World - NCS Continuous Media Optimization w/H2O - Satya Satyamoorthy
H2O World - NCS Continuous Media Optimization w/H2O - Satya SatyamoorthyH2O World - NCS Continuous Media Optimization w/H2O - Satya Satyamoorthy
H2O World - NCS Continuous Media Optimization w/H2O - Satya Satyamoorthy
 
Data Science in Action for an Insurance Product - Shawn Jin
Data Science in Action for an Insurance Product - Shawn JinData Science in Action for an Insurance Product - Shawn Jin
Data Science in Action for an Insurance Product - Shawn Jin
 
Bde presentation dv
Bde presentation dvBde presentation dv
Bde presentation dv
 
Sisense Introduction PPT
Sisense Introduction PPTSisense Introduction PPT
Sisense Introduction PPT
 
Unlock Your Data
Unlock Your DataUnlock Your Data
Unlock Your Data
 

Ähnlich wie Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL

Competitive Advantage from the Data Lake
Competitive Advantage from the Data LakeCompetitive Advantage from the Data Lake
Competitive Advantage from the Data Lake
Argyle Executive Forum
 
No REST till Production – Building and Deploying 9 Models to Production in 3 ...
No REST till Production – Building and Deploying 9 Models to Production in 3 ...No REST till Production – Building and Deploying 9 Models to Production in 3 ...
No REST till Production – Building and Deploying 9 Models to Production in 3 ...
Databricks
 
Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...
Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...
Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...
Innovation Enterprise
 

Ähnlich wie Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL (20)

Big Analytics: Building Lasting Value
Big Analytics: Building Lasting ValueBig Analytics: Building Lasting Value
Big Analytics: Building Lasting Value
 
Customer segmentation for business success with knime
Customer segmentation for business success with knimeCustomer segmentation for business success with knime
Customer segmentation for business success with knime
 
Get your data analytics strategy right!
Get your data analytics strategy right!Get your data analytics strategy right!
Get your data analytics strategy right!
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
Competitive Advantage from the Data Lake
Competitive Advantage from the Data LakeCompetitive Advantage from the Data Lake
Competitive Advantage from the Data Lake
 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
 
Denodo DataFest 2016: Enterprise View of Data with Semantic Data Layer
Denodo DataFest 2016: Enterprise View of Data with Semantic Data LayerDenodo DataFest 2016: Enterprise View of Data with Semantic Data Layer
Denodo DataFest 2016: Enterprise View of Data with Semantic Data Layer
 
BI and Big Data DeepDive - Pressmart
BI and Big Data DeepDive - PressmartBI and Big Data DeepDive - Pressmart
BI and Big Data DeepDive - Pressmart
 
No REST till Production – Building and Deploying 9 Models to Production in 3 ...
No REST till Production – Building and Deploying 9 Models to Production in 3 ...No REST till Production – Building and Deploying 9 Models to Production in 3 ...
No REST till Production – Building and Deploying 9 Models to Production in 3 ...
 
Build it…will they come by Shawn Trainer
 Build it…will they come by Shawn Trainer Build it…will they come by Shawn Trainer
Build it…will they come by Shawn Trainer
 
The new patterns of innovation
The new patterns of innovationThe new patterns of innovation
The new patterns of innovation
 
Transforming Business with Smarter Analytics
Transforming Business with Smarter AnalyticsTransforming Business with Smarter Analytics
Transforming Business with Smarter Analytics
 
Think Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial IntelligenceThink Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial Intelligence
 
Enabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
Enabling a Bimodal IT Framework for Advanced Analytics with Data VirtualizationEnabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
Enabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
 
How to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics CloudHow to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics Cloud
 
Making Money Out of Data
Making Money Out of DataMaking Money Out of Data
Making Money Out of Data
 
Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...
Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...
Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...
 
ISDI MDA Master Class November_2015
ISDI MDA Master Class November_2015ISDI MDA Master Class November_2015
ISDI MDA Master Class November_2015
 
Data Strategy - Executive MBA Class, IE Business School
Data Strategy - Executive MBA Class, IE Business SchoolData Strategy - Executive MBA Class, IE Business School
Data Strategy - Executive MBA Class, IE Business School
 
Data strategy demistifying data
Data strategy demistifying dataData strategy demistifying data
Data strategy demistifying data
 

Mehr von MLconf

Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
MLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
MLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
MLconf
 

Mehr von MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 

Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL

  • 1. Organizing for Data Science Dan Mallinger Data Science Practice Manager September 2014
  • 2. CONFIDENTIAL | Dan Mallinger • Data Science Practice Manager − Think Big Analytics • Working with clients across − Financial Services − Advertising − Manufacturing − Social − Network Providers CONFIDENTIAL 2
  • 3. CONFIDENTIAL | Today • Define Data Science in the Organization • Look at Current Perspectives on Organization • Discuss Shortcomings • Review a Real World Solution CONFIDENTIAL 3
  • 4. Ÿ Use Data to Improve Our Business Ÿ Better Understand Customers Ÿ Act Proactively, Not Reactively CONFIDENTIAL | What Do We Hope to Do? CONFIDENTIAL 4
  • 5. CONFIDENTIAL | Ÿ Scale Ÿ Robustness Ÿ Repeatability Why Organize? CONFIDENTIAL 5
  • 6. Ÿ Revolutionizing Ad Targeting Ÿ Automating Deals and Recommendations Ÿ Alerting Admins to New Network Attacks CONFIDENTIAL | Perception: What Does Data Science Do? CONFIDENTIAL 6
  • 7. CONFIDENTIAL | Ÿ Specific Data Expertise Ÿ Exploratory Analysis Ÿ Modeling Ÿ Creativity Ÿ Programming Ÿ Big Data Ÿ Communication Ÿ Ability to Target Impact Ÿ Unstructured Analysis Ÿ Organizational Politics Ÿ Visualization Ÿ … What Does It Take? CONFIDENTIAL 7
  • 8. CONFIDENTIAL | The New Toy: A Center of Excellence Ÿ Centralized - Brings data, analysis, and processing together - Data scientists support one another Ÿ Distributed - Data scientists close to business - Multiple models for rotating data scientists into lines of business CONFIDENTIAL 8 Line of Business A CoE Line of Business B Line of Business C
  • 9. CONFIDENTIAL | Ÿ Specific Data Expertise Ÿ Exploratory Analysis Ÿ Modeling Ÿ Creativity Ÿ Programming Ÿ Big Data Ÿ Communication Ÿ Ability to Target Impact Ÿ Unstructured Analysis Ÿ Organizational Politics Ÿ Visualization Ÿ … What Does It Still Take? CONFIDENTIAL 9
  • 10. CONFIDENTIAL | Ÿ Designed a great home for unicorns Ÿ But they are still unicorns CONFIDENTIAL 10 If You Build It, They Will Come?
  • 11. Ÿ Unravel Capability Ÿ Map Activities to Functional Roles Ÿ Align Functions with Process, Not Individuals Ÿ Don’t Forget to Scale CONFIDENTIAL | Working with Horses, Not Unicorns CONFIDENTIAL 11
  • 12. Ÿ Identify Fraudulent Sessions Ÿ Cross Channel Analysis Ÿ Next Best Action Ÿ Optimize Pathways Ÿ Determine Session Interest Ÿ Customizing Experience Ÿ Proactive Outreach Ÿ Search Analysis Ÿ Content Optimization CONFIDENTIAL | CLIENT EXAMPLE Clickstream Data in Action CONFIDENTIAL 12
  • 13. Ÿ Billions of clicks Ÿ Unstructured data Ÿ How do we model it?! CONFIDENTIAL | Ÿ Model the SIGNAL Ÿ Not the data CLIENT EXAMPLE Scaling Data Science CONFIDENTIAL 13
  • 14. MPP Web CONFIDENTIAL | CLIENT EXAMPLE Clickstream Data Science in Action CONFIDENTIAL 14 Hadoop 1.0 Feature Selection & Dimensionality Reduction
  • 15. CONFIDENTIAL | Ÿ Feature Selection - Forests - Clustering Ÿ Dimensionality Reduction - SVM Ÿ Challenges - Job Latency - Limited Iterations CLIENT EXAMPLE Extracting Signal: Hadoop 1.0 CONFIDENTIAL 15
  • 16. CONFIDENTIAL | CLIENT EXAMPLE Extracting Signal: Hadoop 2.0 • Spark − Faster response in exploration − Better Support for Iterative Models • Genetic Algorithms • Neural Networks • Challenges − In memory: costly and limiting − MapReduce does not go away CONFIDENTIAL 16
  • 17. Ÿ Focus on Technical Skills - EDA - Modeling - Programming / Big Data Ÿ Communication Skills - Capturing signal needs - Iterating with stakeholders CONFIDENTIAL | CLIENT EXAMPLE Horses, Not Unicorns CONFIDENTIAL 17 Hadoop 1.0
  • 18. CONFIDENTIAL | CLIENT EXAMPLE CoE Next Steps • Continue to make signal available to analysts − Next up: Extracting signal from text • Act as a capability search party − Sprints of new insights and tools • Finalize operating model − Funding structure − Engagement model with lines of business CONFIDENTIAL 18
  • 19. CONFIDENTIAL | Discussion Over Drinks CONFIDENTIAL 19