SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
Introducing
Datameer 4.0!
Visual, End-to-End!
© 2014 Datameer, Inc. All rights reserved.
View Recording!!
!
You can view the recording of this webinar at:!
!
http://info.datameer.com/Online-Slideshare-
Datameer-4-0-Visual-End-to-End-
OnDemand.html!
© 2014 Datameer, Inc. All rights reserved.
© 2013 Datameer, Inc. All rights reserved.
About Our Speakers!
Matt Schumpert @datameer!
Senior Director, Solutions Engineering!
!
Matt has been working in the enterprise infrastructure
software space for over 14 years in various capacities,
including sales engineering, strategic alliances and
consulting.!
!
Matt currently runs the pre-sales engineering team at
Datameer, supporting all technical aspects of customer
engagement from initial contact through roll-out of
customers into production.!
!
Matt holds a BS in Computer Science from the
University of Virginia. !
#datameer
@datameer!
© 2013 Datameer, Inc. All rights reserved.
About Our Speaker !
Matt McManus @datameer
Vice President, Engineering

Matt has been building enterprise software products for over 10
years with deep experience in architecture, software engineering and
team management roles.

Matt currently leads the engineering team at Datameer, managing all
aspects of product development, releases and quality assurance.

Matt attended Boston University where he earned a Bachelor’s
degree in Computer Science. 
#datameer @datameer!
© 2014 Datameer, Inc. All rights reserved.
The Lean Data Supply Chain!
Classical Data Pipeline!
Modern Data Pipeline!
© 2014 Datameer, Inc. All rights reserved.
The Lean Data Supply Chain!
© 2014 Datameer, Inc. All rights reserved.
Informatica!
Talend!
Flume!
Sqoop!
Trifacta!
Paxata!
PIG!
Hive!
Impala!
Tableau!
Platfora!
© 2013 Datameer, Inc. All rights reserved.
The Lean Data Supply Chain!
Integrate! Analyze! Visualize!Prepare!
© 2014 Datameer, Inc. All rights reserved.
© 2013 Datameer, Inc. All rights reserved.
An end-to-end Solution!
Analytics! Visualization!Data Integration!
Any Distro!
© 2014 Datameer, Inc. All rights reserved.
© 2013 Datameer, Inc. All rights reserved.
Smart Analytics!
Clustering
gg
 Column Dependencies
Recommendation
Decision Trees
© 2014 Datameer, Inc. All rights reserved.
Enterprise Integration!
Introducing
Datameer 4.0!
Visual Insights at Every Step!
© 2014 Datameer, Inc. All rights reserved.
© 2013 Datameer, Inc. All rights reserved.
Introducing ‘Flip-Side’
© 2014 Datameer, Inc. All rights reserved.
© 2013 Datameer, Inc. All rights reserved.
Before !
Integrate! Analyze! Visualize!Prepare!
© 2014 Datameer, Inc. All rights reserved.
© 2013 Datameer, Inc. All rights reserved.
Now!
Integrate! Analyze! Visualize!Prepare!
© 2014 Datameer, Inc. All rights reserved.
© 2013 Datameer, Inc. All rights reserved.
Problems Solved
Before:! With Datameer 4.0:!
Multiple Tools!
Not for business!
Visualize at End!
Single Platform!
Self-Service!
Visual Insights at Every Step!
© 2014 Datameer, Inc. All rights reserved.
© 2013 Datameer, Inc. All rights reserved.
Use Cases and Impact
Industry! Challenge! Impact!
Banking!
Identify credit scores that were out of range
based on zip code (credit scores in affluent
areas tend to be higher than in others)!
!
Identify loans that have highest risk and
better quantify risk exposure (>$13M)!
!
Retail!
Identify missing product id or inaccurate
product descriptions!
!
Inventory: Slower turnover of stock!
Fulfillment: Out of stock at customers!
Logistics: Distribution errors and rework,
extra shipping costs (>$1M)!
Telco!
Identify incorrect subscriber data (e.g.
invalid email addresses) that will skew
results on usage in particular area!
By correlating subscriber data with
network performance data, meet existing
and forecasted demand, but not excess
capacity resulting in inflated capital
expenditures. (>$140M)!
Telco!
Identify incorrect subscriber data (e.g.
negative ages) that will skew segments
used for churn analysis!
Discount and retention campaigns are
executed optimally and targeted to the
right clusters, avoiding lost revenue!
© 2014 Datameer, Inc. All rights reserved.
4.0 Technical Details!
Matt McManus!
VP, Engineering!
© 2014 Datameer, Inc. All rights reserved.
Column Metrics Collection!
Metric! Supported Column Types!
Cardinality*! All!
Histogram*! Numeric + Date!
Frequency* (Top K)! All!
Summary (min/max/mean)! Numeric + Date!
Null vs. Present! All!
* indicates estimated value!
© 2014 Datameer, Inc. All rights reserved.
Performance Implications!
!   Metrics are calculated using streaming
techniques designed to minimize performance
impacts!
!   Often an estimate is provided to achieve high
performance!
!   Collection can be disabled on a per job or cluster
wide basis!
© 2014 Datameer, Inc. All rights reserved.
Visual Profiling of Full Results!
!   Column statistics available on full results of every
worksheet (without leaving workbook)!
!   Column statistics fall back to “preview” in certain
circumstances!
! Visual cues guide users:!
© 2014 Datameer, Inc. All rights reserved.
Flip-side with Smart Analytics!
!   Visualize model on full results!
• Decision trees!
• Column dependencies!
!   Visually explore cluster composition!
•  Compare data shape across clusters !
!   Enhancements to recommendation visualizations!
© 2014 Datameer, Inc. All rights reserved.
Demo …!
Customer Churn!
@Datameer!
© 2014 Datameer, Inc. All rights reserved.
© 2013 Datameer, Inc. All rights reserved.
For More Information!
#datameer @datameer!
!   http://www.datameer.com!
!  @datameer!
mschumpert@datameer.com!
mmcmanus@datameer.com!

Weitere ähnliche Inhalte

Was ist angesagt?

Analyzing Unstructured Data in Hadoop Webinar
Analyzing Unstructured Data in Hadoop WebinarAnalyzing Unstructured Data in Hadoop Webinar
Analyzing Unstructured Data in Hadoop WebinarDatameer
 
Webinar - Big Data: Power to the User
Webinar - Big Data: Power to the User Webinar - Big Data: Power to the User
Webinar - Big Data: Power to the User Datameer
 
Cloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learningCloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learningCloudera, Inc.
 
Best Practices for Big Data Analytics with Machine Learning by Datameer
Best Practices for Big Data Analytics with Machine Learning by DatameerBest Practices for Big Data Analytics with Machine Learning by Datameer
Best Practices for Big Data Analytics with Machine Learning by DatameerDatameer
 
Engaging with Cloudera & Morning Wrap Up
Engaging with Cloudera & Morning Wrap UpEngaging with Cloudera & Morning Wrap Up
Engaging with Cloudera & Morning Wrap UpCloudera, Inc.
 
Becoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural ChangeBecoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural ChangeCloudera, Inc.
 
Get Started with Cloudera’s Cyber Solution
Get Started with Cloudera’s Cyber SolutionGet Started with Cloudera’s Cyber Solution
Get Started with Cloudera’s Cyber SolutionCloudera, Inc.
 
How to implement Hadoop successfully
How to implement Hadoop successfullyHow to implement Hadoop successfully
How to implement Hadoop successfullyAdir Sharabi
 
Optimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataOptimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataCloudera, Inc.
 
Transforming Business for the Digital Age (Presented by Microsoft)
Transforming Business for the Digital Age (Presented by Microsoft)Transforming Business for the Digital Age (Presented by Microsoft)
Transforming Business for the Digital Age (Presented by Microsoft)Cloudera, Inc.
 
The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)Cloudera, Inc.
 
How to implement hadoop successfuly
How to implement hadoop successfulyHow to implement hadoop successfuly
How to implement hadoop successfulyAdir Sharabi
 
Optimize your cloud strategy for machine learning and analytics
Optimize your cloud strategy for machine learning and analyticsOptimize your cloud strategy for machine learning and analytics
Optimize your cloud strategy for machine learning and analyticsCloudera, Inc.
 
The Big Picture: Real-time Data is Defining Intelligent Offers
The Big Picture: Real-time Data is Defining Intelligent OffersThe Big Picture: Real-time Data is Defining Intelligent Offers
The Big Picture: Real-time Data is Defining Intelligent OffersCloudera, Inc.
 
How to Avoid Pitfalls in Big Data Analytics Webinar
How to Avoid Pitfalls in Big Data Analytics WebinarHow to Avoid Pitfalls in Big Data Analytics Webinar
How to Avoid Pitfalls in Big Data Analytics WebinarDatameer
 
Big Data as Competitive Advantage in Financial Services
Big Data as Competitive Advantage in Financial ServicesBig Data as Competitive Advantage in Financial Services
Big Data as Competitive Advantage in Financial ServicesCloudera, Inc.
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Cloudera, Inc.
 
Markerstudy Group Drives Growth and Innovation
Markerstudy Group Drives Growth and InnovationMarkerstudy Group Drives Growth and Innovation
Markerstudy Group Drives Growth and InnovationCloudera, Inc.
 
Transform Banking with Big Data and Automated Machine Learning 9.12.17
Transform Banking with Big Data and Automated Machine Learning 9.12.17Transform Banking with Big Data and Automated Machine Learning 9.12.17
Transform Banking with Big Data and Automated Machine Learning 9.12.17Cloudera, Inc.
 
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017Cloudera, Inc.
 

Was ist angesagt? (20)

Analyzing Unstructured Data in Hadoop Webinar
Analyzing Unstructured Data in Hadoop WebinarAnalyzing Unstructured Data in Hadoop Webinar
Analyzing Unstructured Data in Hadoop Webinar
 
Webinar - Big Data: Power to the User
Webinar - Big Data: Power to the User Webinar - Big Data: Power to the User
Webinar - Big Data: Power to the User
 
Cloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learningCloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learning
 
Best Practices for Big Data Analytics with Machine Learning by Datameer
Best Practices for Big Data Analytics with Machine Learning by DatameerBest Practices for Big Data Analytics with Machine Learning by Datameer
Best Practices for Big Data Analytics with Machine Learning by Datameer
 
Engaging with Cloudera & Morning Wrap Up
Engaging with Cloudera & Morning Wrap UpEngaging with Cloudera & Morning Wrap Up
Engaging with Cloudera & Morning Wrap Up
 
Becoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural ChangeBecoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural Change
 
Get Started with Cloudera’s Cyber Solution
Get Started with Cloudera’s Cyber SolutionGet Started with Cloudera’s Cyber Solution
Get Started with Cloudera’s Cyber Solution
 
How to implement Hadoop successfully
How to implement Hadoop successfullyHow to implement Hadoop successfully
How to implement Hadoop successfully
 
Optimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataOptimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big Data
 
Transforming Business for the Digital Age (Presented by Microsoft)
Transforming Business for the Digital Age (Presented by Microsoft)Transforming Business for the Digital Age (Presented by Microsoft)
Transforming Business for the Digital Age (Presented by Microsoft)
 
The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)
 
How to implement hadoop successfuly
How to implement hadoop successfulyHow to implement hadoop successfuly
How to implement hadoop successfuly
 
Optimize your cloud strategy for machine learning and analytics
Optimize your cloud strategy for machine learning and analyticsOptimize your cloud strategy for machine learning and analytics
Optimize your cloud strategy for machine learning and analytics
 
The Big Picture: Real-time Data is Defining Intelligent Offers
The Big Picture: Real-time Data is Defining Intelligent OffersThe Big Picture: Real-time Data is Defining Intelligent Offers
The Big Picture: Real-time Data is Defining Intelligent Offers
 
How to Avoid Pitfalls in Big Data Analytics Webinar
How to Avoid Pitfalls in Big Data Analytics WebinarHow to Avoid Pitfalls in Big Data Analytics Webinar
How to Avoid Pitfalls in Big Data Analytics Webinar
 
Big Data as Competitive Advantage in Financial Services
Big Data as Competitive Advantage in Financial ServicesBig Data as Competitive Advantage in Financial Services
Big Data as Competitive Advantage in Financial Services
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
 
Markerstudy Group Drives Growth and Innovation
Markerstudy Group Drives Growth and InnovationMarkerstudy Group Drives Growth and Innovation
Markerstudy Group Drives Growth and Innovation
 
Transform Banking with Big Data and Automated Machine Learning 9.12.17
Transform Banking with Big Data and Automated Machine Learning 9.12.17Transform Banking with Big Data and Automated Machine Learning 9.12.17
Transform Banking with Big Data and Automated Machine Learning 9.12.17
 
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
 

Ähnlich wie Webinar - Introducing Datameer 4.0: Visual, End-to-End

Online Fraud Detection Using Big Data Analytics Webinar
Online Fraud Detection Using Big Data Analytics WebinarOnline Fraud Detection Using Big Data Analytics Webinar
Online Fraud Detection Using Big Data Analytics WebinarDatameer
 
Iasa Architect responsibilities in the cloud
Iasa Architect responsibilities in the cloudIasa Architect responsibilities in the cloud
Iasa Architect responsibilities in the cloudiasaglobal
 
Cognos Data Manager Support Changes: Entitlements Migrate to DataStage
Cognos Data Manager Support Changes: Entitlements Migrate to DataStageCognos Data Manager Support Changes: Entitlements Migrate to DataStage
Cognos Data Manager Support Changes: Entitlements Migrate to DataStageSenturus
 
Improving the Planning Cycle for Sophisticated Business Needs: TM1 Demo, Plus...
Improving the Planning Cycle for Sophisticated Business Needs: TM1 Demo, Plus...Improving the Planning Cycle for Sophisticated Business Needs: TM1 Demo, Plus...
Improving the Planning Cycle for Sophisticated Business Needs: TM1 Demo, Plus...Senturus
 
Self-Service Business Authoring in a Managed Reporting World: IBM Cognos Work...
Self-Service Business Authoring in a Managed Reporting World: IBM Cognos Work...Self-Service Business Authoring in a Managed Reporting World: IBM Cognos Work...
Self-Service Business Authoring in a Managed Reporting World: IBM Cognos Work...Senturus
 
I_Heart_DAM_DIGITAL
I_Heart_DAM_DIGITALI_Heart_DAM_DIGITAL
I_Heart_DAM_DIGITALSjors Bos
 
Understanding Customer Buying Journey with Big Data
Understanding Customer Buying Journey with Big DataUnderstanding Customer Buying Journey with Big Data
Understanding Customer Buying Journey with Big DataAnalyticsWeek
 
Nvent Enabling The Data Driven Enterprise
Nvent Enabling The Data Driven EnterpriseNvent Enabling The Data Driven Enterprise
Nvent Enabling The Data Driven EnterpriseGrafic.guru
 
Mike Siegler at INCOSE Minneapolis, 2014
Mike Siegler at INCOSE Minneapolis, 2014Mike Siegler at INCOSE Minneapolis, 2014
Mike Siegler at INCOSE Minneapolis, 2014Etherios
 
Making Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to useMaking Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to useSwiss Big Data User Group
 
Fight Fraud with Big Data Analytics
Fight Fraud with Big Data AnalyticsFight Fraud with Big Data Analytics
Fight Fraud with Big Data AnalyticsDatameer
 
FME:23 Bringing Life to Data
FME:23 Bringing Life to DataFME:23 Bringing Life to Data
FME:23 Bringing Life to DataSafe Software
 
Validus investor pitch deck 03212014 rjc updates
Validus investor pitch deck 03212014 rjc updatesValidus investor pitch deck 03212014 rjc updates
Validus investor pitch deck 03212014 rjc updatesRick Catalano
 
Case Study on Big Data Service for Manufacturing - Silver Touch Technologies
Case Study on Big Data Service for Manufacturing - Silver Touch TechnologiesCase Study on Big Data Service for Manufacturing - Silver Touch Technologies
Case Study on Big Data Service for Manufacturing - Silver Touch TechnologiesSilver Touch Technologies
 
Live Webinar: E2E Transition to Net@Work & Sage 300 ERP (Accpac) Tips & Tricks
Live Webinar: E2E Transition to Net@Work & Sage 300 ERP (Accpac) Tips & TricksLive Webinar: E2E Transition to Net@Work & Sage 300 ERP (Accpac) Tips & Tricks
Live Webinar: E2E Transition to Net@Work & Sage 300 ERP (Accpac) Tips & TricksNet at Work
 
Imaginea Overview
Imaginea OverviewImaginea Overview
Imaginea OverviewJimit Shah
 
Spacestem - Web Development Company overview
Spacestem - Web Development Company overviewSpacestem - Web Development Company overview
Spacestem - Web Development Company overviewJayesh Pau
 
Certero ITAM Review Tools Day
Certero ITAM Review Tools Day Certero ITAM Review Tools Day
Certero ITAM Review Tools Day Martin Thompson
 

Ähnlich wie Webinar - Introducing Datameer 4.0: Visual, End-to-End (20)

Online Fraud Detection Using Big Data Analytics Webinar
Online Fraud Detection Using Big Data Analytics WebinarOnline Fraud Detection Using Big Data Analytics Webinar
Online Fraud Detection Using Big Data Analytics Webinar
 
Iasa Architect responsibilities in the cloud
Iasa Architect responsibilities in the cloudIasa Architect responsibilities in the cloud
Iasa Architect responsibilities in the cloud
 
Cognos Data Manager Support Changes: Entitlements Migrate to DataStage
Cognos Data Manager Support Changes: Entitlements Migrate to DataStageCognos Data Manager Support Changes: Entitlements Migrate to DataStage
Cognos Data Manager Support Changes: Entitlements Migrate to DataStage
 
Azure unleashed
Azure unleashedAzure unleashed
Azure unleashed
 
Improving the Planning Cycle for Sophisticated Business Needs: TM1 Demo, Plus...
Improving the Planning Cycle for Sophisticated Business Needs: TM1 Demo, Plus...Improving the Planning Cycle for Sophisticated Business Needs: TM1 Demo, Plus...
Improving the Planning Cycle for Sophisticated Business Needs: TM1 Demo, Plus...
 
Self-Service Business Authoring in a Managed Reporting World: IBM Cognos Work...
Self-Service Business Authoring in a Managed Reporting World: IBM Cognos Work...Self-Service Business Authoring in a Managed Reporting World: IBM Cognos Work...
Self-Service Business Authoring in a Managed Reporting World: IBM Cognos Work...
 
I_Heart_DAM_DIGITAL
I_Heart_DAM_DIGITALI_Heart_DAM_DIGITAL
I_Heart_DAM_DIGITAL
 
Understanding Customer Buying Journey with Big Data
Understanding Customer Buying Journey with Big DataUnderstanding Customer Buying Journey with Big Data
Understanding Customer Buying Journey with Big Data
 
Interviewstreet goals
Interviewstreet goalsInterviewstreet goals
Interviewstreet goals
 
Nvent Enabling The Data Driven Enterprise
Nvent Enabling The Data Driven EnterpriseNvent Enabling The Data Driven Enterprise
Nvent Enabling The Data Driven Enterprise
 
Mike Siegler at INCOSE Minneapolis, 2014
Mike Siegler at INCOSE Minneapolis, 2014Mike Siegler at INCOSE Minneapolis, 2014
Mike Siegler at INCOSE Minneapolis, 2014
 
Making Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to useMaking Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to use
 
Fight Fraud with Big Data Analytics
Fight Fraud with Big Data AnalyticsFight Fraud with Big Data Analytics
Fight Fraud with Big Data Analytics
 
FME:23 Bringing Life to Data
FME:23 Bringing Life to DataFME:23 Bringing Life to Data
FME:23 Bringing Life to Data
 
Validus investor pitch deck 03212014 rjc updates
Validus investor pitch deck 03212014 rjc updatesValidus investor pitch deck 03212014 rjc updates
Validus investor pitch deck 03212014 rjc updates
 
Case Study on Big Data Service for Manufacturing - Silver Touch Technologies
Case Study on Big Data Service for Manufacturing - Silver Touch TechnologiesCase Study on Big Data Service for Manufacturing - Silver Touch Technologies
Case Study on Big Data Service for Manufacturing - Silver Touch Technologies
 
Live Webinar: E2E Transition to Net@Work & Sage 300 ERP (Accpac) Tips & Tricks
Live Webinar: E2E Transition to Net@Work & Sage 300 ERP (Accpac) Tips & TricksLive Webinar: E2E Transition to Net@Work & Sage 300 ERP (Accpac) Tips & Tricks
Live Webinar: E2E Transition to Net@Work & Sage 300 ERP (Accpac) Tips & Tricks
 
Imaginea Overview
Imaginea OverviewImaginea Overview
Imaginea Overview
 
Spacestem - Web Development Company overview
Spacestem - Web Development Company overviewSpacestem - Web Development Company overview
Spacestem - Web Development Company overview
 
Certero ITAM Review Tools Day
Certero ITAM Review Tools Day Certero ITAM Review Tools Day
Certero ITAM Review Tools Day
 

Mehr von Datameer

Extending BI with Big Data Analytics
Extending BI with Big Data AnalyticsExtending BI with Big Data Analytics
Extending BI with Big Data AnalyticsDatameer
 
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...Datameer
 
Why Use Hadoop for Big Data Analytics?
Why Use Hadoop for Big Data Analytics?Why Use Hadoop for Big Data Analytics?
Why Use Hadoop for Big Data Analytics?Datameer
 
Why Use Hadoop?
Why Use Hadoop?Why Use Hadoop?
Why Use Hadoop?Datameer
 
BI, Hive or Big Data Analytics?
BI, Hive or Big Data Analytics? BI, Hive or Big Data Analytics?
BI, Hive or Big Data Analytics? Datameer
 
Is Your Hadoop Environment Secure?
Is Your Hadoop Environment Secure?Is Your Hadoop Environment Secure?
Is Your Hadoop Environment Secure?Datameer
 
Complement Your Existing Data Warehouse with Big Data & Hadoop
Complement Your Existing Data Warehouse with Big Data & HadoopComplement Your Existing Data Warehouse with Big Data & Hadoop
Complement Your Existing Data Warehouse with Big Data & HadoopDatameer
 
Lean Production Meets Big Data: A Next Generation Use Case
Lean Production Meets Big Data: A Next Generation Use CaseLean Production Meets Big Data: A Next Generation Use Case
Lean Production Meets Big Data: A Next Generation Use CaseDatameer
 
The Economics of SQL on Hadoop
The Economics of SQL on HadoopThe Economics of SQL on Hadoop
The Economics of SQL on HadoopDatameer
 
Top 3 Considerations for Machine Learning on Big Data
Top 3 Considerations for Machine Learning on Big DataTop 3 Considerations for Machine Learning on Big Data
Top 3 Considerations for Machine Learning on Big DataDatameer
 
How to do Predictive Analytics with Limited Data
How to do Predictive Analytics with Limited DataHow to do Predictive Analytics with Limited Data
How to do Predictive Analytics with Limited DataDatameer
 

Mehr von Datameer (11)

Extending BI with Big Data Analytics
Extending BI with Big Data AnalyticsExtending BI with Big Data Analytics
Extending BI with Big Data Analytics
 
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
 
Why Use Hadoop for Big Data Analytics?
Why Use Hadoop for Big Data Analytics?Why Use Hadoop for Big Data Analytics?
Why Use Hadoop for Big Data Analytics?
 
Why Use Hadoop?
Why Use Hadoop?Why Use Hadoop?
Why Use Hadoop?
 
BI, Hive or Big Data Analytics?
BI, Hive or Big Data Analytics? BI, Hive or Big Data Analytics?
BI, Hive or Big Data Analytics?
 
Is Your Hadoop Environment Secure?
Is Your Hadoop Environment Secure?Is Your Hadoop Environment Secure?
Is Your Hadoop Environment Secure?
 
Complement Your Existing Data Warehouse with Big Data & Hadoop
Complement Your Existing Data Warehouse with Big Data & HadoopComplement Your Existing Data Warehouse with Big Data & Hadoop
Complement Your Existing Data Warehouse with Big Data & Hadoop
 
Lean Production Meets Big Data: A Next Generation Use Case
Lean Production Meets Big Data: A Next Generation Use CaseLean Production Meets Big Data: A Next Generation Use Case
Lean Production Meets Big Data: A Next Generation Use Case
 
The Economics of SQL on Hadoop
The Economics of SQL on HadoopThe Economics of SQL on Hadoop
The Economics of SQL on Hadoop
 
Top 3 Considerations for Machine Learning on Big Data
Top 3 Considerations for Machine Learning on Big DataTop 3 Considerations for Machine Learning on Big Data
Top 3 Considerations for Machine Learning on Big Data
 
How to do Predictive Analytics with Limited Data
How to do Predictive Analytics with Limited DataHow to do Predictive Analytics with Limited Data
How to do Predictive Analytics with Limited Data
 

Kürzlich hochgeladen

Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 

Kürzlich hochgeladen (20)

Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 

Webinar - Introducing Datameer 4.0: Visual, End-to-End

  • 2. © 2014 Datameer, Inc. All rights reserved. View Recording!! ! You can view the recording of this webinar at:! ! http://info.datameer.com/Online-Slideshare- Datameer-4-0-Visual-End-to-End- OnDemand.html!
  • 3. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. About Our Speakers! Matt Schumpert @datameer! Senior Director, Solutions Engineering! ! Matt has been working in the enterprise infrastructure software space for over 14 years in various capacities, including sales engineering, strategic alliances and consulting.! ! Matt currently runs the pre-sales engineering team at Datameer, supporting all technical aspects of customer engagement from initial contact through roll-out of customers into production.! ! Matt holds a BS in Computer Science from the University of Virginia. ! #datameer @datameer!
  • 4. © 2013 Datameer, Inc. All rights reserved. About Our Speaker ! Matt McManus @datameer Vice President, Engineering Matt has been building enterprise software products for over 10 years with deep experience in architecture, software engineering and team management roles. Matt currently leads the engineering team at Datameer, managing all aspects of product development, releases and quality assurance. Matt attended Boston University where he earned a Bachelor’s degree in Computer Science. #datameer @datameer!
  • 5. © 2014 Datameer, Inc. All rights reserved. The Lean Data Supply Chain!
  • 8. © 2014 Datameer, Inc. All rights reserved. The Lean Data Supply Chain!
  • 9. © 2014 Datameer, Inc. All rights reserved. Informatica! Talend! Flume! Sqoop! Trifacta! Paxata! PIG! Hive! Impala! Tableau! Platfora! © 2013 Datameer, Inc. All rights reserved. The Lean Data Supply Chain! Integrate! Analyze! Visualize!Prepare!
  • 10. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. An end-to-end Solution! Analytics! Visualization!Data Integration! Any Distro!
  • 11. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. Smart Analytics! Clustering gg Column Dependencies Recommendation Decision Trees
  • 12. © 2014 Datameer, Inc. All rights reserved. Enterprise Integration!
  • 14. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. Introducing ‘Flip-Side’
  • 15. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. Before ! Integrate! Analyze! Visualize!Prepare!
  • 16. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. Now! Integrate! Analyze! Visualize!Prepare!
  • 17. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. Problems Solved Before:! With Datameer 4.0:! Multiple Tools! Not for business! Visualize at End! Single Platform! Self-Service! Visual Insights at Every Step!
  • 18. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. Use Cases and Impact Industry! Challenge! Impact! Banking! Identify credit scores that were out of range based on zip code (credit scores in affluent areas tend to be higher than in others)! ! Identify loans that have highest risk and better quantify risk exposure (>$13M)! ! Retail! Identify missing product id or inaccurate product descriptions! ! Inventory: Slower turnover of stock! Fulfillment: Out of stock at customers! Logistics: Distribution errors and rework, extra shipping costs (>$1M)! Telco! Identify incorrect subscriber data (e.g. invalid email addresses) that will skew results on usage in particular area! By correlating subscriber data with network performance data, meet existing and forecasted demand, but not excess capacity resulting in inflated capital expenditures. (>$140M)! Telco! Identify incorrect subscriber data (e.g. negative ages) that will skew segments used for churn analysis! Discount and retention campaigns are executed optimally and targeted to the right clusters, avoiding lost revenue!
  • 19. © 2014 Datameer, Inc. All rights reserved. 4.0 Technical Details! Matt McManus! VP, Engineering!
  • 20. © 2014 Datameer, Inc. All rights reserved. Column Metrics Collection! Metric! Supported Column Types! Cardinality*! All! Histogram*! Numeric + Date! Frequency* (Top K)! All! Summary (min/max/mean)! Numeric + Date! Null vs. Present! All! * indicates estimated value!
  • 21. © 2014 Datameer, Inc. All rights reserved. Performance Implications! !   Metrics are calculated using streaming techniques designed to minimize performance impacts! !   Often an estimate is provided to achieve high performance! !   Collection can be disabled on a per job or cluster wide basis!
  • 22. © 2014 Datameer, Inc. All rights reserved. Visual Profiling of Full Results! !   Column statistics available on full results of every worksheet (without leaving workbook)! !   Column statistics fall back to “preview” in certain circumstances! ! Visual cues guide users:!
  • 23. © 2014 Datameer, Inc. All rights reserved. Flip-side with Smart Analytics! !   Visualize model on full results! • Decision trees! • Column dependencies! !   Visually explore cluster composition! •  Compare data shape across clusters ! !   Enhancements to recommendation visualizations!
  • 24. © 2014 Datameer, Inc. All rights reserved. Demo …! Customer Churn!
  • 26. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. For More Information! #datameer @datameer! !   http://www.datameer.com! !  @datameer! mschumpert@datameer.com! mmcmanus@datameer.com!