SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
Teradata Partners Conference ’14 
Applying Topological Data Analysis to Complex Data 
Abhishek Gupta, Senior Engineer
Ayasdi makes the world’s complex data useful 
by extracting powerful insights automatically. 
Ayasdi named one of the Top 10 
Most Innovative Companies in Big 
Data for 2013 
These big data 
companies are ones 
to watch 
The Structure Data Awards: 
Machine Learning / 
Artificial Intelligence 
Top 100 Private Companies 
– Big Data/Analytics 
Named by Mary Meeker as 
one of the most interesting 
companies in the data/ 
analytics space 
Company Confidential & Proprietary 2
The Promise of Big Data 
Company Confidential & Proprietary 
3 
Information Understanding Business Impact
Why Do Current Approaches Fail? 
Company Confidential & Proprietary 
4 
Today’s Approach to Analytics 
Hypothesis 
Challenges: 
• Incomplete and missing insights 
• Depends on humans to scale 
• Slow responses due to iteration
A New Approach Is Required 
Company Confidential & Proprietary 
5 
Algorithms & Compute 
OR 
Benefits: 
• Automated understanding 
• Comprehensive 
• Fast
Comparison 
Hypothesis 
Verifies Explains 
Company Confidential & Proprietary 
6 
Traditional Analytics Ayasdi Approach 
Algorithms & Compute 
Labor Intensive Automated 
Analysts and Data Scientists Domain Experts 
or
Ayasdi’s topological framework incorporates, unifies and 
enhances other disciplines. Because of these properties 
it has extraordinary reach and effectiveness. 
Statistics 
Machine Learning 
Geometry 
Company Confidential & Proprietary 7
Ayasdi & Teradata Partnership 
Company Confidential & Proprietary 
8 
SQL Code 
DDL 
Data pushed 
through analysis 
Key Benefit: 
Making your ETL process simpler.
Use Case: Anomaly Detection 
Leader in flash 
memory 
storage and 
software 
Company Confidential & Proprietary 
ABOUT THE DATA 
• Data consists of die level test information for 1 wafer 
• 12,000+ dies with 100+ tests done for each of the die 
• Network was built using all the test columns 
• Test Result column with pass/fail flag used as metadata 
GOAL OF THE ANALYSIS 
• Identify different subgroups of dies based on similar test 
information 
• Find tests that uniquely identify failed die subgroups 
9 
Fortune 500 and 
S&P 500 company 
$5B+ in revenue
Ayasdi CoreTM Demo 
10
Use Case: Anomaly Detection 
Rows in Node 
Company Confidential & Proprietary 
11 
High Low
Use Case: Anomaly Detection 
Key Takeaway: 
Tight concentration of 
wafers that pass their tests 
in the middle of the cluster 
Test Result=True 
Company Confidential & Proprietary 
12 
High Low
Use Case: Anomaly Detection 
Key Takeaway: 
Two distinct regions of 
wafers failing their tests 
à Next action: 
investigate the “why” 
Test Result=False 
Company Confidential & Proprietary 
13 
High Low
Use Case: Anomaly Detection 
Select first failure group 
to view underlying 
wafer properties 
Test Result=False 
Company Confidential & Proprietary 
14 
High Low
Use Case: Anomaly Detection 
Company Confidential & Proprietary 
15 
KS scores for test 13 
show correlations for 
specific failures
Use Case: Anomaly Detection 
Select second failure 
group to view 
underlying wafer 
properties 
Company Confidential & Proprietary 
16 
High Low
Use Case: Anomaly Detection 
Company Confidential & Proprietary 
17 
KS scores for tests 8, 11, 
and 3 show correlations 
for specific failures
Use Case: Anomaly Detection 
Leader in flash 
memory 
storage and 
software 
Company Confidential & Proprietary 
• Pinpoint wafer anomalies that result in scrap and lost revenue 
• Previously required at least two days of analysis to identify even the 
18 
CHALLENGE 
most systemic anomalies 
SOLUTION 
• Accelerated the analysis of wafer data and yield rates to identify 
and resolve issues 
• Identified additional systemic anomalies previously dismissed as 
“random” 
• Estimated to save hundred million dollars in the first year from a 
reduction in scrap by reducing yield loss by 10% 
Fortune 500 and 
S&P 500 company 
$5B+ in revenue
Corporate Headquarters 
4400 Bohannon Drive 
Suite #200 
Menlo Park, CA 94025 
ayasdi.com 
19

Weitere ähnliche Inhalte

Andere mochten auch

Metadata lecture(9 17-14)
Metadata lecture(9 17-14)Metadata lecture(9 17-14)
Metadata lecture(9 17-14)mhb120
 
Data dictionary, domain modelling and making things easy
Data dictionary, domain modelling and making things easyData dictionary, domain modelling and making things easy
Data dictionary, domain modelling and making things easyLockheed-Martin
 
Machine Learning with Ayasdi
Machine Learning with AyasdiMachine Learning with Ayasdi
Machine Learning with AyasdiAyasdi
 
Using Topological Data Analysis on your BigData
Using Topological Data Analysis on your BigDataUsing Topological Data Analysis on your BigData
Using Topological Data Analysis on your BigDataAnalyticsWeek
 
Creating Your Own Threat Intel Through Hunting & Visualization
Creating Your Own Threat Intel Through Hunting & VisualizationCreating Your Own Threat Intel Through Hunting & Visualization
Creating Your Own Threat Intel Through Hunting & VisualizationRaffael Marty
 
Topological Data Analysis: visual presentation of multidimensional data sets
Topological Data Analysis: visual presentation of multidimensional data setsTopological Data Analysis: visual presentation of multidimensional data sets
Topological Data Analysis: visual presentation of multidimensional data setsDataRefiner
 
Python for Financial Data Analysis with pandas
Python for Financial Data Analysis with pandasPython for Financial Data Analysis with pandas
Python for Financial Data Analysis with pandasWes McKinney
 
Systems Analyst and Design - Data Dictionary
Systems Analyst and Design -  Data DictionarySystems Analyst and Design -  Data Dictionary
Systems Analyst and Design - Data DictionaryKimberly Coquilla
 

Andere mochten auch (12)

Metadata lecture(9 17-14)
Metadata lecture(9 17-14)Metadata lecture(9 17-14)
Metadata lecture(9 17-14)
 
Topological data analysis
Topological data analysisTopological data analysis
Topological data analysis
 
Data dictionary, domain modelling and making things easy
Data dictionary, domain modelling and making things easyData dictionary, domain modelling and making things easy
Data dictionary, domain modelling and making things easy
 
Machine Learning with Ayasdi
Machine Learning with AyasdiMachine Learning with Ayasdi
Machine Learning with Ayasdi
 
Using Topological Data Analysis on your BigData
Using Topological Data Analysis on your BigDataUsing Topological Data Analysis on your BigData
Using Topological Data Analysis on your BigData
 
Creating Your Own Threat Intel Through Hunting & Visualization
Creating Your Own Threat Intel Through Hunting & VisualizationCreating Your Own Threat Intel Through Hunting & Visualization
Creating Your Own Threat Intel Through Hunting & Visualization
 
Data Dictionary
Data DictionaryData Dictionary
Data Dictionary
 
Topological Data Analysis: visual presentation of multidimensional data sets
Topological Data Analysis: visual presentation of multidimensional data setsTopological Data Analysis: visual presentation of multidimensional data sets
Topological Data Analysis: visual presentation of multidimensional data sets
 
Data dictionary
Data dictionaryData dictionary
Data dictionary
 
What is a DATA DICTIONARY?
What is a DATA DICTIONARY?What is a DATA DICTIONARY?
What is a DATA DICTIONARY?
 
Python for Financial Data Analysis with pandas
Python for Financial Data Analysis with pandasPython for Financial Data Analysis with pandas
Python for Financial Data Analysis with pandas
 
Systems Analyst and Design - Data Dictionary
Systems Analyst and Design -  Data DictionarySystems Analyst and Design -  Data Dictionary
Systems Analyst and Design - Data Dictionary
 

Ähnlich wie Ayasdi & Teradata : Applying Topological Data Analysis to Complex Data

Conociendo y entendiendo a tu cliente mediante monitoreo, analíticos y big data
Conociendo y entendiendo a tu cliente mediante monitoreo, analíticos y big dataConociendo y entendiendo a tu cliente mediante monitoreo, analíticos y big data
Conociendo y entendiendo a tu cliente mediante monitoreo, analíticos y big dataMundo Contact
 
Oracle primavera and bpm the power of integration ppt
Oracle primavera and bpm   the power of integration pptOracle primavera and bpm   the power of integration ppt
Oracle primavera and bpm the power of integration pptp6academy
 
Informatica Cloud Data Replication for Salesforce
Informatica Cloud Data Replication for SalesforceInformatica Cloud Data Replication for Salesforce
Informatica Cloud Data Replication for SalesforceDarren Cunningham
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA
 
Your Sales and Operations Planning (S&OP) Analytics: Crystal Ball or Ball and...
Your Sales and Operations Planning (S&OP) Analytics: Crystal Ball or Ball and...Your Sales and Operations Planning (S&OP) Analytics: Crystal Ball or Ball and...
Your Sales and Operations Planning (S&OP) Analytics: Crystal Ball or Ball and...Steelwedge
 
HyperconvergedFantasyAnalytics
HyperconvergedFantasyAnalyticsHyperconvergedFantasyAnalytics
HyperconvergedFantasyAnalyticsJerry Jermann
 
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti
 
Tdwi austin simplifying big data delivery to drive new insights final
Tdwi austin   simplifying big data delivery to drive new insights finalTdwi austin   simplifying big data delivery to drive new insights final
Tdwi austin simplifying big data delivery to drive new insights finalSal Marcus
 
DATA Warehousing & Data Mining
DATA Warehousing & Data MiningDATA Warehousing & Data Mining
DATA Warehousing & Data Miningcpjcollege
 
Operational efficiencies stefan baciu
Operational efficiencies stefan baciuOperational efficiencies stefan baciu
Operational efficiencies stefan baciuAdela Marin
 
9 Hyperion Performance Myths and How to Debunk Them
9 Hyperion Performance Myths and How to Debunk Them9 Hyperion Performance Myths and How to Debunk Them
9 Hyperion Performance Myths and How to Debunk ThemDatavail
 
Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...
Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...
Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...Spark Summit
 
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution AnalyticsRevolution Analytics
 
7 Big Data Challenges and How to Overcome Them
7 Big Data Challenges and How to Overcome Them7 Big Data Challenges and How to Overcome Them
7 Big Data Challenges and How to Overcome ThemQubole
 
5 Data Quality Issues
5 Data Quality Issues5 Data Quality Issues
5 Data Quality IssuesSocial123it
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoopDr. Wilfred Lin (Ph.D.)
 
8 from zero to insight with real time big data
8 from zero to insight with real time big data8 from zero to insight with real time big data
8 from zero to insight with real time big dataDr. Wilfred Lin (Ph.D.)
 
stellar Data Recovery Gurgaon (H.O)
stellar Data Recovery Gurgaon (H.O)stellar Data Recovery Gurgaon (H.O)
stellar Data Recovery Gurgaon (H.O)Mehul kumar
 
Stellar data recovery Gurgaon (ho)
Stellar data recovery Gurgaon (ho)Stellar data recovery Gurgaon (ho)
Stellar data recovery Gurgaon (ho)Mehul kumar
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseJeff Kelly
 

Ähnlich wie Ayasdi & Teradata : Applying Topological Data Analysis to Complex Data (20)

Conociendo y entendiendo a tu cliente mediante monitoreo, analíticos y big data
Conociendo y entendiendo a tu cliente mediante monitoreo, analíticos y big dataConociendo y entendiendo a tu cliente mediante monitoreo, analíticos y big data
Conociendo y entendiendo a tu cliente mediante monitoreo, analíticos y big data
 
Oracle primavera and bpm the power of integration ppt
Oracle primavera and bpm   the power of integration pptOracle primavera and bpm   the power of integration ppt
Oracle primavera and bpm the power of integration ppt
 
Informatica Cloud Data Replication for Salesforce
Informatica Cloud Data Replication for SalesforceInformatica Cloud Data Replication for Salesforce
Informatica Cloud Data Replication for Salesforce
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
 
Your Sales and Operations Planning (S&OP) Analytics: Crystal Ball or Ball and...
Your Sales and Operations Planning (S&OP) Analytics: Crystal Ball or Ball and...Your Sales and Operations Planning (S&OP) Analytics: Crystal Ball or Ball and...
Your Sales and Operations Planning (S&OP) Analytics: Crystal Ball or Ball and...
 
HyperconvergedFantasyAnalytics
HyperconvergedFantasyAnalyticsHyperconvergedFantasyAnalytics
HyperconvergedFantasyAnalytics
 
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to Production
 
Tdwi austin simplifying big data delivery to drive new insights final
Tdwi austin   simplifying big data delivery to drive new insights finalTdwi austin   simplifying big data delivery to drive new insights final
Tdwi austin simplifying big data delivery to drive new insights final
 
DATA Warehousing & Data Mining
DATA Warehousing & Data MiningDATA Warehousing & Data Mining
DATA Warehousing & Data Mining
 
Operational efficiencies stefan baciu
Operational efficiencies stefan baciuOperational efficiencies stefan baciu
Operational efficiencies stefan baciu
 
9 Hyperion Performance Myths and How to Debunk Them
9 Hyperion Performance Myths and How to Debunk Them9 Hyperion Performance Myths and How to Debunk Them
9 Hyperion Performance Myths and How to Debunk Them
 
Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...
Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...
Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...
 
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
 
7 Big Data Challenges and How to Overcome Them
7 Big Data Challenges and How to Overcome Them7 Big Data Challenges and How to Overcome Them
7 Big Data Challenges and How to Overcome Them
 
5 Data Quality Issues
5 Data Quality Issues5 Data Quality Issues
5 Data Quality Issues
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
 
8 from zero to insight with real time big data
8 from zero to insight with real time big data8 from zero to insight with real time big data
8 from zero to insight with real time big data
 
stellar Data Recovery Gurgaon (H.O)
stellar Data Recovery Gurgaon (H.O)stellar Data Recovery Gurgaon (H.O)
stellar Data Recovery Gurgaon (H.O)
 
Stellar data recovery Gurgaon (ho)
Stellar data recovery Gurgaon (ho)Stellar data recovery Gurgaon (ho)
Stellar data recovery Gurgaon (ho)
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouse
 

Kürzlich hochgeladen

Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Rob Geurden
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...Akihiro Suda
 
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfInnovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfYashikaSharma391629
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZABSYZ Inc
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commercemanigoyal112
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odishasmiwainfosol
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsSafe Software
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Developmentvyaparkranti
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringHironori Washizaki
 

Kürzlich hochgeladen (20)

Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
 
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfInnovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZ
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commerce
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Development
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their Engineering
 

Ayasdi & Teradata : Applying Topological Data Analysis to Complex Data

  • 1. Teradata Partners Conference ’14 Applying Topological Data Analysis to Complex Data Abhishek Gupta, Senior Engineer
  • 2. Ayasdi makes the world’s complex data useful by extracting powerful insights automatically. Ayasdi named one of the Top 10 Most Innovative Companies in Big Data for 2013 These big data companies are ones to watch The Structure Data Awards: Machine Learning / Artificial Intelligence Top 100 Private Companies – Big Data/Analytics Named by Mary Meeker as one of the most interesting companies in the data/ analytics space Company Confidential & Proprietary 2
  • 3. The Promise of Big Data Company Confidential & Proprietary 3 Information Understanding Business Impact
  • 4. Why Do Current Approaches Fail? Company Confidential & Proprietary 4 Today’s Approach to Analytics Hypothesis Challenges: • Incomplete and missing insights • Depends on humans to scale • Slow responses due to iteration
  • 5. A New Approach Is Required Company Confidential & Proprietary 5 Algorithms & Compute OR Benefits: • Automated understanding • Comprehensive • Fast
  • 6. Comparison Hypothesis Verifies Explains Company Confidential & Proprietary 6 Traditional Analytics Ayasdi Approach Algorithms & Compute Labor Intensive Automated Analysts and Data Scientists Domain Experts or
  • 7. Ayasdi’s topological framework incorporates, unifies and enhances other disciplines. Because of these properties it has extraordinary reach and effectiveness. Statistics Machine Learning Geometry Company Confidential & Proprietary 7
  • 8. Ayasdi & Teradata Partnership Company Confidential & Proprietary 8 SQL Code DDL Data pushed through analysis Key Benefit: Making your ETL process simpler.
  • 9. Use Case: Anomaly Detection Leader in flash memory storage and software Company Confidential & Proprietary ABOUT THE DATA • Data consists of die level test information for 1 wafer • 12,000+ dies with 100+ tests done for each of the die • Network was built using all the test columns • Test Result column with pass/fail flag used as metadata GOAL OF THE ANALYSIS • Identify different subgroups of dies based on similar test information • Find tests that uniquely identify failed die subgroups 9 Fortune 500 and S&P 500 company $5B+ in revenue
  • 11. Use Case: Anomaly Detection Rows in Node Company Confidential & Proprietary 11 High Low
  • 12. Use Case: Anomaly Detection Key Takeaway: Tight concentration of wafers that pass their tests in the middle of the cluster Test Result=True Company Confidential & Proprietary 12 High Low
  • 13. Use Case: Anomaly Detection Key Takeaway: Two distinct regions of wafers failing their tests à Next action: investigate the “why” Test Result=False Company Confidential & Proprietary 13 High Low
  • 14. Use Case: Anomaly Detection Select first failure group to view underlying wafer properties Test Result=False Company Confidential & Proprietary 14 High Low
  • 15. Use Case: Anomaly Detection Company Confidential & Proprietary 15 KS scores for test 13 show correlations for specific failures
  • 16. Use Case: Anomaly Detection Select second failure group to view underlying wafer properties Company Confidential & Proprietary 16 High Low
  • 17. Use Case: Anomaly Detection Company Confidential & Proprietary 17 KS scores for tests 8, 11, and 3 show correlations for specific failures
  • 18. Use Case: Anomaly Detection Leader in flash memory storage and software Company Confidential & Proprietary • Pinpoint wafer anomalies that result in scrap and lost revenue • Previously required at least two days of analysis to identify even the 18 CHALLENGE most systemic anomalies SOLUTION • Accelerated the analysis of wafer data and yield rates to identify and resolve issues • Identified additional systemic anomalies previously dismissed as “random” • Estimated to save hundred million dollars in the first year from a reduction in scrap by reducing yield loss by 10% Fortune 500 and S&P 500 company $5B+ in revenue
  • 19. Corporate Headquarters 4400 Bohannon Drive Suite #200 Menlo Park, CA 94025 ayasdi.com 19