SlideShare ist ein Scribd-Unternehmen logo
1 von 37
Unlocking value from data with Data Integration Tools Phil Watt, Principal Integration Architect, HP Business Intelligence Solutions, EMEA 29/04/2010 1
Outline Introduction Business drivers – why use a DI tool? the challenge private sector public sector Background and history DI tools timeline Emerging features – and value Governance and Best Practice Selecting a tool for your situation Demonstration: Summary – followed by hands on session 29/04/2010 2
About me 29/04/2010 3 19 years big data 10 years Data Integration tools High volume Complex business rules Governance and metadata management Clients include BSkyB BT Barclays/Barclaycard Centrica  Experian John Lewis Partnership Microsoft A major UK political party Strong focus on pragmatic delivery Best practices Design patterns Tool evaluation, selection and implementation
Scope 29/04/2010 4
Glossary 29/04/2010 5
The challenge 29/04/2010 6
Data warehouse example sizes 29/04/2010 7
Public and academic examples 29/04/2010 8 Birmingham City Council http://www.experian.co.uk/www/pages/about_us/our_clients/ http://www.qas.co.uk/company/press/new-experian-software-helps-public-sector-to-enhance-single-citizen-view-projects-503.htm University of Toulouse – academic medical research http://www.talend.com/open-source-provider/casestudy/CaseStudy_Academic_Medical_Research_EN.php
Benefits of DI tools 29/04/2010 9
Extract, Transform and Load 29/04/2010 10 e.g. CRM or  ERP system Hub and spoke Shared DW and ETL server
Extract, Load and Transform 29/04/2010 11 e.g. CRM or  ERP system Shared DW and ETL server
ETL versus ELT 29/04/2010 12
Multiple sources and targets 29/04/2010 13
DI Tools Features Timeline1995 – 2005 29/04/2010 14
DI Tools Features Timeline from 2006 29/04/2010 15
Market features 29/04/2010 16
Gartner Magic Quadrant Taken from research document, ‘Magic Quadrant for Data Integration Tools’  Authors: Ted Friedman, Mark A. Beyer, Eric Thoo Full report available by registering at www.talend.com 29/04/2010 17 Image removed for web publication as agreed with Gartner
Magic Quadrant Disclaimer The Magic Quadrant is copyrighted November 25, 2009 by Gartner, Inc. and is reused with permission.  The Magic Quadrant is a graphical representation of a marketplace at and for a specific time period.  It depicts Gartner's analysis of how certain vendors measure against criteria for that marketplace, as defined by Gartner.  Gartner does not endorse any vendor, product or service depicted in the Magic Quadrant, and does not advise technology users to select only those vendors placed in the "Leaders" quadrant.  The Magic Quadrant is intended solely as a research tool, and is not meant to be a specific guide to action.  Gartner disclaims all warranties, express or implied, with respect to this research, including any warranties of merchantability or fitness for a particular purpose. 29/04/2010 18
Best practices 29/04/2010 19
Worst Practices 29/04/2010 20
Gartner advice 29/04/2010 21 ,[object Object]
Allocate 20 - 30% to mapping and transformation rules
Avoid custom-coding or desktop tools
Increase business user involvement to improve successBest Practices Mitigate Data Migration Risks and Challenges – May 2009
Governance and the data integration lifecycle 29/04/2010 22
Best practices 29/04/2010 23 ,[object Object]
Spend 50% of project time doing discovery, analysis, design
Get business users involved early and often
Use tools to accelerate and compress timescales
Pay attention to governance and metadata
So you can:
De-risk the project
Reduce overall cost and timescales
Achieve best possible quality,[object Object]
Qualification matrix (PW ) 29/04/2010 25
Demonstration 29/04/2010 26
29/04/2010 27

Weitere ähnliche Inhalte

Ähnlich wie Unlocking value from data with data integration tools

Modern Business Intelligence - Design and Implementations
Modern Business Intelligence - Design and ImplementationsModern Business Intelligence - Design and Implementations
Modern Business Intelligence - Design and Implementations
David J Rosenthal
 
Technology business management_7.13
Technology business management_7.13Technology business management_7.13
Technology business management_7.13
James Sutter
 
Technology business management_7.13
Technology business management_7.13Technology business management_7.13
Technology business management_7.13
Jim Sutter
 
Business Intelligence (Av Arif Shafique)
Business Intelligence (Av Arif Shafique)Business Intelligence (Av Arif Shafique)
Business Intelligence (Av Arif Shafique)
Microsoft Norge AS
 
Successful Processes for Selecting a Content Management System: How to Become...
Successful Processes for Selecting a Content Management System: How to Become...Successful Processes for Selecting a Content Management System: How to Become...
Successful Processes for Selecting a Content Management System: How to Become...
Scott Abel
 
Cloud cpmputing and busness processes
Cloud cpmputing and busness processesCloud cpmputing and busness processes
Cloud cpmputing and busness processes
Minka Fudulova
 

Ähnlich wie Unlocking value from data with data integration tools (20)

Modern Business Intelligence - Design and Implementations
Modern Business Intelligence - Design and ImplementationsModern Business Intelligence - Design and Implementations
Modern Business Intelligence - Design and Implementations
 
Bp006 Duguid
Bp006 DuguidBp006 Duguid
Bp006 Duguid
 
Kudu Forrester Webinar
Kudu Forrester WebinarKudu Forrester Webinar
Kudu Forrester Webinar
 
Big data analytics fas trak solution overview
Big data analytics fas trak solution overviewBig data analytics fas trak solution overview
Big data analytics fas trak solution overview
 
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
 
Technology business management_7.13
Technology business management_7.13Technology business management_7.13
Technology business management_7.13
 
Technology business management_7.13
Technology business management_7.13Technology business management_7.13
Technology business management_7.13
 
eBook-DataSciencePlatform
eBook-DataSciencePlatformeBook-DataSciencePlatform
eBook-DataSciencePlatform
 
Game plan wkshp1
Game plan wkshp1Game plan wkshp1
Game plan wkshp1
 
Forecast 2014: Business Strategy Enabled by Cloud
Forecast 2014:  Business Strategy Enabled by Cloud Forecast 2014:  Business Strategy Enabled by Cloud
Forecast 2014: Business Strategy Enabled by Cloud
 
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
 
Reporte forrester bpms
Reporte forrester bpmsReporte forrester bpms
Reporte forrester bpms
 
Business Intelligence (Av Arif Shafique)
Business Intelligence (Av Arif Shafique)Business Intelligence (Av Arif Shafique)
Business Intelligence (Av Arif Shafique)
 
DALICC (Data Licenses Clearance Centre)
DALICC (Data Licenses Clearance Centre)DALICC (Data Licenses Clearance Centre)
DALICC (Data Licenses Clearance Centre)
 
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...
 
Successful Processes for Selecting a Content Management System: How to Become...
Successful Processes for Selecting a Content Management System: How to Become...Successful Processes for Selecting a Content Management System: How to Become...
Successful Processes for Selecting a Content Management System: How to Become...
 
Graphs in Telecommunications - Jesus Barrasa, Neo4j
Graphs in Telecommunications - Jesus Barrasa, Neo4jGraphs in Telecommunications - Jesus Barrasa, Neo4j
Graphs in Telecommunications - Jesus Barrasa, Neo4j
 
Cloud cpmputing and busness processes
Cloud cpmputing and busness processesCloud cpmputing and busness processes
Cloud cpmputing and busness processes
 
Clarity It Sourcing Diagnostic Presentation
Clarity It Sourcing Diagnostic PresentationClarity It Sourcing Diagnostic Presentation
Clarity It Sourcing Diagnostic Presentation
 
Beyond Bioprocessing 4.0: The Convergence of IT, OT and Processing Technolog...
Beyond Bioprocessing 4.0:  The Convergence of IT, OT and Processing Technolog...Beyond Bioprocessing 4.0:  The Convergence of IT, OT and Processing Technolog...
Beyond Bioprocessing 4.0: The Convergence of IT, OT and Processing Technolog...
 

Kürzlich hochgeladen

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Kürzlich hochgeladen (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 

Unlocking value from data with data integration tools

  • 1. Unlocking value from data with Data Integration Tools Phil Watt, Principal Integration Architect, HP Business Intelligence Solutions, EMEA 29/04/2010 1
  • 2. Outline Introduction Business drivers – why use a DI tool? the challenge private sector public sector Background and history DI tools timeline Emerging features – and value Governance and Best Practice Selecting a tool for your situation Demonstration: Summary – followed by hands on session 29/04/2010 2
  • 3. About me 29/04/2010 3 19 years big data 10 years Data Integration tools High volume Complex business rules Governance and metadata management Clients include BSkyB BT Barclays/Barclaycard Centrica Experian John Lewis Partnership Microsoft A major UK political party Strong focus on pragmatic delivery Best practices Design patterns Tool evaluation, selection and implementation
  • 7. Data warehouse example sizes 29/04/2010 7
  • 8. Public and academic examples 29/04/2010 8 Birmingham City Council http://www.experian.co.uk/www/pages/about_us/our_clients/ http://www.qas.co.uk/company/press/new-experian-software-helps-public-sector-to-enhance-single-citizen-view-projects-503.htm University of Toulouse – academic medical research http://www.talend.com/open-source-provider/casestudy/CaseStudy_Academic_Medical_Research_EN.php
  • 9. Benefits of DI tools 29/04/2010 9
  • 10. Extract, Transform and Load 29/04/2010 10 e.g. CRM or ERP system Hub and spoke Shared DW and ETL server
  • 11. Extract, Load and Transform 29/04/2010 11 e.g. CRM or ERP system Shared DW and ETL server
  • 12. ETL versus ELT 29/04/2010 12
  • 13. Multiple sources and targets 29/04/2010 13
  • 14. DI Tools Features Timeline1995 – 2005 29/04/2010 14
  • 15. DI Tools Features Timeline from 2006 29/04/2010 15
  • 17. Gartner Magic Quadrant Taken from research document, ‘Magic Quadrant for Data Integration Tools’ Authors: Ted Friedman, Mark A. Beyer, Eric Thoo Full report available by registering at www.talend.com 29/04/2010 17 Image removed for web publication as agreed with Gartner
  • 18. Magic Quadrant Disclaimer The Magic Quadrant is copyrighted November 25, 2009 by Gartner, Inc. and is reused with permission. The Magic Quadrant is a graphical representation of a marketplace at and for a specific time period. It depicts Gartner's analysis of how certain vendors measure against criteria for that marketplace, as defined by Gartner. Gartner does not endorse any vendor, product or service depicted in the Magic Quadrant, and does not advise technology users to select only those vendors placed in the "Leaders" quadrant. The Magic Quadrant is intended solely as a research tool, and is not meant to be a specific guide to action. Gartner disclaims all warranties, express or implied, with respect to this research, including any warranties of merchantability or fitness for a particular purpose. 29/04/2010 18
  • 21.
  • 22. Allocate 20 - 30% to mapping and transformation rules
  • 23. Avoid custom-coding or desktop tools
  • 24. Increase business user involvement to improve successBest Practices Mitigate Data Migration Risks and Challenges – May 2009
  • 25. Governance and the data integration lifecycle 29/04/2010 22
  • 26.
  • 27. Spend 50% of project time doing discovery, analysis, design
  • 28. Get business users involved early and often
  • 29. Use tools to accelerate and compress timescales
  • 30. Pay attention to governance and metadata
  • 33. Reduce overall cost and timescales
  • 34.
  • 35. Qualification matrix (PW ) 29/04/2010 25
  • 43. Demo metrics 29/04/2010 33 Performance Hardware – dual core 2.0Ghz Intel Centrino, 2.5Gb Ram Environment – WinXP, Oracle Express (DB) +DI tool (Expressor 2.0) 3 data sources Customers 155 MB 1000K records Today’s orders 112 MB 100K records Yesterday's orders 0.3 MB 3K records Total data volume 267 MB 1.1M records Execution time 72 seconds Throughput 3.7 MB/sec 41k/sec
  • 44. Demo features 29/04/2010 34 Developer Productivity Graphical development Semantic Rationalisation and Re-usable Business Rules Demo represents a generic business scenario XML, message queues (MSMQ) , database inputs/outputs, joins, aggregations and referential integrity management Similar features to the ATG/Integrated Basket challenges?
  • 45. Summary 29/04/2010 35 Business drivers – why use a DI tool? the challenge private sector public sector Background and history DI tools timeline Emerging features – and value Governance and Best Practice Selecting a tool for your situation Demonstration:
  • 47. References 29/04/2010 37 Curt Monashhttp://www.dbms2.com/2009/04/30/ebays-two-enormous-data-warehouses/ Wired: http://www.wired.com/wired/archive/12.04/grid.html Zdnet: http://blogs.zdnet.com/storage/?p=213 Professor Chris Bishop: http://conferences.theiet.org/lectures/turing/ Gartner http://www.gartner.com LHC data (2007): http://www-conf.slac.stanford.edu/xldb07/xldb_lhc.pdf

Hinweis der Redaktion

  1. eBay – 2 Petabytes and 6.5 PetabytesFacebook2.5 PetabytesWal-mart2.5 PetabytesYahoo> 10 Petabytes plannedLHC (Large Hadron Collider, Year 1)10 Petabytes data/yearNational ID Cards (planned estimate)>2 Terabytes
  2. Many tools have claimed this in the past
  3. 2 typesengine based, (Informatica, Ab Initio, expressor, etc)code generators (ETI, Talend, etc.)
  4. DatabasesDifferent character sets (ASCII, EBCDIC, Unicode)International characters (unicode)Queues,Web Services (SOAP, WSDL, RPC)XMLODBC/JDBC
  5. Features listed up to 2004 represent minimum marketable features for new entrants to the marketplace
  6. Describe value of each Workflow optimisation is the key driver nowEarly tools focussed on selling developer features, strengths around complexity rather than value to delivery process.
  7. Almost weekly news of M&A
  8. Example of one analyst business’s view of the DI Tools marketplaceGartner’s Magic Quadrant provides a view of eligible vendors in the marketplace.Indicates this is a mature market, with considerable global interest and healthy competitionAlso notable that HP, for example, does not have a tool in this spaceThere may be vendors not in the Magic Quadrant that are worth considering – don’t rule out vendors based on inclusion/exclusion from this report
  9. Goes much further than illustrated in this slideGovernance must apply structures to manage quality of dataEnterprises must incentivise people to maintain and improve data qualityyou cannot manage what you can’t measureMetrics must align to personal objectives