SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Insight to Action – Big Data
– Challenge and Opportunity
Smarter Business 2012


      Mobility          Smarter         Social        Smarter        Smarter
  – bring your own      Analytics    Collaboration    Security        Cities
        device




Insight to Action –      Smarter       Smarter        Smarter          Smarter
Big Data - Challenge   Commerce         Product       Process       Infrastructure
  and Opportunity      & Marketing    Innovation     Optimization   Management
                       Automation
Agenda
10:30   IBM Big Data Platform
        Flemming Bagger, Big Data Analytics Leader, Nordic
11:15   Pause
11:30   Opnå konkrete resultater med Big Data Analytics
        Lauren Walker, Big Data Analytics Leader, Europe
12:15   Frokost
13:30   Succes eller fiasko? Sådan håndteres Big Data i den finansielle sektor
        Keith Prince, EMEA Industry Solutions Executive, Financial Services, IBM
14:15   Pause
14:30   Dataindsamling og overvågning på tværs af sociale medier
        Ulrik Bo Larsen, Founder & CEO, FALCON Social
15:10   Afrunding
Agenda
10:30   IBM Big Data Platform
        Flemming Bagger, Big Data Analytics Leader, Nordic
11:15   Pause
11:30   Opnå konkrete resultater med Big Data Analytics
        Lauren Walker, Big Data Analytics Leader, Europe
12:15   Frokost
13:30   Succes eller fiasko? Sådan håndteres Big Data i den finansielle sektor
        Keith Prince, EMEA Industry Solutions Executive, Financial Services, IBM
14:15   Pause
14:30   Dataindsamling og overvågning på tværs af sociale medier
        Ulrik Bo Larsen, Founder & CEO, FALCON Social
15:10   Afrunding
Information Management


Highlight from the IBM CEO Study 2012




                                        © 2012 IBM Corporation
Information Management




                                83x

 6,000,000 users on Twitter           500,000,000 users on Twitter
    pushing out 300,000                 pushing out 400,000,000
               tweets per day                   tweets per day
                                        1333x
                                                                 © 2012 IBM Corporation
Information Management




                         In 2005 there were 1.3 billion
                         RFID
                          tags in circulation…
© 2012 IBM Corporation
Information Management



Where is big data coming from?                                                  4.6
                                               30 billion RFID
                                                  tags today
                                                                            billion
                                                                           camera
                         12+ TBs                 (1.3B in 2005)
                                                                           phones
                    of tweet data                                       world wide
                      every day

                                                                          100s of
                                                                         millions
                                                                          of GPS
       data every
? TBs of




                                                                         enabled
          day




                                                                      devices sold
                                                                          annually

                             25+ TBs of                                          2+
                               log data                                     billion
                              every day                                   people on
                                          76 million smart                 the Web
                                                                             by end
                                           meters in 2009…
                                                                              2011
                                            200M by 2014
                                                                  © 2012 IBM Corporation
Information Management

In Order to Realize New Opportunities, You Need to Think Beyond Traditional
Sources of Data

 Transactional and          Machine Data        Social Data              Enterprise
  Application Data                                                        Content




   Volume                 Velocity           Variety                Variety
   Structured             Semi-structured    Highly unstructured    Highly unstructured
   Throughput             Ingestion          Veracity               Volume


                                                                              © 2012 IBM Corporation
Information Management


The Characteristics of Big Data

      Cost efficiently             Responding to the        Collectively analyzing
      processing the               increasing Velocity      the broadening Variety
      growing Volume
         50x         35 ZB                   30 Billion
                                             RFID sensors             80% of the
                                             and counting             worlds data is
                                                                      unstructured
         2010        2020



                Establishing the         1 in 3 business leaders don’t trust
                Veracity of big          the information they use to make
                data sources             decisions


                                                                            © 2012 IBM Corporation
Information Management


 The Big Data Conundrum
 The percentage of available data an enterprise can analyze is decreasing
  proportionately to the available to that enterprise
   – Quite simply, this means as enterprises, we are getting
     “more naive” about our business over time

 Just collecting and storing “Big Data” doesn’t drive a cent
  of value to an organization’s bottom line



                                                 Data AVAILABLE to
                                                   an organization



                                                                 Data an organization
                                                                    can PROCESS
                                                                             © 2012 IBM Corporation
Information Management

Big Data is a Hot topic
- Because Technology Makes it Possible to Analyze ALL Available Data
                                         Cost effectively manage and analyze
                                          all available data in its native form
                      unstructured, structured, streaming…….Internal and external




                          Website                                                      Social Media



                               Billing
                                           ERP                                    Network Switches
                                                    CRM            RFID
                                                                                                 © 2012 IBM Corporation
Information Management

Most Client Use Cases Combine Multiple Technologies

                              Pre-processing
                                  Ingest and analyze unstructured data
                                  types and convert to structured data

                              Combine structured and unstructured analysis
                                  Augment data warehouse with additional external
                                  sources, such as social media

                              Combine high velocity and historical analysis
                                  Analyze and react to data in motion; adjust models
                                  with deep historical analysis

                              Reuse structured data for exploratory analysis
                                  Experimentation and ad-hoc analysis with structured
                                  data
                                                                         © 2012 IBM Corporation
Information Management

Business-centric Big Data enables you to start with a critical business pain and
expand the foundation for future requirements

                                                      “Big data” isn’t just a technology—it’s a
                                                       business strategy for capitalizing on
                                                       information resources
                                                      Getting started is crucial
                                                      Success at each entry point is
                                                       accelerated by products within the Big
                                                       Data platform
                                                      Build the foundation for future
                                                       requirements by expanding further into
                                                       the big data platform




14                                                                                  © 2012 IBM Corporation
Information Management


1 – Unlock Big Data
 Customer Need
    – Understand existing data sources
    – Expose the data within existing content management
      and file systems for new uses, without copying the data
      to a central location
    – Search and navigate big data from
      federated sources

 Value Statement
    – Get up and running quickly and discover and retrieve
      relevant big data
    – Use big data sources in new information-centric
      applications

 Get started with: IBM Vivisimo Velocity




                                                                © 2012 IBM Corporation
Information Management

Most Common Big Data Use Case = 360-Views
Single view of the information




                         Customer-
                         Facing
                         Professional/Kn
                         owledge
                         Worker




                                            © 2012 IBM Corporation
Information Management


2 – Analyze Raw Data
 Customer Need
    – Ingest data as-is into Hadoop and derive insight from it
    – Process large volumes of diverse data within Hadoop
    – Combine insights with the data warehouse
    – Low-cost ad-hoc analysis with Hadoop to test new
      hypothesis

 Value Statement
    – Gain new insights from a variety and combination of data
      sources
    – Overcome the prohibitively high cost of converting
      unstructured data sources to a structured format
    – Extend the value of the data warehouse by bringing in new
      types of data and driving new types of analysis
    – Experiment with analysis of different data combinations to
      modify the analytic models in the data warehouse


 Get started with: InfoSphere BigInsights



                                                                   © 2012 IBM Corporation
Information Management


3 – Simplify your Warehouse
  Customer Need
    – Business users are hampered by the poor performance
       of analytics of a general-purpose enterprise warehouse
       – queries take hours to run
    – Enterprise data warehouse is encumbered by too much
       data for too many purposes
    – Need to ingest huge volumes of structured data and run
       multiple concurrent deep analytic queries against it
    – IT needs to reduce the cost of maintaining the data
       warehouse
  Value Statement
    – Speed and Simplicity for deep analytics (Netezza)
    – 100s to 1000s users/second for operation analytics (IBM
       Smart Analytics System)
  Get started with: IBM Netezza




18                                                              © 2012 IBM Corporation
Information Management


4 – Reduce costs with Hadoop
 Customer Need
    – Reduce the overall cost to maintain data in the warehouse –
      often its seldom used and kept ‘just in case’
    – Lower costs as data grows within the data warehouse
    – Reduce expensive infrastructure used for processing and
      transformations

 Value Statement
    – Support existing and new workloads on the most cost effective
      alternative, while preserving existing access and queries
    – Lower storage costs
    – Reduce processing costs by pushing processing onto
      commodity hardware and the parallel processing of Hadoop

 Get started with: IBM InfoSphere BigInsights




                                                                      © 2012 IBM Corporation
Information Management


IBM Significantly Enhances Hadoop


                                                   IBM Innovation
• Scalable                                     • Performance & reliability
     –   New nodes can be added on the fly.       – Adaptive MapReduce, Compression,
                                                    Indexing, Flexible Scheduler
• Affordable
     – Massively parallel computing on         • Analytic Accelerators
         commodity servers
                                               • Productivity Accelerators
• Flexible                                        – Web-based UIs
     – Hadoop is schema-less, and can absorb      – Tools to leverage existing skills
         any type of data.
                                                  – End-user visualization
• Fault Tolerant                               • Enterprise Integration
     – Through MapReduce software framework
                                                  – To extend & enrich your information
                                                    supply chain.
20                                                                                © 2012 IBM Corporation
Information Management


5 – Analyze Streaming Data
                                                             Streaming Data
                                                                 Sources      Streams Computing
 Customer Need
    – Harness and process streaming
      data sources
    – Select valuable data and insights to be stored for                                                   ACTION
      further processing
    – Quickly process and analyze perishable data, and
      take timely action

 Value Statement
    – Significantly reduced processing time
      and cost – process and then store
      what’s valuable
    – React in real-time to capture opportunities before
      they expire

 Customer examples
    – Ufone – Telco Call Detail Record (CDR) analytics for
      customer churn prevention

 Get started with: InfoSphere Streams


                                                                                                  © 2012 IBM Corporation
Information Management


Entry points are accelerated by products within the big data platform

1 – Unlock Big Data                    Analytic Applications
                           BI /    Exploration / Functional Industry Predictive Content
IBM Vivisimo             Reporting Visualization   App        App                 BI /
                                                                     Analytics Analytics
                                                                                Reporting


                                    IBM Big Data Platform
                                                                                            3 – Simplify your
                           Visualization         Application          Systems               warehouse
2 – Analyze Raw Rata       & Discovery          Development          Management             Netezza
InfoSphere
BigInsights                                        Accelerators

                               Hadoop             Stream                Data
                               System            Computing            Warehouse
                                                                                            5 – Analyze Streaming
4 – Reduce costs with                                                                       Data
Hadoop                                                                                      InfoSphere Streams
InfoSphere
BigInsights                         Information Integration & Governance


22                                                                                                     © 2012 IBM Corporation
Information Management


Is Big Data imperative?




                          © 2012 IBM Corporation
Information Management




                         THINK
24                               © 2012 IBM Corporation
Agenda
10:30   IBM Big Data Platform
        Flemming Bagger, Big Data Analytics Leader, Nordic
11:15   Pause
11:30   Opnå konkrete resultater med Big Data Analytics
        Lauren Walker, Big Data Analytics Leader, Europe
12:15   Frokost
13:30   Succes eller fiasko? Sådan håndteres Big Data i den finansielle sektor
        Keith Prince, EMEA Industry Solutions Executive, Financial Services, IBM
14:15   Pause
14:30   Dataindsamling og overvågning på tværs af sociale medier
        Ulrik Bo Larsen, Founder & CEO, FALCON Social
15:10   Afrunding
Pause

Weitere ähnliche Inhalte

Was ist angesagt?

Intel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick KnupfferIntel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick KnupfferIntelAPAC
 
Understanding The Big Data Opportunity Final
Understanding The Big Data Opportunity FinalUnderstanding The Big Data Opportunity Final
Understanding The Big Data Opportunity FinalAndrew Gregoris
 
Dr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big Data
Dr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big DataDr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big Data
Dr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big DataGlobal Business Events
 
EDF2013 - Richard Benjamins: Big Data – Big opportunities – Big risks? And ...
EDF2013 - Richard Benjamins: Big Data –  Big opportunities –  Big risks? And ...EDF2013 - Richard Benjamins: Big Data –  Big opportunities –  Big risks? And ...
EDF2013 - Richard Benjamins: Big Data – Big opportunities – Big risks? And ...European Data Forum
 
Smarter Planet: How Big Data changes our world
Smarter Planet: How Big Data changes our worldSmarter Planet: How Big Data changes our world
Smarter Planet: How Big Data changes our worldKim Escherich
 
Cutting Big Data Down to Size with AMD and Dell
Cutting Big Data Down to Size with AMD and DellCutting Big Data Down to Size with AMD and Dell
Cutting Big Data Down to Size with AMD and DellAMD
 
The Future of ERP by Bertrand Andries
The Future of ERP by Bertrand Andries  The Future of ERP by Bertrand Andries
The Future of ERP by Bertrand Andries CONFENIS 2012
 
Delivering next generation enterprise no sql database technology
Delivering next generation enterprise no sql database technologyDelivering next generation enterprise no sql database technology
Delivering next generation enterprise no sql database technologymarcmcneill
 
Mesa Big Data 2nd Screen Final
Mesa Big Data 2nd Screen FinalMesa Big Data 2nd Screen Final
Mesa Big Data 2nd Screen FinalTripp Payne
 
Cloud Computing: da curiosidade para casos reais
Cloud Computing: da curiosidade para casos reaisCloud Computing: da curiosidade para casos reais
Cloud Computing: da curiosidade para casos reaissoudW
 
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)Mark Heid
 
The Evolving Realities of Digital Marketing:
The Evolving Realities of Digital Marketing: The Evolving Realities of Digital Marketing:
The Evolving Realities of Digital Marketing: RENGAN SRINIVASAN
 
Smarter Analytics: Big Data and Predictive Governance
Smarter Analytics: Big Data and Predictive GovernanceSmarter Analytics: Big Data and Predictive Governance
Smarter Analytics: Big Data and Predictive GovernanceIBM Danmark
 
The new normal in business intelligence
The new normal in business intelligenceThe new normal in business intelligence
The new normal in business intelligenceJohan Blomme
 
Progress with confidence into next generation IT
Progress with confidence into next generation ITProgress with confidence into next generation IT
Progress with confidence into next generation ITPaul Muller
 
Trends in business intelligence 2012
Trends in business intelligence 2012Trends in business intelligence 2012
Trends in business intelligence 2012Johan Blomme
 
Big Data = Big Decisions
Big Data = Big DecisionsBig Data = Big Decisions
Big Data = Big DecisionsInnoTech
 
600 Minutes Public IT: Smarter Cities
600 Minutes Public IT: Smarter Cities600 Minutes Public IT: Smarter Cities
600 Minutes Public IT: Smarter CitiesKim Escherich
 

Was ist angesagt? (20)

Intel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick KnupfferIntel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick Knupffer
 
Understanding The Big Data Opportunity Final
Understanding The Big Data Opportunity FinalUnderstanding The Big Data Opportunity Final
Understanding The Big Data Opportunity Final
 
Dr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big Data
Dr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big DataDr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big Data
Dr. Shahbaz Ali, CEO at Tarmin - Business Transformation in the Age of Big Data
 
EDF2013 - Richard Benjamins: Big Data – Big opportunities – Big risks? And ...
EDF2013 - Richard Benjamins: Big Data –  Big opportunities –  Big risks? And ...EDF2013 - Richard Benjamins: Big Data –  Big opportunities –  Big risks? And ...
EDF2013 - Richard Benjamins: Big Data – Big opportunities – Big risks? And ...
 
Smarter Planet: How Big Data changes our world
Smarter Planet: How Big Data changes our worldSmarter Planet: How Big Data changes our world
Smarter Planet: How Big Data changes our world
 
Cutting Big Data Down to Size with AMD and Dell
Cutting Big Data Down to Size with AMD and DellCutting Big Data Down to Size with AMD and Dell
Cutting Big Data Down to Size with AMD and Dell
 
The Future of ERP by Bertrand Andries
The Future of ERP by Bertrand Andries  The Future of ERP by Bertrand Andries
The Future of ERP by Bertrand Andries
 
Delivering next generation enterprise no sql database technology
Delivering next generation enterprise no sql database technologyDelivering next generation enterprise no sql database technology
Delivering next generation enterprise no sql database technology
 
Mesa Big Data 2nd Screen Final
Mesa Big Data 2nd Screen FinalMesa Big Data 2nd Screen Final
Mesa Big Data 2nd Screen Final
 
Cloud Computing: da curiosidade para casos reais
Cloud Computing: da curiosidade para casos reaisCloud Computing: da curiosidade para casos reais
Cloud Computing: da curiosidade para casos reais
 
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
 
The Evolving Realities of Digital Marketing:
The Evolving Realities of Digital Marketing: The Evolving Realities of Digital Marketing:
The Evolving Realities of Digital Marketing:
 
Smarter Analytics: Big Data and Predictive Governance
Smarter Analytics: Big Data and Predictive GovernanceSmarter Analytics: Big Data and Predictive Governance
Smarter Analytics: Big Data and Predictive Governance
 
The new normal in business intelligence
The new normal in business intelligenceThe new normal in business intelligence
The new normal in business intelligence
 
Progress with confidence into next generation IT
Progress with confidence into next generation ITProgress with confidence into next generation IT
Progress with confidence into next generation IT
 
IBM Stream au Hadoop User Group
IBM Stream au Hadoop User GroupIBM Stream au Hadoop User Group
IBM Stream au Hadoop User Group
 
Trends in business intelligence 2012
Trends in business intelligence 2012Trends in business intelligence 2012
Trends in business intelligence 2012
 
Cloud conf2012
Cloud conf2012Cloud conf2012
Cloud conf2012
 
Big Data = Big Decisions
Big Data = Big DecisionsBig Data = Big Decisions
Big Data = Big Decisions
 
600 Minutes Public IT: Smarter Cities
600 Minutes Public IT: Smarter Cities600 Minutes Public IT: Smarter Cities
600 Minutes Public IT: Smarter Cities
 

Ähnlich wie Konceptuelt overblik over Big Data, Flemming Bagger, IBM

Ähnlich wie Konceptuelt overblik over Big Data, Flemming Bagger, IBM (20)

IBM-Why Big Data?
IBM-Why Big Data?IBM-Why Big Data?
IBM-Why Big Data?
 
Big data 20120327
Big data 20120327Big data 20120327
Big data 20120327
 
Bigdata final(이지은)
Bigdata final(이지은)Bigdata final(이지은)
Bigdata final(이지은)
 
Big Data: Industry trends and key players
Big Data: Industry trends and key playersBig Data: Industry trends and key players
Big Data: Industry trends and key players
 
IBM Big Data Platform Nov 2012
IBM Big Data Platform Nov 2012IBM Big Data Platform Nov 2012
IBM Big Data Platform Nov 2012
 
Big Data ppt
Big Data pptBig Data ppt
Big Data ppt
 
Accelerate Return on Data
Accelerate Return on DataAccelerate Return on Data
Accelerate Return on Data
 
big-data-8722-m8RQ3h1.pptx
big-data-8722-m8RQ3h1.pptxbig-data-8722-m8RQ3h1.pptx
big-data-8722-m8RQ3h1.pptx
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Do More With Less with DB2 for z/OS
Do More With Less with DB2 for z/OSDo More With Less with DB2 for z/OS
Do More With Less with DB2 for z/OS
 
Big Data and Cloud Analytics
Big Data and Cloud AnalyticsBig Data and Cloud Analytics
Big Data and Cloud Analytics
 
Big Data in Asia
Big Data in AsiaBig Data in Asia
Big Data in Asia
 
Big data Analytics
Big data Analytics Big data Analytics
Big data Analytics
 
bigdata.pptx
bigdata.pptxbigdata.pptx
bigdata.pptx
 
Big Data World Forum
Big Data World ForumBig Data World Forum
Big Data World Forum
 
Big data
Big dataBig data
Big data
 
What is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesWhat is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use Cases
 
IBM Spain BP Storage Day Inigo Osoro
IBM Spain BP Storage Day    Inigo OsoroIBM Spain BP Storage Day    Inigo Osoro
IBM Spain BP Storage Day Inigo Osoro
 
The New Enterprise Data Platform
The New Enterprise Data PlatformThe New Enterprise Data Platform
The New Enterprise Data Platform
 
The future of bi isn't a bi tool
The future of bi isn't a bi toolThe future of bi isn't a bi tool
The future of bi isn't a bi tool
 

Mehr von IBM Danmark

DevOps, Development and Operations, Tina McGinley
DevOps, Development and Operations, Tina McGinleyDevOps, Development and Operations, Tina McGinley
DevOps, Development and Operations, Tina McGinleyIBM Danmark
 
Velkomst, Universitetssporet 2013, Pia Rønhøj
Velkomst, Universitetssporet 2013, Pia RønhøjVelkomst, Universitetssporet 2013, Pia Rønhøj
Velkomst, Universitetssporet 2013, Pia RønhøjIBM Danmark
 
Smarter Commerce, Salg og Marketing, Thomas Steglich-Andersen
Smarter Commerce, Salg og Marketing, Thomas Steglich-AndersenSmarter Commerce, Salg og Marketing, Thomas Steglich-Andersen
Smarter Commerce, Salg og Marketing, Thomas Steglich-AndersenIBM Danmark
 
Mobile, Philip Nyborg
Mobile, Philip NyborgMobile, Philip Nyborg
Mobile, Philip NyborgIBM Danmark
 
IT innovation, Kim Escherich
IT innovation, Kim EscherichIT innovation, Kim Escherich
IT innovation, Kim EscherichIBM Danmark
 
Echo.IT, Stefan K. Madsen
Echo.IT, Stefan K. MadsenEcho.IT, Stefan K. Madsen
Echo.IT, Stefan K. MadsenIBM Danmark
 
Big Data & Analytics, Peter Jönsson
Big Data & Analytics, Peter JönssonBig Data & Analytics, Peter Jönsson
Big Data & Analytics, Peter JönssonIBM Danmark
 
Social Business, Alice Bayer
Social Business, Alice BayerSocial Business, Alice Bayer
Social Business, Alice BayerIBM Danmark
 
Numascale Product IBM
Numascale Product IBMNumascale Product IBM
Numascale Product IBMIBM Danmark
 
Intel HPC Update
Intel HPC UpdateIntel HPC Update
Intel HPC UpdateIBM Danmark
 
IBM general parallel file system - introduction
IBM general parallel file system - introductionIBM general parallel file system - introduction
IBM general parallel file system - introductionIBM Danmark
 
NeXtScale HPC seminar
NeXtScale HPC seminarNeXtScale HPC seminar
NeXtScale HPC seminarIBM Danmark
 
Future of Power: PowerLinux - Jan Kristian Nielsen
Future of Power: PowerLinux - Jan Kristian NielsenFuture of Power: PowerLinux - Jan Kristian Nielsen
Future of Power: PowerLinux - Jan Kristian NielsenIBM Danmark
 
Future of Power: Power Strategy and Offerings for Denmark - Steve Sibley
Future of Power: Power Strategy and Offerings for Denmark - Steve SibleyFuture of Power: Power Strategy and Offerings for Denmark - Steve Sibley
Future of Power: Power Strategy and Offerings for Denmark - Steve SibleyIBM Danmark
 
Future of Power: Big Data - Søren Ravn
Future of Power: Big Data - Søren RavnFuture of Power: Big Data - Søren Ravn
Future of Power: Big Data - Søren RavnIBM Danmark
 
Future of Power: IBM PureFlex - Kim Mortensen
Future of Power: IBM PureFlex - Kim MortensenFuture of Power: IBM PureFlex - Kim Mortensen
Future of Power: IBM PureFlex - Kim MortensenIBM Danmark
 
Future of Power: IBM Trends & Directions - Erik Rex
Future of Power: IBM Trends & Directions - Erik RexFuture of Power: IBM Trends & Directions - Erik Rex
Future of Power: IBM Trends & Directions - Erik RexIBM Danmark
 
Future of Power: Håndtering af nye teknologier - Kim Escherich
Future of Power: Håndtering af nye teknologier - Kim EscherichFuture of Power: Håndtering af nye teknologier - Kim Escherich
Future of Power: Håndtering af nye teknologier - Kim EscherichIBM Danmark
 
Future of Power - Lars Mikkelgaard-Jensen
Future of Power - Lars Mikkelgaard-JensenFuture of Power - Lars Mikkelgaard-Jensen
Future of Power - Lars Mikkelgaard-JensenIBM Danmark
 

Mehr von IBM Danmark (20)

DevOps, Development and Operations, Tina McGinley
DevOps, Development and Operations, Tina McGinleyDevOps, Development and Operations, Tina McGinley
DevOps, Development and Operations, Tina McGinley
 
Velkomst, Universitetssporet 2013, Pia Rønhøj
Velkomst, Universitetssporet 2013, Pia RønhøjVelkomst, Universitetssporet 2013, Pia Rønhøj
Velkomst, Universitetssporet 2013, Pia Rønhøj
 
Smarter Commerce, Salg og Marketing, Thomas Steglich-Andersen
Smarter Commerce, Salg og Marketing, Thomas Steglich-AndersenSmarter Commerce, Salg og Marketing, Thomas Steglich-Andersen
Smarter Commerce, Salg og Marketing, Thomas Steglich-Andersen
 
Mobile, Philip Nyborg
Mobile, Philip NyborgMobile, Philip Nyborg
Mobile, Philip Nyborg
 
IT innovation, Kim Escherich
IT innovation, Kim EscherichIT innovation, Kim Escherich
IT innovation, Kim Escherich
 
Echo.IT, Stefan K. Madsen
Echo.IT, Stefan K. MadsenEcho.IT, Stefan K. Madsen
Echo.IT, Stefan K. Madsen
 
Big Data & Analytics, Peter Jönsson
Big Data & Analytics, Peter JönssonBig Data & Analytics, Peter Jönsson
Big Data & Analytics, Peter Jönsson
 
Social Business, Alice Bayer
Social Business, Alice BayerSocial Business, Alice Bayer
Social Business, Alice Bayer
 
Numascale Product IBM
Numascale Product IBMNumascale Product IBM
Numascale Product IBM
 
Mellanox IBM
Mellanox IBMMellanox IBM
Mellanox IBM
 
Intel HPC Update
Intel HPC UpdateIntel HPC Update
Intel HPC Update
 
IBM general parallel file system - introduction
IBM general parallel file system - introductionIBM general parallel file system - introduction
IBM general parallel file system - introduction
 
NeXtScale HPC seminar
NeXtScale HPC seminarNeXtScale HPC seminar
NeXtScale HPC seminar
 
Future of Power: PowerLinux - Jan Kristian Nielsen
Future of Power: PowerLinux - Jan Kristian NielsenFuture of Power: PowerLinux - Jan Kristian Nielsen
Future of Power: PowerLinux - Jan Kristian Nielsen
 
Future of Power: Power Strategy and Offerings for Denmark - Steve Sibley
Future of Power: Power Strategy and Offerings for Denmark - Steve SibleyFuture of Power: Power Strategy and Offerings for Denmark - Steve Sibley
Future of Power: Power Strategy and Offerings for Denmark - Steve Sibley
 
Future of Power: Big Data - Søren Ravn
Future of Power: Big Data - Søren RavnFuture of Power: Big Data - Søren Ravn
Future of Power: Big Data - Søren Ravn
 
Future of Power: IBM PureFlex - Kim Mortensen
Future of Power: IBM PureFlex - Kim MortensenFuture of Power: IBM PureFlex - Kim Mortensen
Future of Power: IBM PureFlex - Kim Mortensen
 
Future of Power: IBM Trends & Directions - Erik Rex
Future of Power: IBM Trends & Directions - Erik RexFuture of Power: IBM Trends & Directions - Erik Rex
Future of Power: IBM Trends & Directions - Erik Rex
 
Future of Power: Håndtering af nye teknologier - Kim Escherich
Future of Power: Håndtering af nye teknologier - Kim EscherichFuture of Power: Håndtering af nye teknologier - Kim Escherich
Future of Power: Håndtering af nye teknologier - Kim Escherich
 
Future of Power - Lars Mikkelgaard-Jensen
Future of Power - Lars Mikkelgaard-JensenFuture of Power - Lars Mikkelgaard-Jensen
Future of Power - Lars Mikkelgaard-Jensen
 

Kürzlich hochgeladen

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 

Kürzlich hochgeladen (20)

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 

Konceptuelt overblik over Big Data, Flemming Bagger, IBM

  • 1. Insight to Action – Big Data – Challenge and Opportunity
  • 2. Smarter Business 2012 Mobility Smarter Social Smarter Smarter – bring your own Analytics Collaboration Security Cities device Insight to Action – Smarter Smarter Smarter Smarter Big Data - Challenge Commerce Product Process Infrastructure and Opportunity & Marketing Innovation Optimization Management Automation
  • 3. Agenda 10:30 IBM Big Data Platform Flemming Bagger, Big Data Analytics Leader, Nordic 11:15 Pause 11:30 Opnå konkrete resultater med Big Data Analytics Lauren Walker, Big Data Analytics Leader, Europe 12:15 Frokost 13:30 Succes eller fiasko? Sådan håndteres Big Data i den finansielle sektor Keith Prince, EMEA Industry Solutions Executive, Financial Services, IBM 14:15 Pause 14:30 Dataindsamling og overvågning på tværs af sociale medier Ulrik Bo Larsen, Founder & CEO, FALCON Social 15:10 Afrunding
  • 4. Agenda 10:30 IBM Big Data Platform Flemming Bagger, Big Data Analytics Leader, Nordic 11:15 Pause 11:30 Opnå konkrete resultater med Big Data Analytics Lauren Walker, Big Data Analytics Leader, Europe 12:15 Frokost 13:30 Succes eller fiasko? Sådan håndteres Big Data i den finansielle sektor Keith Prince, EMEA Industry Solutions Executive, Financial Services, IBM 14:15 Pause 14:30 Dataindsamling og overvågning på tværs af sociale medier Ulrik Bo Larsen, Founder & CEO, FALCON Social 15:10 Afrunding
  • 5. Information Management Highlight from the IBM CEO Study 2012 © 2012 IBM Corporation
  • 6. Information Management 83x 6,000,000 users on Twitter 500,000,000 users on Twitter pushing out 300,000 pushing out 400,000,000 tweets per day tweets per day 1333x © 2012 IBM Corporation
  • 7. Information Management In 2005 there were 1.3 billion RFID tags in circulation… © 2012 IBM Corporation
  • 8. Information Management Where is big data coming from? 4.6 30 billion RFID tags today billion camera 12+ TBs (1.3B in 2005) phones of tweet data world wide every day 100s of millions of GPS data every ? TBs of enabled day devices sold annually 25+ TBs of 2+ log data billion every day people on 76 million smart the Web by end meters in 2009… 2011 200M by 2014 © 2012 IBM Corporation
  • 9. Information Management In Order to Realize New Opportunities, You Need to Think Beyond Traditional Sources of Data Transactional and Machine Data Social Data Enterprise Application Data Content  Volume  Velocity  Variety  Variety  Structured  Semi-structured  Highly unstructured  Highly unstructured  Throughput  Ingestion  Veracity  Volume © 2012 IBM Corporation
  • 10. Information Management The Characteristics of Big Data Cost efficiently Responding to the Collectively analyzing processing the increasing Velocity the broadening Variety growing Volume 50x 35 ZB 30 Billion RFID sensors 80% of the and counting worlds data is unstructured 2010 2020 Establishing the 1 in 3 business leaders don’t trust Veracity of big the information they use to make data sources decisions © 2012 IBM Corporation
  • 11. Information Management The Big Data Conundrum  The percentage of available data an enterprise can analyze is decreasing proportionately to the available to that enterprise – Quite simply, this means as enterprises, we are getting “more naive” about our business over time  Just collecting and storing “Big Data” doesn’t drive a cent of value to an organization’s bottom line Data AVAILABLE to an organization Data an organization can PROCESS © 2012 IBM Corporation
  • 12. Information Management Big Data is a Hot topic - Because Technology Makes it Possible to Analyze ALL Available Data Cost effectively manage and analyze all available data in its native form unstructured, structured, streaming…….Internal and external Website Social Media Billing ERP Network Switches CRM RFID © 2012 IBM Corporation
  • 13. Information Management Most Client Use Cases Combine Multiple Technologies Pre-processing Ingest and analyze unstructured data types and convert to structured data Combine structured and unstructured analysis Augment data warehouse with additional external sources, such as social media Combine high velocity and historical analysis Analyze and react to data in motion; adjust models with deep historical analysis Reuse structured data for exploratory analysis Experimentation and ad-hoc analysis with structured data © 2012 IBM Corporation
  • 14. Information Management Business-centric Big Data enables you to start with a critical business pain and expand the foundation for future requirements  “Big data” isn’t just a technology—it’s a business strategy for capitalizing on information resources  Getting started is crucial  Success at each entry point is accelerated by products within the Big Data platform  Build the foundation for future requirements by expanding further into the big data platform 14 © 2012 IBM Corporation
  • 15. Information Management 1 – Unlock Big Data  Customer Need – Understand existing data sources – Expose the data within existing content management and file systems for new uses, without copying the data to a central location – Search and navigate big data from federated sources  Value Statement – Get up and running quickly and discover and retrieve relevant big data – Use big data sources in new information-centric applications  Get started with: IBM Vivisimo Velocity © 2012 IBM Corporation
  • 16. Information Management Most Common Big Data Use Case = 360-Views Single view of the information Customer- Facing Professional/Kn owledge Worker © 2012 IBM Corporation
  • 17. Information Management 2 – Analyze Raw Data  Customer Need – Ingest data as-is into Hadoop and derive insight from it – Process large volumes of diverse data within Hadoop – Combine insights with the data warehouse – Low-cost ad-hoc analysis with Hadoop to test new hypothesis  Value Statement – Gain new insights from a variety and combination of data sources – Overcome the prohibitively high cost of converting unstructured data sources to a structured format – Extend the value of the data warehouse by bringing in new types of data and driving new types of analysis – Experiment with analysis of different data combinations to modify the analytic models in the data warehouse  Get started with: InfoSphere BigInsights © 2012 IBM Corporation
  • 18. Information Management 3 – Simplify your Warehouse  Customer Need – Business users are hampered by the poor performance of analytics of a general-purpose enterprise warehouse – queries take hours to run – Enterprise data warehouse is encumbered by too much data for too many purposes – Need to ingest huge volumes of structured data and run multiple concurrent deep analytic queries against it – IT needs to reduce the cost of maintaining the data warehouse  Value Statement – Speed and Simplicity for deep analytics (Netezza) – 100s to 1000s users/second for operation analytics (IBM Smart Analytics System)  Get started with: IBM Netezza 18 © 2012 IBM Corporation
  • 19. Information Management 4 – Reduce costs with Hadoop  Customer Need – Reduce the overall cost to maintain data in the warehouse – often its seldom used and kept ‘just in case’ – Lower costs as data grows within the data warehouse – Reduce expensive infrastructure used for processing and transformations  Value Statement – Support existing and new workloads on the most cost effective alternative, while preserving existing access and queries – Lower storage costs – Reduce processing costs by pushing processing onto commodity hardware and the parallel processing of Hadoop  Get started with: IBM InfoSphere BigInsights © 2012 IBM Corporation
  • 20. Information Management IBM Significantly Enhances Hadoop IBM Innovation • Scalable • Performance & reliability – New nodes can be added on the fly. – Adaptive MapReduce, Compression, Indexing, Flexible Scheduler • Affordable – Massively parallel computing on • Analytic Accelerators commodity servers • Productivity Accelerators • Flexible – Web-based UIs – Hadoop is schema-less, and can absorb – Tools to leverage existing skills any type of data. – End-user visualization • Fault Tolerant • Enterprise Integration – Through MapReduce software framework – To extend & enrich your information supply chain. 20 © 2012 IBM Corporation
  • 21. Information Management 5 – Analyze Streaming Data Streaming Data Sources Streams Computing  Customer Need – Harness and process streaming data sources – Select valuable data and insights to be stored for ACTION further processing – Quickly process and analyze perishable data, and take timely action  Value Statement – Significantly reduced processing time and cost – process and then store what’s valuable – React in real-time to capture opportunities before they expire  Customer examples – Ufone – Telco Call Detail Record (CDR) analytics for customer churn prevention  Get started with: InfoSphere Streams © 2012 IBM Corporation
  • 22. Information Management Entry points are accelerated by products within the big data platform 1 – Unlock Big Data Analytic Applications BI / Exploration / Functional Industry Predictive Content IBM Vivisimo Reporting Visualization App App BI / Analytics Analytics Reporting IBM Big Data Platform 3 – Simplify your Visualization Application Systems warehouse 2 – Analyze Raw Rata & Discovery Development Management Netezza InfoSphere BigInsights Accelerators Hadoop Stream Data System Computing Warehouse 5 – Analyze Streaming 4 – Reduce costs with Data Hadoop InfoSphere Streams InfoSphere BigInsights Information Integration & Governance 22 © 2012 IBM Corporation
  • 23. Information Management Is Big Data imperative? © 2012 IBM Corporation
  • 24. Information Management THINK 24 © 2012 IBM Corporation
  • 25. Agenda 10:30 IBM Big Data Platform Flemming Bagger, Big Data Analytics Leader, Nordic 11:15 Pause 11:30 Opnå konkrete resultater med Big Data Analytics Lauren Walker, Big Data Analytics Leader, Europe 12:15 Frokost 13:30 Succes eller fiasko? Sådan håndteres Big Data i den finansielle sektor Keith Prince, EMEA Industry Solutions Executive, Financial Services, IBM 14:15 Pause 14:30 Dataindsamling og overvågning på tværs af sociale medier Ulrik Bo Larsen, Founder & CEO, FALCON Social 15:10 Afrunding
  • 26. Pause

Hinweis der Redaktion

  1. Nothing illustrates the breakthrough of Twitter than a simple comparison between the Olympic Games. To put it in perspective, if you look at the time between the Beijing Olympics in 2008 and the 2012 London games +CLICK+ as we start the London Olympics, there are over 500-million active users on Twitter, pushing out over 400 million tweets a day. This is a massive increase from the six million Twitter users during the Beijing Olympics in 2008 pushing out about 300,000 tweets per day. How big? +CLICK+ The number of Twitter users between these two periods in time have increase by 83X and the number of tweets by a whopping 1333X!
  2. C&A is a Brazilian retailer that has ‘Smart Hangers’ where shoppers can Like a piece of clothing and see the amount of Likes an object has on Facebook. On the C&A Web site, each piece of clothing has it’s own post and the Likes just keep piling up. While folks can get push happy, so it’s tough to tell just how popular something is, the point here is where we are headed.
  3. Obviously, there are many other forms of data. Let ’ s start with the hottest topic associated with Big Data today: social networks. Twitter generates about 12 terabytes a day of tweet data – which is every single day. Now, keep in mind, these numbers are hard to keep accurate, so the point is that they ’ re big , right? So don ’ t fixate on the actual number because they change all the time and realize that even if these numbers are out of date by 2 years, it ’ s at a point where it ’ s too staggering to handle exclusively using traditional approaches. +CLICK+ Facebook over a year ago was generating 25 terabytes of log data every day ( Facebook log data reference: http://www.datacenterknowledge.com/archives/2009/04/17/a-look-inside-facebooks-data-center/ ) and probably about 7 to 8 terabytes of data that goes up on the Internet. +CLICK+ Google, who knows? Look at Google Plus, YouTube, Google Maps, and all that kind of stuff. So that ’ s the left hand of this chart – the social network layer. +CLICK+ Now let ’ s get back to instrumentation: there are massive amounts of proliferated technologies that allow us to be more interconnected than in the history of the world – and it just isn ’ t P2P (people to people) interconnections, it ’ s M2M (machine to machine) as well. Again, with these numbers, who cares what the current number is, I try to keep them updated, but it ’ s the point that even if they are out of date, it ’ s almost unimaginable how large these numbers are. Over 4.6 billion camera phones that leverage built-in GPD to tag your location or your photos, purpose built GPS devices, smart metres. If you recall the bridge that collapsed in Minneapolis a number of years ago in the USA, it was rebuilt with smart sensors inside it that measure the contraction of the concrete based on weather conditions, ice build up, and so much more. So I didn ’ t realise how true it was when Sam P launched Smart Planet: I thought it was a marketing play. But truly the world is more instrumented, interconnected, and intelligent than it ’ s ever been before and this capability allows us to address new problems and gain new insight never before thought possible and that ’ s what the Big Data opportunity is going to be all about!
  4. Big data comes from many sources. Its much more than traditional data sources. And it order to capitalize on the breakthrough opportunities we’ve discussed, you definitely need to look beyond traditional data sources. But at the same time, don’t forget that big data comes from those traditional sources too. Transactional data and application data is growing an a significant rate. Although it’s structured, that data is large and it is contained in many different structures. Big data includes machine data – logs, web logs, instrumentation data, network data. Data generated by machines is multiplying quickly, and it contains valuable insights that need to be discovered. Social data also needs to be incorporated. Most social data is really textual data. And the valuable insights remain locked within that text and its many possible meanings. And most of that data isn’t valuable, or has a very short expiry date during which it is valuable. That makes social data very challenging – extracting insight from largely textual content in very little time. And enterprise content must be amalgamated as well. And that data comes in many forms, and also in significant volume.
  5. Big data has 4 key characteristics. The first is volume. Of course this may seem obvious, but it is complex that you may think. Yes the volume of data is growing. Experts predict that the volume of data in the world will grow to 25 Zettabytes in 2020. That same phenomenon affects every business – their data is growing at the same exponential rate too. But it isn’t jus the volume of data that is growing. It’s the number of sources of that data. And that leads to the third characteristic of big data, variety, which we will cover later. Data is increasingly accelerating the velocity at which it is created and at which it is integrated. We’ve moved from batch to a real-time business. Data comes at you at a record or a byte level, not always in bulk. And the demands of the business have increased as well – from an answer next week to an answer in a minute. And the world is also becoming more instrumented and interconnected. The volume of data streaming off those instruments is exponentially larger than it was even 2 years ago. Variety presents an equally difficult challenge. The growth in data sources has fuelled the growth in data types. In fact, 80% of the worlds data is unstructured. Yet most traditional methods apply analytics only to structured information. And finally we have veracity. How can you act upon information if you don’t trust it. Establishing trust in big data presents a huge challenge as the sources and the variety grows.
  6. In this slide you can see a graph – it ’ s not to scale, but you get the point – and this graph shows that the percentage of data available to an enterprise is growing enormously; you can see that at the top bar. And as the amount of data available to an organization grows, the percentage of data that the organization can actually process is decreasing. It ’ s kind of like we ’ re getting “ dumber ” as organizations – in terms of proportion of measurement to the data we are collecting - are understanding less and less of it.   +CLICK+ I call the shaded area between these opposite trending lines “ The Blind Spot ” : it contains signals and noise. This area has got all this data in there, and perhaps it would make sense for us to ingest this into our traditional analytic systems, but we don ’ t know if that data will yield value or not – it ’ s a blind spot. We have a hunch that there is value in there, but truly we have no idea what ’ s in the shaded area. Furthermore, while we know there is value in here, we know it ’ s not all going to be useful, so how do we sift through the noise to find the signals? We can start ingesting 10 TB of data a day , ask the CIO for her or his approval for triple OPEX and CAPEX costs on a hunch? So we have to find a way to find the signals within all the noise in a cost effective manner.   Now if we can leverage some new approach to find the value in the blind spot, at a relatively low cost, if we could tie together things like Big Data social media around our core trusted information that we know about our customers, and drop the stuff that isn ’ t related to what the business is trying to accomplish, you could really start to monetize that relationship and intents - not just transactions. And that ’ s the difference, right? Do we monetize intent and relationships? - And that ’ s a problem domain that includes Big Data.   In the previous paragraph I just gave a ubiquitous example, since social media is so obviously tied to Big Data. But you can imagine this dichotomy in any industry. For example, think Oil and Gas (O&G) drill well readings streaming in – and wanting to apply analytics to that with geological data that is unstructured and comes from other sources in various formats and is likely often changing (from an attribute perspective). Harvesting wind energy, traffic patterns, and more.
  7. Another reason that big data is a hot topic in the market today is the new technology that enables an organization to take advantage of the natural resource of big data. Big data itself isn’t new – its been here for a while and growing exponentially. What is new is the technology to process and analyze it. The purpose of big data technology is to cost effectively manage and analyze all of the available data. Any data, as is. If you want to analyze structured data, then structure it. If you want to analyze an acoustic file, then analyze the acoustic file with appropriate analytics. You’ll see the wide variety of sources of big data. It comes from our traditional systems – Billing systems, ERP systems, CRM systems. It also comes from machine data – from RFID tags, sensors, network switches. And it comes from humans – website data, social media, etc.
  8. Key Points Many use cases require multiple technologies to address big data challenges Pre-processing – to ingest multiple data types, structuring data, identify insights, then store those insights in a structured DW Combined structured and unstructured – having a structured DW and unstructured Hadoop system analyzing data and sharing insights back and forth High velocity and historical – stream computing to analyze in motion data and store insights in structured DW for deeper insights and/or reporting Reuse structured – unload structured data into Hadoop and experiment – some companies have found entirely new uses for data that could become new service offerings (e.g,. A large bank discovered that they can profile their client base by financial profile and potentially offer a service to tell customers how they rate vs. their profile – e.g., you have 20% higher mortgage than clients in your fin profile)
  9. Let’s first look at unlocking big data. The customer need is to understand existing data sources without moving any of the data – to discover, navigate, view, and search big data in a federated manner. One customer was able to get up and running in a few months to search and navigate big data across many existing sources of big data. This type of implementation can yield significant business value - from cutting manual efforts to search and retrieve big data, to gaining a better understanding of existing sources of big data before further analysis. The payback period is often short. Customer example – Proctor and Gamble …. The entry point in the big data platform is Vivisimo Velocity – it enables federated search and navigation.
  10. Next we have a pain point around analyzing raw data. The primary need is to analyze unstructured, or semi-structured, data from one or multiple sources. Often the content is textual – and the meaning is hidden within the text. Another common need is to combine different data types – structured and unstructured – for combined analysis. Customers often gain significant value in this approach – they unlock insights that were previously unknown. Those insights can be the key to retaining a valuable customer, to identifying a previously undetected fraud, or discovering a game-changing efficiency in operational processes. One client, a financial services regulatory organization, analyzed a variety of new data sources and integrated the insights with their existing data warehouse to further enhance their risk modeling processes. The big data platform entry point is InfoSphere BigInsights, a Hadoop-based analytics system.
  11. Often data warehouse environments are anything but simple. Warehouses can become glutted with data and not be well-suited to any one particular task. Often, organizations will be hampered by poor performance of analytics – queries will take hours or even days to run. And the cost of the data warehouse and improving performance can be prohibitively high. The value is striking. Many organizations realize a 10 to 100 times performance boost on deep analytics. Queries that took hours now take minutes. So the cost and performance is significant – and the efficiency of employees is boosted. Its also extremely simple to install and administer, yielding significantly lower administration costs. One customer example is Catalina marketing – who executes 10x the amount of predictive workloads with the same staffing level. The entry point for this pain point is IBM Netezza.
  12. Hadoop is a cost-efficient platform and it has the ability to significantly lower the cost of certain workloads. Organizations may have particular pain around reducing the overall cost of their data warehouse. Certain groups of data may be seldom used and possible candidates to offload to a lower-cost platform. Certain operations such as transformations may be able to be offloaded to a more cost efficient platform. The primary area of value creation is cost savings. By pushing workloads and data sets onto a Hadoop platform, organizations are able to preserve their queries and take advantage of Hadoop’s cost-effective processing capabilities. One customer example, a financial services firm, moved processing of applications and reports from an operational data warehouse to Hadoop Hbase; they were able to preserve their existing queries and reduce the operating costs of their data management platform. The entry point for this pain is InfoSphere BigInsights – IBM’s Hadoop-based product.
  13. Key Points Hadoop is not a product but an open source framework for more cost effectively and efficiently analyzing large amounts of structured and unstructured data. However, to use open source Hadoop requires the download, installation, configuration, and maintenance of a myriad of different software pieces (Hadoop, MapReduce, Hive, Pig, Hbase, etc). On the left, you see the characteristics that make Hadoop different and so valuable for analyzing big data. Some vendors try to simplify the installation and configuration of the Hadoop framework and projects by prepackaging all the components into a single “distribution” without providing any real added value. IBM’s approach to Hadoop is different. On the right, you see what the innovations and enhancements we added to our BigInsights hadoop-analytics product making it significantly better for enterprises than open source Hadoop: In the area of performance and reliability, we’ve added ground-breaking innovations: Our “Adaptive MapReduce” that speeds up MapReduce workloads by enabling dynamic changes to resource utilization (CPU, disk space, memory) without human intervention. Without this innovation, Hadoop users would need to monitor their MapReduce workloads and manually turn the configuration “knobs” to adjust the resource utilization. Compression enhancements that reduce storage needs/costs as well as query time Indexing reduces the latency on text searches And, our Workload Scheduler capability that makes it easy to schedule and optimize Hadoop analytics runs Other enhancements: Accelerators of prepackaged content and knowledge (best practice patterns) to solve discrete big data problems UIs and tooling needed by data scientists, developers, and administrators to minimize the Hadoop learning curve Our of the box integration connectors to access any type of data type and source Security to control data access – critical to maintain data privacy and protect confidential data
  14. Customers often have many sources of streaming data, yet they are unable to take full advantage of them. Sometimes its because there is simply too much data to collect and store before analyzing it. Or it may be because of timing – by the time they store data on disk, analyze it, and respond – it’s too late. They need a way to harness the natural resource of streaming data and turn it into actionable insight. The benefits of streaming analytics are immediately obvious. Dramatic cost savings by analyzing data and only storing what is necessary. The ability to detect and make real-time decisions, resulting in customer retention to detecting fraud to cross-selling a product. One client, Ufone, analyzed Call Detail Records as data streamed off their network. By analyzing CDRs in real-time, they were able to detect potential customer service issues and proactively respond, thereby reducing customer churn. The entry point to the big data platform is InfoSphere Streams, which is often accompanied by a system to persist insights and perform deeper analysis to adjust the streaming analytic models – either Netezza or InfoSphere BigInsights.
  15. There are many entry points to the big data platform. It isn’t a one-time, one-size-fits-all proposition. There are many entry points to the big data platform – illustrated on this slide and in the previous slides. <Read pains and entry points to re-iterate>. They key point is that clients will start with one pain and entry point, and adopt others over time. And there is a benefit to doing so – they may leverage reusable aspects of the platform as they adopt new capabilities – sharing analytics, accelerators, etc. from one implementation to the next. And that is the power of the platform – the ability to leverage from one project to the next and to go faster .