SlideShare ist ein Scribd-Unternehmen logo
1 von 24
Big Data Analytics
– Challenge and Opportunity
83x


6,000,000 users on Twitter         500,000,000 users on Twitter
   pushing out 300,000               pushing out 400,000,000
      tweets per day                         tweets per day
                                     1333x
Where is big data coming from?
                                                                           4.6
                                                   30 billion RFID
                                                                       billion
                         12+ TBs                       tags today
                                                                        camera
                        of tweet data                (1.3B in 2005)
                                                                        phones
                         every day                                       world
                                                                          wide

                                                                       100s of
                                                                      millions
           data every
? TBs of




                                                                       of GPS
              day




                                                                      enabled
                                                                        devices
                                                                           sold
                               25+ TBs                                 annually
                                    of                                      2+
                                 log data                              billion
                                every day                                people
                                            76 million smart meters      on the
                                             in 2009… 200M by 2014      Web by
                                                                       end 2011
The Characteristics of Big Data
Cost efficiently           Responding to the                     Collectively analyzing
processing the
                           increasing Velocity                   the broadening Variety
growing Volume
   50x           35 ZB                     30 Billion
                                           RFID                                 80%   of the
                                           sensors and                          worlds data is
                                           counting                             unstructured
 2010     2020



        Establishing the   By 2015, 80% of all available data will be uncertain
                           -   The number of networked devices will be double the entire
        Veracity of big        global population
        data sources       - The total number of social media accounts exceeds the entire
                             global population
Big Data is a Hot topic
- Because it is possible to Analyze ALL Available Data
• The percentage of available data an enterprise can analyze is decreasing proportionately to
  the available to that enterprise
– Quite simply, this means as enterprises, we are getting ―more naive‖ about our business over time
• Just collecting and storing “Big Data” doesn’t drive a cent of value to an organization’s
  bottom line
• Cost effectively manage and analyze ALL available data in its native form
  unstructured, structured, realtime streaming…….Internal and external

                                                    Data AVAILABLE to
                                                     an organization



                                                                                     Data an organization
                                                                                        can PROCESS
Business-centric Big Data Platform

                                • ―Big data‖ isn’t just a technology
                                  —it’s a business strategy for
                                  capitalizing on information resources

                                • Getting started is crucial

                                • Success at each entry point is
                                  accelerated by products within the
                                  Big Data platform

                                • Build the foundation for future
                                  requirements by expanding further
                                  into the big data platform



6
Different data workloads have different characteristics


                                   Database services that handle
                                   large volumes of transactions with
         System for Transactions   high availability, scalability and integrity

                                   Data Warehouse services for
         System for Analytics      complex analytics and reporting
         powered by                on data up to petabyte scale -
         Netezza technology        with minimal administration

                                   Operational Warehouse services for continuous
                                   ingest of operational data, complex analytics, and
         System for                a large volume
         Operational Analytics     of concurrent operational queries
Big Data Analytics – A national research
initiative
Big Data Analytics – A national research initiative

Daniel Gillblad
Research Group Leader, Senior Research Scientist
SICS, Swedish Institute of Computer Science
Background

• There is a very large potential, both societal and
  commercial, in the analysis, refinement, modeling,
  and visualization these data sets
• Capacity to store, transfer, and search is not enough -
  analytics is critical
Additional business value of Analytics

• Predict and optimize business outcomes
• New services and applications, both for end-users
  and industry
• New value chains, were different actors can create and
  exchange new analysis services
A national Big Data Analytics initiative

① A strategic nation-wide research and innovation agenda
   – Input from several sectors and application areas
   – Both new businesses built on analytics applications
     and traditional industry
   – Input from academia, both as developers and as users
② A national Big Data Analytics network
   – Open to all interested parties
   – Industry and academia with an active interest in Big Data Analytics
Focus areas

                     Control and planning


                     Visualization


   Focus areas
                 {   Analytics

                     Computation

                     Storage

                     Collection
Current constellation
Research and development challenges

• Huge businesses are built on Big Data Analytics today,
  but a large number of issues must be resolved to fully
  realize the potential

• Three examples
Example 1: Large-scale physics experimentation




• Challenges: Scale (storage, computation), scalable analytics
Example 2: Social network mining




• Challenges: Unstructured data, biased data, data access
Example 3: Access network pattern mining




• Challenges: Integrity issues, distributed
  mining, service frameworks
Long term trends

• Currently dominating approach will continue to be successful, but
  will be complemented due to
    – Too much data, unstructured data, noisy data
    – Limited access – security, integrity, legal, and business
    – Fast data generation, situation awareness
• The consequences are
    –   Analysis closer to data generation / collection
    –   No storage - Catching information on the fly
    –   Distributed analysis with incomplete data
    –   Real time collection, real time analytics
Research challenges

• Research challenges on different levels:
   –   The sensor/collection level
   –   The algorithmic/analytical level
   –   The system level
   –   The organisational level
Technical challenges, examples

•   Computational and storage framework development
•   Analysis of unstructured data
•   Distributed analysis
•   Efficient analysis algorithms
•   Stream mining
•   Managing sample bias
•   Managing uncertain and missing data
Platform and organisational challenges, examples

• Service and analytics frameworks, exchanging models and data
• API:s and standards

• Privacy, integrity, security, and legal
• Business models
Contacts

• If you are interested in the Swedish Big Data Analytics Network,
  feel free to contact


       Daniel Gillblad                Anders Holst
       dgi@sics.se                    aho@sics.se
       +46 8 633 15 68                +46 8 633 15 93
IBM Smarter Business 2012 - Big Data Analytics

Weitere ähnliche Inhalte

Mehr von IBM Sverige

Mehr von IBM Sverige (20)

#ibmbpsse18 - Koppla säkert & redundant till IBM Cloud - Magnus Huss, Interexion
#ibmbpsse18 - Koppla säkert & redundant till IBM Cloud - Magnus Huss, Interexion#ibmbpsse18 - Koppla säkert & redundant till IBM Cloud - Magnus Huss, Interexion
#ibmbpsse18 - Koppla säkert & redundant till IBM Cloud - Magnus Huss, Interexion
 
#ibmbpsse18 - Den svenska marknaden, Andreas Lundgren, CMO, IBM
#ibmbpsse18 - Den svenska marknaden, Andreas Lundgren, CMO, IBM#ibmbpsse18 - Den svenska marknaden, Andreas Lundgren, CMO, IBM
#ibmbpsse18 - Den svenska marknaden, Andreas Lundgren, CMO, IBM
 
Multiresursplanering - Karolinska Universitetssjukhuset
Multiresursplanering - Karolinska UniversitetssjukhusetMultiresursplanering - Karolinska Universitetssjukhuset
Multiresursplanering - Karolinska Universitetssjukhuset
 
Solving Challenges With 'Huge Data'
Solving Challenges With 'Huge Data'Solving Challenges With 'Huge Data'
Solving Challenges With 'Huge Data'
 
Blockchain explored
Blockchain explored Blockchain explored
Blockchain explored
 
Blockchain architected
Blockchain architectedBlockchain architected
Blockchain architected
 
Blockchain explained
Blockchain explainedBlockchain explained
Blockchain explained
 
Grow smarter project kista watson summit 2018_tommy auoja-1
Grow smarter project  kista watson summit 2018_tommy auoja-1Grow smarter project  kista watson summit 2018_tommy auoja-1
Grow smarter project kista watson summit 2018_tommy auoja-1
 
Bemanningsplanering axfood och houston final
Bemanningsplanering axfood och houston finalBemanningsplanering axfood och houston final
Bemanningsplanering axfood och houston final
 
Power ai nordics dcm
Power ai nordics dcmPower ai nordics dcm
Power ai nordics dcm
 
Nvidia and ibm presentation feb18
Nvidia and ibm presentation feb18Nvidia and ibm presentation feb18
Nvidia and ibm presentation feb18
 
Hwx introduction to_ibm_ai
Hwx introduction to_ibm_aiHwx introduction to_ibm_ai
Hwx introduction to_ibm_ai
 
Ac922 watson 180208 v1
Ac922 watson 180208 v1Ac922 watson 180208 v1
Ac922 watson 180208 v1
 
Watson kista summit 2018 box
Watson kista summit 2018 box Watson kista summit 2018 box
Watson kista summit 2018 box
 
Watson kista summit 2018 en bättre arbetsdag för de många människorna
Watson kista summit 2018   en bättre arbetsdag för de många människornaWatson kista summit 2018   en bättre arbetsdag för de många människorna
Watson kista summit 2018 en bättre arbetsdag för de många människorna
 
Iwcs and cisco watson kista summit 2018 v2
Iwcs and cisco   watson kista summit 2018 v2Iwcs and cisco   watson kista summit 2018 v2
Iwcs and cisco watson kista summit 2018 v2
 
Ibm intro (watson summit) bkacke
Ibm intro (watson summit) bkackeIbm intro (watson summit) bkacke
Ibm intro (watson summit) bkacke
 
Acoustic io t rail monitoring.pptx
Acoustic io t rail monitoring.pptxAcoustic io t rail monitoring.pptx
Acoustic io t rail monitoring.pptx
 
Watson christofer j_180208
Watson christofer j_180208Watson christofer j_180208
Watson christofer j_180208
 
Watson kista summit 2018 icp
Watson kista summit 2018 icpWatson kista summit 2018 icp
Watson kista summit 2018 icp
 

Kürzlich hochgeladen

The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai KuwaitThe Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
daisycvs
 

Kürzlich hochgeladen (20)

Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
 
Phases of Negotiation .pptx
 Phases of Negotiation .pptx Phases of Negotiation .pptx
Phases of Negotiation .pptx
 
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All Time
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All TimeCall 7737669865 Vadodara Call Girls Service at your Door Step Available All Time
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All Time
 
Putting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptxPutting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptx
 
Chennai Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Av...
Chennai Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Av...Chennai Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Av...
Chennai Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Av...
 
PHX May 2024 Corporate Presentation Final
PHX May 2024 Corporate Presentation FinalPHX May 2024 Corporate Presentation Final
PHX May 2024 Corporate Presentation Final
 
Unveiling Falcon Invoice Discounting: Leading the Way as India's Premier Bill...
Unveiling Falcon Invoice Discounting: Leading the Way as India's Premier Bill...Unveiling Falcon Invoice Discounting: Leading the Way as India's Premier Bill...
Unveiling Falcon Invoice Discounting: Leading the Way as India's Premier Bill...
 
Berhampur CALL GIRL❤7091819311❤CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Berhampur CALL GIRL❤7091819311❤CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDINGBerhampur CALL GIRL❤7091819311❤CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Berhampur CALL GIRL❤7091819311❤CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
 
Cannabis Legalization World Map: 2024 Updated
Cannabis Legalization World Map: 2024 UpdatedCannabis Legalization World Map: 2024 Updated
Cannabis Legalization World Map: 2024 Updated
 
Uneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration PresentationUneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration Presentation
 
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai KuwaitThe Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
 
Falcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business PotentialFalcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business Potential
 
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
 
Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1
 
GUWAHATI 💋 Call Girl 9827461493 Call Girls in Escort service book now
GUWAHATI 💋 Call Girl 9827461493 Call Girls in  Escort service book nowGUWAHATI 💋 Call Girl 9827461493 Call Girls in  Escort service book now
GUWAHATI 💋 Call Girl 9827461493 Call Girls in Escort service book now
 
Arti Languages Pre Seed Teaser Deck 2024.pdf
Arti Languages Pre Seed Teaser Deck 2024.pdfArti Languages Pre Seed Teaser Deck 2024.pdf
Arti Languages Pre Seed Teaser Deck 2024.pdf
 
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptxQSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
 
JAJPUR CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN JAJPUR ESCORTS
JAJPUR CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN JAJPUR  ESCORTSJAJPUR CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN JAJPUR  ESCORTS
JAJPUR CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN JAJPUR ESCORTS
 
Lucknow Housewife Escorts by Sexy Bhabhi Service 8250092165
Lucknow Housewife Escorts  by Sexy Bhabhi Service 8250092165Lucknow Housewife Escorts  by Sexy Bhabhi Service 8250092165
Lucknow Housewife Escorts by Sexy Bhabhi Service 8250092165
 
Escorts in Nungambakkam Phone 8250092165 Enjoy 24/7 Escort Service Enjoy Your...
Escorts in Nungambakkam Phone 8250092165 Enjoy 24/7 Escort Service Enjoy Your...Escorts in Nungambakkam Phone 8250092165 Enjoy 24/7 Escort Service Enjoy Your...
Escorts in Nungambakkam Phone 8250092165 Enjoy 24/7 Escort Service Enjoy Your...
 

IBM Smarter Business 2012 - Big Data Analytics

  • 1. Big Data Analytics – Challenge and Opportunity
  • 2. 83x 6,000,000 users on Twitter 500,000,000 users on Twitter pushing out 300,000 pushing out 400,000,000 tweets per day tweets per day 1333x
  • 3. Where is big data coming from? 4.6 30 billion RFID billion 12+ TBs tags today camera of tweet data (1.3B in 2005) phones every day world wide 100s of millions data every ? TBs of of GPS day enabled devices sold 25+ TBs annually of 2+ log data billion every day people 76 million smart meters on the in 2009… 200M by 2014 Web by end 2011
  • 4. The Characteristics of Big Data Cost efficiently Responding to the Collectively analyzing processing the increasing Velocity the broadening Variety growing Volume 50x 35 ZB 30 Billion RFID 80% of the sensors and worlds data is counting unstructured 2010 2020 Establishing the By 2015, 80% of all available data will be uncertain - The number of networked devices will be double the entire Veracity of big global population data sources - The total number of social media accounts exceeds the entire global population
  • 5. Big Data is a Hot topic - Because it is possible to Analyze ALL Available Data • The percentage of available data an enterprise can analyze is decreasing proportionately to the available to that enterprise – Quite simply, this means as enterprises, we are getting ―more naive‖ about our business over time • Just collecting and storing “Big Data” doesn’t drive a cent of value to an organization’s bottom line • Cost effectively manage and analyze ALL available data in its native form unstructured, structured, realtime streaming…….Internal and external Data AVAILABLE to an organization Data an organization can PROCESS
  • 6. Business-centric Big Data Platform • ―Big data‖ isn’t just a technology —it’s a business strategy for capitalizing on information resources • Getting started is crucial • Success at each entry point is accelerated by products within the Big Data platform • Build the foundation for future requirements by expanding further into the big data platform 6
  • 7. Different data workloads have different characteristics Database services that handle large volumes of transactions with System for Transactions high availability, scalability and integrity Data Warehouse services for System for Analytics complex analytics and reporting powered by on data up to petabyte scale - Netezza technology with minimal administration Operational Warehouse services for continuous ingest of operational data, complex analytics, and System for a large volume Operational Analytics of concurrent operational queries
  • 8. Big Data Analytics – A national research initiative
  • 9. Big Data Analytics – A national research initiative Daniel Gillblad Research Group Leader, Senior Research Scientist SICS, Swedish Institute of Computer Science
  • 10. Background • There is a very large potential, both societal and commercial, in the analysis, refinement, modeling, and visualization these data sets • Capacity to store, transfer, and search is not enough - analytics is critical
  • 11. Additional business value of Analytics • Predict and optimize business outcomes • New services and applications, both for end-users and industry • New value chains, were different actors can create and exchange new analysis services
  • 12. A national Big Data Analytics initiative ① A strategic nation-wide research and innovation agenda – Input from several sectors and application areas – Both new businesses built on analytics applications and traditional industry – Input from academia, both as developers and as users ② A national Big Data Analytics network – Open to all interested parties – Industry and academia with an active interest in Big Data Analytics
  • 13. Focus areas Control and planning Visualization Focus areas { Analytics Computation Storage Collection
  • 15. Research and development challenges • Huge businesses are built on Big Data Analytics today, but a large number of issues must be resolved to fully realize the potential • Three examples
  • 16. Example 1: Large-scale physics experimentation • Challenges: Scale (storage, computation), scalable analytics
  • 17. Example 2: Social network mining • Challenges: Unstructured data, biased data, data access
  • 18. Example 3: Access network pattern mining • Challenges: Integrity issues, distributed mining, service frameworks
  • 19. Long term trends • Currently dominating approach will continue to be successful, but will be complemented due to – Too much data, unstructured data, noisy data – Limited access – security, integrity, legal, and business – Fast data generation, situation awareness • The consequences are – Analysis closer to data generation / collection – No storage - Catching information on the fly – Distributed analysis with incomplete data – Real time collection, real time analytics
  • 20. Research challenges • Research challenges on different levels: – The sensor/collection level – The algorithmic/analytical level – The system level – The organisational level
  • 21. Technical challenges, examples • Computational and storage framework development • Analysis of unstructured data • Distributed analysis • Efficient analysis algorithms • Stream mining • Managing sample bias • Managing uncertain and missing data
  • 22. Platform and organisational challenges, examples • Service and analytics frameworks, exchanging models and data • API:s and standards • Privacy, integrity, security, and legal • Business models
  • 23. Contacts • If you are interested in the Swedish Big Data Analytics Network, feel free to contact Daniel Gillblad Anders Holst dgi@sics.se aho@sics.se +46 8 633 15 68 +46 8 633 15 93

Hinweis der Redaktion

  1. An enormous amounts of data permeate societyBoth the data itself and how it is usedDeeper analysis of audio and video
  2. * A move from instance based to model based approaches