SlideShare ist ein Scribd-Unternehmen logo
1 von 59
Downloaden Sie, um offline zu lesen
Enterprise Intelligence



       Jeff Jonas, IBM Distinguished Engineer
        Chief Scientist, IBM Entity Analytics
                  Email: jeffjonas@us.ibm.com
                Blog: www.jeffjonas.typepad.com
           Twitter: http://www.twitter.com/jeffjonas

1
                                                       © 2012 IBM Corporation
My Background

     Early 80‟s: Founded Systems Research & Development (SRD), a
      custom software consultancy

     Personally designed and deployed +/- 100 systems, a number of
      which contained multi-billions of transactions describing 100‟s
      of millions of entities

     1989 – 2003: Built numerous systems for Las Vegas casinos
      including a technology known as Non-Obvious Relationship
      Awareness (NORA)

     2001: Funded by In-Q-Tel, the venture capital arm of the CIA

     2005: IBM acquires SRD

     Today: Primarily focused on „sensemaking on streams‟ with
      special attention towards privacy and civil liberties protections

2
                                                                © 2012 IBM Corporation
Trend: Organizations Are Getting Dumber

                                                  Every two days now we create as
                              Available           much information as we did from
                             Observation          the dawn of civilization up until
    Computing Power Growth


                               Space              2003.”

                                                      ~ EricContext CEO Google
                                                             Schmidt,
                                                   Enterprise
                                                    Amnesia




                                                                Sensemaking
                                                                 Algorithms


                                           Time
3
                                                                           © 2012 IBM Corporation
Amnesia, definition

    A defect in memory, especially resulting
     from brain damage.




4
                                        © 2012 IBM Corporation
Enterprise Amnesia, definition

    A defect in memory, resulting in wasted
     resources, lower revenues, unnecessary
     fraud losses, etc.




5
                                      © 2012 IBM Corporation
Trend: Organizations Are Getting Dumber

                              Available
                             Observation
    Computing Power Growth


                               Space

                                                  WHY?
                                                  Context




                                                    Sensemaking
                                                     Algorithms


                                           Time
6
                                                            © 2012 IBM Corporation
Algorithms at Dead End.

          You Can‟t
      Squeeze Knowledge
        Out of a Pixel.

7
                          © 2012 IBM Corporation
No Context


             scrila34@msn.com




8
                                © 2012 IBM Corporation
Context, definition

    Better understanding
     something by taking into
     account the things around it.


9
                              © 2012 IBM Corporation
Information in Context … and Accumulating



                 scrila34@msn.com




       Job
     Applicant                              Top 200
                                            Customer




                                         Criminal
                                       Investigation
     Identity
      Thief

10
                                            © 2012 IBM Corporation
The Puzzle Metaphor

      Imagine an ever-growing pile of puzzle pieces of varying sizes,
       shapes and colors

      What it represents is unknown – there is no picture on hand

      Is it one puzzle, 15 puzzles, or 1,500 different puzzles?

      Some pieces are duplicates, missing, incomplete, low quality, or
       have been misinterpreted

      Some pieces may even be professionally fabricated lies

      Until you take the pieces to the table and attempt assembly,
       you don‟t know what you are dealing with


11
                                                                   © 2012 IBM Corporation
Puzzling




     270 pieces
     Vegas                       200 pieces
                                Neuschwanstein Beauty
                                © 2009 Photo Copyright
                                                           150 pieces
                                                           Down Home Music
                                                           © Kay Lamb Shannon,
                                                                                          6 pieces
                                                                                      Cottage Garden
                                                                                      © 2010 Royce B. McClure,
                                                                                             2%
     Artwork provided by
        90%
     Hadley House Licensing,
     Minneapolis
                                    66%
                                Robert Cushman Hayes
                                © 2009 Ravensburger USA,
                                                              50%
                                                           Artist
                                                           Licensed by Cypress Fine
                                                                                      Artist All Rights Reserved
                                                                                      © 2010 Ravensburger USA,
     © 2011 Giesla Hoelscher    Inc.                       Art Licensing              Inc.
     All Rights Reserved                                   © 2011 Ravensburger USA
     © 2011 Ravensburger USA,                              Inc.
     Inc.
                                30 pieces
                                10%
                                (duplicates)




12
                                                                                                     © 2012 IBM Corporation
13
     © 2012 IBM Corporation
14
     © 2012 IBM Corporation
First Discovery




15
                  © 2012 IBM Corporation
More Data Finds Data




16
                       © 2012 IBM Corporation
Duplicates in Front Of Your Eyes




17
                                   © 2012 IBM Corporation
First Duplicate Found Here




18
                             © 2012 IBM Corporation
19
     © 2012 IBM Corporation
Incremental Context – Incremental Discovery

     6:40pm   START

     22min    “Hey, this one is a duplicate!”

     35min    “I think some pieces are missing.”

     37min    “Looks like a bunch of hillbillies on
              a porch.”

     44min    “Hillbillies, playing guitars, sitting
              on a porch, near a barber sign …
              and a banjo!”

20
                                                       © 2012 IBM Corporation
150 pieces
     50%




21
                  © 2012 IBM Corporation
Incremental Context – Incremental Discovery

     47min    “We should take the sky and grass
              off the table.”

     2hr      “Let‟s switch sides, and see if we
              can make sense of this from
              different perspectives.”

     2hr10m   “Wait, there are three … no, four
              puzzles.”

     2hr17m   “We need a bigger table.”

     2hr18m   “I think you threw in a few random
              pieces.”

22
                                                   © 2012 IBM Corporation
23
     © 2012 IBM Corporation
How Context Accumulates

      With each new observation … one of three assertions are made:
       1) Un-associated; 2) placed near like neighbors; or 3) connected

      Must favor the false negative

      New observations sometimes reverse earlier assertions

      Some observations produce novel discovery

      As the working space expands, computational effort increases

      Given sufficient observations, there can come a tipping point

      Thereafter, confidence improves while computational effort
       decreases!


24
                                                                  © 2012 IBM Corporation
Big Data [in context]. New Physics.


     More data: better the predictions
       – Lower false positives
       – Lower false negatives


     More data: bad data good
       – Suddenly glad your data is not perfect


     More data: less compute

25
                                                  © 2012 IBM Corporation
Big Data




           Pile of ____   In Context
26
                                   © 2012 IBM Corporation
One Form of Context: “Expert Counting”


      Is it 5 people each with 1 account … or is it 1
       person with 5 accounts?

      Is it 20 cases of H1N1 in 20 cities … or one
       case reported 20 times?

      If one cannot count … one cannot estimate
       vector or velocity (direction and speed).

      Without vector and velocity … prediction is
       nearly impossible.
27
                                                 © 2012 IBM Corporation
Entity Resolution
        Demonstration




28
                         © 2012 IBM Corporation
Entity Resolution Demonstration

       VOTER                                 DECEASED PERSON
       George F Balston                      George Balston
       YOB: 1951 D/L: 4801                   YOB: 1951 SSN: 5598
       13070 SW Karen Blvd Apt 7             DOD: 1995
       Beaverton, OR 97005
       Last voted: 2008

     When it comes to best practices in voter matching, if only a name and
      year of birth match, this is insufficient proof of a match. Many
        different people in the U.S. share a name and year of birth.

                          Human review is required.

     Unfortunately, there are thousands and thousands of cases just like
      this and state election offices don‟t have the staff (or budget) to
                         manually review such volumes.


29
                                                                    © 2012 IBM Corporation
Now Consider This Tertiary DMV Record

       VOTER                                 DECEASED PERSON
       George F Balston                      George Balston
       YOB: 1951 D/L: 4801                   YOB: 1951 SSN: 5598
       13070 SW Karen Blvd Apt 7             DOD: 1995
       Beaverton, OR 97005
       Last voted: 2008


                       DMV
                       George F Balston
                       YOB: 1951 SSN: 5598 D/L: 4801
                       3043 SW Clementine Blvd Apt 210
                       Beaverton, OR 97005

       The DMV record contains enough features to match both the voter
     (name, year of birth and driver‟s license) and/or the deceased persons
     record (name, year of birth and SSN). For the sake of argument, let‟s
30
                         say it matches the voter best.
                                                                    © 2012 IBM Corporation
Features Accumulate

       VOTER                                 DECEASED PERSON
       George F Balston                      George Balston
       YOB: 1951 D/L: 4801                   YOB: 1951 SSN: 5598
       13070 SW Karen Blvd Apt 7             DOD: 1995
       Beaverton, OR 97005
       Last voted: 2008
       DMV
       George F Balston
       YOB: 1951 SSN: 5598 D/L: 4801
       3043 SW Clementine Blvd Apt 210
       Beaverton, OR 97005



     The voter/DMV record now shares a name, year of birth and SSN with
      the deceased person record. In voter matching best practices, this
      evidence would be sufficient to make a determination that this voter
          is in fact deceased. This case no longer needs human review.
31
                                                                   © 2012 IBM Corporation
Useful Insight Revealed!

     VOTER
     George F Balston                  As features accumulate it
     YOB: 1951 D/L: 4801                  becomes possible to
     13070 SW Karen Blvd Apt 7            resolve previous un-
     Beaverton, OR 97005                   resolvable identity
     Last voted: 2008                           records.
     DMV
     George F Balston                        As events and
     YOB: 1951 SSN: 5598 D/L: 4801     transactions accumulate –
     3043 SW Clementine Blvd Apt 210     detection of relevance
     Beaverton, OR 97005                       improves.
     DECEASED PERSON                   Here we can see George
     George Balston                    who died in 1995 voted in
     YOB: 1951 SSN: 5598                         2008.
     DOD: 1995


32
                                                           © 2012 IBM Corporation
IBM InfoSphere
     Identity Insight V8



33
                           © 2012 IBM Corporation
MoneyGram International




34
                          © 2012 IBM Corporation
Enterprise Intelligence
         One Plausible Journey
         One Plausible Journey




35
                                 © 2012 IBM Corporation
Sense and Respond


       Observation
         Space




        New
     Observations




                         What you know




36
                                         © 2012 IBM Corporation
Sense and Respond


       Observation
         Space




                                  Data Finds
                                     Data




                  Relevance
               Finds the Sensor
                     (<200ms)
                                                ?
                                               Decide



37
                                                        © 2012 IBM Corporation
Sense and Respond                                  Explore and Reflect


       Observation
         Space                                                          Deep
                                                                      Reflection

                                                        Curated
                                                         Data

                                  Data Finds                                 Pattern
                                     Data                                   Discovery




                                                                       Directed
                                                                       Attention
                  Relevance
               Finds the Sensor
                     (<200ms)
                                                ?
                                                          Relevance
                                               Decide      Find You




38
                                                                            © 2012 IBM Corporation
Sense and Respond                                    Explore and Reflect


       Observation
         Space                                                           Deep
                                                                       Reflection

                                                             Curated
                                                              Data

                                  Data Finds                                  Pattern
                                     Data                                    Discovery




                                                                        Directed
                                                                        Attention
                  Relevance                                NEW
               Finds the Sensor
                     (<200ms)
                                                ?       INTERESTS



                                               Decide



39
                                                                             © 2012 IBM Corporation
Sense and Respond                                     Explore and Reflect


       Observation              InfoSphere Streams                      Netezza
         Space                                                            Deep
                                                                         SPSS
                                                                        Reflection
                                                                        Watson
                                                              Curated
                                                               Data

                                       Data Finds                              Pattern
                                          Data SPSS                           Discovery

                                         Sensemaking


                                                                        Cognos
                                                                          Directed
                                                                         Attention
                  Relevance                                 NEW
                            InfoSphere
               Finds the Sensor
                     (<200ms)
                                           Streams   ?   INTERESTS
                                       ILog

                                                Decide



40
                                                                              © 2012 IBM Corporation
Sense and Respond                                     Explore and Reflect


       Observation
         Space                                                            Deep
                                                                        Reflection

                                                              Curated
                                                               Data

                                   Data Finds                                  Pattern
                                      Data                                    Discovery




                                                                         Directed
                                                                         Attention
                  Relevance                                 NEW
               Finds the Sensor
                     (<200ms)
                                                 ?       INTERESTS



                                                Decide



41
                                  Report and Manage
                                                                              © 2012 IBM Corporation
Data Finds                              Pattern
                            Data                                Discovery




                                                           Directed
                                                           Attention
        Relevance                                 NEW
     Finds the Sensor
         (<200ms)
                                       ?       INTERESTS



                                      Decide




                             Content Management
                              Info Management
                              Case Management
                              Data Systems
                                   Warehousing




42
                        Report and Manage
                                                                © 2012 IBM Corporation
Big Data Trends




43
                       © 2012 IBM Corporation
The Greater the Context, the Greater the Value

                                           Data
                                        in Context
     Value of Data




                                                        Pile of Data



                     (Big)   Records Managed         (Ludicrous Big)
44
                                                                © 2012 IBM Corporation
Time Is Of The Essence
                                                        The better the
                                                       predictions … the
                                  Batch               faster they will be
                                                           wanted.
                           Day
                                                      “Why did we have
     Willingness to Wait




                                                        to wait until the
                           Hour                       end of the day for
                                                      the smart answer?”




                           200ms                       Real-Time



                                 (Iffy)   Relevance      (Totally)
45
                                                                     © 2012 IBM Corporation
Closing Thoughts




46
                        © 2012 IBM Corporation
The most competitive organizations

     are going to make sense of what they are observing

            fast enough to do something about it

                while they are observing it.




47
                                                   © 2012 IBM Corporation
Wish This On The Competitor

                               Available
                              Observation
     Computing Power Growth


                                Space

                                                           Context
                                                   Enterprise
                                                    Amnesia




                                                            Sensemaking
                                                             Algorithms


                                            Time
48
                                                                     © 2012 IBM Corporation
The Way Forward: Enterprise Intelligence

                               Available
                              Observation
     Computing Power Growth


                                Space
                                                   Context




                                                             Sensemaking
                                                              Algorithms


                                            Time
49
                                                                     © 2012 IBM Corporation
Related Blog Posts

     Algorithms At Dead-End: Cannot Squeeze Knowledge Out Of A
       Pixel

     Puzzling: How Observations Are Accumulated Into Context

     On A Smarter Planet … Some Organizations Will Be Smarter-er
      Than Others




     G2 | Sensemaking – One Year Birthday Today. Cognitive Basics
      Emerging.




50
                                                            © 2012 IBM Corporation
Questions?

          Email: jeffjonas@us.ibm.com
        Blog: www.jeffjonas.typepad.com
     Twitter: http://www.twitter.com/jeffjonas



51
                                                 © 2012 IBM Corporation
Enterprise Intelligence



        Jeff Jonas, IBM Distinguished Engineer
         Chief Scientist, IBM Entity Analytics
                   Email: jeffjonas@us.ibm.com
                 Blog: www.jeffjonas.typepad.com
            Twitter: http://www.twitter.com/jeffjonas

52
                                                        © 2012 IBM Corporation
Sensemaking on Streams
        My G2 Secret Little IBM Project




              3+ years in the making




53
                                          © 2012 IBM Corporation
G2 Mission Statement

     1) Evaluate each new observation against
       previous observations.

     2) Determine if what is being observed is
      relevant.

     3) Delivering this actionable insight to its
      consumer … fast enough to do something
      about it while it is still happening.

     4) Doing this with sufficient accuracy and
      scale to really matter.
54
                                                 © 2012 IBM Corporation
From Pixels to Pictures to Action

                                     Relevance Finds You
           Data Finds Data



                 This is G2




Observations            Persistent                     Consumer
                         Context                    (An analyst, a system,
                                                    the sensor itself, etc.)




55
                                                             © 2012 IBM Corporation
Uniquely G2

      More scalable, faster and extensible
        – Designed for grid compute and sub-200ms sense and respond


      Smarter
        – Tolerance for disagreement (no such thing as a single version of truth)
        – Support for more abstract entities (e.g., locations, products, asteroids)
        – Support for more exotic features (e.g., biometrics, social circles)


      Crazy stuff
        – Detects on its own when it is confused and makes “note to self”
        – Geospatial reasoning including a sense of here and now


      Privacy by Design (PbD)
        – More privacy and civil liberties enhancing features baked-in than any other
          commercial technology


56
                                                                            © 2012 IBM Corporation
PbD: Self-Correcting False Positives


     1                        A plausible claim these
     John T Smith Jr          two people are the same
     123 Main Street
      703 111-2000
     DOB: 03/12/1984
                                              3
     2 John T Smith       Until this record    John T Smith Sr
      123 Main Street      comes into view     123 Main Street
        703 111-2000                             703 111-2000
       DL: 009900991                            DL: 009900991




           Which reveals this is a
            FALSE POSITIVE

57
                                                                 © 2012 IBM Corporation
PbD: Self-Correcting False Positives


     1
     John T Smith Jr
     123 Main Street
      703 111-2000
     DOB: 03/12/1984
                                        3
     2 John T Smith                      John T Smith Sr
      123 Main Street                    123 Main Street
        703 111-2000                       703 111-2000
       DL: 009900991                      DL: 009900991

                                        2 John T Smith
                                         123 Main Street
                                           703 111-2000
                                          DL: 009900991
            New Best Practice:
          FIXED IN REAL-TIME
                   (not end of month)


58
                                                           © 2012 IBM Corporation
Customer Facing Systems




       Fraud                                          Data Mining

                             Sensemaking

     This System                                      That System




                   Back-of-House Accounting Systems
59
                                                            © 2012 IBM Corporation

Weitere ähnliche Inhalte

Ähnlich wie Enterprise Intelligence: How Context Accumulates

IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!Tony Pearson
 
HBase and Hadoop at Adobe
HBase and Hadoop at AdobeHBase and Hadoop at Adobe
HBase and Hadoop at AdobeCosmin Lehene
 
Ramsey in the eye of the beholder wcfay final
Ramsey in the eye of the beholder wcfay finalRamsey in the eye of the beholder wcfay final
Ramsey in the eye of the beholder wcfay finalwcfay
 
The evolution of a global workplace connect 2012 (cnw001)
The evolution of a global workplace connect 2012 (cnw001)The evolution of a global workplace connect 2012 (cnw001)
The evolution of a global workplace connect 2012 (cnw001)Mark Heid
 
Social Business and Digital Marketing
Social Business and Digital MarketingSocial Business and Digital Marketing
Social Business and Digital MarketingIBM Danmark
 
The Drastic Changing Landscape of Digital Marketing
The Drastic Changing Landscape of Digital MarketingThe Drastic Changing Landscape of Digital Marketing
The Drastic Changing Landscape of Digital MarketingIBM Danmark
 

Ähnlich wie Enterprise Intelligence: How Context Accumulates (7)

IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
 
HBase and Hadoop at Adobe
HBase and Hadoop at AdobeHBase and Hadoop at Adobe
HBase and Hadoop at Adobe
 
Ramsey in the eye of the beholder wcfay final
Ramsey in the eye of the beholder wcfay finalRamsey in the eye of the beholder wcfay final
Ramsey in the eye of the beholder wcfay final
 
Inside the Mind of Watson: Cognitive Computing
Inside the Mind of Watson: Cognitive ComputingInside the Mind of Watson: Cognitive Computing
Inside the Mind of Watson: Cognitive Computing
 
The evolution of a global workplace connect 2012 (cnw001)
The evolution of a global workplace connect 2012 (cnw001)The evolution of a global workplace connect 2012 (cnw001)
The evolution of a global workplace connect 2012 (cnw001)
 
Social Business and Digital Marketing
Social Business and Digital MarketingSocial Business and Digital Marketing
Social Business and Digital Marketing
 
The Drastic Changing Landscape of Digital Marketing
The Drastic Changing Landscape of Digital MarketingThe Drastic Changing Landscape of Digital Marketing
The Drastic Changing Landscape of Digital Marketing
 

Kürzlich hochgeladen

Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 

Kürzlich hochgeladen (20)

Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 

Enterprise Intelligence: How Context Accumulates

  • 1. Enterprise Intelligence Jeff Jonas, IBM Distinguished Engineer Chief Scientist, IBM Entity Analytics Email: jeffjonas@us.ibm.com Blog: www.jeffjonas.typepad.com Twitter: http://www.twitter.com/jeffjonas 1 © 2012 IBM Corporation
  • 2. My Background  Early 80‟s: Founded Systems Research & Development (SRD), a custom software consultancy  Personally designed and deployed +/- 100 systems, a number of which contained multi-billions of transactions describing 100‟s of millions of entities  1989 – 2003: Built numerous systems for Las Vegas casinos including a technology known as Non-Obvious Relationship Awareness (NORA)  2001: Funded by In-Q-Tel, the venture capital arm of the CIA  2005: IBM acquires SRD  Today: Primarily focused on „sensemaking on streams‟ with special attention towards privacy and civil liberties protections 2 © 2012 IBM Corporation
  • 3. Trend: Organizations Are Getting Dumber Every two days now we create as Available much information as we did from Observation the dawn of civilization up until Computing Power Growth Space 2003.” ~ EricContext CEO Google Schmidt, Enterprise Amnesia Sensemaking Algorithms Time 3 © 2012 IBM Corporation
  • 4. Amnesia, definition A defect in memory, especially resulting from brain damage. 4 © 2012 IBM Corporation
  • 5. Enterprise Amnesia, definition A defect in memory, resulting in wasted resources, lower revenues, unnecessary fraud losses, etc. 5 © 2012 IBM Corporation
  • 6. Trend: Organizations Are Getting Dumber Available Observation Computing Power Growth Space WHY? Context Sensemaking Algorithms Time 6 © 2012 IBM Corporation
  • 7. Algorithms at Dead End. You Can‟t Squeeze Knowledge Out of a Pixel. 7 © 2012 IBM Corporation
  • 8. No Context scrila34@msn.com 8 © 2012 IBM Corporation
  • 9. Context, definition Better understanding something by taking into account the things around it. 9 © 2012 IBM Corporation
  • 10. Information in Context … and Accumulating scrila34@msn.com Job Applicant Top 200 Customer Criminal Investigation Identity Thief 10 © 2012 IBM Corporation
  • 11. The Puzzle Metaphor  Imagine an ever-growing pile of puzzle pieces of varying sizes, shapes and colors  What it represents is unknown – there is no picture on hand  Is it one puzzle, 15 puzzles, or 1,500 different puzzles?  Some pieces are duplicates, missing, incomplete, low quality, or have been misinterpreted  Some pieces may even be professionally fabricated lies  Until you take the pieces to the table and attempt assembly, you don‟t know what you are dealing with 11 © 2012 IBM Corporation
  • 12. Puzzling 270 pieces Vegas 200 pieces Neuschwanstein Beauty © 2009 Photo Copyright 150 pieces Down Home Music © Kay Lamb Shannon, 6 pieces Cottage Garden © 2010 Royce B. McClure, 2% Artwork provided by 90% Hadley House Licensing, Minneapolis 66% Robert Cushman Hayes © 2009 Ravensburger USA, 50% Artist Licensed by Cypress Fine Artist All Rights Reserved © 2010 Ravensburger USA, © 2011 Giesla Hoelscher Inc. Art Licensing Inc. All Rights Reserved © 2011 Ravensburger USA © 2011 Ravensburger USA, Inc. Inc. 30 pieces 10% (duplicates) 12 © 2012 IBM Corporation
  • 13. 13 © 2012 IBM Corporation
  • 14. 14 © 2012 IBM Corporation
  • 15. First Discovery 15 © 2012 IBM Corporation
  • 16. More Data Finds Data 16 © 2012 IBM Corporation
  • 17. Duplicates in Front Of Your Eyes 17 © 2012 IBM Corporation
  • 18. First Duplicate Found Here 18 © 2012 IBM Corporation
  • 19. 19 © 2012 IBM Corporation
  • 20. Incremental Context – Incremental Discovery 6:40pm START 22min “Hey, this one is a duplicate!” 35min “I think some pieces are missing.” 37min “Looks like a bunch of hillbillies on a porch.” 44min “Hillbillies, playing guitars, sitting on a porch, near a barber sign … and a banjo!” 20 © 2012 IBM Corporation
  • 21. 150 pieces 50% 21 © 2012 IBM Corporation
  • 22. Incremental Context – Incremental Discovery 47min “We should take the sky and grass off the table.” 2hr “Let‟s switch sides, and see if we can make sense of this from different perspectives.” 2hr10m “Wait, there are three … no, four puzzles.” 2hr17m “We need a bigger table.” 2hr18m “I think you threw in a few random pieces.” 22 © 2012 IBM Corporation
  • 23. 23 © 2012 IBM Corporation
  • 24. How Context Accumulates  With each new observation … one of three assertions are made: 1) Un-associated; 2) placed near like neighbors; or 3) connected  Must favor the false negative  New observations sometimes reverse earlier assertions  Some observations produce novel discovery  As the working space expands, computational effort increases  Given sufficient observations, there can come a tipping point  Thereafter, confidence improves while computational effort decreases! 24 © 2012 IBM Corporation
  • 25. Big Data [in context]. New Physics. More data: better the predictions – Lower false positives – Lower false negatives More data: bad data good – Suddenly glad your data is not perfect More data: less compute 25 © 2012 IBM Corporation
  • 26. Big Data Pile of ____ In Context 26 © 2012 IBM Corporation
  • 27. One Form of Context: “Expert Counting”  Is it 5 people each with 1 account … or is it 1 person with 5 accounts?  Is it 20 cases of H1N1 in 20 cities … or one case reported 20 times?  If one cannot count … one cannot estimate vector or velocity (direction and speed).  Without vector and velocity … prediction is nearly impossible. 27 © 2012 IBM Corporation
  • 28. Entity Resolution Demonstration 28 © 2012 IBM Corporation
  • 29. Entity Resolution Demonstration VOTER DECEASED PERSON George F Balston George Balston YOB: 1951 D/L: 4801 YOB: 1951 SSN: 5598 13070 SW Karen Blvd Apt 7 DOD: 1995 Beaverton, OR 97005 Last voted: 2008 When it comes to best practices in voter matching, if only a name and year of birth match, this is insufficient proof of a match. Many different people in the U.S. share a name and year of birth. Human review is required. Unfortunately, there are thousands and thousands of cases just like this and state election offices don‟t have the staff (or budget) to manually review such volumes. 29 © 2012 IBM Corporation
  • 30. Now Consider This Tertiary DMV Record VOTER DECEASED PERSON George F Balston George Balston YOB: 1951 D/L: 4801 YOB: 1951 SSN: 5598 13070 SW Karen Blvd Apt 7 DOD: 1995 Beaverton, OR 97005 Last voted: 2008 DMV George F Balston YOB: 1951 SSN: 5598 D/L: 4801 3043 SW Clementine Blvd Apt 210 Beaverton, OR 97005 The DMV record contains enough features to match both the voter (name, year of birth and driver‟s license) and/or the deceased persons record (name, year of birth and SSN). For the sake of argument, let‟s 30 say it matches the voter best. © 2012 IBM Corporation
  • 31. Features Accumulate VOTER DECEASED PERSON George F Balston George Balston YOB: 1951 D/L: 4801 YOB: 1951 SSN: 5598 13070 SW Karen Blvd Apt 7 DOD: 1995 Beaverton, OR 97005 Last voted: 2008 DMV George F Balston YOB: 1951 SSN: 5598 D/L: 4801 3043 SW Clementine Blvd Apt 210 Beaverton, OR 97005 The voter/DMV record now shares a name, year of birth and SSN with the deceased person record. In voter matching best practices, this evidence would be sufficient to make a determination that this voter is in fact deceased. This case no longer needs human review. 31 © 2012 IBM Corporation
  • 32. Useful Insight Revealed! VOTER George F Balston As features accumulate it YOB: 1951 D/L: 4801 becomes possible to 13070 SW Karen Blvd Apt 7 resolve previous un- Beaverton, OR 97005 resolvable identity Last voted: 2008 records. DMV George F Balston As events and YOB: 1951 SSN: 5598 D/L: 4801 transactions accumulate – 3043 SW Clementine Blvd Apt 210 detection of relevance Beaverton, OR 97005 improves. DECEASED PERSON Here we can see George George Balston who died in 1995 voted in YOB: 1951 SSN: 5598 2008. DOD: 1995 32 © 2012 IBM Corporation
  • 33. IBM InfoSphere Identity Insight V8 33 © 2012 IBM Corporation
  • 34. MoneyGram International 34 © 2012 IBM Corporation
  • 35. Enterprise Intelligence One Plausible Journey One Plausible Journey 35 © 2012 IBM Corporation
  • 36. Sense and Respond Observation Space New Observations What you know 36 © 2012 IBM Corporation
  • 37. Sense and Respond Observation Space Data Finds Data Relevance Finds the Sensor (<200ms) ? Decide 37 © 2012 IBM Corporation
  • 38. Sense and Respond Explore and Reflect Observation Space Deep Reflection Curated Data Data Finds Pattern Data Discovery Directed Attention Relevance Finds the Sensor (<200ms) ? Relevance Decide Find You 38 © 2012 IBM Corporation
  • 39. Sense and Respond Explore and Reflect Observation Space Deep Reflection Curated Data Data Finds Pattern Data Discovery Directed Attention Relevance NEW Finds the Sensor (<200ms) ? INTERESTS Decide 39 © 2012 IBM Corporation
  • 40. Sense and Respond Explore and Reflect Observation InfoSphere Streams Netezza Space Deep SPSS Reflection Watson Curated Data Data Finds Pattern Data SPSS Discovery Sensemaking Cognos Directed Attention Relevance NEW InfoSphere Finds the Sensor (<200ms) Streams ? INTERESTS ILog Decide 40 © 2012 IBM Corporation
  • 41. Sense and Respond Explore and Reflect Observation Space Deep Reflection Curated Data Data Finds Pattern Data Discovery Directed Attention Relevance NEW Finds the Sensor (<200ms) ? INTERESTS Decide 41 Report and Manage © 2012 IBM Corporation
  • 42. Data Finds Pattern Data Discovery Directed Attention Relevance NEW Finds the Sensor (<200ms) ? INTERESTS Decide Content Management Info Management Case Management Data Systems Warehousing 42 Report and Manage © 2012 IBM Corporation
  • 43. Big Data Trends 43 © 2012 IBM Corporation
  • 44. The Greater the Context, the Greater the Value Data in Context Value of Data Pile of Data (Big) Records Managed (Ludicrous Big) 44 © 2012 IBM Corporation
  • 45. Time Is Of The Essence The better the predictions … the Batch faster they will be wanted. Day “Why did we have Willingness to Wait to wait until the Hour end of the day for the smart answer?” 200ms Real-Time (Iffy) Relevance (Totally) 45 © 2012 IBM Corporation
  • 46. Closing Thoughts 46 © 2012 IBM Corporation
  • 47. The most competitive organizations are going to make sense of what they are observing fast enough to do something about it while they are observing it. 47 © 2012 IBM Corporation
  • 48. Wish This On The Competitor Available Observation Computing Power Growth Space Context Enterprise Amnesia Sensemaking Algorithms Time 48 © 2012 IBM Corporation
  • 49. The Way Forward: Enterprise Intelligence Available Observation Computing Power Growth Space Context Sensemaking Algorithms Time 49 © 2012 IBM Corporation
  • 50. Related Blog Posts Algorithms At Dead-End: Cannot Squeeze Knowledge Out Of A Pixel Puzzling: How Observations Are Accumulated Into Context On A Smarter Planet … Some Organizations Will Be Smarter-er Than Others G2 | Sensemaking – One Year Birthday Today. Cognitive Basics Emerging. 50 © 2012 IBM Corporation
  • 51. Questions? Email: jeffjonas@us.ibm.com Blog: www.jeffjonas.typepad.com Twitter: http://www.twitter.com/jeffjonas 51 © 2012 IBM Corporation
  • 52. Enterprise Intelligence Jeff Jonas, IBM Distinguished Engineer Chief Scientist, IBM Entity Analytics Email: jeffjonas@us.ibm.com Blog: www.jeffjonas.typepad.com Twitter: http://www.twitter.com/jeffjonas 52 © 2012 IBM Corporation
  • 53. Sensemaking on Streams My G2 Secret Little IBM Project 3+ years in the making 53 © 2012 IBM Corporation
  • 54. G2 Mission Statement 1) Evaluate each new observation against previous observations. 2) Determine if what is being observed is relevant. 3) Delivering this actionable insight to its consumer … fast enough to do something about it while it is still happening. 4) Doing this with sufficient accuracy and scale to really matter. 54 © 2012 IBM Corporation
  • 55. From Pixels to Pictures to Action Relevance Finds You Data Finds Data This is G2 Observations Persistent Consumer Context (An analyst, a system, the sensor itself, etc.) 55 © 2012 IBM Corporation
  • 56. Uniquely G2  More scalable, faster and extensible – Designed for grid compute and sub-200ms sense and respond  Smarter – Tolerance for disagreement (no such thing as a single version of truth) – Support for more abstract entities (e.g., locations, products, asteroids) – Support for more exotic features (e.g., biometrics, social circles)  Crazy stuff – Detects on its own when it is confused and makes “note to self” – Geospatial reasoning including a sense of here and now  Privacy by Design (PbD) – More privacy and civil liberties enhancing features baked-in than any other commercial technology 56 © 2012 IBM Corporation
  • 57. PbD: Self-Correcting False Positives 1 A plausible claim these John T Smith Jr two people are the same 123 Main Street 703 111-2000 DOB: 03/12/1984 3 2 John T Smith Until this record John T Smith Sr 123 Main Street comes into view 123 Main Street 703 111-2000 703 111-2000 DL: 009900991 DL: 009900991 Which reveals this is a FALSE POSITIVE 57 © 2012 IBM Corporation
  • 58. PbD: Self-Correcting False Positives 1 John T Smith Jr 123 Main Street 703 111-2000 DOB: 03/12/1984 3 2 John T Smith John T Smith Sr 123 Main Street 123 Main Street 703 111-2000 703 111-2000 DL: 009900991 DL: 009900991 2 John T Smith 123 Main Street 703 111-2000 DL: 009900991 New Best Practice: FIXED IN REAL-TIME (not end of month) 58 © 2012 IBM Corporation
  • 59. Customer Facing Systems Fraud Data Mining Sensemaking This System That System Back-of-House Accounting Systems 59 © 2012 IBM Corporation