SlideShare ist ein Scribd-Unternehmen logo
1 von 12
Downloaden Sie, um offline zu lesen
LIGHTNING TALKS
Powered by Lucene:
IBM Content Analytics with Enterprise Search




Wolfgang Jung



Barcelona, 19th October 2011               © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search



Our agenda in the next 10 minutes
LIGHTNING TALKS
    IBM is commited to Open Source
     – Decade of contribution to the community.

    Adoption of Apache Lucene to IBM Content Analytics
    – The Why, What & examples.

    Demonstration of IBM Content Analytics
    – see the development results live.
               Be enlightened !

2                                                  © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search



IBM is commited to Open Source

    Decade of lineage and contributions to the open source community
      – Apache Hadoop.
          IBM‘s use of BigIndex for Search is mention in Chuck Lams‘s “Hadopp in Action”
      – Apache Derby
      – Apache Geronimo and Jetty
      – Eclipse: Founded by IBM, PMC Board of Directors
      – Apache UIMA: Unstructured Information Management Architecture.
          Developed by IBM, Contributed to Apache
      – Apache Jakarta: Lucene. PMC members
          Significant contributions via IBM Lucene Extension Library (ILEL)
      – Linux ... and more!


3                                                                                  © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search



Adoption of Apache Lucene
to IBM Content Analytics with Enterprise Search
    The use of UIMA is existing since first release in 2005 of IBM OmniFind and later
    IBM Content Analytics, continued into today‘s IBM Content Analytics with
    Enterprise Search
         http://www-01.ibm.com/software/data/content-management/analytics/uima.html


    IBM‘s decision for the use of Lucene
      –Index is a common technology and better to improve
      –lower cost of maintenance
      –advantage in incremental indexing
      –extensibility



4                                                                                     © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search



Adoption of Apache Lucene
to IBM Content Analytics with Enterprise Search
    IBM is a very active contributor. Look for PMC members:
      –Michael McCandless; Shai Erera; Doron Cohen
         http://lucene.apache.org/who.html

    IBM extended Lucene based on our needs. Two examples already
    contributed to community :
      –Query Parser
      –Facets




5                                                             © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search



Adoption of Apache Lucene
to IBM Content Analytics with Enterprise Search
    On 13th December 2006, IBM and Yahoo! announced IBM OmniFind Yahoo! Edition, as
    “no-cost, entry level enterprise search product developed to help eliminate financial and
    technology barriers to intranet and Web search.”
         http://www-03.ibm.com/press/us/en/pressrelease/20767.wss

    This technology included Lucene as index technology and had full support by IBM
      – 45,000+ downloads from the website http://omnifind.ibm.yahoo.net
      – IBM support contracts for clients with “IBM Elite Support for OmniFind Yahoo Edition“
      – Below 15 incidents regarding index technology


    Technology is seen as success for IBM




6                                                                                               © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search


Content Analytics generates new insights and aggregates key
findings gathered from large data volumes in a visualized form

                                                          Extracted Concept
                                                        Claimant: Soft Tissue Injury
                                                                                                     Automatic
                                                                                                     Visualizing
                                               Person    Injury    Body Part      Location     Results of concept evaluation
                                                                                                are displayed to the users
                                               Noun      Verb     Noun Phrase    Prep Phrase

                                               Claus sprained his ankle on the step




                                               Analysed documents
                                                 with identified concepts


       Sources of Information
       Internal (ECM, Files, DBMS, etc.)
        and External (Social, News, etc.)




7                                                                                                           © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search




Rapid Insights from Automotive Complaints

    We will be using publically available data from the National Highway Traffic Safety Agency (NHTSA)
    to demonstrate how IBM Content Analytics can be used to identify problems with automobiles.
    NHTSA receives various reports about malfunctions, accidents, and other issues with automobiles
    from dealerships, repair facilities, and from the general public. NHTSA publishes the data at
    http://www.nhtsa.gov. For this demo we have created a collection from the NHTSA “complaints”
    data spanning several years ending in early 2010. We will show how this and similar data can be
    analyzed to arrive at rapid insights not possible by manually reading through the complaint records.




8                                                                                             © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search



See Content Analytics live!




9                                              © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search



See Content Analytics live!




10                                             © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search




                                               Be enlightened !



11                                                                © 2011 IBM Corporation
LIGHTNING TALKS
Powered by Lucene:
IBM Content Analytics with Enterprise Search




Wolfgang Jung



Barcelona, 19th October 2011                   © 2011 IBM Corporation

Weitere Àhnliche Inhalte

Was ist angesagt?

IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter Analytics
Adrian Turcu
 
Watson and Analytics
Watson and AnalyticsWatson and Analytics
Watson and Analytics
Jorge W. Hago
 
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Data Con LA
 

Was ist angesagt? (20)

Ml, AI and IBM Watson - 101 for Business
Ml, AI  and IBM Watson - 101 for BusinessMl, AI  and IBM Watson - 101 for Business
Ml, AI and IBM Watson - 101 for Business
 
IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter Analytics
 
IBM Cognitive platform: IBM Watson
IBM Cognitive platform: IBM WatsonIBM Cognitive platform: IBM Watson
IBM Cognitive platform: IBM Watson
 
Watson and Analytics
Watson and AnalyticsWatson and Analytics
Watson and Analytics
 
Ibm big data-platform
Ibm big data-platformIbm big data-platform
Ibm big data-platform
 
What Watson Explorer is and How it works
What Watson Explorer is and How it worksWhat Watson Explorer is and How it works
What Watson Explorer is and How it works
 
Oltre l’intelligenza Artificiale: agire alla velocità del pensiero
Oltre l’intelligenza Artificiale: agire alla velocità del pensieroOltre l’intelligenza Artificiale: agire alla velocità del pensiero
Oltre l’intelligenza Artificiale: agire alla velocità del pensiero
 
Watson AI platform for business - IBM Cloud
Watson AI platform for business - IBM CloudWatson AI platform for business - IBM Cloud
Watson AI platform for business - IBM Cloud
 
IBM Watson
IBM Watson IBM Watson
IBM Watson
 
Building Bots Using IBM Watson
Building Bots Using IBM WatsonBuilding Bots Using IBM Watson
Building Bots Using IBM Watson
 
Using Watson to build Cognitive IoT Apps on Bluemix
Using Watson to build Cognitive IoT Apps on BluemixUsing Watson to build Cognitive IoT Apps on Bluemix
Using Watson to build Cognitive IoT Apps on Bluemix
 
IBM Watson Explorer: Explore, analyze and interpret information for better bu...
IBM Watson Explorer: Explore, analyze and interpret information for better bu...IBM Watson Explorer: Explore, analyze and interpret information for better bu...
IBM Watson Explorer: Explore, analyze and interpret information for better bu...
 
AI future 2025 - IBM Watson Re
AI future 2025  - IBM Watson ReAI future 2025  - IBM Watson Re
AI future 2025 - IBM Watson Re
 
An AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven OrganizationAn AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven Organization
 
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
 
Big Data and Analytics: The IBM Perspective
Big Data and Analytics: The IBM PerspectiveBig Data and Analytics: The IBM Perspective
Big Data and Analytics: The IBM Perspective
 
Webinar - Comparative Analysis of Cloud based Machine Learning Platforms
Webinar - Comparative Analysis of Cloud based Machine Learning PlatformsWebinar - Comparative Analysis of Cloud based Machine Learning Platforms
Webinar - Comparative Analysis of Cloud based Machine Learning Platforms
 
Master the art of Data Science
Master the art of Data ScienceMaster the art of Data Science
Master the art of Data Science
 
Libera la potenza del Machine Learning
Libera la potenza del Machine LearningLibera la potenza del Machine Learning
Libera la potenza del Machine Learning
 
InTTrust -IBM Artificial Intelligence Event
InTTrust -IBM Artificial Intelligence  EventInTTrust -IBM Artificial Intelligence  Event
InTTrust -IBM Artificial Intelligence Event
 

Ähnlich wie Lightning talk :IBM Content Analytics with Enterprise Search - Wolfgang Jung

Flex 4.5 and mobile development
Flex 4.5 and mobile developmentFlex 4.5 and mobile development
Flex 4.5 and mobile development
Michael Chaize
 
Employ the Cloud for Efficient Content Analytics - 10 november 2011
Employ the Cloud for Efficient Content Analytics - 10 november 2011Employ the Cloud for Efficient Content Analytics - 10 november 2011
Employ the Cloud for Efficient Content Analytics - 10 november 2011
Samir Batla
 
Mariana Alupului Inventions
Mariana Alupului InventionsMariana Alupului Inventions
Mariana Alupului Inventions
malupului
 
Adobe flash platform java
Adobe flash platform javaAdobe flash platform java
Adobe flash platform java
Ch'ti JUG
 
Native extensions webinar
Native extensions webinarNative extensions webinar
Native extensions webinar
immanuelnoel
 

Ähnlich wie Lightning talk :IBM Content Analytics with Enterprise Search - Wolfgang Jung (20)

Smw+ semantic enterprise wiki en_153
Smw+ semantic enterprise wiki en_153Smw+ semantic enterprise wiki en_153
Smw+ semantic enterprise wiki en_153
 
"IBMs Open Source Strategy" by Adam Jollans @ eLiberatica 2009
"IBMs Open Source Strategy" by Adam Jollans @ eLiberatica 2009"IBMs Open Source Strategy" by Adam Jollans @ eLiberatica 2009
"IBMs Open Source Strategy" by Adam Jollans @ eLiberatica 2009
 
Smw+tutorial berlin-fall-2011
Smw+tutorial berlin-fall-2011Smw+tutorial berlin-fall-2011
Smw+tutorial berlin-fall-2011
 
Flex 4.5 and mobile development
Flex 4.5 and mobile developmentFlex 4.5 and mobile development
Flex 4.5 and mobile development
 
Deploying Enterprise Search in PLM Context with Aras
Deploying Enterprise Search in PLM Context with ArasDeploying Enterprise Search in PLM Context with Aras
Deploying Enterprise Search in PLM Context with Aras
 
Employ the Cloud for Efficient Content Analytics - 10 november 2011
Employ the Cloud for Efficient Content Analytics - 10 november 2011Employ the Cloud for Efficient Content Analytics - 10 november 2011
Employ the Cloud for Efficient Content Analytics - 10 november 2011
 
Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...
 
Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...
Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...
Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...
 
Breizh camp adobe flex et les mobiles
Breizh camp   adobe flex et les mobilesBreizh camp   adobe flex et les mobiles
Breizh camp adobe flex et les mobiles
 
Splunk in 60 Minutes | Splunk Tutorial For Beginners | Splunk Training | Splu...
Splunk in 60 Minutes | Splunk Tutorial For Beginners | Splunk Training | Splu...Splunk in 60 Minutes | Splunk Tutorial For Beginners | Splunk Training | Splu...
Splunk in 60 Minutes | Splunk Tutorial For Beginners | Splunk Training | Splu...
 
Open source, commercial or a co-existance strategy
Open source, commercial or a co-existance strategyOpen source, commercial or a co-existance strategy
Open source, commercial or a co-existance strategy
 
Starting mobile development
Starting mobile developmentStarting mobile development
Starting mobile development
 
Mariana Alupului Inventions
Mariana Alupului InventionsMariana Alupului Inventions
Mariana Alupului Inventions
 
Adobe flash platform java
Adobe flash platform javaAdobe flash platform java
Adobe flash platform java
 
Adobe flash platform java
Adobe flash platform javaAdobe flash platform java
Adobe flash platform java
 
Native extensions webinar
Native extensions webinarNative extensions webinar
Native extensions webinar
 
Jax2001 adobe keynote
Jax2001 adobe keynoteJax2001 adobe keynote
Jax2001 adobe keynote
 
The IBM Rational Insight Reporting Solution
The IBM Rational Insight Reporting SolutionThe IBM Rational Insight Reporting Solution
The IBM Rational Insight Reporting Solution
 
Convergence of mobility, analytics, social and cloud to drive innovation
Convergence of mobility, analytics, social and cloud to drive innovationConvergence of mobility, analytics, social and cloud to drive innovation
Convergence of mobility, analytics, social and cloud to drive innovation
 
Inform: Targeting the Interest Graph
Inform: Targeting the Interest GraphInform: Targeting the Interest Graph
Inform: Targeting the Interest Graph
 

Mehr von lucenerevolution

Enhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic searchEnhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic search
lucenerevolution
 
Shrinking the haystack wes caldwell - final
Shrinking the haystack   wes caldwell - finalShrinking the haystack   wes caldwell - final
Shrinking the haystack wes caldwell - final
lucenerevolution
 

Mehr von lucenerevolution (20)

Text Classification Powered by Apache Mahout and Lucene
Text Classification Powered by Apache Mahout and LuceneText Classification Powered by Apache Mahout and Lucene
Text Classification Powered by Apache Mahout and Lucene
 
State of the Art Logging. Kibana4Solr is Here!
State of the Art Logging. Kibana4Solr is Here! State of the Art Logging. Kibana4Solr is Here!
State of the Art Logging. Kibana4Solr is Here!
 
Search at Twitter
Search at TwitterSearch at Twitter
Search at Twitter
 
Building Client-side Search Applications with Solr
Building Client-side Search Applications with SolrBuilding Client-side Search Applications with Solr
Building Client-side Search Applications with Solr
 
Integrate Solr with real-time stream processing applications
Integrate Solr with real-time stream processing applicationsIntegrate Solr with real-time stream processing applications
Integrate Solr with real-time stream processing applications
 
Scaling Solr with SolrCloud
Scaling Solr with SolrCloudScaling Solr with SolrCloud
Scaling Solr with SolrCloud
 
Administering and Monitoring SolrCloud Clusters
Administering and Monitoring SolrCloud ClustersAdministering and Monitoring SolrCloud Clusters
Administering and Monitoring SolrCloud Clusters
 
Implementing a Custom Search Syntax using Solr, Lucene, and Parboiled
Implementing a Custom Search Syntax using Solr, Lucene, and ParboiledImplementing a Custom Search Syntax using Solr, Lucene, and Parboiled
Implementing a Custom Search Syntax using Solr, Lucene, and Parboiled
 
Using Solr to Search and Analyze Logs
Using Solr to Search and Analyze Logs Using Solr to Search and Analyze Logs
Using Solr to Search and Analyze Logs
 
Enhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic searchEnhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic search
 
Real-time Inverted Search in the Cloud Using Lucene and Storm
Real-time Inverted Search in the Cloud Using Lucene and StormReal-time Inverted Search in the Cloud Using Lucene and Storm
Real-time Inverted Search in the Cloud Using Lucene and Storm
 
Solr's Admin UI - Where does the data come from?
Solr's Admin UI - Where does the data come from?Solr's Admin UI - Where does the data come from?
Solr's Admin UI - Where does the data come from?
 
Schemaless Solr and the Solr Schema REST API
Schemaless Solr and the Solr Schema REST APISchemaless Solr and the Solr Schema REST API
Schemaless Solr and the Solr Schema REST API
 
High Performance JSON Search and Relational Faceted Browsing with Lucene
High Performance JSON Search and Relational Faceted Browsing with LuceneHigh Performance JSON Search and Relational Faceted Browsing with Lucene
High Performance JSON Search and Relational Faceted Browsing with Lucene
 
Text Classification with Lucene/Solr, Apache Hadoop and LibSVM
Text Classification with Lucene/Solr, Apache Hadoop and LibSVMText Classification with Lucene/Solr, Apache Hadoop and LibSVM
Text Classification with Lucene/Solr, Apache Hadoop and LibSVM
 
Faceted Search with Lucene
Faceted Search with LuceneFaceted Search with Lucene
Faceted Search with Lucene
 
Recent Additions to Lucene Arsenal
Recent Additions to Lucene ArsenalRecent Additions to Lucene Arsenal
Recent Additions to Lucene Arsenal
 
Turning search upside down
Turning search upside downTurning search upside down
Turning search upside down
 
Spellchecking in Trovit: Implementing a Contextual Multi-language Spellchecke...
Spellchecking in Trovit: Implementing a Contextual Multi-language Spellchecke...Spellchecking in Trovit: Implementing a Contextual Multi-language Spellchecke...
Spellchecking in Trovit: Implementing a Contextual Multi-language Spellchecke...
 
Shrinking the haystack wes caldwell - final
Shrinking the haystack   wes caldwell - finalShrinking the haystack   wes caldwell - final
Shrinking the haystack wes caldwell - final
 

KĂŒrzlich hochgeladen

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
Christopher Logan Kennedy
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

KĂŒrzlich hochgeladen (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 

Lightning talk :IBM Content Analytics with Enterprise Search - Wolfgang Jung

  • 1. LIGHTNING TALKS Powered by Lucene: IBM Content Analytics with Enterprise Search Wolfgang Jung Barcelona, 19th October 2011 © 2011 IBM Corporation
  • 2. IBM Content Analytics with Enterprise Search Our agenda in the next 10 minutes LIGHTNING TALKS IBM is commited to Open Source – Decade of contribution to the community. Adoption of Apache Lucene to IBM Content Analytics – The Why, What & examples. Demonstration of IBM Content Analytics – see the development results live. Be enlightened ! 2 © 2011 IBM Corporation
  • 3. IBM Content Analytics with Enterprise Search IBM is commited to Open Source Decade of lineage and contributions to the open source community – Apache Hadoop. IBM‘s use of BigIndex for Search is mention in Chuck Lams‘s “Hadopp in Action” – Apache Derby – Apache Geronimo and Jetty – Eclipse: Founded by IBM, PMC Board of Directors – Apache UIMA: Unstructured Information Management Architecture. Developed by IBM, Contributed to Apache – Apache Jakarta: Lucene. PMC members Significant contributions via IBM Lucene Extension Library (ILEL) – Linux ... and more! 3 © 2011 IBM Corporation
  • 4. IBM Content Analytics with Enterprise Search Adoption of Apache Lucene to IBM Content Analytics with Enterprise Search The use of UIMA is existing since first release in 2005 of IBM OmniFind and later IBM Content Analytics, continued into today‘s IBM Content Analytics with Enterprise Search http://www-01.ibm.com/software/data/content-management/analytics/uima.html IBM‘s decision for the use of Lucene –Index is a common technology and better to improve –lower cost of maintenance –advantage in incremental indexing –extensibility 4 © 2011 IBM Corporation
  • 5. IBM Content Analytics with Enterprise Search Adoption of Apache Lucene to IBM Content Analytics with Enterprise Search IBM is a very active contributor. Look for PMC members: –Michael McCandless; Shai Erera; Doron Cohen http://lucene.apache.org/who.html IBM extended Lucene based on our needs. Two examples already contributed to community : –Query Parser –Facets 5 © 2011 IBM Corporation
  • 6. IBM Content Analytics with Enterprise Search Adoption of Apache Lucene to IBM Content Analytics with Enterprise Search On 13th December 2006, IBM and Yahoo! announced IBM OmniFind Yahoo! Edition, as “no-cost, entry level enterprise search product developed to help eliminate financial and technology barriers to intranet and Web search.” http://www-03.ibm.com/press/us/en/pressrelease/20767.wss This technology included Lucene as index technology and had full support by IBM – 45,000+ downloads from the website http://omnifind.ibm.yahoo.net – IBM support contracts for clients with “IBM Elite Support for OmniFind Yahoo Edition“ – Below 15 incidents regarding index technology Technology is seen as success for IBM 6 © 2011 IBM Corporation
  • 7. IBM Content Analytics with Enterprise Search Content Analytics generates new insights and aggregates key findings gathered from large data volumes in a visualized form Extracted Concept Claimant: Soft Tissue Injury Automatic Visualizing Person Injury Body Part Location Results of concept evaluation are displayed to the users Noun Verb Noun Phrase Prep Phrase Claus sprained his ankle on the step Analysed documents with identified concepts Sources of Information Internal (ECM, Files, DBMS, etc.) and External (Social, News, etc.) 7 © 2011 IBM Corporation
  • 8. IBM Content Analytics with Enterprise Search Rapid Insights from Automotive Complaints We will be using publically available data from the National Highway Traffic Safety Agency (NHTSA) to demonstrate how IBM Content Analytics can be used to identify problems with automobiles. NHTSA receives various reports about malfunctions, accidents, and other issues with automobiles from dealerships, repair facilities, and from the general public. NHTSA publishes the data at http://www.nhtsa.gov. For this demo we have created a collection from the NHTSA “complaints” data spanning several years ending in early 2010. We will show how this and similar data can be analyzed to arrive at rapid insights not possible by manually reading through the complaint records. 8 © 2011 IBM Corporation
  • 9. IBM Content Analytics with Enterprise Search See Content Analytics live! 9 © 2011 IBM Corporation
  • 10. IBM Content Analytics with Enterprise Search See Content Analytics live! 10 © 2011 IBM Corporation
  • 11. IBM Content Analytics with Enterprise Search Be enlightened ! 11 © 2011 IBM Corporation
  • 12. LIGHTNING TALKS Powered by Lucene: IBM Content Analytics with Enterprise Search Wolfgang Jung Barcelona, 19th October 2011 © 2011 IBM Corporation