SlideShare ist ein Scribd-Unternehmen logo
1 von 46
Washington DC, November 2011
George Roth, Adonis Damian
www.recognos.com
 A document management system (DMS) is a computer system (or
  set of computer programs) used to track and store electronic
  documents and/or images of paper documents. It is usually also
  capable of keeping track of the different versions created by different
  users (history tracking). The term has some overlap with the concepts
  of content management systems. It is often viewed as a component
  of enterprise content management (ECM) systems and related to
  digital asset management, document imaging, workflow systems and
  records management systems.
 Make the formatted equivalent with non-formatted !




November 2011
CLASSICAL           NEW
   Metadata           Compliance
   Integration        Accessibility
   Capture            Interactivity
   Indexing           Augmentation
   Storage            Translation
   Retrieval          Linking – Relationships
   Distribution       Sentiment Analysis
   Security           New Search (Semantic Tagging, Deep
   Workflow            Search, NL Questions)
   Collaboration
   Versioning
   Search
   Publishing
   …




November 2011
   Volume
   Labor extensive
   The “research project” – 40% – 60% data
    gathering
   Metadata independent of content
   Shallow Search
   Hard to understand by non-experts


November 2011
   NLP Natural Language Processing –
    understand the meaning of documents
    (statistic, machine learning, hybrid, graph
    based)
   Semantic Search – tagging
   Data Integration
   Sentiment Analysis
   Linked Open Data – Linked Data
   Inference - Reasoning

November 2011
   Inside – Controlled Environment - TRUST
   Inside – Security issues
   Same techniques as outside the enterprise
   Integrates non-formatted with formatted
    data
   Easy to measure the effects - ROI
   Add on to the existing KM models
   Emerging area – Semantic technologies
    started on the www
November 2011
New features will become commodity in 2-3 years

   Compliance
   Data Extraction, Comparison, Change
    Analysis
   Interactivity
   Augmentation
   Translation
   Linking – Relationships
   Sentiment Analysis
   New Search (Semantic Tagging, Deep Search,
    NL Questions)
November 2011
   Microsoft: Powerset (Bing), Fast Search, Jinni
   Google: Freebase, Needlebase
   Apple: SIRI
   Etc…




November 2011
 Embedded Compliance Rules




November 2011
 Example there is a rule: – email –
Rule 0134C: “Not allowed to mention a percentage as a
  profit promise investing with the firm”
 In an email:
“ Dear John, Our company has an amazing method to
  invest, so that you will make at least 10% profit in 3
  months !!!! “
 The email was stopped – sent to Compliance with the
  message: “Violation of the Rule 0134C”



November 2011
   MFIP data extraction
   Link to the original document




November 2011
 Data Extraction, Comparison,
    Change Analysis



November 2011
November 2011
November 2011
   Create Alarm when Trading Policy Changes
   Create Alarm when Commissions Change
    (fields)
   Create Alarms when member of the Board
    Changes




November 2011
 Interactivity




November 2011
November 2011
 Augmentation




November 2011
November 2011
 Automated Translation




November 2011
   Google Translate
     Great for simple translation – emails, non
        technical documents

   Language Weaver
     Specialized translation through machine learning
     Train the system per domains



November 2011
 Sentiment Analysis




November 2011
   Media Sentry
   Open Amplify, Expert Systems, Lymbix
   NLP and machine learning




November 2011
November 2011
 Search




November 2011
November 2011
November 2011
November 2011
November 2011
November 2011
November 2011
 Complex App Samples




November 2011
November 2011
WWW

                 Google        Meltwaters                                            Forums /
                                               Twitter           Facebook                                Websites
                 Alerts          Alerts                                               Blogs




                           Exchange
                              Server



                                                         External Data Pull


                          Exchange                 Twitter          Facebook              80legs                 Diffbot
                            Adapter               Adapter             Adapter            Adapter                Adapter




                Internal Message Storage

                                        File
                                      Server


                                                                      Natural Language Processing


                                                                                                     Uploaded
                                                                                ESSEX               Taxonomy




                Web User Interface
                                                                                Data Storage


                                                                                   MS SQL Server




November 2011
   Amdocs AIDA (AMDOCS Intelligent Decision Automation)




November 2011
November 2011
Display Linked Data   Ask a question –   Entity Lookup
                       semantic search

November 2011
November 2011
November 2011
November 2011
November 2011
November 2011
November 2011
   Interactive - Exists
   Search – Semantic Search, Q&A
   Semantic Tagging – Summarization
   LOD with domains
   Linked : People, Companies, Locations,
    Specific Terms
   Example a travel book


November 2011
The following technologies were used:
- iQser – GIN
- Clark & Parsia – Spanner, StarDog
- Expert System – NLP
- GATE
- Smart Logic – Enterprise Query Platform – Fast Search – Microsoft
  Sharepoint 11
- Revelytix
- Cognition
- Franz Systems
- DiffBot
- Ontotext




November 2011
George Roth
President and CEO Recognos Inc.
San Francisco
www.recognos.com
groth@recognos.com
Drew Warren
CEO Recognos Financial
New York
dwarren@recognosfinancial.com
www.recognosfinancial.com



November 2011

Weitere ähnliche Inhalte

Ähnlich wie Semantic Technology in Document Management

SharePoint 2010 and Changing Business Needs-MAJU 2011
SharePoint 2010 and Changing Business Needs-MAJU 2011SharePoint 2010 and Changing Business Needs-MAJU 2011
SharePoint 2010 and Changing Business Needs-MAJU 2011
Shakir Majeed Khan
 
Fishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationFishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar Presentation
Kim Negaard
 

Ähnlich wie Semantic Technology in Document Management (20)

Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
 
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
 
Stug-paf kiet 28 january live and on location-Enterprise Content Management
Stug-paf kiet 28 january live and on location-Enterprise Content Management Stug-paf kiet 28 january live and on location-Enterprise Content Management
Stug-paf kiet 28 january live and on location-Enterprise Content Management
 
SharePoint 2010 and Changing Business Needs-MAJU 2011
SharePoint 2010 and Changing Business Needs-MAJU 2011SharePoint 2010 and Changing Business Needs-MAJU 2011
SharePoint 2010 and Changing Business Needs-MAJU 2011
 
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing TagSPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Content is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday DenverContent is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
 
SharePoint Saturday DC by ImageTech Systems - David Strock
SharePoint Saturday DC by ImageTech Systems - David StrockSharePoint Saturday DC by ImageTech Systems - David Strock
SharePoint Saturday DC by ImageTech Systems - David Strock
 
KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101
 
SharePoint & ERM
SharePoint & ERMSharePoint & ERM
SharePoint & ERM
 
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
 
SharePoint Server 2007 Overview - TechMentor 2007 with Joel Oleson
SharePoint Server 2007 Overview - TechMentor 2007 with Joel OlesonSharePoint Server 2007 Overview - TechMentor 2007 with Joel Oleson
SharePoint Server 2007 Overview - TechMentor 2007 with Joel Oleson
 
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebContent Management, Metadata and Semantic Web
Content Management, Metadata and Semantic Web
 
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebContent Management, Metadata and Semantic Web
Content Management, Metadata and Semantic Web
 
Sp tech con-admin101
Sp tech con-admin101Sp tech con-admin101
Sp tech con-admin101
 
SharePoint 2010- Changing business needs
SharePoint 2010- Changing business needsSharePoint 2010- Changing business needs
SharePoint 2010- Changing business needs
 
Fishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationFishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar Presentation
 
Asap session 1
Asap session 1Asap session 1
Asap session 1
 
Productie Sharepoint Presentatie
Productie Sharepoint PresentatieProductie Sharepoint Presentatie
Productie Sharepoint Presentatie
 
Driving End User Adoption in SharePoint 2013 & 2010 - EPC Group
Driving End User Adoption in SharePoint 2013 & 2010 - EPC GroupDriving End User Adoption in SharePoint 2013 & 2010 - EPC Group
Driving End User Adoption in SharePoint 2013 & 2010 - EPC Group
 

Kürzlich hochgeladen

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Kürzlich hochgeladen (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

Semantic Technology in Document Management

  • 1. Washington DC, November 2011 George Roth, Adonis Damian www.recognos.com
  • 2.  A document management system (DMS) is a computer system (or set of computer programs) used to track and store electronic documents and/or images of paper documents. It is usually also capable of keeping track of the different versions created by different users (history tracking). The term has some overlap with the concepts of content management systems. It is often viewed as a component of enterprise content management (ECM) systems and related to digital asset management, document imaging, workflow systems and records management systems.  Make the formatted equivalent with non-formatted ! November 2011
  • 3. CLASSICAL NEW  Metadata  Compliance  Integration  Accessibility  Capture  Interactivity  Indexing  Augmentation  Storage  Translation  Retrieval  Linking – Relationships  Distribution  Sentiment Analysis  Security  New Search (Semantic Tagging, Deep  Workflow Search, NL Questions)  Collaboration  Versioning  Search  Publishing  … November 2011
  • 4. Volume  Labor extensive  The “research project” – 40% – 60% data gathering  Metadata independent of content  Shallow Search  Hard to understand by non-experts November 2011
  • 5. NLP Natural Language Processing – understand the meaning of documents (statistic, machine learning, hybrid, graph based)  Semantic Search – tagging  Data Integration  Sentiment Analysis  Linked Open Data – Linked Data  Inference - Reasoning November 2011
  • 6. Inside – Controlled Environment - TRUST  Inside – Security issues  Same techniques as outside the enterprise  Integrates non-formatted with formatted data  Easy to measure the effects - ROI  Add on to the existing KM models  Emerging area – Semantic technologies started on the www November 2011
  • 7. New features will become commodity in 2-3 years  Compliance  Data Extraction, Comparison, Change Analysis  Interactivity  Augmentation  Translation  Linking – Relationships  Sentiment Analysis  New Search (Semantic Tagging, Deep Search, NL Questions) November 2011
  • 8. Microsoft: Powerset (Bing), Fast Search, Jinni  Google: Freebase, Needlebase  Apple: SIRI  Etc… November 2011
  • 9.  Embedded Compliance Rules November 2011
  • 10.  Example there is a rule: – email – Rule 0134C: “Not allowed to mention a percentage as a profit promise investing with the firm”  In an email: “ Dear John, Our company has an amazing method to invest, so that you will make at least 10% profit in 3 months !!!! “ The email was stopped – sent to Compliance with the message: “Violation of the Rule 0134C” November 2011
  • 11. MFIP data extraction  Link to the original document November 2011
  • 12.  Data Extraction, Comparison, Change Analysis November 2011
  • 15. Create Alarm when Trading Policy Changes  Create Alarm when Commissions Change (fields)  Create Alarms when member of the Board Changes November 2011
  • 21. Google Translate  Great for simple translation – emails, non technical documents  Language Weaver  Specialized translation through machine learning  Train the system per domains November 2011
  • 23. Media Sentry  Open Amplify, Expert Systems, Lymbix  NLP and machine learning November 2011
  • 32.  Complex App Samples November 2011
  • 34. WWW Google Meltwaters Forums / Twitter Facebook Websites Alerts Alerts Blogs Exchange Server External Data Pull Exchange Twitter Facebook 80legs Diffbot Adapter Adapter Adapter Adapter Adapter Internal Message Storage File Server Natural Language Processing Uploaded ESSEX Taxonomy Web User Interface Data Storage MS SQL Server November 2011
  • 35. Amdocs AIDA (AMDOCS Intelligent Decision Automation) November 2011
  • 37. Display Linked Data Ask a question – Entity Lookup semantic search November 2011
  • 44. Interactive - Exists  Search – Semantic Search, Q&A  Semantic Tagging – Summarization  LOD with domains  Linked : People, Companies, Locations, Specific Terms  Example a travel book November 2011
  • 45. The following technologies were used: - iQser – GIN - Clark & Parsia – Spanner, StarDog - Expert System – NLP - GATE - Smart Logic – Enterprise Query Platform – Fast Search – Microsoft Sharepoint 11 - Revelytix - Cognition - Franz Systems - DiffBot - Ontotext November 2011
  • 46. George Roth President and CEO Recognos Inc. San Francisco www.recognos.com groth@recognos.com Drew Warren CEO Recognos Financial New York dwarren@recognosfinancial.com www.recognosfinancial.com November 2011