SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Data Tactics




             Unified DataSpace
WWW.DATA–TACTICS.COM   © 2012 Data Tactics   ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
Cloud




WWW.DATA–TACTICS.COM   © 2012 Data Tactics   ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
Systems Engineering & Integration

    SYSTEMS ENGINEERING
    •   Data Ingestion Frameworks (structured, unstructured, semi-structured)
    •   Semantic DataSpace Enrichment                          SYSTEM INTEGRATION
    •   Cloud Management Systems (CMS)                         •   Ingestion
    •   Cloudbase/Accumulo                                          – Generalized Ingest /
         – Pig (Big Data) Plug-in                                     NiagraFiles
    •   Dissemination and Reporting Tools                      •   Geospatial Capabilities
    •   Data Mining, Exploitation, and Correlation Tools       •   Biometric Capabilities




WWW.DATA–TACTICS.COM        © 2012 Data Tactics   ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
Cloud Experience
 17 Enclaves at SECRET//NOFORN                  4 Enclaves for NATO ISAF
      • 3 in Tyson’s                                • 2 in Afghanistan
      • 1 at GISA, Ft. Bragg                       • 1 at GISA, Fort Bragg
      • 2 in Hawaii                                • 1 in Germany
      • 2 in Germany                            US BICES Cloud in Germany
      • 7 at Aberdeen                           Over a dozen at UNCLASS//FOUO
      • 2 in Afghanistan                           • Supporting real-world missions on
 6 Enclaves at TS//SCI                             contract
      • AF TENCAP                                  • At various levels of complexity
      • NRL
      • DARPA
      • INSCOM
      • DCGS-A
      • DHS OI&A

                                                Cloud Domains is where we live
                                                   Data, is the Hard Problem

WWW.DATA–TACTICS.COM      © 2012 Data Tactics     ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
Data – The Hard Part




WWW.DATA–TACTICS.COM   © 2012 Data Tactics   ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
BigData Architecture
    Data Tactics has delivered solutions that manage PETABYTES of data
    and provide mission relevant analytics, metrics and user interfaces
    •   DESIGN, DEVELOPMENT AND INTEGRATION OF REFERENCE ARCHITECTURES
         –   Ghost Machine
         –   Stratus
    •   SECURE DATABASE ARCHITECTURES
         –   Secure Entity Database (SED)
         –   Defense Cross-Domain Analytic Capability (DCAC)
    •   DATA MIGRATION, EXTRACTION, TRANSFORM AND PARSING
    •   FEDERATED DATA MANAGEMENT
         –   Federated Search, Multi-Source / Multi-Vendor Integration
         –   Storage Cluster Management
    •   DATA MINING AND FORENSIC ANALYSIS
    •   SPATIAL, MULTI-DOMAIN, AND CLOUD DATA SERVICES



WWW.DATA–TACTICS.COM          © 2012 Data Tactics     ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
Unified DataSpace
  The Wild
  • Data sources
  with rich data &                      Segment 3 - Model Description
  semantic context
  locked in domain                        Data
                                                                           Rich semantic
  silos                                   Models
                                                                           context

  • Data tightly
    coupled to
    data-models
  • Data-models
                                         Segment 2 - Data Description
    tightly coupled to                   Structured
                                                                                Integration
                                                                                Enrichment
    storage models                       Data                                   Exploitation
                                                                                Exploration
  Silos isolated by                                                             Across all sources

  • Implementation                     Segment 1 - Artifact Description
   technology
  • Storage structure                   Unstructured                      Rich data
  • Data                                Data                              context
   representation
  • Data modality

WWW.DATA–TACTICS.COM     © 2012 Data Tactics       ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
Unified DataSpace
                                    High-Level Conceptual Model of the DataSpace
                                             and Ingest/Extraction Flows


                                                         Segment 3 - Model Semantics
                                                                               .
                                         2                                     .
                                                                                                            2
                  CONCEPT                    CONCEPT_ASSOCIATION                       PREDICATE                        PREDICATE_ASSOCIATION
                                                                       .
                                                                           .
                                                                               .

                                                 Uses                                      Uses




                      Segment 1 - Artifact Semantics                                            Segment 2 - Data Semantics                                   Semantics
             SOURCE                                                                                            .                                                +
                                         2                                                          .        2       .                                       Metadata

                      ARTIFACT               ARTIFACT_ASSOCIATION                        TERM                   .            STATEMENT
                                     .                                                                              .
                                             .                                                                          .
                                                                                                                                                 Data
                                                                    Uses                                                                           +
 Metadata                                                                                                                                       Metadata




                                                            Segment 0 - Artifacts

    Ingest                                                                                                                                             Extraction




WWW.DATA–TACTICS.COM                     © 2012 Data Tactics                        ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
Unified DataSpace
  •Segment 0 is an artifact store (i.e., binary
  representation of artifacts).                                                          High-Level Conceptual Model of the DataSpace
                                                                                                  and Ingest/Extraction Flows


                                                                                                              Segment 3 - Model Semantics




  •Segment 1 represents artifact semantics
                                                                                                                                    .
                                                                                              2                                     .
                                                                                                                                                                 2
                                                                       CONCEPT                    CONCEPT_ASSOCIATION                       PREDICATE                        PREDICATE_ASSOCIATION
                                                                                                                            .
                                                                                                                                .
                                                                                                                                    .



  and includes artifact metadata and                                                                  Uses                                      Uses




  associations between the artifacts. Indexing                             Segment 1 - Artifact Semantics                                            Segment 2 - Data Semantics                                   Semantics




  of Segment 1 supports search on text
                                                                  SOURCE                                                                                            .                                                +
                                                                                              2                                                          .        2       .                                       Metadata

                                                                           ARTIFACT               ARTIFACT_ASSOCIATION                        TERM                   .            STATEMENT
                                                                                          .                                                                              .
                                                                                                  .



  content, geospatial, and artifact meta data.
                                                                                                                                                                             .
                                                                                                                                                                                                      Data
                                                                                                                         Uses                                                                           +
                                                      Metadata                                                                                                                                       Metadata




                                                                                                                 Segment 0 - Artifacts




  •Segment 2 represents data and semantics
                                                         Ingest                                                                                                                                             Extraction




  of structured data elements extracted from
  artifacts. Indexing of Segment 2 supports
  search on properties of entities (e.g., Person,
  Location) based on their properties and
  relationships.

  •Segment      3   represents   data-models
  extracted from artifacts and models used for
  aligning, disambiguating, and enriching the
  elements of Segments 1 and 2.

WWW.DATA–TACTICS.COM        © 2012 Data Tactics     ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
Data Description Framework
  •   DDF – looks at data in the following ways
       – Mention: A chunk of data, either physically located within a tangible artifact,
         or contained within an analyst’s mind
           • “Washington” at offset x in file Y
       – Sign: A representation of all disambiguated mentions that are identical except
         for their indexicality
           • E.g., “Washington”
       – Concept: An abstract idea, defined explicitly or implicitly by a source data-
         model
           • E.g., City, Person, Name, Address, Photo
       – Predicate: An abstract idea used to express a relationship between “things”
           • E.g., isCity, isPerson, hasName, hasAddress, hasPhoto
       – Term: A disambiguated sign abstracted from the source artifact or asserting
         analyst
           • E.g., Washington Person; Washington Location
       – Statement: Encodes a binary relationship between a subject (term) and an
         object mediated by a predicate
           • E.g.,[Washington, Person] hasPhoto [GeorgeWashingtonImage.jpg]


WWW.DATA–TACTICS.COM          © 2012 Data Tactics       ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
Unified DataSpace




WWW.DATA–TACTICS.COM   © 2012 Data Tactics   ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
Data Model Example




WWW.DATA–TACTICS.COM   © 2012 Data Tactics   ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
DataSpace Workbench




WWW.DATA–TACTICS.COM   © 2012 Data Tactics   ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
Elastic Data Ingest
                                        Java Messaging Service
       Artifact Processor          Persistence             Index Manager         Error Manager
             Queue                Manager Queue                Queue                 Queue




       Queue                 Artifact            Persistence           Index             Error
       Loader               Processor             Manager             Manager           Manager



                                                       UDS Components
                                                                                                      Lucene
        File
      System
                                                    Custom Components

                                                                                                      Hadoop DFS
                            Artifact Processor        Persistence
                                 Modules            Manager Modules
                                                                                                     BigTable




WWW.DATA–TACTICS.COM                    © 2012 Data Tactics             ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS

Weitere ähnliche Inhalte

Ähnlich wie Data Tactics Unified Dataspace Architecture and Description

Getting Cloud Architecture Right the First Time Ver 2
Getting Cloud Architecture Right the First Time Ver 2Getting Cloud Architecture Right the First Time Ver 2
Getting Cloud Architecture Right the First Time Ver 2
David Linthicum
 
Data Mining
Data MiningData Mining
Data Mining
swami920
 
Mark logic Corporate Overview
Mark logic Corporate OverviewMark logic Corporate Overview
Mark logic Corporate Overview
Tony Agresta
 
TELUS Case Study: iVAULT implementation improved corporate intelligence
TELUS Case Study: iVAULT implementation improved corporate intelligence TELUS Case Study: iVAULT implementation improved corporate intelligence
TELUS Case Study: iVAULT implementation improved corporate intelligence
eventspat
 
Standardizing the Data Distribution Service (DDS) API for Modern C++
Standardizing the Data Distribution Service (DDS) API for Modern C++Standardizing the Data Distribution Service (DDS) API for Modern C++
Standardizing the Data Distribution Service (DDS) API for Modern C++
Sumant Tambe
 
Wallchart - Data Warehouse Documentation Roadmap
Wallchart - Data Warehouse Documentation RoadmapWallchart - Data Warehouse Documentation Roadmap
Wallchart - Data Warehouse Documentation Roadmap
David Walker
 
Protect Your Big Data with Intel<sup>®</sup> Xeon<sup>®</sup> Processors a..
Protect Your Big Data with Intel<sup>®</sup> Xeon<sup>®</sup> Processors a..Protect Your Big Data with Intel<sup>®</sup> Xeon<sup>®</sup> Processors a..
Protect Your Big Data with Intel<sup>®</sup> Xeon<sup>®</sup> Processors a..
Odinot Stanislas
 

Ähnlich wie Data Tactics Unified Dataspace Architecture and Description (20)

Cncf kanister.pptx
Cncf kanister.pptxCncf kanister.pptx
Cncf kanister.pptx
 
TCUK 2012, Nolwenn Kerzreho, Metadata: Why Should Technical Communicators Care?
TCUK 2012, Nolwenn Kerzreho, Metadata: Why Should Technical Communicators Care?TCUK 2012, Nolwenn Kerzreho, Metadata: Why Should Technical Communicators Care?
TCUK 2012, Nolwenn Kerzreho, Metadata: Why Should Technical Communicators Care?
 
Getting Cloud Architecture Right the First Time Ver 2
Getting Cloud Architecture Right the First Time Ver 2Getting Cloud Architecture Right the First Time Ver 2
Getting Cloud Architecture Right the First Time Ver 2
 
Data Mining
Data MiningData Mining
Data Mining
 
Mark logic Corporate Overview
Mark logic Corporate OverviewMark logic Corporate Overview
Mark logic Corporate Overview
 
Web 2.0 And The End Of DITA
Web 2.0 And The End Of DITAWeb 2.0 And The End Of DITA
Web 2.0 And The End Of DITA
 
TELUS Case Study: GIS for Telecommunications
TELUS Case Study: GIS for TelecommunicationsTELUS Case Study: GIS for Telecommunications
TELUS Case Study: GIS for Telecommunications
 
TELUS Case Study: iVAULT implementation improved corporate intelligence
TELUS Case Study: iVAULT implementation improved corporate intelligence TELUS Case Study: iVAULT implementation improved corporate intelligence
TELUS Case Study: iVAULT implementation improved corporate intelligence
 
Standardizing the Data Distribution Service (DDS) API for Modern C++
Standardizing the Data Distribution Service (DDS) API for Modern C++Standardizing the Data Distribution Service (DDS) API for Modern C++
Standardizing the Data Distribution Service (DDS) API for Modern C++
 
STI Summit 2011 - A personal look at the future of Semantic Technologies
STI Summit 2011 - A personal look at the future of Semantic TechnologiesSTI Summit 2011 - A personal look at the future of Semantic Technologies
STI Summit 2011 - A personal look at the future of Semantic Technologies
 
Hadoop World 2011: Security Considerations for Hadoop Deployments - Jeremy Gl...
Hadoop World 2011: Security Considerations for Hadoop Deployments - Jeremy Gl...Hadoop World 2011: Security Considerations for Hadoop Deployments - Jeremy Gl...
Hadoop World 2011: Security Considerations for Hadoop Deployments - Jeremy Gl...
 
Learning from google megastore (Part-1)
Learning from google megastore (Part-1)Learning from google megastore (Part-1)
Learning from google megastore (Part-1)
 
Wallchart - Data Warehouse Documentation Roadmap
Wallchart - Data Warehouse Documentation RoadmapWallchart - Data Warehouse Documentation Roadmap
Wallchart - Data Warehouse Documentation Roadmap
 
DDS vs DDS4CCM
DDS vs DDS4CCMDDS vs DDS4CCM
DDS vs DDS4CCM
 
Open Calais For SF And LA Meetups
Open Calais For SF And LA MeetupsOpen Calais For SF And LA Meetups
Open Calais For SF And LA Meetups
 
Protect Your Big Data with Intel<sup>®</sup> Xeon<sup>®</sup> Processors a..
Protect Your Big Data with Intel<sup>®</sup> Xeon<sup>®</sup> Processors a..Protect Your Big Data with Intel<sup>®</sup> Xeon<sup>®</sup> Processors a..
Protect Your Big Data with Intel<sup>®</sup> Xeon<sup>®</sup> Processors a..
 
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Product Update - Spring 2017
MapR Product Update - Spring 2017
 
Big data and cloud
Big data and cloudBig data and cloud
Big data and cloud
 
Cloud Networking: Network aspects of the cloud
Cloud Networking: Network aspects of the cloudCloud Networking: Network aspects of the cloud
Cloud Networking: Network aspects of the cloud
 
MEDIN Discovery Metadata Standard
MEDIN Discovery Metadata StandardMEDIN Discovery Metadata Standard
MEDIN Discovery Metadata Standard
 

Mehr von DataTactics

Big Data Conference
Big Data ConferenceBig Data Conference
Big Data Conference
DataTactics
 
Discontinuities Demo
Discontinuities DemoDiscontinuities Demo
Discontinuities Demo
DataTactics
 
Analytics Brownbag
Analytics Brownbag Analytics Brownbag
Analytics Brownbag
DataTactics
 
Big Data Taxonomy 8/26/2013
Big Data Taxonomy 8/26/2013Big Data Taxonomy 8/26/2013
Big Data Taxonomy 8/26/2013
DataTactics
 
Ontology and Reports
Ontology and ReportsOntology and Reports
Ontology and Reports
DataTactics
 
Data Tactics Semantic and Interoperability Summit Feb 12, 2013
Data Tactics Semantic and Interoperability Summit Feb 12, 2013Data Tactics Semantic and Interoperability Summit Feb 12, 2013
Data Tactics Semantic and Interoperability Summit Feb 12, 2013
DataTactics
 
Horizontal Integration of Big Intelligence Data
Horizontal Integration of Big Intelligence DataHorizontal Integration of Big Intelligence Data
Horizontal Integration of Big Intelligence Data
DataTactics
 
Bill Ontology Summit (08 feb 1400hrs) v2
Bill Ontology Summit (08 feb 1400hrs) v2Bill Ontology Summit (08 feb 1400hrs) v2
Bill Ontology Summit (08 feb 1400hrs) v2
DataTactics
 
DT Company Overview January 2013
DT Company Overview January 2013DT Company Overview January 2013
DT Company Overview January 2013
DataTactics
 
Capabilities Brief Analytics
Capabilities Brief AnalyticsCapabilities Brief Analytics
Capabilities Brief Analytics
DataTactics
 
Data Tactics dhs introduction to cloud technologies wtc
Data Tactics dhs introduction to cloud technologies wtcData Tactics dhs introduction to cloud technologies wtc
Data Tactics dhs introduction to cloud technologies wtc
DataTactics
 
Multi Discipline Intelligence Production Teams 1
Multi Discipline Intelligence Production Teams 1Multi Discipline Intelligence Production Teams 1
Multi Discipline Intelligence Production Teams 1
DataTactics
 
Data Tactics and Nervve Integrated Big Data v3
Data Tactics and Nervve Integrated Big Data v3Data Tactics and Nervve Integrated Big Data v3
Data Tactics and Nervve Integrated Big Data v3
DataTactics
 
Data Tactics Open Source Brief
Data Tactics Open Source BriefData Tactics Open Source Brief
Data Tactics Open Source Brief
DataTactics
 

Mehr von DataTactics (19)

NETWORK CENTRALITY IN SUB-NATIONAL AREAS OF INTEREST USING GDELT DATA
NETWORK CENTRALITY IN SUB-NATIONAL AREAS OF INTEREST USING GDELT DATANETWORK CENTRALITY IN SUB-NATIONAL AREAS OF INTEREST USING GDELT DATA
NETWORK CENTRALITY IN SUB-NATIONAL AREAS OF INTEREST USING GDELT DATA
 
C Star Analytic Presentation
C Star Analytic PresentationC Star Analytic Presentation
C Star Analytic Presentation
 
Text Analysis Using Twitter: A Case Study in Dhaka
Text Analysis Using Twitter: A Case Study in Dhaka Text Analysis Using Twitter: A Case Study in Dhaka
Text Analysis Using Twitter: A Case Study in Dhaka
 
Data Science and Analytics Brown Bag
Data Science and Analytics Brown BagData Science and Analytics Brown Bag
Data Science and Analytics Brown Bag
 
Big Data Conference
Big Data ConferenceBig Data Conference
Big Data Conference
 
Discontinuities Demo
Discontinuities DemoDiscontinuities Demo
Discontinuities Demo
 
DLISA
DLISADLISA
DLISA
 
Analytics Brownbag
Analytics Brownbag Analytics Brownbag
Analytics Brownbag
 
Big Data Taxonomy 8/26/2013
Big Data Taxonomy 8/26/2013Big Data Taxonomy 8/26/2013
Big Data Taxonomy 8/26/2013
 
Ontology and Reports
Ontology and ReportsOntology and Reports
Ontology and Reports
 
Data Tactics Semantic and Interoperability Summit Feb 12, 2013
Data Tactics Semantic and Interoperability Summit Feb 12, 2013Data Tactics Semantic and Interoperability Summit Feb 12, 2013
Data Tactics Semantic and Interoperability Summit Feb 12, 2013
 
Horizontal Integration of Big Intelligence Data
Horizontal Integration of Big Intelligence DataHorizontal Integration of Big Intelligence Data
Horizontal Integration of Big Intelligence Data
 
Bill Ontology Summit (08 feb 1400hrs) v2
Bill Ontology Summit (08 feb 1400hrs) v2Bill Ontology Summit (08 feb 1400hrs) v2
Bill Ontology Summit (08 feb 1400hrs) v2
 
DT Company Overview January 2013
DT Company Overview January 2013DT Company Overview January 2013
DT Company Overview January 2013
 
Capabilities Brief Analytics
Capabilities Brief AnalyticsCapabilities Brief Analytics
Capabilities Brief Analytics
 
Data Tactics dhs introduction to cloud technologies wtc
Data Tactics dhs introduction to cloud technologies wtcData Tactics dhs introduction to cloud technologies wtc
Data Tactics dhs introduction to cloud technologies wtc
 
Multi Discipline Intelligence Production Teams 1
Multi Discipline Intelligence Production Teams 1Multi Discipline Intelligence Production Teams 1
Multi Discipline Intelligence Production Teams 1
 
Data Tactics and Nervve Integrated Big Data v3
Data Tactics and Nervve Integrated Big Data v3Data Tactics and Nervve Integrated Big Data v3
Data Tactics and Nervve Integrated Big Data v3
 
Data Tactics Open Source Brief
Data Tactics Open Source BriefData Tactics Open Source Brief
Data Tactics Open Source Brief
 

Data Tactics Unified Dataspace Architecture and Description

  • 1. Data Tactics Unified DataSpace WWW.DATA–TACTICS.COM © 2012 Data Tactics ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
  • 2. Cloud WWW.DATA–TACTICS.COM © 2012 Data Tactics ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
  • 3. Systems Engineering & Integration SYSTEMS ENGINEERING • Data Ingestion Frameworks (structured, unstructured, semi-structured) • Semantic DataSpace Enrichment SYSTEM INTEGRATION • Cloud Management Systems (CMS) • Ingestion • Cloudbase/Accumulo – Generalized Ingest / – Pig (Big Data) Plug-in NiagraFiles • Dissemination and Reporting Tools • Geospatial Capabilities • Data Mining, Exploitation, and Correlation Tools • Biometric Capabilities WWW.DATA–TACTICS.COM © 2012 Data Tactics ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
  • 4. Cloud Experience 17 Enclaves at SECRET//NOFORN 4 Enclaves for NATO ISAF • 3 in Tyson’s • 2 in Afghanistan • 1 at GISA, Ft. Bragg • 1 at GISA, Fort Bragg • 2 in Hawaii • 1 in Germany • 2 in Germany US BICES Cloud in Germany • 7 at Aberdeen Over a dozen at UNCLASS//FOUO • 2 in Afghanistan • Supporting real-world missions on 6 Enclaves at TS//SCI contract • AF TENCAP • At various levels of complexity • NRL • DARPA • INSCOM • DCGS-A • DHS OI&A Cloud Domains is where we live Data, is the Hard Problem WWW.DATA–TACTICS.COM © 2012 Data Tactics ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
  • 5. Data – The Hard Part WWW.DATA–TACTICS.COM © 2012 Data Tactics ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
  • 6. BigData Architecture Data Tactics has delivered solutions that manage PETABYTES of data and provide mission relevant analytics, metrics and user interfaces • DESIGN, DEVELOPMENT AND INTEGRATION OF REFERENCE ARCHITECTURES – Ghost Machine – Stratus • SECURE DATABASE ARCHITECTURES – Secure Entity Database (SED) – Defense Cross-Domain Analytic Capability (DCAC) • DATA MIGRATION, EXTRACTION, TRANSFORM AND PARSING • FEDERATED DATA MANAGEMENT – Federated Search, Multi-Source / Multi-Vendor Integration – Storage Cluster Management • DATA MINING AND FORENSIC ANALYSIS • SPATIAL, MULTI-DOMAIN, AND CLOUD DATA SERVICES WWW.DATA–TACTICS.COM © 2012 Data Tactics ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
  • 7. Unified DataSpace The Wild • Data sources with rich data & Segment 3 - Model Description semantic context locked in domain Data Rich semantic silos Models context • Data tightly coupled to data-models • Data-models Segment 2 - Data Description tightly coupled to Structured Integration Enrichment storage models Data Exploitation Exploration Silos isolated by Across all sources • Implementation Segment 1 - Artifact Description technology • Storage structure Unstructured Rich data • Data Data context representation • Data modality WWW.DATA–TACTICS.COM © 2012 Data Tactics ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
  • 8. Unified DataSpace High-Level Conceptual Model of the DataSpace and Ingest/Extraction Flows Segment 3 - Model Semantics . 2 . 2 CONCEPT CONCEPT_ASSOCIATION PREDICATE PREDICATE_ASSOCIATION . . . Uses Uses Segment 1 - Artifact Semantics Segment 2 - Data Semantics Semantics SOURCE . + 2 . 2 . Metadata ARTIFACT ARTIFACT_ASSOCIATION TERM . STATEMENT . . . . Data Uses + Metadata Metadata Segment 0 - Artifacts Ingest Extraction WWW.DATA–TACTICS.COM © 2012 Data Tactics ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
  • 9. Unified DataSpace •Segment 0 is an artifact store (i.e., binary representation of artifacts). High-Level Conceptual Model of the DataSpace and Ingest/Extraction Flows Segment 3 - Model Semantics •Segment 1 represents artifact semantics . 2 . 2 CONCEPT CONCEPT_ASSOCIATION PREDICATE PREDICATE_ASSOCIATION . . . and includes artifact metadata and Uses Uses associations between the artifacts. Indexing Segment 1 - Artifact Semantics Segment 2 - Data Semantics Semantics of Segment 1 supports search on text SOURCE . + 2 . 2 . Metadata ARTIFACT ARTIFACT_ASSOCIATION TERM . STATEMENT . . . content, geospatial, and artifact meta data. . Data Uses + Metadata Metadata Segment 0 - Artifacts •Segment 2 represents data and semantics Ingest Extraction of structured data elements extracted from artifacts. Indexing of Segment 2 supports search on properties of entities (e.g., Person, Location) based on their properties and relationships. •Segment 3 represents data-models extracted from artifacts and models used for aligning, disambiguating, and enriching the elements of Segments 1 and 2. WWW.DATA–TACTICS.COM © 2012 Data Tactics ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
  • 10. Data Description Framework • DDF – looks at data in the following ways – Mention: A chunk of data, either physically located within a tangible artifact, or contained within an analyst’s mind • “Washington” at offset x in file Y – Sign: A representation of all disambiguated mentions that are identical except for their indexicality • E.g., “Washington” – Concept: An abstract idea, defined explicitly or implicitly by a source data- model • E.g., City, Person, Name, Address, Photo – Predicate: An abstract idea used to express a relationship between “things” • E.g., isCity, isPerson, hasName, hasAddress, hasPhoto – Term: A disambiguated sign abstracted from the source artifact or asserting analyst • E.g., Washington Person; Washington Location – Statement: Encodes a binary relationship between a subject (term) and an object mediated by a predicate • E.g.,[Washington, Person] hasPhoto [GeorgeWashingtonImage.jpg] WWW.DATA–TACTICS.COM © 2012 Data Tactics ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
  • 11. Unified DataSpace WWW.DATA–TACTICS.COM © 2012 Data Tactics ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
  • 12. Data Model Example WWW.DATA–TACTICS.COM © 2012 Data Tactics ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
  • 13. DataSpace Workbench WWW.DATA–TACTICS.COM © 2012 Data Tactics ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS
  • 14. Elastic Data Ingest Java Messaging Service Artifact Processor Persistence Index Manager Error Manager Queue Manager Queue Queue Queue Queue Artifact Persistence Index Error Loader Processor Manager Manager Manager UDS Components Lucene File System Custom Components Hadoop DFS Artifact Processor Persistence Modules Manager Modules BigTable WWW.DATA–TACTICS.COM © 2012 Data Tactics ARCHITECT – ENGINEER – INTEGRATE – SOLUTIONS