SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Downloaden Sie, um offline zu lesen
The Italian Integrated System of
Statistical Registers
Design and Implementation
of an Ontology-Based
Data Integration Architecture
Roberta Radini, Monica Scannapieco, Laura Tosco
Directorate for Methodology and Statistical Process Design, Istat
 The project: the Italian Integrated System of
Statistical Registers (ISSR)
 ISSR as an ontology-based data integration system
 Piloting activities
 Next steps
Outline
 Significant revision of
statistical production
processes based on an
Integrated System of
Statistical Registers (ISSR)
 Global view: Identification
and estimation for the whole
integrated system of units
and variables
 Single logical environment to
support the consistency of
statistical production
processes
The Project: ISSR
Individuals
and Families
Economic
Units
Places
Requirement: Need to integrate
concepts belonging to different
thematic areas
4
Adoption of the ontology-based
data management approach for
accessing, integrating and
managing data sources
ISSR as an ontology-based data integration system - 1
 Ontology-based Data Management (OBDM) is a new paradigm, rooted
on the idea of using Database Theory and Semantic Technologies for
data management.
 OBDM is characterized by the following principles:
 Let data reside where they are (no need to move data)
 Define a logic-based conceptual specification of the domain of interest
(ontology)
 Map the ontology to the concrete data sources
 Express services/queries over the ontology and automatically obtain
answers
ISSR as an ontology-based data integration system - 2
Ontology: formal, shared and
explicit representation of the
conceptualization of the domain of
interest expressed through the
formal language which makes it
“machine-actionable”
Individuals and
Families
Economic
Units
Places
Ontology
Mapping
Data sources: sources
heterogeneous both semantically
and technologically
Mapping: rules expressing the
correspondence between data
and concepts/attributes of the
domain of interest
6
Detailed benefits
Global ontology layer
 Allows the coexistence of
different definitions of a
concept according to
different contexts, allowing
consistent access to the
underlying data
 Offers reasoning capability
allowing to “infer” new
knowledge
Metadata
 Complex in terms of
hugeness and lack of a direct
control. Ontologies can cope
with such a complexity
 Represented and accessible
through an IT system: so far
statistical metadata models
are “not represented” in
formal languages
 Coupled with data
Integration layer
 Permits to virtualize data
sources
 Performs “on-the fly” query
answering
Tranparency and
flexibility
Quality
Automated
Metadata
Governance
7
Raw Data Area
Working Data Area
Validated Data Area
ISSR
Ontology for:
 Semantic Data
Integration
 Data Access
 Data Quality
Check
Ontology for:
 Semantic Data
Integration
 Data Access
Data Stack
8
ISSR Prototype: The Istat - UNIROMA 1 Experience
Persons, Families and
Cohabitation Ontology
Use of the Mastro system for OBDM:
http://www.obdasystems.com/mastro
Individuals and
Families
Mapping: used to semantically
link data at the sources to the
ontology
Domain: portion of Base Register of Individuals, Families and Cohabitations
related to persons, including residential data, and their family
relationships.
Ontology expressed in Graphol (http://www.obdasystems.com/graphol)
9
ISSR Prototype: The Istat - UNIROMA 1 Experience
- Tab_pers[idp, date_of_birth, address]
- Tab_family[idf, idp, head_flag]
Tab_pers(x,y,z)  Person(x)
Tab_pers(x,y,z) ∧ y ≠ NULL  date_of_birth(x,y)
Tab_pers(x,y,z) ∧ z ≠ NULL  address(x,z)
Tab_family(x,y,z)  Family(x)
Tab_family(x,y,z)  belongsTo(y,x)
Tab_family(x,y,z) ∧ z = TRUE  headOf(y,x)
The data sources connected to the ontology are those containing information
about persons and families.
We specified about 100 mapping assertions to link such data sources to
the ontology.
Tables in the data source
containing information on persons
and families
10
ISSR Prototype: The Istat - UNIROMA 1 Experience
 Example of Access Query: We ask for persons belonging to
the same family joined by a civil union
 This is expressed over the ontology by means of the
following query:
? x,y,z | Resident(x) ∧ ReferencePerson(y) ∧ Family(z)
∧ belongsTo(x,z) ∧ belongsTo(y,z) ∧ headOf(y,z)
∧civilPartnerOf(x,y)
11
ISSR Prototype: The Istat - UNIROMA 1 Experience
 Example of Quality Check: Persons belonging to the same
family cannot have different residential addresses
 This is expressed over the ontology by means of the
following constraint:
∀x,y,z,v,w Family(x) ∧ belongsTo(y,x) ∧ address(y,v)
∧ belongsTo(z,x) ∧ address(z, w) ∧ v ≠ w  ⊥
12
ISSR Prototype: The Istat - UNIROMA 1 Experience
 Blue cells represent the ontology - mediated
access
 Green cells represent the direct access
13
Use cases, users and ontology mediated/direct access to data
 From prototyping to production
 Ontology modelling for registers of ISSRs
 Architectural implementations: choice of production
systems, configuration and set up
 Training IT, statistical and thematic people
 So far addressed Processing phase GSBPM (Generic
Statistical Production Model)
 Need to start addressing Analyse and Dissemination phases
of GSBPM (Aggregated Data)
 Start with prototyping
14
Conclusions and next steps

Weitere ähnliche Inhalte

Ähnlich wie Session III Census and registers - M. Scannapieco,The Italian Integrated System of Statistical Registers: Design and Implementation of an Ontology-based Data

Horizontal integration of warfighter intelligence data
Horizontal integration of warfighter intelligence dataHorizontal integration of warfighter intelligence data
Horizontal integration of warfighter intelligence dataBarry Smith
 
Driving Deep Semantics in Middleware and Networks: What, why and how?
Driving Deep Semantics in Middleware and Networks: What, why and how?Driving Deep Semantics in Middleware and Networks: What, why and how?
Driving Deep Semantics in Middleware and Networks: What, why and how?Amit Sheth
 
Semantics in Financial Services -David Newman
Semantics in Financial Services -David NewmanSemantics in Financial Services -David Newman
Semantics in Financial Services -David NewmanPeter Berger
 
Ingredients for Semantic Sensor Networks
Ingredients for Semantic Sensor NetworksIngredients for Semantic Sensor Networks
Ingredients for Semantic Sensor NetworksOscar Corcho
 
Patient-Like-Mine
Patient-Like-MinePatient-Like-Mine
Patient-Like-MineSimon Yates
 
Adaptive named entity recognition for social network analysis and domain onto...
Adaptive named entity recognition for social network analysis and domain onto...Adaptive named entity recognition for social network analysis and domain onto...
Adaptive named entity recognition for social network analysis and domain onto...Cuong Tran Van
 
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTION
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTIONA NAIVE METHOD FOR ONTOLOGY CONSTRUCTION
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTIONijscai
 
A Naive Method For Ontology Construction
A Naive Method For Ontology Construction A Naive Method For Ontology Construction
A Naive Method For Ontology Construction IJSCAI Journal
 
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTION
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTIONA NAIVE METHOD FOR ONTOLOGY CONSTRUCTION
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTIONijscai
 
The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook OntologyStuart Chalk
 
Identifying semantics characteristics of user’s interactions datasets through...
Identifying semantics characteristics of user’s interactions datasets through...Identifying semantics characteristics of user’s interactions datasets through...
Identifying semantics characteristics of user’s interactions datasets through...Fernando de Assis Rodrigues
 
The web of data: how are we doing so far
The web of data: how are we doing so farThe web of data: how are we doing so far
The web of data: how are we doing so farElena Simperl
 
Ontology engineering of automatic text processing methods
Ontology engineering of automatic text processing methodsOntology engineering of automatic text processing methods
Ontology engineering of automatic text processing methodsIJECEIAES
 
Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...
Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...
Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...Artificial Intelligence Institute at UofSC
 
Research Interests : Their Dynamics, Structures and Applications in Personali...
Research Interests : Their Dynamics, Structures and Applications in Personali...Research Interests : Their Dynamics, Structures and Applications in Personali...
Research Interests : Their Dynamics, Structures and Applications in Personali...Yi Zeng
 
Enhancing Semantic Mining
Enhancing Semantic MiningEnhancing Semantic Mining
Enhancing Semantic MiningSanthosh Kumar
 
Data science innovations
Data science innovations Data science innovations
Data science innovations suresh sood
 
Semantic Sensor Networks and Linked Stream Data
Semantic Sensor Networks and Linked Stream DataSemantic Sensor Networks and Linked Stream Data
Semantic Sensor Networks and Linked Stream DataOscar Corcho
 
In Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
In Search of a Missing Link in the Data Deluge vs. Data Scarcity DebateIn Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
In Search of a Missing Link in the Data Deluge vs. Data Scarcity DebateNeuroscience Information Framework
 

Ähnlich wie Session III Census and registers - M. Scannapieco,The Italian Integrated System of Statistical Registers: Design and Implementation of an Ontology-based Data (20)

Horizontal integration of warfighter intelligence data
Horizontal integration of warfighter intelligence dataHorizontal integration of warfighter intelligence data
Horizontal integration of warfighter intelligence data
 
Driving Deep Semantics in Middleware and Networks: What, why and how?
Driving Deep Semantics in Middleware and Networks: What, why and how?Driving Deep Semantics in Middleware and Networks: What, why and how?
Driving Deep Semantics in Middleware and Networks: What, why and how?
 
Semantics in Financial Services -David Newman
Semantics in Financial Services -David NewmanSemantics in Financial Services -David Newman
Semantics in Financial Services -David Newman
 
Ingredients for Semantic Sensor Networks
Ingredients for Semantic Sensor NetworksIngredients for Semantic Sensor Networks
Ingredients for Semantic Sensor Networks
 
Patient-Like-Mine
Patient-Like-MinePatient-Like-Mine
Patient-Like-Mine
 
Adaptive named entity recognition for social network analysis and domain onto...
Adaptive named entity recognition for social network analysis and domain onto...Adaptive named entity recognition for social network analysis and domain onto...
Adaptive named entity recognition for social network analysis and domain onto...
 
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTION
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTIONA NAIVE METHOD FOR ONTOLOGY CONSTRUCTION
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTION
 
A Naive Method For Ontology Construction
A Naive Method For Ontology Construction A Naive Method For Ontology Construction
A Naive Method For Ontology Construction
 
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTION
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTIONA NAIVE METHOD FOR ONTOLOGY CONSTRUCTION
A NAIVE METHOD FOR ONTOLOGY CONSTRUCTION
 
String.pptx
String.pptxString.pptx
String.pptx
 
The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook Ontology
 
Identifying semantics characteristics of user’s interactions datasets through...
Identifying semantics characteristics of user’s interactions datasets through...Identifying semantics characteristics of user’s interactions datasets through...
Identifying semantics characteristics of user’s interactions datasets through...
 
The web of data: how are we doing so far
The web of data: how are we doing so farThe web of data: how are we doing so far
The web of data: how are we doing so far
 
Ontology engineering of automatic text processing methods
Ontology engineering of automatic text processing methodsOntology engineering of automatic text processing methods
Ontology engineering of automatic text processing methods
 
Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...
Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...
Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...
 
Research Interests : Their Dynamics, Structures and Applications in Personali...
Research Interests : Their Dynamics, Structures and Applications in Personali...Research Interests : Their Dynamics, Structures and Applications in Personali...
Research Interests : Their Dynamics, Structures and Applications in Personali...
 
Enhancing Semantic Mining
Enhancing Semantic MiningEnhancing Semantic Mining
Enhancing Semantic Mining
 
Data science innovations
Data science innovations Data science innovations
Data science innovations
 
Semantic Sensor Networks and Linked Stream Data
Semantic Sensor Networks and Linked Stream DataSemantic Sensor Networks and Linked Stream Data
Semantic Sensor Networks and Linked Stream Data
 
In Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
In Search of a Missing Link in the Data Deluge vs. Data Scarcity DebateIn Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
In Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
 

Mehr von Istituto nazionale di statistica

Mehr von Istituto nazionale di statistica (20)

Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
 
14a Conferenza Nazionale di Statisticacnstatistica14
14a Conferenza Nazionale di Statisticacnstatistica1414a Conferenza Nazionale di Statisticacnstatistica14
14a Conferenza Nazionale di Statisticacnstatistica14
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 

Kürzlich hochgeladen

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...RKavithamani
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 

Kürzlich hochgeladen (20)

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 

Session III Census and registers - M. Scannapieco,The Italian Integrated System of Statistical Registers: Design and Implementation of an Ontology-based Data

  • 1. The Italian Integrated System of Statistical Registers Design and Implementation of an Ontology-Based Data Integration Architecture Roberta Radini, Monica Scannapieco, Laura Tosco Directorate for Methodology and Statistical Process Design, Istat
  • 2.  The project: the Italian Integrated System of Statistical Registers (ISSR)  ISSR as an ontology-based data integration system  Piloting activities  Next steps Outline
  • 3.  Significant revision of statistical production processes based on an Integrated System of Statistical Registers (ISSR)  Global view: Identification and estimation for the whole integrated system of units and variables  Single logical environment to support the consistency of statistical production processes The Project: ISSR Individuals and Families Economic Units Places
  • 4. Requirement: Need to integrate concepts belonging to different thematic areas 4 Adoption of the ontology-based data management approach for accessing, integrating and managing data sources ISSR as an ontology-based data integration system - 1  Ontology-based Data Management (OBDM) is a new paradigm, rooted on the idea of using Database Theory and Semantic Technologies for data management.  OBDM is characterized by the following principles:  Let data reside where they are (no need to move data)  Define a logic-based conceptual specification of the domain of interest (ontology)  Map the ontology to the concrete data sources  Express services/queries over the ontology and automatically obtain answers
  • 5. ISSR as an ontology-based data integration system - 2 Ontology: formal, shared and explicit representation of the conceptualization of the domain of interest expressed through the formal language which makes it “machine-actionable” Individuals and Families Economic Units Places Ontology Mapping Data sources: sources heterogeneous both semantically and technologically Mapping: rules expressing the correspondence between data and concepts/attributes of the domain of interest
  • 6. 6 Detailed benefits Global ontology layer  Allows the coexistence of different definitions of a concept according to different contexts, allowing consistent access to the underlying data  Offers reasoning capability allowing to “infer” new knowledge Metadata  Complex in terms of hugeness and lack of a direct control. Ontologies can cope with such a complexity  Represented and accessible through an IT system: so far statistical metadata models are “not represented” in formal languages  Coupled with data Integration layer  Permits to virtualize data sources  Performs “on-the fly” query answering Tranparency and flexibility Quality Automated Metadata Governance
  • 7. 7 Raw Data Area Working Data Area Validated Data Area ISSR Ontology for:  Semantic Data Integration  Data Access  Data Quality Check Ontology for:  Semantic Data Integration  Data Access Data Stack
  • 8. 8 ISSR Prototype: The Istat - UNIROMA 1 Experience Persons, Families and Cohabitation Ontology Use of the Mastro system for OBDM: http://www.obdasystems.com/mastro Individuals and Families Mapping: used to semantically link data at the sources to the ontology
  • 9. Domain: portion of Base Register of Individuals, Families and Cohabitations related to persons, including residential data, and their family relationships. Ontology expressed in Graphol (http://www.obdasystems.com/graphol) 9 ISSR Prototype: The Istat - UNIROMA 1 Experience
  • 10. - Tab_pers[idp, date_of_birth, address] - Tab_family[idf, idp, head_flag] Tab_pers(x,y,z)  Person(x) Tab_pers(x,y,z) ∧ y ≠ NULL  date_of_birth(x,y) Tab_pers(x,y,z) ∧ z ≠ NULL  address(x,z) Tab_family(x,y,z)  Family(x) Tab_family(x,y,z)  belongsTo(y,x) Tab_family(x,y,z) ∧ z = TRUE  headOf(y,x) The data sources connected to the ontology are those containing information about persons and families. We specified about 100 mapping assertions to link such data sources to the ontology. Tables in the data source containing information on persons and families 10 ISSR Prototype: The Istat - UNIROMA 1 Experience
  • 11.  Example of Access Query: We ask for persons belonging to the same family joined by a civil union  This is expressed over the ontology by means of the following query: ? x,y,z | Resident(x) ∧ ReferencePerson(y) ∧ Family(z) ∧ belongsTo(x,z) ∧ belongsTo(y,z) ∧ headOf(y,z) ∧civilPartnerOf(x,y) 11 ISSR Prototype: The Istat - UNIROMA 1 Experience
  • 12.  Example of Quality Check: Persons belonging to the same family cannot have different residential addresses  This is expressed over the ontology by means of the following constraint: ∀x,y,z,v,w Family(x) ∧ belongsTo(y,x) ∧ address(y,v) ∧ belongsTo(z,x) ∧ address(z, w) ∧ v ≠ w  ⊥ 12 ISSR Prototype: The Istat - UNIROMA 1 Experience
  • 13.  Blue cells represent the ontology - mediated access  Green cells represent the direct access 13 Use cases, users and ontology mediated/direct access to data
  • 14.  From prototyping to production  Ontology modelling for registers of ISSRs  Architectural implementations: choice of production systems, configuration and set up  Training IT, statistical and thematic people  So far addressed Processing phase GSBPM (Generic Statistical Production Model)  Need to start addressing Analyse and Dissemination phases of GSBPM (Aggregated Data)  Start with prototyping 14 Conclusions and next steps