SlideShare a Scribd company logo
1 of 36
MEETING TODAY‟S
DISSEMINATION CHALLENGES:
Implementing international standards in .Stat
Prepared by Jonathan Challener, OECD
For MSIS, April 2014 - Dublin, Ireland
Doesn’t non-standard power supplies make things difficult?
What happens when standards are not applied well?
Picture: ‘The day Sweden changed from left-hand drive to right’
Confusion entails
This all adds up…
…high costs…
and inefficiencies!
and inefficiencies!
“A little like the grade 8 student who doesn‟t
pay attention in class all year”.
WHAT IS .STAT?
What is .Stat?
.Stat is the central repository ("warehouse")
of validated statistics and related metadata
.Stat is the central hub connecting data production,
sharing & dissemination processes
It is the corporate source of data for
data sharing and dissemination purposes
What is .Stat?
.Stat is the central repository ("warehouse")
of validated statistics and related metadata
.Stat is the central hub connecting data production,
sharing & dissemination processes
It is the corporate source of data for
data sharing and dissemination purposes
“.Stat is now being used and shared with 10 organisations
including the OECD, as part of the Statistical Information
System Collaboration Community (SIS-CC)”.
.Stat Positioning in Statistical Information System
DATA DELIVERY
INTERNAL DATA
SHARING
DATA DISSEMINATION
DATA PRODUCTION
.STAT
.Stat Positioning in Statistical Information System
DATA DELIVERY
INTERNAL DATA
SHARING
DATA DISSEMINATION
DATA PRODUCTION
.STAT
“The diagram illustrates the .Stat contribution to the SIS processes.
.Stat’s core value-added lies in “Data Delivery”, a set of functions that
enable dissemination and data sharing, and “Data Upload”, a set of
functions interfacing data production processes into a single upload
mechanism to feed dissemination channels”.
.Stat Functional Representation
.STAT DATA DELIVERY ENGINE
DATA PRODUCTION
DATA SHARING DATA DISSEMINATION
SEARCH
ENGINES
DATA ANALYSIS
TOOLS
P
C
WEBSITES, APPS
PUBLICATIONS.STAT BROWSER
.STAT DATA
UPLOAD ENGINE
FILE
UPLOAD
SDMX
IMPORT
DATA PRODUCTION
TOOLS
TABLE & CHART
EXTRACTION SERVICES
RELEASE MGT
SERVICES
.STAT BROWSER
CONFIGURATION
DATA
EXTRACTION SERVICES
SDMX
INPUT
E
P
BATCH
UPLOAD
SDMX
GLOBAL
REGISTRY
PUBLISHING
BACK
OFFICE
DATA
MAPPING
SDMX
OUTPUT
X
X
.Stat
Component
Process
Human user
Data Producer
Data Editor
Data Consumer
API or
Webservice
Other
SDMX hubs
.Stat Functional Representation
.STAT DATA DELIVERY ENGINE
DATA PRODUCTION
DATA SHARING DATA DISSEMINATION
SEARCH
ENGINES
DATA ANALYSIS
TOOLS
P
C
WEBSITES, APPS
PUBLICATIONS.STAT BROWSER
.STAT DATA
UPLOAD ENGINE
FILE
UPLOAD
SDMX
IMPORT
DATA PRODUCTION
TOOLS
TABLE & CHART
EXTRACTION SERVICES
RELEASE MGT
SERVICES
.STAT BROWSER
CONFIGURATION
DATA
EXTRACTION SERVICES
SDMX
INPUT
E
P
BATCH
UPLOAD
SDMX
GLOBAL
REGISTRY
PUBLISHING
BACK
OFFICE
DATA
MAPPING
SDMX
OUTPUT
X
X
.Stat
Component
Process
Human user
Data Producer
Data Editor
Data Consumer
API or
Webservice
Other
SDMX hubs
“The grey shaded boxes in the figure below show a visual
representation of how .Stat fits within a broader Data Dissemination
Information System of organisations; the boxes with dotted lines
represent other components of the Data Dissemination Information
System that are not supported by .Stat but are enabled by it”.
.Stat Functional Representation
In particular, .Stat provides the following 3 key functional areas…
.Stat Functional Representation.Stat Data Upload Engine
.Stat Functional Representation.Stat Data Delivery Engine
.Stat Functional Representation.Stat Data Browser
.Stat Positioning in GSBPM Reference Model
.Stat contributes to
Planned additions
Archive incorporated into the over-
arching process of data and metadata
management
.Stat Positioning in GSBPM Reference Model
.Stat contributes to
Planned additions
Archive incorporated into the over-
arching process of data and metadata
management
“.Stat can be mapped today to the Generic Statistical Business
Process Model (GSBPM) under “Disseminate” and “Build”. In the
future it will also incorporate archive functions as part of the over-
arching process for data and metadata management”.
Multipurpose SDMX within .Stat…
For dissemination and data eXchange
SDMXWS and RESTful API
• SDMX 2.0 compliant
• SOAP + REST
• Pull
• SDMX-ML
• SDMX Structural metadata
created on the fly
For „Open Data‟ dissemination
SDMX-JSON (beta)
SDMX-TWG agreed in mid 2013 on proposal for data and their
structural metadata (inc. flat & sliced layouts) and referential
metadata (dataset, series, obs) as annotations.
Further enhancements to come: Complete data structures and
referential metadata
For data reporting
SDMX-Reference
Infrastructure (RI)*
• SDMX 2.0 and 2.1 compliant
• SOAP + REST
• SDMX Common APIs
(SdmxSource.NET)
• Pull + Push
• SDMX-ML, GESMES , CSV
• Structural metadata stored in
mini registry
• One web service - several
mapped database instances
Mapping
Store DB
XXX.Stat
Data
warehouse
SDMX-RI
Web Service
Dissemination
Mapping
Assistant
SDMX-RI
* The integration of SDMX-RI in .Stat is based on collaboration with Eurostat,
provider of the SDMX-RI component with ISTAT taking the lead on behalf of
the OECD‟s Statistical Information System Collaboration Community.
For internal data sharing
DirectAccess
• Restful SDMX query
• Flat data, flags, units
• Referential metadata
Excel-add-in
• DirectAccess (Rest SDMX)
• Native Excel pivot table
• Wizard to select data
For a decentralised publishing environment
DataHub*
• One interface to the publishing
tools
• Centralised reporting and auditing
• SDMX based structural metadata,
and referential metadata
management
• Flexible load tool that promotes
‘self publish’ for data custodians
• In-built checks and safeguards to
minimise errors
• Manages security and access
rights
• Can be extended to manage other
outputs and not limited to .Stat
* DataHub has been developed and integrated with .Stat by
Statistics NZ, with an additional connection to the Fusion Registry
for managing structural metadata through the definition of DSDs.
Future outlook…
Further SDMX artifact support
SDMX ingest (Import)
SDMX global registry API
SDMX-RDF data cube vocabulary pilot
SDMX-RDF data cube vocabulary pilot
“Explore further semantic web/linked data
opportunities (SDMX-RDF data cube
vocabulary). To be taken forward by ISTAT and
ABS under the SIS-CC umbrella”.
• Lower technology adoption costs
• Increased development consistency, simplicity and
predictability
• Improved code reuse
• Reduced cost, time and effort to transition between
different solutions
We all know the…
• Reduced focus on infrastructure
• Ability to create composite interfaces that are tailored to the needs of
specific task
• Improved application portability
• Enable faster time to market because it is easier to use off the shelf
components and applications that can integrate and provide features for the
solution
References
1. Operationalising .Stat in a decentralised publishing
environment (DataHub) by Tony Breen SNZ :
https://community.oecd.org/docs/DOC-68362
2. Building a scalable architecture (.Stat) by Jens Dosse OECD:
https://community.oecd.org/docs/DOC-68363
3. SDMX-RI and .Stat integration by Francesco Rizzo Istat:
https://community.oecd.org/docs/DOC-68696
4. SDMX-JSON API: http://stats.oecd.org/opendataapi/Index.htm
Jonathan Challener, OECD
jonathan.challener@oecd.org
@Challener
MSIS - Dublin, 14-16 April 2014
Meeting today’s dissemination challenges: Implementing international standards in .Stat
Thank you

More Related Content

What's hot

Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshJeffrey T. Pollock
 
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...Denodo
 
Myth Busters III: I’m Building a Data Lake, So I Don’t Need Data Virtualization
Myth Busters III: I’m Building a Data Lake, So I Don’t Need Data VirtualizationMyth Busters III: I’m Building a Data Lake, So I Don’t Need Data Virtualization
Myth Busters III: I’m Building a Data Lake, So I Don’t Need Data VirtualizationDenodo
 
Workshop Rio de Janeiro Strategies for Web Based Data Dissemination
Workshop Rio de Janeiro Strategies for Web Based Data DisseminationWorkshop Rio de Janeiro Strategies for Web Based Data Dissemination
Workshop Rio de Janeiro Strategies for Web Based Data DisseminationZoltan Nagy
 
Cloud Modernization and Data as a Service Option
Cloud Modernization and Data as a Service OptionCloud Modernization and Data as a Service Option
Cloud Modernization and Data as a Service OptionDenodo
 
Cortana Analytics Workshop: Azure Data Catalog
Cortana Analytics Workshop: Azure Data CatalogCortana Analytics Workshop: Azure Data Catalog
Cortana Analytics Workshop: Azure Data CatalogMSAdvAnalytics
 
Data Virtualization: From Zero to Hero
Data Virtualization: From Zero to HeroData Virtualization: From Zero to Hero
Data Virtualization: From Zero to HeroDenodo
 
Supporting Data Services Marketplace using Data Virtualization
Supporting Data Services Marketplace using Data VirtualizationSupporting Data Services Marketplace using Data Virtualization
Supporting Data Services Marketplace using Data VirtualizationDenodo
 
Integrating with Azure Data Lake
Integrating with Azure Data LakeIntegrating with Azure Data Lake
Integrating with Azure Data LakeRuman Khan
 
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview PresentationFilling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview PresentationPentaho
 
DKAN Drupal Distribution Presentation at Drupal Gov Days 2013
DKAN Drupal Distribution Presentation at Drupal Gov Days 2013DKAN Drupal Distribution Presentation at Drupal Gov Days 2013
DKAN Drupal Distribution Presentation at Drupal Gov Days 2013Andrew Hoppin
 
Why Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionWhy Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionDenodo
 
Data Virtualization: From Zero to Hero (Middle East)
Data Virtualization: From Zero to Hero (Middle East)Data Virtualization: From Zero to Hero (Middle East)
Data Virtualization: From Zero to Hero (Middle East)Denodo
 
The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...
The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...
The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...Denodo
 
Denodo DataFest 2016: What’s New in Denodo Platform – Demo and Roadmap
Denodo DataFest 2016: What’s New in Denodo Platform – Demo and RoadmapDenodo DataFest 2016: What’s New in Denodo Platform – Demo and Roadmap
Denodo DataFest 2016: What’s New in Denodo Platform – Demo and RoadmapDenodo
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationDenodo
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Denodo
 
Fast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationFast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationDenodo
 

What's hot (20)

Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
 
Myth Busters III: I’m Building a Data Lake, So I Don’t Need Data Virtualization
Myth Busters III: I’m Building a Data Lake, So I Don’t Need Data VirtualizationMyth Busters III: I’m Building a Data Lake, So I Don’t Need Data Virtualization
Myth Busters III: I’m Building a Data Lake, So I Don’t Need Data Virtualization
 
Workshop Rio de Janeiro Strategies for Web Based Data Dissemination
Workshop Rio de Janeiro Strategies for Web Based Data DisseminationWorkshop Rio de Janeiro Strategies for Web Based Data Dissemination
Workshop Rio de Janeiro Strategies for Web Based Data Dissemination
 
Cloud Modernization and Data as a Service Option
Cloud Modernization and Data as a Service OptionCloud Modernization and Data as a Service Option
Cloud Modernization and Data as a Service Option
 
Cortana Analytics Workshop: Azure Data Catalog
Cortana Analytics Workshop: Azure Data CatalogCortana Analytics Workshop: Azure Data Catalog
Cortana Analytics Workshop: Azure Data Catalog
 
Data Virtualization: From Zero to Hero
Data Virtualization: From Zero to HeroData Virtualization: From Zero to Hero
Data Virtualization: From Zero to Hero
 
Supporting Data Services Marketplace using Data Virtualization
Supporting Data Services Marketplace using Data VirtualizationSupporting Data Services Marketplace using Data Virtualization
Supporting Data Services Marketplace using Data Virtualization
 
Integrating with Azure Data Lake
Integrating with Azure Data LakeIntegrating with Azure Data Lake
Integrating with Azure Data Lake
 
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview PresentationFilling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
 
DKAN Drupal Distribution Presentation at Drupal Gov Days 2013
DKAN Drupal Distribution Presentation at Drupal Gov Days 2013DKAN Drupal Distribution Presentation at Drupal Gov Days 2013
DKAN Drupal Distribution Presentation at Drupal Gov Days 2013
 
Why Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionWhy Data Virtualization? An Introduction
Why Data Virtualization? An Introduction
 
Data Virtualization: From Zero to Hero (Middle East)
Data Virtualization: From Zero to Hero (Middle East)Data Virtualization: From Zero to Hero (Middle East)
Data Virtualization: From Zero to Hero (Middle East)
 
The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...
The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...
The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...
 
Denodo DataFest 2016: What’s New in Denodo Platform – Demo and Roadmap
Denodo DataFest 2016: What’s New in Denodo Platform – Demo and RoadmapDenodo DataFest 2016: What’s New in Denodo Platform – Demo and Roadmap
Denodo DataFest 2016: What’s New in Denodo Platform – Demo and Roadmap
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data Virtualization
 
Big Data Landscape 2016
Big Data Landscape 2016Big Data Landscape 2016
Big Data Landscape 2016
 
tecFinal 451 webinar deck
tecFinal 451 webinar decktecFinal 451 webinar deck
tecFinal 451 webinar deck
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
 
Fast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationFast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow Presentation
 

Viewers also liked

The oecd delta project – providing easier access to data through api's
The oecd delta project – providing easier access to data through api'sThe oecd delta project – providing easier access to data through api's
The oecd delta project – providing easier access to data through api'sJonathan Challener
 
The future of charting in .Stat
The future of charting in .StatThe future of charting in .Stat
The future of charting in .StatJonathan Challener
 
Being open, accessible, and understandable by Jonathan Challener, OECD - #ima...
Being open, accessible, and understandable by Jonathan Challener, OECD - #ima...Being open, accessible, and understandable by Jonathan Challener, OECD - #ima...
Being open, accessible, and understandable by Jonathan Challener, OECD - #ima...Jonathan Challener
 
Speech presentation amber palassis
Speech presentation  amber palassisSpeech presentation  amber palassis
Speech presentation amber palassisAmber Palassis
 
Volkswagen aftersales crm_architecture
Volkswagen aftersales crm_architectureVolkswagen aftersales crm_architecture
Volkswagen aftersales crm_architectureAlexander Zhironkin
 
Lovers that never were - a story in 5 pictures
Lovers that never were -  a story in 5 picturesLovers that never were -  a story in 5 pictures
Lovers that never were - a story in 5 picturesAzam Noon
 
Community capacity building and process improvements
Community capacity building and process improvementsCommunity capacity building and process improvements
Community capacity building and process improvementsJonathan Challener
 
Vw dsg tender presentation 14042014
Vw dsg tender presentation 14042014Vw dsg tender presentation 14042014
Vw dsg tender presentation 14042014Alexander Zhironkin
 
The building blocks for a reusable front end - #imaodbc2015
The building blocks for a reusable front end - #imaodbc2015The building blocks for a reusable front end - #imaodbc2015
The building blocks for a reusable front end - #imaodbc2015Jonathan Challener
 
The path to an hybrid open source paradigm
The path to an hybrid open source paradigmThe path to an hybrid open source paradigm
The path to an hybrid open source paradigmJonathan Challener
 
Introduccion a la peluqueria canina 1
Introduccion a la peluqueria canina 1Introduccion a la peluqueria canina 1
Introduccion a la peluqueria canina 1krolligan
 

Viewers also liked (13)

The oecd delta project – providing easier access to data through api's
The oecd delta project – providing easier access to data through api'sThe oecd delta project – providing easier access to data through api's
The oecd delta project – providing easier access to data through api's
 
The future of charting in .Stat
The future of charting in .StatThe future of charting in .Stat
The future of charting in .Stat
 
Being open, accessible, and understandable by Jonathan Challener, OECD - #ima...
Being open, accessible, and understandable by Jonathan Challener, OECD - #ima...Being open, accessible, and understandable by Jonathan Challener, OECD - #ima...
Being open, accessible, and understandable by Jonathan Challener, OECD - #ima...
 
Nfl
NflNfl
Nfl
 
Speech presentation amber palassis
Speech presentation  amber palassisSpeech presentation  amber palassis
Speech presentation amber palassis
 
Volkswagen aftersales crm_architecture
Volkswagen aftersales crm_architectureVolkswagen aftersales crm_architecture
Volkswagen aftersales crm_architecture
 
Lovers that never were - a story in 5 pictures
Lovers that never were -  a story in 5 picturesLovers that never were -  a story in 5 pictures
Lovers that never were - a story in 5 pictures
 
Community capacity building and process improvements
Community capacity building and process improvementsCommunity capacity building and process improvements
Community capacity building and process improvements
 
Vw dsg tender presentation 14042014
Vw dsg tender presentation 14042014Vw dsg tender presentation 14042014
Vw dsg tender presentation 14042014
 
The building blocks for a reusable front end - #imaodbc2015
The building blocks for a reusable front end - #imaodbc2015The building blocks for a reusable front end - #imaodbc2015
The building blocks for a reusable front end - #imaodbc2015
 
The path to an hybrid open source paradigm
The path to an hybrid open source paradigmThe path to an hybrid open source paradigm
The path to an hybrid open source paradigm
 
Introduccion a la peluqueria canina 1
Introduccion a la peluqueria canina 1Introduccion a la peluqueria canina 1
Introduccion a la peluqueria canina 1
 
Philippine National Heroes
Philippine National HeroesPhilippine National Heroes
Philippine National Heroes
 

Similar to Meeting today’s dissemination challenges – Implementing International Standards in OECD.Stat MSIS Dublin 14-16 April 2014

Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationDenodo
 
5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data Lake5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data LakeMetroStar
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Denodo
 
LinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchLinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchSheetal Pratik
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptxElsonPaul2
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016StampedeCon
 
Trivadis Azure Data Lake
Trivadis Azure Data LakeTrivadis Azure Data Lake
Trivadis Azure Data LakeTrivadis
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Denodo
 
Denodo DataFest 2016: The Role of Data Virtualization in IoT Integration
Denodo DataFest 2016: The Role of Data Virtualization in IoT IntegrationDenodo DataFest 2016: The Role of Data Virtualization in IoT Integration
Denodo DataFest 2016: The Role of Data Virtualization in IoT IntegrationDenodo
 
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...Denodo
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakeseccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
eccenca CorporateMemory - Semantically integrated Enterprise Data LakesLinked Enterprise Date Services
 
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data LakesData Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data LakesDenodo
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesVirtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesDenodo
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Denodo
 
Big and fast data strategy 2017 jr
Big and fast data strategy 2017 jrBig and fast data strategy 2017 jr
Big and fast data strategy 2017 jrJonathan Raspaud
 
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo
 
Bridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItBridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItDenodo
 

Similar to Meeting today’s dissemination challenges – Implementing International Standards in OECD.Stat MSIS Dublin 14-16 April 2014 (20)

Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal Modernization
 
5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data Lake5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data Lake
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)
 
LinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchLinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbench
 
Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
 
Trivadis Azure Data Lake
Trivadis Azure Data LakeTrivadis Azure Data Lake
Trivadis Azure Data Lake
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)
 
Denodo DataFest 2016: The Role of Data Virtualization in IoT Integration
Denodo DataFest 2016: The Role of Data Virtualization in IoT IntegrationDenodo DataFest 2016: The Role of Data Virtualization in IoT Integration
Denodo DataFest 2016: The Role of Data Virtualization in IoT Integration
 
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakeseccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
 
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data LakesData Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesVirtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & Bénéfices
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
 
Big and fast data strategy 2017 jr
Big and fast data strategy 2017 jrBig and fast data strategy 2017 jr
Big and fast data strategy 2017 jr
 
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
 
Bridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItBridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need It
 

Recently uploaded

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 

Recently uploaded (20)

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

Meeting today’s dissemination challenges – Implementing International Standards in OECD.Stat MSIS Dublin 14-16 April 2014

  • 1. MEETING TODAY‟S DISSEMINATION CHALLENGES: Implementing international standards in .Stat Prepared by Jonathan Challener, OECD For MSIS, April 2014 - Dublin, Ireland
  • 2. Doesn’t non-standard power supplies make things difficult?
  • 3. What happens when standards are not applied well?
  • 4. Picture: ‘The day Sweden changed from left-hand drive to right’ Confusion entails
  • 8. and inefficiencies! “A little like the grade 8 student who doesn‟t pay attention in class all year”.
  • 10. What is .Stat? .Stat is the central repository ("warehouse") of validated statistics and related metadata .Stat is the central hub connecting data production, sharing & dissemination processes It is the corporate source of data for data sharing and dissemination purposes
  • 11. What is .Stat? .Stat is the central repository ("warehouse") of validated statistics and related metadata .Stat is the central hub connecting data production, sharing & dissemination processes It is the corporate source of data for data sharing and dissemination purposes “.Stat is now being used and shared with 10 organisations including the OECD, as part of the Statistical Information System Collaboration Community (SIS-CC)”.
  • 12. .Stat Positioning in Statistical Information System DATA DELIVERY INTERNAL DATA SHARING DATA DISSEMINATION DATA PRODUCTION .STAT
  • 13. .Stat Positioning in Statistical Information System DATA DELIVERY INTERNAL DATA SHARING DATA DISSEMINATION DATA PRODUCTION .STAT “The diagram illustrates the .Stat contribution to the SIS processes. .Stat’s core value-added lies in “Data Delivery”, a set of functions that enable dissemination and data sharing, and “Data Upload”, a set of functions interfacing data production processes into a single upload mechanism to feed dissemination channels”.
  • 14. .Stat Functional Representation .STAT DATA DELIVERY ENGINE DATA PRODUCTION DATA SHARING DATA DISSEMINATION SEARCH ENGINES DATA ANALYSIS TOOLS P C WEBSITES, APPS PUBLICATIONS.STAT BROWSER .STAT DATA UPLOAD ENGINE FILE UPLOAD SDMX IMPORT DATA PRODUCTION TOOLS TABLE & CHART EXTRACTION SERVICES RELEASE MGT SERVICES .STAT BROWSER CONFIGURATION DATA EXTRACTION SERVICES SDMX INPUT E P BATCH UPLOAD SDMX GLOBAL REGISTRY PUBLISHING BACK OFFICE DATA MAPPING SDMX OUTPUT X X .Stat Component Process Human user Data Producer Data Editor Data Consumer API or Webservice Other SDMX hubs
  • 15. .Stat Functional Representation .STAT DATA DELIVERY ENGINE DATA PRODUCTION DATA SHARING DATA DISSEMINATION SEARCH ENGINES DATA ANALYSIS TOOLS P C WEBSITES, APPS PUBLICATIONS.STAT BROWSER .STAT DATA UPLOAD ENGINE FILE UPLOAD SDMX IMPORT DATA PRODUCTION TOOLS TABLE & CHART EXTRACTION SERVICES RELEASE MGT SERVICES .STAT BROWSER CONFIGURATION DATA EXTRACTION SERVICES SDMX INPUT E P BATCH UPLOAD SDMX GLOBAL REGISTRY PUBLISHING BACK OFFICE DATA MAPPING SDMX OUTPUT X X .Stat Component Process Human user Data Producer Data Editor Data Consumer API or Webservice Other SDMX hubs “The grey shaded boxes in the figure below show a visual representation of how .Stat fits within a broader Data Dissemination Information System of organisations; the boxes with dotted lines represent other components of the Data Dissemination Information System that are not supported by .Stat but are enabled by it”.
  • 16. .Stat Functional Representation In particular, .Stat provides the following 3 key functional areas…
  • 18. .Stat Functional Representation.Stat Data Delivery Engine
  • 20. .Stat Positioning in GSBPM Reference Model .Stat contributes to Planned additions Archive incorporated into the over- arching process of data and metadata management
  • 21. .Stat Positioning in GSBPM Reference Model .Stat contributes to Planned additions Archive incorporated into the over- arching process of data and metadata management “.Stat can be mapped today to the Generic Statistical Business Process Model (GSBPM) under “Disseminate” and “Build”. In the future it will also incorporate archive functions as part of the over- arching process for data and metadata management”.
  • 23. For dissemination and data eXchange SDMXWS and RESTful API • SDMX 2.0 compliant • SOAP + REST • Pull • SDMX-ML • SDMX Structural metadata created on the fly
  • 24. For „Open Data‟ dissemination SDMX-JSON (beta) SDMX-TWG agreed in mid 2013 on proposal for data and their structural metadata (inc. flat & sliced layouts) and referential metadata (dataset, series, obs) as annotations. Further enhancements to come: Complete data structures and referential metadata
  • 25. For data reporting SDMX-Reference Infrastructure (RI)* • SDMX 2.0 and 2.1 compliant • SOAP + REST • SDMX Common APIs (SdmxSource.NET) • Pull + Push • SDMX-ML, GESMES , CSV • Structural metadata stored in mini registry • One web service - several mapped database instances Mapping Store DB XXX.Stat Data warehouse SDMX-RI Web Service Dissemination Mapping Assistant SDMX-RI * The integration of SDMX-RI in .Stat is based on collaboration with Eurostat, provider of the SDMX-RI component with ISTAT taking the lead on behalf of the OECD‟s Statistical Information System Collaboration Community.
  • 26. For internal data sharing DirectAccess • Restful SDMX query • Flat data, flags, units • Referential metadata Excel-add-in • DirectAccess (Rest SDMX) • Native Excel pivot table • Wizard to select data
  • 27. For a decentralised publishing environment DataHub* • One interface to the publishing tools • Centralised reporting and auditing • SDMX based structural metadata, and referential metadata management • Flexible load tool that promotes ‘self publish’ for data custodians • In-built checks and safeguards to minimise errors • Manages security and access rights • Can be extended to manage other outputs and not limited to .Stat * DataHub has been developed and integrated with .Stat by Statistics NZ, with an additional connection to the Fusion Registry for managing structural metadata through the definition of DSDs.
  • 32. SDMX-RDF data cube vocabulary pilot
  • 33. SDMX-RDF data cube vocabulary pilot “Explore further semantic web/linked data opportunities (SDMX-RDF data cube vocabulary). To be taken forward by ISTAT and ABS under the SIS-CC umbrella”.
  • 34. • Lower technology adoption costs • Increased development consistency, simplicity and predictability • Improved code reuse • Reduced cost, time and effort to transition between different solutions We all know the… • Reduced focus on infrastructure • Ability to create composite interfaces that are tailored to the needs of specific task • Improved application portability • Enable faster time to market because it is easier to use off the shelf components and applications that can integrate and provide features for the solution
  • 35. References 1. Operationalising .Stat in a decentralised publishing environment (DataHub) by Tony Breen SNZ : https://community.oecd.org/docs/DOC-68362 2. Building a scalable architecture (.Stat) by Jens Dosse OECD: https://community.oecd.org/docs/DOC-68363 3. SDMX-RI and .Stat integration by Francesco Rizzo Istat: https://community.oecd.org/docs/DOC-68696 4. SDMX-JSON API: http://stats.oecd.org/opendataapi/Index.htm
  • 36. Jonathan Challener, OECD jonathan.challener@oecd.org @Challener MSIS - Dublin, 14-16 April 2014 Meeting today’s dissemination challenges: Implementing international standards in .Stat Thank you