SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Going Local with a World-Class Data
Infrastructure: Enabling SDMX for
Research Support
   Rob Grim
   Head Research Support/Research Data Specialist
   Executive Manager Open Data Foundation (ODaF)
   Library & IT Services, Tilburg University (Netherlands)
   IASSIST 2012, June 8 Washington
Research Data Support? With SDMX?

•   Why should we support
    researchers anyway?
•   Why should a university use a
    complex set of standards such as
    SDMX to support research?
•   CARDS World Taxation Indicators                       Curation

    project
•   Collaborative research
                                                   SDMX
•   Workflow support
•   Infrastructure development                                   Capture

•   Metadata management
•   What does it take get SDMX up
    and running?


                                       13-6-2012                           2
Research Data Support (Tilburg University)

1.   Archive research data
     and supplementary
     materials
2.   Register data sources
     used and provenance                     Dataset available !
     information
3.   Assist with dataset
     description to improve
     accessibility of datasets
4.   Integrated library and
     data catalogue
5.   Subject portals e.g.
     „European Values Study‟
                                 DDI and RDF in metadata record (hidden)
6.   Financial Data Support



                                 13-6-2012                           3
Research (Data) Support

 1.   “Research Support”, often              Landscape tools
      used as a synonym for IT
      support
 2.   Current research data                         Dataverse Network (DVN)
      services focus on data
      archiving, DMPs, curation
                                     Archiving + Access Management
 3.   Simple approaches to data
      sharing
 4.   Portfolio of research data                 SDMX
      tools needed to support
      academic practices
                                     Metadata Repository               Questasy
 5.   Potential of metadata
      management undervalued                            Survey documentation

                                   SDMX Data Repository
 Aim for “Need to have”
 instead of “Nice to Have”

                                                13-6-2012                         4
Metadata Management
         Metadata Registry                           Capture




 Dissemination                          Tools

                                                                 Capture




                             Capture
                             Time-series data                        Source: Eurostat
 Source: OECD.Stat

                                                13-6-2012                               5
                                                            Tools needed!
Why SDMX?

1.   SDMX allows us to capture and manage
     „data intelligence‟ in a formalized and
     structured way
                                                                          Curation
2.   SDMX information model useful to
     describe time-series data from different
     disciplines                                                   SDMX

3.   SDMX offers means to prevent
     unnecessary replication of data                                             Capture


4.   SDMX offers means to deal with
     confidential data and IPR
5.   The standard is well used, training
     materials, tutorials available
6.   SDMX IT tools are available for different               FAO
     platforms: Java .NET
7.   FAO OpenSDMX initiative (D4Science)
8.   Researchers want „something‟ like
     OECD.Stat

                                                 OECD.Stat
                                                 13-6-2012                                 6
Workflow

                                                    Existing tools




                                        Time series metadata:
 Verbs:                                 concepts, dimensions,
 …Extract from PDF, CSV                 attributes
 …Convert toSDMX-ML
 …Code 4 Registry         Table: Overview of registry structures
                 Agency                 ECB    FAOStat   GIST   IAEG    ISO    SDMX
                 Agency Scheme            0       0         0      0       0     1
                 Categorisations          0       0         0      0       0     0
                 Category schemes         1       0         0      0       0     1
                 Codelists                2       8         2      10      0     9
URL: WTI
                 Concept schemes          1       1         0      2       1     1
                 DSD                      3       1         0      1       0     0
                 Dataflows                        1         1
                 Data provider scheme             1         1
                 Provision agreement                        2

                                                    13-6-2012                         7
Where we are now?
•   Production workflow for SDMX
•   Populating the metadata registry
     •   Enter (hierachichal)
         codelists
     •   Concept IDs
     •   Concept Schemes
     •   DSDs
     •   Dataflows
     •   SDMX ML Generic format
•   WTI Fusion Registry
•   SDMX data repository
     •   Keep data in the original
         formats (csv, txt, Stata)
     •   Convert data from a
         database to SDMX              Source: SDMX Information Model
     •   Specific purpose database
         for SDMX compliant system
     •   Other: Collaborate with
         FAO, Open SDMX?

                                                      13-6-2012         8
Metadata registry: Fusion Registry




            Code Values



Codelists
Concept Scheme




Titelpresentatie in Footer   13-6-2012           10
Category Scheme
CARDS-project World Taxation Indicators

1.   Georgia State University, International
     Center for Public Policy, World Tax
     Indicators Portal
2.   Tilburg University, prof. Jenny Ligthart
3.   Lack of data on personal income tax
     (PIT), corporate income tax (CIT), Value
     Added Tax (VAT) and other tax
     indicators
4.   Incomplete series, missing countries,
     tax data difficult to access
     (addendums), difficult to compare
5.   Work WTI group: statutory tax rates.
     Tilburg: effective tax rates, corporate
     income tax.
6.   The „raw „data stem from the IMF/GFS
     and the OECD/Revenue statistics.


                                                13-6-2012   12
Lessons learnt so far

• Support of senior management is needed to get beyond the
  project/pilot stage
• SDMX standards are complex: steep learning curve
• Capacity building is a must (Tip: Eurostat SDMX tutorials)
• SDMX data repository: collaborate with other organizations
• Focus on DSDs, full target and partial identifiers, hierarchical code
  lists
• Fusion Registry upgrade
• Additional (academic) partners welcome to leverage the macro
  economic time series registry and repository




                                          13-6-2012                       13
Acknowledgements

      • CARDS was funded SURF.     Final Thought
        The CARDS project was
                                           Don‟t forget!
        undertaken in 2011 in the
        framework of the SURFshare         Before you ask:
        programme – Access to
        Research Data                      “What you can do for your
      • WTI group and prof. Jenny                                     country “, ask yourself:
        Ligthart
                                                                      “What metadata management
                                                                      can do for you”
References
1.   Burgi-Schmelz, A. (2009). Data to the rescue. Why improved statistical information will be key for
     prevention of future crises. Finance and Development, 46(1), 31-43.
2.   Peter, K. S., Buttrick, S., & Duncan, D. Data appendix to “global reform of personal income taxation, 1981-
     2005: Evidence from 189 countries”
3.   Peter, K. S., Steve Buttrick, & Duncan, D. (2010). Global reform of personal income taxation, 1981-2005:
     Evidence from 189 countries. National Tax Journal, 6(3).


                                                                   13-6-2012                               14

Weitere ähnliche Inhalte

Was ist angesagt?

Research data management & planning: an introduction
Research data management & planning: an introductionResearch data management & planning: an introduction
Research data management & planning: an introductionMaggie Neilson
 
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...EUDAT
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and LibariesRob Grim
 
EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu | EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu | EUDAT
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT
 
Developing research data management policy & services
Developing research data management policy & servicesDeveloping research data management policy & services
Developing research data management policy & servicesSarah Jones
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | EUDAT
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Gautier Poupeau
 
SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science Robert H. McDonald
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data ManagementSarah Jones
 
Providing support and services for researchers in good data governance
Providing support and services for researchers in good data governanceProviding support and services for researchers in good data governance
Providing support and services for researchers in good data governanceRobin Rice
 

Was ist angesagt? (20)

General concepts: DDI
General concepts: DDIGeneral concepts: DDI
General concepts: DDI
 
Research data management & planning: an introduction
Research data management & planning: an introductionResearch data management & planning: an introduction
Research data management & planning: an introduction
 
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and Libaries
 
EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu | EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu |
 
Metadata Standards
Metadata StandardsMetadata Standards
Metadata Standards
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
 
NISO/DCMI Webinar: Metadata for Managing Scientific Research Data
NISO/DCMI Webinar: Metadata for Managing Scientific Research DataNISO/DCMI Webinar: Metadata for Managing Scientific Research Data
NISO/DCMI Webinar: Metadata for Managing Scientific Research Data
 
Developing research data management policy & services
Developing research data management policy & servicesDeveloping research data management policy & services
Developing research data management policy & services
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...
 
DMP in 5 minutes
DMP in 5 minutesDMP in 5 minutes
DMP in 5 minutes
 
Metadata: A concept
Metadata: A conceptMetadata: A concept
Metadata: A concept
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdm
 
Data mining
Data miningData mining
Data mining
 
MANTRA Research Data Lifecycle
MANTRA Research Data LifecycleMANTRA Research Data Lifecycle
MANTRA Research Data Lifecycle
 
SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Providing support and services for researchers in good data governance
Providing support and services for researchers in good data governanceProviding support and services for researchers in good data governance
Providing support and services for researchers in good data governance
 
P1 capitulo 5
P1 capitulo 5P1 capitulo 5
P1 capitulo 5
 

Ähnlich wie Going local with a world-class data infrastructure: Enabling SDMX for research support

Simon Hodson
Simon HodsonSimon Hodson
Simon HodsonEduserv
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATTony Ross-Hellauer
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATOpenAIRE
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataMartin Hamilton
 
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...OpenAIRE
 
Data Management Planning at the DCC: a human factor
Data Management Planning at the DCC: a human factorData Management Planning at the DCC: a human factor
Data Management Planning at the DCC: a human factorMartin Donnelly
 
Introduction to metadata management
Introduction to metadata managementIntroduction to metadata management
Introduction to metadata managementOpen Data Support
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE
 
Big Data A Review
Big Data A ReviewBig Data A Review
Big Data A Reviewijtsrd
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Sarah Anna Stewart
 
Data Management Plans: a gentle introduction
Data Management Plans: a gentle introductionData Management Plans: a gentle introduction
Data Management Plans: a gentle introductionMartin Donnelly
 
2012 Fall Data Management Planning Workshop
2012 Fall Data Management Planning Workshop2012 Fall Data Management Planning Workshop
2012 Fall Data Management Planning WorkshopLizzy_Rolando
 
First they have to find it: Getting Open Government Data Discovered and Used
First they have to find it: Getting Open Government Data Discovered and UsedFirst they have to find it: Getting Open Government Data Discovered and Used
First they have to find it: Getting Open Government Data Discovered and UsedRensselaer Polytechnic Institute
 
Metadata for digital long-term preservation
Metadata for digital long-term preservationMetadata for digital long-term preservation
Metadata for digital long-term preservationMichael Day
 

Ähnlich wie Going local with a world-class data infrastructure: Enabling SDMX for research support (20)

Bosch, Wackerow: Linked data on the web
Bosch, Wackerow: Linked data on the web Bosch, Wackerow: Linked data on the web
Bosch, Wackerow: Linked data on the web
 
Simon Hodson
Simon HodsonSimon Hodson
Simon Hodson
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
RDM for trainee physicians
RDM for trainee physiciansRDM for trainee physicians
RDM for trainee physicians
 
What is a DMP
What is a DMPWhat is a DMP
What is a DMP
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research Data
 
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
 
Data Management Planning at the DCC: a human factor
Data Management Planning at the DCC: a human factorData Management Planning at the DCC: a human factor
Data Management Planning at the DCC: a human factor
 
Introduction to metadata management
Introduction to metadata managementIntroduction to metadata management
Introduction to metadata management
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
 
The necessity of metadata for linked open data and its contribution to policy...
The necessity of metadata for linked open data and its contribution to policy...The necessity of metadata for linked open data and its contribution to policy...
The necessity of metadata for linked open data and its contribution to policy...
 
Big Data A Review
Big Data A ReviewBig Data A Review
Big Data A Review
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
 
Data Management Plans: a gentle introduction
Data Management Plans: a gentle introductionData Management Plans: a gentle introduction
Data Management Plans: a gentle introduction
 
2012 Fall Data Management Planning Workshop
2012 Fall Data Management Planning Workshop2012 Fall Data Management Planning Workshop
2012 Fall Data Management Planning Workshop
 
First they have to find it: Getting Open Government Data Discovered and Used
First they have to find it: Getting Open Government Data Discovered and UsedFirst they have to find it: Getting Open Government Data Discovered and Used
First they have to find it: Getting Open Government Data Discovered and Used
 
Metadata for digital long-term preservation
Metadata for digital long-term preservationMetadata for digital long-term preservation
Metadata for digital long-term preservation
 

Kürzlich hochgeladen

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Kürzlich hochgeladen (20)

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Going local with a world-class data infrastructure: Enabling SDMX for research support

  • 1. Going Local with a World-Class Data Infrastructure: Enabling SDMX for Research Support Rob Grim Head Research Support/Research Data Specialist Executive Manager Open Data Foundation (ODaF) Library & IT Services, Tilburg University (Netherlands) IASSIST 2012, June 8 Washington
  • 2. Research Data Support? With SDMX? • Why should we support researchers anyway? • Why should a university use a complex set of standards such as SDMX to support research? • CARDS World Taxation Indicators Curation project • Collaborative research SDMX • Workflow support • Infrastructure development Capture • Metadata management • What does it take get SDMX up and running? 13-6-2012 2
  • 3. Research Data Support (Tilburg University) 1. Archive research data and supplementary materials 2. Register data sources used and provenance Dataset available ! information 3. Assist with dataset description to improve accessibility of datasets 4. Integrated library and data catalogue 5. Subject portals e.g. „European Values Study‟ DDI and RDF in metadata record (hidden) 6. Financial Data Support 13-6-2012 3
  • 4. Research (Data) Support 1. “Research Support”, often Landscape tools used as a synonym for IT support 2. Current research data Dataverse Network (DVN) services focus on data archiving, DMPs, curation Archiving + Access Management 3. Simple approaches to data sharing 4. Portfolio of research data SDMX tools needed to support academic practices Metadata Repository Questasy 5. Potential of metadata management undervalued Survey documentation SDMX Data Repository Aim for “Need to have” instead of “Nice to Have” 13-6-2012 4
  • 5. Metadata Management Metadata Registry Capture Dissemination Tools Capture Capture Time-series data Source: Eurostat Source: OECD.Stat 13-6-2012 5 Tools needed!
  • 6. Why SDMX? 1. SDMX allows us to capture and manage „data intelligence‟ in a formalized and structured way Curation 2. SDMX information model useful to describe time-series data from different disciplines SDMX 3. SDMX offers means to prevent unnecessary replication of data Capture 4. SDMX offers means to deal with confidential data and IPR 5. The standard is well used, training materials, tutorials available 6. SDMX IT tools are available for different FAO platforms: Java .NET 7. FAO OpenSDMX initiative (D4Science) 8. Researchers want „something‟ like OECD.Stat OECD.Stat 13-6-2012 6
  • 7. Workflow Existing tools Time series metadata: Verbs: concepts, dimensions, …Extract from PDF, CSV attributes …Convert toSDMX-ML …Code 4 Registry Table: Overview of registry structures Agency ECB FAOStat GIST IAEG ISO SDMX Agency Scheme 0 0 0 0 0 1 Categorisations 0 0 0 0 0 0 Category schemes 1 0 0 0 0 1 Codelists 2 8 2 10 0 9 URL: WTI Concept schemes 1 1 0 2 1 1 DSD 3 1 0 1 0 0 Dataflows 1 1 Data provider scheme 1 1 Provision agreement 2 13-6-2012 7
  • 8. Where we are now? • Production workflow for SDMX • Populating the metadata registry • Enter (hierachichal) codelists • Concept IDs • Concept Schemes • DSDs • Dataflows • SDMX ML Generic format • WTI Fusion Registry • SDMX data repository • Keep data in the original formats (csv, txt, Stata) • Convert data from a database to SDMX Source: SDMX Information Model • Specific purpose database for SDMX compliant system • Other: Collaborate with FAO, Open SDMX? 13-6-2012 8
  • 9. Metadata registry: Fusion Registry Code Values Codelists
  • 10. Concept Scheme Titelpresentatie in Footer 13-6-2012 10
  • 12. CARDS-project World Taxation Indicators 1. Georgia State University, International Center for Public Policy, World Tax Indicators Portal 2. Tilburg University, prof. Jenny Ligthart 3. Lack of data on personal income tax (PIT), corporate income tax (CIT), Value Added Tax (VAT) and other tax indicators 4. Incomplete series, missing countries, tax data difficult to access (addendums), difficult to compare 5. Work WTI group: statutory tax rates. Tilburg: effective tax rates, corporate income tax. 6. The „raw „data stem from the IMF/GFS and the OECD/Revenue statistics. 13-6-2012 12
  • 13. Lessons learnt so far • Support of senior management is needed to get beyond the project/pilot stage • SDMX standards are complex: steep learning curve • Capacity building is a must (Tip: Eurostat SDMX tutorials) • SDMX data repository: collaborate with other organizations • Focus on DSDs, full target and partial identifiers, hierarchical code lists • Fusion Registry upgrade • Additional (academic) partners welcome to leverage the macro economic time series registry and repository 13-6-2012 13
  • 14. Acknowledgements • CARDS was funded SURF. Final Thought The CARDS project was Don‟t forget! undertaken in 2011 in the framework of the SURFshare Before you ask: programme – Access to Research Data “What you can do for your • WTI group and prof. Jenny country “, ask yourself: Ligthart “What metadata management can do for you” References 1. Burgi-Schmelz, A. (2009). Data to the rescue. Why improved statistical information will be key for prevention of future crises. Finance and Development, 46(1), 31-43. 2. Peter, K. S., Buttrick, S., & Duncan, D. Data appendix to “global reform of personal income taxation, 1981- 2005: Evidence from 189 countries” 3. Peter, K. S., Steve Buttrick, & Duncan, D. (2010). Global reform of personal income taxation, 1981-2005: Evidence from 189 countries. National Tax Journal, 6(3). 13-6-2012 14

Hinweis der Redaktion

  1. Leverage existing data if we curate for machinesAd 5 local perspective: infrastructure development. From SDMX infrastructure perspective: leverage existing infrastructure.
  2. Metadata recordsSupplementary materials (code please!)Curate for machinesMetadata: Conceptueel (wat)Methodologisch (hoe)Quality~~Bruikbaarheid (wow!)