SlideShare ist ein Scribd-Unternehmen logo
1 von 13
Research on the quality of registers to make
data decisions in the Dutch Virtual Census


    Presentation at the 58th World Statistics Congress of the
           International Statistical Institute in Dublin
                         23 August 2011


        Eric Schulte Nordholt (e.schultenordholt@cbs.nl),
Saskia Ossen (sjl.ossen@cbs.nl) and Piet Daas (pjh.daas@cbs.nl)
Combining research



                                                                      Data decisions
Development of                                                        on secundary
a quality                                                             sources in the
framework for                                                         Dutch Virtual
administrative                                                        Census of 2011
data




  Research on the quality of registers to make data decisions in the Dutch
  Virtual Census                                                                  1
Contents

           • Data considerations in the Dutch Census of 2011

           • Introduction to the quality framework

           • Results Source hyper dimension

           • Results Metadata hyper dimension

           • Results Data hyper dimension

           • Conclusions


Research on the quality of registers to make data decisions in the Dutch
Virtual Census                                                             2
Data considerations in the Dutch
           Census of 2011 (1)
           Last traditional census: 1971

           Unwillingness (nonresponse) and reduction of
           expenses  no more traditional censuses

           Alternative: virtual census
           1981 and 1991: Population Register and surveys
           Development 90’s: more registers →
           2001 and 2011:
           integrated set of registers and surveys
           European Census Act → hypercubes

Research on the quality of registers to make data decisions in the Dutch
Virtual Census                                                             3
Data considerations in the Dutch
           Census of 2011 (2)
           Registers:
           • Population Register (PR), >16.6 million records
           • Jobs file, containing all employees
           • Self-employed file, containing all self-employed
           • Unemployment Benefit Register (UR)
           • Social Security Register (SR)
           • Education Register (ER)
           • New Housing Register (HR)
           Surveys:
           • Survey on Employment and Earnings (SEE) stopped
           • Labour Force Survey (LFS)
           • Housing survey
Research on the quality of registers to make data decisions in the Dutch
Virtual Census                                                             4
Introduction to the quality framewok


                                              METADATA:
                                               Focuses on the
         SOURCE: - Focus on data source as a whole
                                               (availability of the)
                - Contact information related  information required to
                - Delivery related aspects     understand and use the
                - and more                     data in the data source




                                                                       DATA:
                                                                       - Technical checks
                                                                       - Accuracy related
                                                                         issues
Research on the quality of registers to make data decisions in the Dutch
Virtual Census                                                                       5
Results Source hyper dimension




                                                                           Important
                                Suffers                                    variables
   Low frequency                                        Purpose
                                seriously from                             are
   of delivery                                          dataprovider
                                selective                                  missing
                                                        unclear
                                undercoverage

Research on the quality of registers to make data decisions in the Dutch
Virtual Census                                                                     6
Results Metadata hyperdimension




Time period in                    Time differences                         Unique
source can’t be                   in reporting                             keys can’t
transferred easily                periods                                  be easily
to the time point                                                          used for
needed                                                                     linking

Research on the quality of registers to make data decisions in the Dutch
Virtual Census                                                                   7
Results Data hyper dimension -
                                      completeness




Research on the quality of registers to make data decisions in the Dutch
Virtual Census                                                             8
Results Data hyper dimension –
                                           accuracy




Research on the quality of registers to make data decisions in the Dutch
Virtual Census                                                             9
Results Data hyper dimension – accuracy




Research on the quality of registers to make data decisions in the Dutch
Virtual Census                                                             10
Conclusions
           • The virtual census has proved to be a successful
           concept in the Netherlands

           • The quality framework is a useful tool for making
           data decisions in the virtual census

           • The “quality study” started in this paper will be
           extended to be able to determine how all census
           variables will be derived




Research on the quality of registers to make data decisions in the Dutch
Virtual Census                                                             11
Time for questions and discussion
Research on the quality of registers to make data decisions in the Dutch
Virtual Census                                                             12

Weitere ähnliche Inhalte

Ähnlich wie Research on the Quality of Registers To Make Data Decisions in the Dutch Virtual Census.

Informatics in Clinical Practice: Designing and Implementing an Electronic Re...
Informatics in Clinical Practice: Designing and Implementing an Electronic Re...Informatics in Clinical Practice: Designing and Implementing an Electronic Re...
Informatics in Clinical Practice: Designing and Implementing an Electronic Re...Health Informatics New Zealand
 
ELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciencesELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciencesRafael C. Jimenez
 
How metadata drives data sharing; UK Data Archive
How metadata drives data sharing; UK Data Archive How metadata drives data sharing; UK Data Archive
How metadata drives data sharing; UK Data Archive Louise Corti
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
 
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris
 
Data warehouse Vs Big Data
Data warehouse Vs Big Data Data warehouse Vs Big Data
Data warehouse Vs Big Data Lisette ZOUNON
 
To Preserve Or Not To Preserve?
To Preserve Or Not To Preserve?To Preserve Or Not To Preserve?
To Preserve Or Not To Preserve?pbajcsy
 
Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...
Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...
Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...ASIS&T
 
Proposal for a quality framework for the evaluation of administrative and sur...
Proposal for a quality framework for the evaluation of administrative and sur...Proposal for a quality framework for the evaluation of administrative and sur...
Proposal for a quality framework for the evaluation of administrative and sur...Piet J.H. Daas
 
Big Data Session Presentations
Big Data Session PresentationsBig Data Session Presentations
Big Data Session PresentationsePSI Platform
 
Presentation
PresentationPresentation
Presentationejoffe
 
Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12ASIS&T
 
Improving Healthcare Delivery
Improving Healthcare DeliveryImproving Healthcare Delivery
Improving Healthcare DeliveryDave DeBonis
 
Effective Internal Investigations
Effective Internal InvestigationsEffective Internal Investigations
Effective Internal InvestigationsDaegis
 
Linking Nbs To Ehr 130410
Linking Nbs To Ehr 130410Linking Nbs To Ehr 130410
Linking Nbs To Ehr 130410Kari Klossner
 
Why manage research data?
Why manage research data?Why manage research data?
Why manage research data?Graham Pryor
 

Ähnlich wie Research on the Quality of Registers To Make Data Decisions in the Dutch Virtual Census. (20)

Informatics in Clinical Practice: Designing and Implementing an Electronic Re...
Informatics in Clinical Practice: Designing and Implementing an Electronic Re...Informatics in Clinical Practice: Designing and Implementing an Electronic Re...
Informatics in Clinical Practice: Designing and Implementing an Electronic Re...
 
Sharing data
Sharing dataSharing data
Sharing data
 
Cloud MRM
Cloud MRM Cloud MRM
Cloud MRM
 
Medical image analysis, retrieval and evaluation infrastructures
Medical image analysis, retrieval and evaluation infrastructuresMedical image analysis, retrieval and evaluation infrastructures
Medical image analysis, retrieval and evaluation infrastructures
 
ELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciencesELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciences
 
How metadata drives data sharing; UK Data Archive
How metadata drives data sharing; UK Data Archive How metadata drives data sharing; UK Data Archive
How metadata drives data sharing; UK Data Archive
 
PEDSnet DQA CHOP Symposium
PEDSnet DQA CHOP SymposiumPEDSnet DQA CHOP Symposium
PEDSnet DQA CHOP Symposium
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
 
Data warehouse Vs Big Data
Data warehouse Vs Big Data Data warehouse Vs Big Data
Data warehouse Vs Big Data
 
To Preserve Or Not To Preserve?
To Preserve Or Not To Preserve?To Preserve Or Not To Preserve?
To Preserve Or Not To Preserve?
 
Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...
Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...
Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...
 
Proposal for a quality framework for the evaluation of administrative and sur...
Proposal for a quality framework for the evaluation of administrative and sur...Proposal for a quality framework for the evaluation of administrative and sur...
Proposal for a quality framework for the evaluation of administrative and sur...
 
Big Data Session Presentations
Big Data Session PresentationsBig Data Session Presentations
Big Data Session Presentations
 
Presentation
PresentationPresentation
Presentation
 
Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12
 
Improving Healthcare Delivery
Improving Healthcare DeliveryImproving Healthcare Delivery
Improving Healthcare Delivery
 
Effective Internal Investigations
Effective Internal InvestigationsEffective Internal Investigations
Effective Internal Investigations
 
Linking Nbs To Ehr 130410
Linking Nbs To Ehr 130410Linking Nbs To Ehr 130410
Linking Nbs To Ehr 130410
 
Why manage research data?
Why manage research data?Why manage research data?
Why manage research data?
 

Mehr von Piet J.H. Daas

Big Data and official statistics with examples of their use
Big Data and official statistics with examples of their useBig Data and official statistics with examples of their use
Big Data and official statistics with examples of their usePiet J.H. Daas
 
IT infrastructure for Big Data and Data Science at Statistics Netherlands
IT infrastructure for Big Data and Data Science at Statistics NetherlandsIT infrastructure for Big Data and Data Science at Statistics Netherlands
IT infrastructure for Big Data and Data Science at Statistics NetherlandsPiet J.H. Daas
 
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)Piet J.H. Daas
 
EMOS 2018 Big Data methods and techniques
EMOS 2018 Big Data methods and techniquesEMOS 2018 Big Data methods and techniques
EMOS 2018 Big Data methods and techniquesPiet J.H. Daas
 
Use of social media for official statistics
Use of social media for official statisticsUse of social media for official statistics
Use of social media for official statisticsPiet J.H. Daas
 
Isi 2017 presentation on Big Data and bias
Isi 2017 presentation on Big Data and biasIsi 2017 presentation on Big Data and bias
Isi 2017 presentation on Big Data and biasPiet J.H. Daas
 
Responsible Data Science at Statistics Netherlands
Responsible Data Science at Statistics NetherlandsResponsible Data Science at Statistics Netherlands
Responsible Data Science at Statistics NetherlandsPiet J.H. Daas
 
CBS lecture at the opening of Data Science Campus of ONS
CBS lecture at the opening of Data Science Campus of ONSCBS lecture at the opening of Data Science Campus of ONS
CBS lecture at the opening of Data Science Campus of ONSPiet J.H. Daas
 
Ntts2017 presentation 45
Ntts2017 presentation 45Ntts2017 presentation 45
Ntts2017 presentation 45Piet J.H. Daas
 
Big Data presentation Mannheim
Big Data presentation MannheimBig Data presentation Mannheim
Big Data presentation MannheimPiet J.H. Daas
 
Extracting information from ' messy' social media data
Extracting information from ' messy' social media dataExtracting information from ' messy' social media data
Extracting information from ' messy' social media dataPiet J.H. Daas
 
Big data cbs_piet_daas
Big data cbs_piet_daasBig data cbs_piet_daas
Big data cbs_piet_daasPiet J.H. Daas
 
Gebruik van sociale media voor de officiële statistiek
Gebruik van sociale media voor de officiële statistiekGebruik van sociale media voor de officiële statistiek
Gebruik van sociale media voor de officiële statistiekPiet J.H. Daas
 
Profiling Big Data sources to assess their selectivity
Profiling Big Data sources to assess their selectivityProfiling Big Data sources to assess their selectivity
Profiling Big Data sources to assess their selectivityPiet J.H. Daas
 
Using Road Sensor Data for Official Statistics: towards a Big Data Methodology
Using Road Sensor Data for Official Statistics: towards a Big Data MethodologyUsing Road Sensor Data for Official Statistics: towards a Big Data Methodology
Using Road Sensor Data for Official Statistics: towards a Big Data MethodologyPiet J.H. Daas
 
Big Data @ CBS for Fontys students in Eindhoven
Big Data @ CBS for Fontys students in EindhovenBig Data @ CBS for Fontys students in Eindhoven
Big Data @ CBS for Fontys students in EindhovenPiet J.H. Daas
 
Big Data presentation for Statistics Canada
Big Data presentation for Statistics CanadaBig Data presentation for Statistics Canada
Big Data presentation for Statistics CanadaPiet J.H. Daas
 
Quality challenges in modernising business statistics
Quality challenges in modernising business statisticsQuality challenges in modernising business statistics
Quality challenges in modernising business statisticsPiet J.H. Daas
 
Quality Approaches to Big Data
Quality Approaches to Big DataQuality Approaches to Big Data
Quality Approaches to Big DataPiet J.H. Daas
 

Mehr von Piet J.H. Daas (20)

Big Data and official statistics with examples of their use
Big Data and official statistics with examples of their useBig Data and official statistics with examples of their use
Big Data and official statistics with examples of their use
 
IT infrastructure for Big Data and Data Science at Statistics Netherlands
IT infrastructure for Big Data and Data Science at Statistics NetherlandsIT infrastructure for Big Data and Data Science at Statistics Netherlands
IT infrastructure for Big Data and Data Science at Statistics Netherlands
 
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
 
EMOS 2018 Big Data methods and techniques
EMOS 2018 Big Data methods and techniquesEMOS 2018 Big Data methods and techniques
EMOS 2018 Big Data methods and techniques
 
Use of social media for official statistics
Use of social media for official statisticsUse of social media for official statistics
Use of social media for official statistics
 
Isi 2017 presentation on Big Data and bias
Isi 2017 presentation on Big Data and biasIsi 2017 presentation on Big Data and bias
Isi 2017 presentation on Big Data and bias
 
Responsible Data Science at Statistics Netherlands
Responsible Data Science at Statistics NetherlandsResponsible Data Science at Statistics Netherlands
Responsible Data Science at Statistics Netherlands
 
CBS lecture at the opening of Data Science Campus of ONS
CBS lecture at the opening of Data Science Campus of ONSCBS lecture at the opening of Data Science Campus of ONS
CBS lecture at the opening of Data Science Campus of ONS
 
Ntts2017 presentation 45
Ntts2017 presentation 45Ntts2017 presentation 45
Ntts2017 presentation 45
 
Big Data presentation Mannheim
Big Data presentation MannheimBig Data presentation Mannheim
Big Data presentation Mannheim
 
Extracting information from ' messy' social media data
Extracting information from ' messy' social media dataExtracting information from ' messy' social media data
Extracting information from ' messy' social media data
 
Big data cbs_piet_daas
Big data cbs_piet_daasBig data cbs_piet_daas
Big data cbs_piet_daas
 
Gebruik van sociale media voor de officiële statistiek
Gebruik van sociale media voor de officiële statistiekGebruik van sociale media voor de officiële statistiek
Gebruik van sociale media voor de officiële statistiek
 
Big Data @ CBS
Big Data @ CBSBig Data @ CBS
Big Data @ CBS
 
Profiling Big Data sources to assess their selectivity
Profiling Big Data sources to assess their selectivityProfiling Big Data sources to assess their selectivity
Profiling Big Data sources to assess their selectivity
 
Using Road Sensor Data for Official Statistics: towards a Big Data Methodology
Using Road Sensor Data for Official Statistics: towards a Big Data MethodologyUsing Road Sensor Data for Official Statistics: towards a Big Data Methodology
Using Road Sensor Data for Official Statistics: towards a Big Data Methodology
 
Big Data @ CBS for Fontys students in Eindhoven
Big Data @ CBS for Fontys students in EindhovenBig Data @ CBS for Fontys students in Eindhoven
Big Data @ CBS for Fontys students in Eindhoven
 
Big Data presentation for Statistics Canada
Big Data presentation for Statistics CanadaBig Data presentation for Statistics Canada
Big Data presentation for Statistics Canada
 
Quality challenges in modernising business statistics
Quality challenges in modernising business statisticsQuality challenges in modernising business statistics
Quality challenges in modernising business statistics
 
Quality Approaches to Big Data
Quality Approaches to Big DataQuality Approaches to Big Data
Quality Approaches to Big Data
 

Research on the Quality of Registers To Make Data Decisions in the Dutch Virtual Census.

  • 1. Research on the quality of registers to make data decisions in the Dutch Virtual Census Presentation at the 58th World Statistics Congress of the International Statistical Institute in Dublin 23 August 2011 Eric Schulte Nordholt (e.schultenordholt@cbs.nl), Saskia Ossen (sjl.ossen@cbs.nl) and Piet Daas (pjh.daas@cbs.nl)
  • 2. Combining research Data decisions Development of on secundary a quality sources in the framework for Dutch Virtual administrative Census of 2011 data Research on the quality of registers to make data decisions in the Dutch Virtual Census 1
  • 3. Contents • Data considerations in the Dutch Census of 2011 • Introduction to the quality framework • Results Source hyper dimension • Results Metadata hyper dimension • Results Data hyper dimension • Conclusions Research on the quality of registers to make data decisions in the Dutch Virtual Census 2
  • 4. Data considerations in the Dutch Census of 2011 (1) Last traditional census: 1971 Unwillingness (nonresponse) and reduction of expenses  no more traditional censuses Alternative: virtual census 1981 and 1991: Population Register and surveys Development 90’s: more registers → 2001 and 2011: integrated set of registers and surveys European Census Act → hypercubes Research on the quality of registers to make data decisions in the Dutch Virtual Census 3
  • 5. Data considerations in the Dutch Census of 2011 (2) Registers: • Population Register (PR), >16.6 million records • Jobs file, containing all employees • Self-employed file, containing all self-employed • Unemployment Benefit Register (UR) • Social Security Register (SR) • Education Register (ER) • New Housing Register (HR) Surveys: • Survey on Employment and Earnings (SEE) stopped • Labour Force Survey (LFS) • Housing survey Research on the quality of registers to make data decisions in the Dutch Virtual Census 4
  • 6. Introduction to the quality framewok METADATA: Focuses on the SOURCE: - Focus on data source as a whole (availability of the) - Contact information related information required to - Delivery related aspects understand and use the - and more data in the data source DATA: - Technical checks - Accuracy related issues Research on the quality of registers to make data decisions in the Dutch Virtual Census 5
  • 7. Results Source hyper dimension Important Suffers variables Low frequency Purpose seriously from are of delivery dataprovider selective missing unclear undercoverage Research on the quality of registers to make data decisions in the Dutch Virtual Census 6
  • 8. Results Metadata hyperdimension Time period in Time differences Unique source can’t be in reporting keys can’t transferred easily periods be easily to the time point used for needed linking Research on the quality of registers to make data decisions in the Dutch Virtual Census 7
  • 9. Results Data hyper dimension - completeness Research on the quality of registers to make data decisions in the Dutch Virtual Census 8
  • 10. Results Data hyper dimension – accuracy Research on the quality of registers to make data decisions in the Dutch Virtual Census 9
  • 11. Results Data hyper dimension – accuracy Research on the quality of registers to make data decisions in the Dutch Virtual Census 10
  • 12. Conclusions • The virtual census has proved to be a successful concept in the Netherlands • The quality framework is a useful tool for making data decisions in the virtual census • The “quality study” started in this paper will be extended to be able to determine how all census variables will be derived Research on the quality of registers to make data decisions in the Dutch Virtual Census 11
  • 13. Time for questions and discussion Research on the quality of registers to make data decisions in the Dutch Virtual Census 12