SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Richard Zijdeman [richard.zijdeman at iisg.nl]
Kathrin Dentler
Rinke Hoekstra
Albert Meroño-Peñuela
Advancing the comparability of
occupational data through
Linked Open Data
HISCO workshop
Historical Population Database of Transylvania
Cluj, Romania
June 18, 2016
... it is market position, and especially position in the occupational
division of labour, which is fundamental to the generation of
structured inequalities. The life chances of individuals and
families are largely determined by their position in the market and
occupation is taken to be its central indicator ... .
(Rose and Harrison, 2010)
2
3
Occupations are important as dependent variables (occupational
attainment studies) and independent variables (occupation
stratification studies) in educational (and occupational) status
attainment, health, voting, consumption, marriage etc.
(Ganzeboom, 2008)
Occupations are one of the few indicators of social position that
are available in:
• large quantities
• different time periods
• various societies
• at the individual level (smallest level of detail)
4
Lack of comparability
• Many different occupational classifications
• Differences in mobility studies could results
from different classification methods
(Kaelble 1985)
5
Charles Booth (1886-1903)
HISCO
• Historical International Standard Classification of Occupations
• Put together by a large number of institutes
• Based on ILO’s ISCO ’68
• Occupations retrieved from registers
• 1675 occupational codes
6
Current solution: 2-step procedure
Code into the concept, first:
• Classify into the concept (HISCO)
• Link the measure of stratification to the concept (e.g. SOCPO,
HISCAM)
7
New problems
1. What concept?
• Historical International Standard Classification (HISCO)
• OCCHISCO
• PST
2. Not all measures link to all concepts
• E.g. no link between OCCHISCO and HISCAM
3. Adaptability of concepts (new versions)
8
Is this a substantive problem?
Illustrative example:
• Subset of SAME occupational titles from NAPP and HISCO
• Link these occupations to HISCAM
• For HISCO directly provided by HISCAM people
• For OCCHISCO indirectly through a mapping
9
10
occupations
OCCHISCO
HISCO
HISCAMCross-
walk
E.g.: necessary for a comparison between Norway and the Netherlands
11
12
So yes, this is problematic
• ‘Lost’ 41% explained variance
• Cf. regression models: usually not above 30%
• HISCAM often both as dependent and independent variable
13
New problems
1. What concept?
• Historical International Standard Classification (HISCO)
• OCCHISCO
• PST
2. Not all measures link to all concepts
• E.g. no link between OCCHISCO and HISCAM
3. Adaptability of concepts (new versions)
14
Towards a solution
• Linked Data (Berners-Lee, 2006)
• Define Resources (books, respondents, etc.) with a URI
• Present URI’s as URL’s
• Describe Resources using so called ’triples’
15
An example of a triple
16
Margaret Miner
works as
PropertyResource Value
17
Miner
occupation
is of type
Resource
Property
Value
18
Miner
occupation
is of type
Margaret Miner
works as
19
miner
50.56
71105
71120
has
occhisco
has hisco
has hiscam
Occupational title
Source
PST: 123
OCCHISCO: 123
HISCO: 12345
HISCO: 54321
Was
DerivedFrom
HISCAM: 88
codedByMappingFile
Provenance
21
HISCO vocabulary
22
• hisco:entry for
‘occupational titles’
• transitivity between
category, unit, minor and
major group
Case study: DBpedia
- Structured data behind Wikipedia
- Information on all kinds of topics, also occupations
- Add HISCO codes to DBpedia occupations
- Let’s try and do this live: http://yasgui.org/short/VJfZvnx6x
23
Caveats
• We did not check the technique on a really big scale
(e.g. NAPP data)
• Sharing code remains a collective action problem
(but less of a coordination problem)
24
Conclusions
Linked Data
• Enhances comparative occupational research
• Adds visibility of heterogeneity in coding practices
25
Outlook
• Linkage to texts (occupations in newspapers)
• Linkage to public resources: Wikipedia
• Combine Machine Learning and Linked Data for automated
occupational coding
26
Thank you
richard.zijdeman@iisg.nl
27

Weitere ähnliche Inhalte

Ähnlich wie Advancing the comparability of occupational data through Linked Open Data

Edited volume on ESP didactics: DidASP GERAS 2018
Edited volume on ESP didactics: DidASP GERAS 2018Edited volume on ESP didactics: DidASP GERAS 2018
Edited volume on ESP didactics: DidASP GERAS 2018
Shona Whyte
 
Crossing Epistemic Boundaries – The Professional Learning of Academics in Hig...
Crossing Epistemic Boundaries – The Professional Learning of Academics in Hig...Crossing Epistemic Boundaries – The Professional Learning of Academics in Hig...
Crossing Epistemic Boundaries – The Professional Learning of Academics in Hig...
Dr Wayne Barry
 
SAMPLE ANSWERPublic Attributions for Poverty in Canada – Reutt.docx
SAMPLE ANSWERPublic Attributions for Poverty in Canada – Reutt.docxSAMPLE ANSWERPublic Attributions for Poverty in Canada – Reutt.docx
SAMPLE ANSWERPublic Attributions for Poverty in Canada – Reutt.docx
rtodd599
 
Turnover intentions among home workers and European Union migrants A proposal...
Turnover intentions among home workers and European Union migrants A proposal...Turnover intentions among home workers and European Union migrants A proposal...
Turnover intentions among home workers and European Union migrants A proposal...
Jaroslav Aleksandrovic
 
The challenges of using education as a means of addressing persistent unemplo...
The challenges of using education as a means of addressing persistent unemplo...The challenges of using education as a means of addressing persistent unemplo...
The challenges of using education as a means of addressing persistent unemplo...
network_trainers
 

Ähnlich wie Advancing the comparability of occupational data through Linked Open Data (20)

Informal work in a post-transition country - some evidence from Poland
Informal work in a post-transition country - some evidence from PolandInformal work in a post-transition country - some evidence from Poland
Informal work in a post-transition country - some evidence from Poland
 
Seminar Basque & Iceland Connetion Calzada PhD & Casado PhD University of Ice...
Seminar Basque & Iceland Connetion Calzada PhD & Casado PhD University of Ice...Seminar Basque & Iceland Connetion Calzada PhD & Casado PhD University of Ice...
Seminar Basque & Iceland Connetion Calzada PhD & Casado PhD University of Ice...
 
Edited volume on ESP didactics: DidASP GERAS 2018
Edited volume on ESP didactics: DidASP GERAS 2018Edited volume on ESP didactics: DidASP GERAS 2018
Edited volume on ESP didactics: DidASP GERAS 2018
 
Phd transfer seminar July 2016
Phd transfer seminar July 2016Phd transfer seminar July 2016
Phd transfer seminar July 2016
 
The role of theory in research on the education and learning of adults
The role of theory in research on the education and learning of adultsThe role of theory in research on the education and learning of adults
The role of theory in research on the education and learning of adults
 
The Complexity of Data: Computer Simulation and “Everyday” Social Science
The Complexity of Data: Computer Simulation and “Everyday” Social ScienceThe Complexity of Data: Computer Simulation and “Everyday” Social Science
The Complexity of Data: Computer Simulation and “Everyday” Social Science
 
Studying young people’s online social practices
Studying young people’s online social practicesStudying young people’s online social practices
Studying young people’s online social practices
 
Crossing Epistemic Boundaries – The Professional Learning of Academics in Hig...
Crossing Epistemic Boundaries – The Professional Learning of Academics in Hig...Crossing Epistemic Boundaries – The Professional Learning of Academics in Hig...
Crossing Epistemic Boundaries – The Professional Learning of Academics in Hig...
 
TL_Thompson.pptx.ppt
TL_Thompson.pptx.pptTL_Thompson.pptx.ppt
TL_Thompson.pptx.ppt
 
SAMPLE ANSWERPublic Attributions for Poverty in Canada – Reutt.docx
SAMPLE ANSWERPublic Attributions for Poverty in Canada – Reutt.docxSAMPLE ANSWERPublic Attributions for Poverty in Canada – Reutt.docx
SAMPLE ANSWERPublic Attributions for Poverty in Canada – Reutt.docx
 
Qualitative methods in CSCW research
Qualitative methods in CSCW researchQualitative methods in CSCW research
Qualitative methods in CSCW research
 
Turnover intentions among home workers and European Union migrants A proposal...
Turnover intentions among home workers and European Union migrants A proposal...Turnover intentions among home workers and European Union migrants A proposal...
Turnover intentions among home workers and European Union migrants A proposal...
 
SDS Networking Event breakout session slides - PhD overview
SDS Networking Event breakout session slides - PhD overviewSDS Networking Event breakout session slides - PhD overview
SDS Networking Event breakout session slides - PhD overview
 
Individualization
IndividualizationIndividualization
Individualization
 
The challenges of using education as a means of addressing persistent unemplo...
The challenges of using education as a means of addressing persistent unemplo...The challenges of using education as a means of addressing persistent unemplo...
The challenges of using education as a means of addressing persistent unemplo...
 
Trainers in Europe: Community IT Centres
Trainers in Europe: Community IT CentresTrainers in Europe: Community IT Centres
Trainers in Europe: Community IT Centres
 
Paolo Landri - Actor Network Theory and the Investigation of Education Policy...
Paolo Landri - Actor Network Theory and the Investigation of Education Policy...Paolo Landri - Actor Network Theory and the Investigation of Education Policy...
Paolo Landri - Actor Network Theory and the Investigation of Education Policy...
 
Euraxess ERD2018 Presentation on a JSPS Usability & eHealth Project
Euraxess ERD2018 Presentation on a JSPS Usability & eHealth Project Euraxess ERD2018 Presentation on a JSPS Usability & eHealth Project
Euraxess ERD2018 Presentation on a JSPS Usability & eHealth Project
 
Menendez - Policies and preferences of academic actors
Menendez - Policies and preferences of academic actorsMenendez - Policies and preferences of academic actors
Menendez - Policies and preferences of academic actors
 
Occupational Mobility of Routine Workers
Occupational Mobility of Routine WorkersOccupational Mobility of Routine Workers
Occupational Mobility of Routine Workers
 

Mehr von Richard Zijdeman

Mehr von Richard Zijdeman (12)

Linked Data: Een extra ontstluitingslaag op archieven
Linked Data: Een extra ontstluitingslaag op archieven Linked Data: Een extra ontstluitingslaag op archieven
Linked Data: Een extra ontstluitingslaag op archieven
 
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
 
grlc. store, share and run sparql queries
grlc. store, share and run sparql queriesgrlc. store, share and run sparql queries
grlc. store, share and run sparql queries
 
Rijpma's Catasto meets SPARQL dhb2017_workshop
Rijpma's Catasto meets SPARQL dhb2017_workshopRijpma's Catasto meets SPARQL dhb2017_workshop
Rijpma's Catasto meets SPARQL dhb2017_workshop
 
Data legend dh_benelux_2017.key
Data legend dh_benelux_2017.keyData legend dh_benelux_2017.key
Data legend dh_benelux_2017.key
 
Toogdag 2017
Toogdag 2017Toogdag 2017
Toogdag 2017
 
Basic introduction into R
Basic introduction into RBasic introduction into R
Basic introduction into R
 
work in a globalized world
work in a globalized worldwork in a globalized world
work in a globalized world
 
Introduction into R for historians (part 3: examine and import data)
Introduction into R for historians (part 3: examine and import data)Introduction into R for historians (part 3: examine and import data)
Introduction into R for historians (part 3: examine and import data)
 
Introduction into R for historians (part 1: introduction)
Introduction into R for historians (part 1: introduction)Introduction into R for historians (part 1: introduction)
Introduction into R for historians (part 1: introduction)
 
Historical occupational classification and stratification schemes (lecture)
Historical occupational classification and stratification schemes (lecture)Historical occupational classification and stratification schemes (lecture)
Historical occupational classification and stratification schemes (lecture)
 
Using HISCO and HISCAM to code and analyze occupations
Using HISCO and HISCAM to code and analyze occupationsUsing HISCO and HISCAM to code and analyze occupations
Using HISCO and HISCAM to code and analyze occupations
 

Kürzlich hochgeladen

Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
Silpa
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
Silpa
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 

Kürzlich hochgeladen (20)

GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 

Advancing the comparability of occupational data through Linked Open Data

  • 1. Richard Zijdeman [richard.zijdeman at iisg.nl] Kathrin Dentler Rinke Hoekstra Albert Meroño-Peñuela Advancing the comparability of occupational data through Linked Open Data HISCO workshop Historical Population Database of Transylvania Cluj, Romania June 18, 2016
  • 2. ... it is market position, and especially position in the occupational division of labour, which is fundamental to the generation of structured inequalities. The life chances of individuals and families are largely determined by their position in the market and occupation is taken to be its central indicator ... . (Rose and Harrison, 2010) 2
  • 3. 3 Occupations are important as dependent variables (occupational attainment studies) and independent variables (occupation stratification studies) in educational (and occupational) status attainment, health, voting, consumption, marriage etc. (Ganzeboom, 2008)
  • 4. Occupations are one of the few indicators of social position that are available in: • large quantities • different time periods • various societies • at the individual level (smallest level of detail) 4
  • 5. Lack of comparability • Many different occupational classifications • Differences in mobility studies could results from different classification methods (Kaelble 1985) 5 Charles Booth (1886-1903)
  • 6. HISCO • Historical International Standard Classification of Occupations • Put together by a large number of institutes • Based on ILO’s ISCO ’68 • Occupations retrieved from registers • 1675 occupational codes 6
  • 7. Current solution: 2-step procedure Code into the concept, first: • Classify into the concept (HISCO) • Link the measure of stratification to the concept (e.g. SOCPO, HISCAM) 7
  • 8. New problems 1. What concept? • Historical International Standard Classification (HISCO) • OCCHISCO • PST 2. Not all measures link to all concepts • E.g. no link between OCCHISCO and HISCAM 3. Adaptability of concepts (new versions) 8
  • 9. Is this a substantive problem? Illustrative example: • Subset of SAME occupational titles from NAPP and HISCO • Link these occupations to HISCAM • For HISCO directly provided by HISCAM people • For OCCHISCO indirectly through a mapping 9
  • 10. 10 occupations OCCHISCO HISCO HISCAMCross- walk E.g.: necessary for a comparison between Norway and the Netherlands
  • 11. 11
  • 12. 12
  • 13. So yes, this is problematic • ‘Lost’ 41% explained variance • Cf. regression models: usually not above 30% • HISCAM often both as dependent and independent variable 13
  • 14. New problems 1. What concept? • Historical International Standard Classification (HISCO) • OCCHISCO • PST 2. Not all measures link to all concepts • E.g. no link between OCCHISCO and HISCAM 3. Adaptability of concepts (new versions) 14
  • 15. Towards a solution • Linked Data (Berners-Lee, 2006) • Define Resources (books, respondents, etc.) with a URI • Present URI’s as URL’s • Describe Resources using so called ’triples’ 15
  • 16. An example of a triple 16 Margaret Miner works as PropertyResource Value
  • 20. Occupational title Source PST: 123 OCCHISCO: 123 HISCO: 12345 HISCO: 54321 Was DerivedFrom HISCAM: 88 codedByMappingFile Provenance
  • 22. 22 • hisco:entry for ‘occupational titles’ • transitivity between category, unit, minor and major group
  • 23. Case study: DBpedia - Structured data behind Wikipedia - Information on all kinds of topics, also occupations - Add HISCO codes to DBpedia occupations - Let’s try and do this live: http://yasgui.org/short/VJfZvnx6x 23
  • 24. Caveats • We did not check the technique on a really big scale (e.g. NAPP data) • Sharing code remains a collective action problem (but less of a coordination problem) 24
  • 25. Conclusions Linked Data • Enhances comparative occupational research • Adds visibility of heterogeneity in coding practices 25
  • 26. Outlook • Linkage to texts (occupations in newspapers) • Linkage to public resources: Wikipedia • Combine Machine Learning and Linked Data for automated occupational coding 26