SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Connecting European
archaeology datasets:
prospects and challenges
Kate Fernie, 2Culture Associates
Big Data in Archaeology: Practicalities and Possibilities
27-28 March 2019
• CARARE
• A brief history
• Datasets and their diversity
• Metadata and schemas
• Challenges
• Possibilities
Introduction
CARARE
Connecting Archaeology and Architecture in Europe
• Began as an EU-funded best practice network in 2010
• Established as a membership association in 2016
• Objective: Advancing professional practice and fostering appreciation
of the digital archaeological and architectural heritage
• Areas:
• Good practices, advice and guidance
• Services to enable data sharing
• CARARE metadata schema
• Promoting re-use
http://www.carare.eu/
Steps on the way to CARARE
• A shared vision
• International collaborations on
heritage data (CIDOC, Arena,
Acquarelle, DARIAH, INSPIRE,
Europeana, etc.)
• Digitisation and use of digital
technologies
• GIS
• Technical infrastructures
A brief history
Who is collecting archaeological and architectural heritage data?
• State agencies
• inventories of protected sites, monuments and buildings
• conservation records, field investigations, surveys
• Museums – finds and excavation archives
• Research Institutions & researchers
• Libraries
Datasets
Image: Swedish National Heritage Board
CARARE and related projects have aggregated over 6 million digital
objects from 20+ countries for Europeana.eu
Many different types of object
• Inventory records, reports, photographs, drawings, books, videos, objects,
aerial photos, GIS datasets, 3D datasets, models, reconstructions, and more
Many different ways of recording objects
• Heritage agencies, museums, archives, libraries, researchers all have
different ways of describing objects
Many different languages, vocabularies, time periods and map systems
Rather diverse
Tournoi royal de motos à Londres changement
d'une roue de side-car en marche, 1932
Agence de presse Mondial Photo-Presse.
We work with
the metadata
that’s provided
CARARE defined a metadata model for metadata aggregation
• Standards based: CIDOC core standards, MIDAS Heritage, LIDO and EDM
• Distinguishes between “heritage assets” (monument, building, painting, book,
image, film, 3D) and digital representations found online
• Allows for events (field activities, lab work) and collections
• Supports objects that are composed of other objects (complexes and
hierarchies)
• Is rich where the domain calls for it (e.g. time, space, monument character)
The schema meets a need to mediate between native data (exports) and enable
their transformation into a common format
Combining datasets
Let’s see an example
MINT
• Metadata mapping (from
native to target schema)
• Preview
• Statistics
• Transformation (to target
schema(s))
Rijksdienst voor het Cultureel Erfgoed:
Rijsmonmumenten
Making connections
Heritage asset
Has
representation
Images: Instituto Universitario de Investigación en Arqueología Ibérica
“Hornos de Peal, Jaén”
Has
representation
is related
Relationships between the main CARARE classes:
• Heritage asset, digital resources and events
Has Met
Enriching metadata during mapping
Heritage asset
Images: Instituto Universitario de Investigación en Arqueología Ibérica
“Hornos de Peal, Jaén”
<car:heritageAssetType>http://vocab.getty.edu/aat/300054328</car:heritageAssetType>
<car:heritageAssetType>http://vocab.getty.edu/aat/300000810</car:heritageAssetType>
<car:heritageAssetType>http://vocab.getty.edu/aat/300305500</car:heritageAssetType>
Adding constants: LOD
AAT concepts
<car:heritageAssetType lang="es">Necrópolis</car:heritageAssetType>
Languages identification
Mapping the metadata gives an opportunity to
make some simple enrichments, by adding:
• Language of the metadata
• Name of the provider
• Country of provider
There’s a difference between doing a schema mapping and a mapping to
transform real data.
Data issues can include:
• Data that doesn’t conform entirely to the scope of an element
• Multiple values within a single element (separators)
• Data inserted in mandatory elements (n/a)
• Lack of unique values
A good mapping can address some of these issues, e.g. by splitting
multiple subject concepts into separate elements.
(issues can be fixed at source, but this can be time consuming with datasets that
include hundreds of thousands of records).
Quality issues
Transformation: some semantic gains
Through transformation to a
common schema, we achieve
interoperability between
disparate datasets
 Enabling cross searches
(what, when, where, who)
 Open licencing of the
metadata and APIs enables
reuse in various applications
http://eculturemap.eculturelab.eu/eCulture14m/Map.html?
• Metadata mapping is rarely easy
• Metadata models are complex with subtle difference in world view
• Statistical metrics can show that recording practices diverge and other
quality issues
• Native metadata is designed to serve specific purposes
• Local context, audiences and questions
• Merging metadata from various organisations in different
countries/languages poses special challenges
Some challenges
Aggregators like CARARE enable transformation of metadata into a
common model and have some services to enable further work
• Language labelling
• Adding Linked Open Data
• Automatic enrichment
• Crowdsourcing
Aggregating and enriching
MORe
One of the big challenges in searching across datasets in Europe is
dealing with data in different languages
Linguistic resources and translation tools are increasingly available, but to
work they need first to identify which language is involved
 Language labels are often missing
 Language identification and labelling microservices
Interfaces, displays and search services can adapt to users’ preferred
language and in this way return results which are relevant but which have
been catalogued in unfamiliar languages.
Why add language information to data?
CARARE microservices include:
• Natural language processing techniques to enable subject concepts
and names to be extracted from text
• Geocoding services to add coordinates for named places
• Vocabulary matching services
• Geo conversion, inversion and normalization services
Automated enrichment
Location case study
• Location is important for archaeology but place information is often
missing, especially for content from library, archive and museum
collections
• Automated extraction techniques can identify place names in data, but
place names are not unique
• The process requires quality control
• Crowd sourcing is one way of harnessing the knowledge of individuals
to check the results of automated enrichment and place objects
correctly on the map
• One such service was developed by the LoCloud project
Crowd sourcing
Map tools
The content aggregated by CARARE is in Europeana
Take a look: www.europeana.eu
Is it big data?
• Volume – 2-4 million assets aggregated by CARARE
• Includes the national heritage inventories for several
countries, which are individually quite large datasets
• Europeana includes another 1 million+ assets relevant for
archaeology aggregated by other projects
• Includes museum and library collections, film archives,
newspaper reports
• Quite big?
• New research would be great!
kfernie27@gmail.com
Any questions?
www.carare.eu

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Introduction to CARARE
Introduction to CARAREIntroduction to CARARE
Introduction to CARARE
 
3D in the CARARE Project. Providing Europeana with 3D Content for the Archaeo...
3D in the CARARE Project. Providing Europeana with 3D Content for the Archaeo...3D in the CARARE Project. Providing Europeana with 3D Content for the Archaeo...
3D in the CARARE Project. Providing Europeana with 3D Content for the Archaeo...
 
3D reconstructions for story telling and understanding
3D reconstructions for story telling and understanding3D reconstructions for story telling and understanding
3D reconstructions for story telling and understanding
 
CARARE: Connecting Archaeology and Architecture in Europeana
CARARE: Connecting Archaeology and Architecture in EuropeanaCARARE: Connecting Archaeology and Architecture in Europeana
CARARE: Connecting Archaeology and Architecture in Europeana
 
Geographic Information in the Carare and Athena Projects
Geographic Information in the Carare and Athena ProjectsGeographic Information in the Carare and Athena Projects
Geographic Information in the Carare and Athena Projects
 
Metadata, the CARARE aggregation service and 3D ICONS
Metadata, the CARARE aggregation service and 3D ICONSMetadata, the CARARE aggregation service and 3D ICONS
Metadata, the CARARE aggregation service and 3D ICONS
 
Sorin Hermon, 'Towards an integrated repository for research and management o...
Sorin Hermon, 'Towards an integrated repository for research and management o...Sorin Hermon, 'Towards an integrated repository for research and management o...
Sorin Hermon, 'Towards an integrated repository for research and management o...
 
Ariadne Services
Ariadne ServicesAriadne Services
Ariadne Services
 
'Towards an integrated repository for research and management of 3D archaeolo...
'Towards an integrated repository for research and management of 3D archaeolo...'Towards an integrated repository for research and management of 3D archaeolo...
'Towards an integrated repository for research and management of 3D archaeolo...
 
The Mint Mapping tool
The Mint Mapping toolThe Mint Mapping tool
The Mint Mapping tool
 
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony Corns
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony CornsImproving Access and Exploitation of 3D Cultural Heritage Data | Anthony Corns
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony Corns
 
Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017
 
Potential usage of 3D data and IPR issues, presented by Sheena Basset
Potential usage of 3D data and IPR issues, presented by Sheena BassetPotential usage of 3D data and IPR issues, presented by Sheena Basset
Potential usage of 3D data and IPR issues, presented by Sheena Basset
 
Digital Cultural Heritage and the new EU Framework Programme
Digital Cultural Heritage and the new EU Framework ProgrammeDigital Cultural Heritage and the new EU Framework Programme
Digital Cultural Heritage and the new EU Framework Programme
 
The last mile of 3DIcons: making available 3D contents and their metadata thr...
The last mile of 3DIcons: making available 3D contents and their metadata thr...The last mile of 3DIcons: making available 3D contents and their metadata thr...
The last mile of 3DIcons: making available 3D contents and their metadata thr...
 
DYAS: The Greek Research Infrastructure Network for the Humanities
DYAS: The Greek Research Infrastructure Network for the HumanitiesDYAS: The Greek Research Infrastructure Network for the Humanities
DYAS: The Greek Research Infrastructure Network for the Humanities
 
Local content in a Europeana cloud for small & medium content providers
Local content in a Europeana cloud for small & medium content providersLocal content in a Europeana cloud for small & medium content providers
Local content in a Europeana cloud for small & medium content providers
 
LoCloud: Local Cultural Heritage Online and in the Cloud
LoCloud: Local Cultural Heritage Online and in the CloudLoCloud: Local Cultural Heritage Online and in the Cloud
LoCloud: Local Cultural Heritage Online and in the Cloud
 
Metadata for 3D models, Sheena Bassett
Metadata for 3D models, Sheena BassettMetadata for 3D models, Sheena Bassett
Metadata for 3D models, Sheena Bassett
 
Introduction to 3D ICONS
Introduction to 3D ICONSIntroduction to 3D ICONS
Introduction to 3D ICONS
 

Ähnlich wie Connecting European Archaeology datasets: prospects and challenges

Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage MetadataEvaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Nuno Freire
 

Ähnlich wie Connecting European Archaeology datasets: prospects and challenges (20)

Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model   Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model
 
LoCloud - Local content in a Europeana cloud
LoCloud - Local content in a Europeana cloudLoCloud - Local content in a Europeana cloud
LoCloud - Local content in a Europeana cloud
 
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
 
ARIADNE Registry - towards interoperability
ARIADNE Registry - towards interoperabilityARIADNE Registry - towards interoperability
ARIADNE Registry - towards interoperability
 
LoCloud: cloud-based services for local cultural heritage
LoCloud: cloud-based services for local cultural heritageLoCloud: cloud-based services for local cultural heritage
LoCloud: cloud-based services for local cultural heritage
 
Digital Archiving at the Meertens Institute
Digital Archiving at the Meertens InstituteDigital Archiving at the Meertens Institute
Digital Archiving at the Meertens Institute
 
LoCloud: Local Content in a Europeana Cloud
LoCloud: Local Content in a Europeana CloudLoCloud: Local Content in a Europeana Cloud
LoCloud: Local Content in a Europeana Cloud
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018
 
Data quality in cultural heritage (meta)data
Data quality in cultural heritage (meta)dataData quality in cultural heritage (meta)data
Data quality in cultural heritage (meta)data
 
Open Data Masterclass - Europeana and LOD
Open Data Masterclass - Europeana and LODOpen Data Masterclass - Europeana and LOD
Open Data Masterclass - Europeana and LOD
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
 
Introduction to LoCloud
Introduction to LoCloud Introduction to LoCloud
Introduction to LoCloud
 
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
 
The Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataThe Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage Data
 
Introduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH FellowsIntroduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH Fellows
 
Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage MetadataEvaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata
 
Easter JISC metadata May25 DT
Easter JISC metadata May25 DTEaster JISC metadata May25 DT
Easter JISC metadata May25 DT
 
Corrado -- Establishing the Landscape
Corrado -- Establishing the LandscapeCorrado -- Establishing the Landscape
Corrado -- Establishing the Landscape
 
DLCS
DLCSDLCS
DLCS
 
Workshop: Concluding Remarks
Workshop: Concluding RemarksWorkshop: Concluding Remarks
Workshop: Concluding Remarks
 

Mehr von CARARE

Mehr von CARARE (20)

Europeana 3D
Europeana 3D Europeana 3D
Europeana 3D
 
Speaking one language: how vocabularies can help organise information
Speaking one language: how vocabularies can help organise informationSpeaking one language: how vocabularies can help organise information
Speaking one language: how vocabularies can help organise information
 
Exploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practiceExploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practice
 
3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing access3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing access
 
Towards data FAIRness
Towards data FAIRnessTowards data FAIRness
Towards data FAIRness
 
Archaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing frameworkArchaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing framework
 
Archaeology in Europeana quality assurance, enrichment and publishing
Archaeology in Europeana quality assurance, enrichment and publishingArchaeology in Europeana quality assurance, enrichment and publishing
Archaeology in Europeana quality assurance, enrichment and publishing
 
Carare Membership
Carare MembershipCarare Membership
Carare Membership
 
How and why people today engage with the archaeological heritage and scholarl...
How and why people today engage with the archaeological heritage and scholarl...How and why people today engage with the archaeological heritage and scholarl...
How and why people today engage with the archaeological heritage and scholarl...
 
An introduction to the PARTHENOS guidelines to FAIRify data management and ma...
An introduction to the PARTHENOS guidelines to FAIRify data management and ma...An introduction to the PARTHENOS guidelines to FAIRify data management and ma...
An introduction to the PARTHENOS guidelines to FAIRify data management and ma...
 
The everyday reality behind the iron curtain
The everyday reality behind the iron curtainThe everyday reality behind the iron curtain
The everyday reality behind the iron curtain
 
Inspiration from the past
Inspiration from the pastInspiration from the past
Inspiration from the past
 
Archaeology in the europeana publishing framework
Archaeology in the europeana publishing frameworkArchaeology in the europeana publishing framework
Archaeology in the europeana publishing framework
 
Sharing New perspectives: overview presentation
Sharing New perspectives: overview presentationSharing New perspectives: overview presentation
Sharing New perspectives: overview presentation
 
Linking Europe to the Nile: connecting sites, monuments, museums and historic...
Linking Europe to the Nile: connecting sites, monuments, museums and historic...Linking Europe to the Nile: connecting sites, monuments, museums and historic...
Linking Europe to the Nile: connecting sites, monuments, museums and historic...
 
An archaeological approach to epigraphy: new data on the electoral programata...
An archaeological approach to epigraphy: new data on the electoral programata...An archaeological approach to epigraphy: new data on the electoral programata...
An archaeological approach to epigraphy: new data on the electoral programata...
 
Updating the Iberians in Europeana, Alberto Sánchez, José A. Tuñón, Carmen Ru...
Updating the Iberians in Europeana, Alberto Sánchez, José A. Tuñón, Carmen Ru...Updating the Iberians in Europeana, Alberto Sánchez, José A. Tuñón, Carmen Ru...
Updating the Iberians in Europeana, Alberto Sánchez, José A. Tuñón, Carmen Ru...
 
Europeana Collections: Archaeology in Europeana, Nienke van Schaverbeke
Europeana Collections: Archaeology in Europeana, Nienke van SchaverbekeEuropeana Collections: Archaeology in Europeana, Nienke van Schaverbeke
Europeana Collections: Archaeology in Europeana, Nienke van Schaverbeke
 
HBIM Leinster House, Laser Scan Survey Modelling and Conservation documentati...
HBIM Leinster House, Laser Scan Survey Modelling and Conservation documentati...HBIM Leinster House, Laser Scan Survey Modelling and Conservation documentati...
HBIM Leinster House, Laser Scan Survey Modelling and Conservation documentati...
 
A presentation of SOCH: Swedish Open Cultural Heritage, Marcus Smith
A presentation of SOCH: Swedish Open Cultural Heritage, Marcus SmithA presentation of SOCH: Swedish Open Cultural Heritage, Marcus Smith
A presentation of SOCH: Swedish Open Cultural Heritage, Marcus Smith
 

Kürzlich hochgeladen

6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
@Chandigarh #call #Girls 9053900678 @Call #Girls in @Punjab 9053900678
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
Diya Sharma
 
VIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
nilamkumrai
 
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 

Kürzlich hochgeladen (20)

(+971568250507 ))# Young Call Girls in Ajman By Pakistani Call Girls in ...
(+971568250507  ))#  Young Call Girls  in Ajman  By Pakistani Call Girls  in ...(+971568250507  ))#  Young Call Girls  in Ajman  By Pakistani Call Girls  in ...
(+971568250507 ))# Young Call Girls in Ajman By Pakistani Call Girls in ...
 
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
 
Call Now ☎ 8264348440 !! Call Girls in Rani Bagh Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Rani Bagh Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Rani Bagh Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Rani Bagh Escort Service Delhi N.C.R.
 
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
 
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
 
Real Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtReal Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirt
 
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
 
Sarola * Female Escorts Service in Pune | 8005736733 Independent Escorts & Da...
Sarola * Female Escorts Service in Pune | 8005736733 Independent Escorts & Da...Sarola * Female Escorts Service in Pune | 8005736733 Independent Escorts & Da...
Sarola * Female Escorts Service in Pune | 8005736733 Independent Escorts & Da...
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
 
VIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 Booking
 
Dubai Call Girls Milky O525547819 Call Girls Dubai Soft Dating
Dubai Call Girls Milky O525547819 Call Girls Dubai Soft DatingDubai Call Girls Milky O525547819 Call Girls Dubai Soft Dating
Dubai Call Girls Milky O525547819 Call Girls Dubai Soft Dating
 
Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...
Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...
Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...
 
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
 
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
 
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
 
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
 
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
 
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
 
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls DubaiDubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
 

Connecting European Archaeology datasets: prospects and challenges

  • 1. Connecting European archaeology datasets: prospects and challenges Kate Fernie, 2Culture Associates Big Data in Archaeology: Practicalities and Possibilities 27-28 March 2019
  • 2. • CARARE • A brief history • Datasets and their diversity • Metadata and schemas • Challenges • Possibilities Introduction
  • 3. CARARE Connecting Archaeology and Architecture in Europe • Began as an EU-funded best practice network in 2010 • Established as a membership association in 2016 • Objective: Advancing professional practice and fostering appreciation of the digital archaeological and architectural heritage • Areas: • Good practices, advice and guidance • Services to enable data sharing • CARARE metadata schema • Promoting re-use http://www.carare.eu/
  • 4. Steps on the way to CARARE • A shared vision • International collaborations on heritage data (CIDOC, Arena, Acquarelle, DARIAH, INSPIRE, Europeana, etc.) • Digitisation and use of digital technologies • GIS • Technical infrastructures A brief history
  • 5. Who is collecting archaeological and architectural heritage data? • State agencies • inventories of protected sites, monuments and buildings • conservation records, field investigations, surveys • Museums – finds and excavation archives • Research Institutions & researchers • Libraries Datasets Image: Swedish National Heritage Board
  • 6. CARARE and related projects have aggregated over 6 million digital objects from 20+ countries for Europeana.eu Many different types of object • Inventory records, reports, photographs, drawings, books, videos, objects, aerial photos, GIS datasets, 3D datasets, models, reconstructions, and more Many different ways of recording objects • Heritage agencies, museums, archives, libraries, researchers all have different ways of describing objects Many different languages, vocabularies, time periods and map systems Rather diverse
  • 7. Tournoi royal de motos à Londres changement d'une roue de side-car en marche, 1932 Agence de presse Mondial Photo-Presse. We work with the metadata that’s provided
  • 8. CARARE defined a metadata model for metadata aggregation • Standards based: CIDOC core standards, MIDAS Heritage, LIDO and EDM • Distinguishes between “heritage assets” (monument, building, painting, book, image, film, 3D) and digital representations found online • Allows for events (field activities, lab work) and collections • Supports objects that are composed of other objects (complexes and hierarchies) • Is rich where the domain calls for it (e.g. time, space, monument character) The schema meets a need to mediate between native data (exports) and enable their transformation into a common format Combining datasets
  • 9. Let’s see an example MINT • Metadata mapping (from native to target schema) • Preview • Statistics • Transformation (to target schema(s)) Rijksdienst voor het Cultureel Erfgoed: Rijsmonmumenten
  • 10. Making connections Heritage asset Has representation Images: Instituto Universitario de Investigación en Arqueología Ibérica “Hornos de Peal, Jaén” Has representation is related Relationships between the main CARARE classes: • Heritage asset, digital resources and events Has Met
  • 11. Enriching metadata during mapping Heritage asset Images: Instituto Universitario de Investigación en Arqueología Ibérica “Hornos de Peal, Jaén” <car:heritageAssetType>http://vocab.getty.edu/aat/300054328</car:heritageAssetType> <car:heritageAssetType>http://vocab.getty.edu/aat/300000810</car:heritageAssetType> <car:heritageAssetType>http://vocab.getty.edu/aat/300305500</car:heritageAssetType> Adding constants: LOD AAT concepts <car:heritageAssetType lang="es">Necrópolis</car:heritageAssetType> Languages identification Mapping the metadata gives an opportunity to make some simple enrichments, by adding: • Language of the metadata • Name of the provider • Country of provider
  • 12. There’s a difference between doing a schema mapping and a mapping to transform real data. Data issues can include: • Data that doesn’t conform entirely to the scope of an element • Multiple values within a single element (separators) • Data inserted in mandatory elements (n/a) • Lack of unique values A good mapping can address some of these issues, e.g. by splitting multiple subject concepts into separate elements. (issues can be fixed at source, but this can be time consuming with datasets that include hundreds of thousands of records). Quality issues
  • 13. Transformation: some semantic gains Through transformation to a common schema, we achieve interoperability between disparate datasets  Enabling cross searches (what, when, where, who)  Open licencing of the metadata and APIs enables reuse in various applications http://eculturemap.eculturelab.eu/eCulture14m/Map.html?
  • 14. • Metadata mapping is rarely easy • Metadata models are complex with subtle difference in world view • Statistical metrics can show that recording practices diverge and other quality issues • Native metadata is designed to serve specific purposes • Local context, audiences and questions • Merging metadata from various organisations in different countries/languages poses special challenges Some challenges
  • 15. Aggregators like CARARE enable transformation of metadata into a common model and have some services to enable further work • Language labelling • Adding Linked Open Data • Automatic enrichment • Crowdsourcing Aggregating and enriching MORe
  • 16. One of the big challenges in searching across datasets in Europe is dealing with data in different languages Linguistic resources and translation tools are increasingly available, but to work they need first to identify which language is involved  Language labels are often missing  Language identification and labelling microservices Interfaces, displays and search services can adapt to users’ preferred language and in this way return results which are relevant but which have been catalogued in unfamiliar languages. Why add language information to data?
  • 17. CARARE microservices include: • Natural language processing techniques to enable subject concepts and names to be extracted from text • Geocoding services to add coordinates for named places • Vocabulary matching services • Geo conversion, inversion and normalization services Automated enrichment
  • 18. Location case study • Location is important for archaeology but place information is often missing, especially for content from library, archive and museum collections • Automated extraction techniques can identify place names in data, but place names are not unique • The process requires quality control • Crowd sourcing is one way of harnessing the knowledge of individuals to check the results of automated enrichment and place objects correctly on the map • One such service was developed by the LoCloud project Crowd sourcing
  • 20. The content aggregated by CARARE is in Europeana Take a look: www.europeana.eu
  • 21. Is it big data? • Volume – 2-4 million assets aggregated by CARARE • Includes the national heritage inventories for several countries, which are individually quite large datasets • Europeana includes another 1 million+ assets relevant for archaeology aggregated by other projects • Includes museum and library collections, film archives, newspaper reports • Quite big? • New research would be great!