SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen


by 
Keith May @Keith_May
Ceri Binding & Prof Doug Tudhope

Faculty of Advanced Technology

University of South Wales

To Boldly or Bravely Go? Experiences of
using Semantic Technologies for
Archaeological Resources
Excavation record data modelling
• CRM-EH focuses on common ‘core’
Concepts of our Archaeological processes
• Stratigraphic relationships (e.g. Harris
matrix) crucial for relating individual records 
• Mapped only a Limited degree of the minute
archaeological detail to CIDOC CRM
• Different broad categories of contexts
(Deposits, Masonry, Timber, etc) handled by
separate forms but modelled together
• Model already "complex" enough - most
archaeologists find it a little daunting
Details of
Context on
recording
form
What about comparing records across different countries?
With thanks to Anja Masur
Documentation
• Different excavation methods bring differing documentation
• Comparison of different documentation sheets
Similarities and Differences
Context
Locus
Excavation
Unit
Lot
Level
Stratum
Behälter
(Troy)
(Basket)
Semantics
One language - one meaning – different terms
Stratigraphic
Unit
With thanks to Gerald Hiebel
English Heritage
Recording Manual
English Heritage
Recording Manual with CRM-EH 'Extensions'
German - e.g. Gottingen & Bayer
Befunde - Stratigraphic Unit /
Context
1. Bayer -Befundbuch (positive
deposit?)
Bodenbefunde (soil SU)
Baubefunde (built SU e.g. Walls)
BefundeKomplex - Feature (Group)
Planum = Multi-context plans by level?
With thanks to Gerald Hiebel
Bavarian
Recording Manual
Catalhoyuk - Hodder's
'Post-Processual' excavation recording
Units - Stratigraphic units,
similar to Contexts
Features - groupings of
units or more complex
structures, similar to
MoLA Groups
French - e.g. ???? Please !!!!
Examples using Single
Context Recording
methodology?
INRAP N'est pas?
Other excavation
methodologies?
Prototype Controlled Vocabulary searching
▪Controlled vocabularies online
▪Vocabularies from EH, RCAHMS, RCAHMW
▪Conversion to a common standard format (SKOS)
▪Persistent globally unique identifiers for every concept
▪Made available online as Linked Open Data
▪Also downloadable data files and listings
▪Web services
▪Facilitate concept searching, browsing, suggestion, validation
▪ Tools to use controlled vocabularies
▪Browser-based ‘widget’ user interface controls
▪Search, browse, suggest, select concepts
▪Case studies
▪Legacy data to thesaurus alignment
▪Thesaurus to thesaurus alignment
▪Third party use of project outcomes
STELLAR Project Tools - SKOS Template
SKOS = Simple Knowledge Organisation System
Using SKOS - W3C standard for Web-based Terminologies
skos:Concept
Castle:c789
skos:Concept
Motte:c456
skos:broader skos:narrower
skos:Concept
Bailey:c789
skos:Concept
Motte:c456
skos:related skos:related
skos:ConceptScheme
Monument:s123
skos:Concept
Motte:c456
skos:inScheme
SKOS_CONCEPTS – scheme_id, broader_id, related_id
Voacabulary Widgets – e.g. for OASIS
▪ Scheme list
▪ Scheme details
▪ Top concepts
▪ Composite control
(composite control)(top concepts)
(scheme details)
(scheme list) More Widget details on HeritageData.org
LOD Heritage Vocabularies: http://www.heritagedata.org
Thesaurus searching and browsing
- Semantic ENrichment Enabling Sustainability of arCHAeological LinksSENESCHAL
Early adoption (continued)
▪Clwyd-Powys Archaeological Trust (SENESCHAL widgets
embedded into HER application and mobile field
recording app)
British Oceanographic Data Centre - LOD
EH
Thesauri of
Maritime
Craft
With Thanks to 
Adam Leadbetter
Typical alignment problems encountered
▪ Simple spelling errors
▪ POSTHLOLE”, “CESS PITT”, “FURRROWS”, FLINT SCRAPPER”
▪ Alternate word forms
▪ “BOUNDARY”/”BOUNDARIES”, “GULLEY”/”GULLIES”
▪ Prefixes / suffixes
▪ “RED HILL (POSSIBLE)”, “TRACKWAY (COBBLED)”, “CROFT?”, “CAIRN (POSSIBLE)”,
“PORTAL DOLMEN (RE-ERECTED)”
▪ Nested delimiters
▪ “POTTERY, CERAMIC TILE, IRON OBJECTS, GLASS”
▪ Terms not intended for indexing
▪ “NONE”, “UNIDENTIFIED OBJECT”, “N/A”, “NA”, “INCOHERENT”
▪ Terms that would not be in (any) thesauri
▪ “WOTSITS PACKET”, “CHARLES 2ND COIN”, “ROMAN STRUCTURE POSSIBLY A VILLA“,
“ST GUTHLACS BENEDICTINE PRIORY”, “WORCESTER-BIRMINGHAM CANAL”,
“KUNGLIGA SLOTTET”, “SUB-FOSSIL BEETLES”
▪ More specific phrases
▪ “SIDE WALL OF POT WITH LUG”, “BRICK-LINED INDUSTRIAL WELL OR MINE SHAFT”,
“ALIGNMENT OF PLATFORMS AND STONES”
Data alignment - R&D approach
▪Levenshtein edit distance algorithm
▪ Measures optimal number of character edits
required to change one string into another
▪ Accommodates small spelling differences/errors
▪ Bulk alignment process
▪ Compares each value to all terms from specified
thesaurus – obtain best textual match
▪ Similarity threshold introduced to suppress low
scoring matches. Levenshtein algorithm will always
produce a match, even if it is a bad one!
▪ Periods require an additional approach due to mixed
formats (named periods, numeric ranges etc.)
Data Alignment R&D Results – Monument Types
Needs some level of 
Human verification by 
Domain experts.
Do we need semantic 
wiki -style
interfaces
To enable that?
Conclusions and Challenges -
Do you want to share Open Archaeological Data
somewhere on or over the horizon?
Different archaeological recording systems share
common conceptual frameworks and semantic
relationships
By conceptualising common relationships in our
different data sets at a broad (metadata) level and
aligning vocabularies of shared reference terms we can
cross-search data with more semantic accuracy to find
patterns and answers to related research questions
The technologies are being developed in other
domains but is there a common will for sharing
archaeological data Openly in the interests of
improving research methods?
References

Catalin Pavel. "Describing and Interpreting the Past"

Tudhope, May, Binding, Vlachidis. "Connecting
Archaeological Data and Grey Literature via Semantic
Cross Search" - Internet Archaeology Vol 30

Contact:
Keith.May@english-heritage.org.uk
@Keith_May

Weitere ähnliche Inhalte

Was ist angesagt?

Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...
Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...
Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...WARCnet
 
Semantic Cartography: Using ontologies to create adaptable tools for text exp...
Semantic Cartography: Using ontologies to create adaptable tools for text exp...Semantic Cartography: Using ontologies to create adaptable tools for text exp...
Semantic Cartography: Using ontologies to create adaptable tools for text exp...andyashton
 
Remoteness and connectedness in the library world
Remoteness and connectedness in the library worldRemoteness and connectedness in the library world
Remoteness and connectedness in the library worldacrawfordlibrary
 
Innovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLPInnovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLPariadnenetwork
 
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...ariadnenetwork
 
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...Andrea Bollini
 

Was ist angesagt? (6)

Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...
Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...
Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...
 
Semantic Cartography: Using ontologies to create adaptable tools for text exp...
Semantic Cartography: Using ontologies to create adaptable tools for text exp...Semantic Cartography: Using ontologies to create adaptable tools for text exp...
Semantic Cartography: Using ontologies to create adaptable tools for text exp...
 
Remoteness and connectedness in the library world
Remoteness and connectedness in the library worldRemoteness and connectedness in the library world
Remoteness and connectedness in the library world
 
Innovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLPInnovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLP
 
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
 
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
 

Andere mochten auch

the best e-mail in the world!
the best e-mail in the world!the best e-mail in the world!
the best e-mail in the world!guestbea1443
 
CAA 2015 - Paths Through the Labyrinth
CAA 2015 - Paths Through the LabyrinthCAA 2015 - Paths Through the Labyrinth
CAA 2015 - Paths Through the LabyrinthKeith.May
 
Vocabularies as Linked Data - OUDCE March2014
Vocabularies as Linked Data - OUDCE March2014Vocabularies as Linked Data - OUDCE March2014
Vocabularies as Linked Data - OUDCE March2014Keith.May
 
Claranetpresentation
ClaranetpresentationClaranetpresentation
Claranetpresentationdes.ward
 
1315 estella ma_motorlearning
1315 estella ma_motorlearning1315 estella ma_motorlearning
1315 estella ma_motorlearningTian Stella
 
Audru Finantsiline Seisund
Audru Finantsiline SeisundAudru Finantsiline Seisund
Audru Finantsiline SeisundPunnpea
 
Longlife
LonglifeLonglife
LonglifeDonchan
 
I Terremoti Cartosio Zanoli
I Terremoti Cartosio ZanoliI Terremoti Cartosio Zanoli
I Terremoti Cartosio Zanoliandrea.multari
 
1420 peter engineering_learning
1420 peter engineering_learning1420 peter engineering_learning
1420 peter engineering_learningTian Stella
 
10 Breakthrough Technologies 2013, MIT Technology Review
10 Breakthrough Technologies 2013, MIT Technology Review10 Breakthrough Technologies 2013, MIT Technology Review
10 Breakthrough Technologies 2013, MIT Technology ReviewPedro Moneo
 
1330 susan bridges_ohl
1330 susan bridges_ohl1330 susan bridges_ohl
1330 susan bridges_ohlTian Stella
 
Tech Review's Top Ten Emerging Technologies 2012
Tech Review's Top Ten Emerging Technologies 2012Tech Review's Top Ten Emerging Technologies 2012
Tech Review's Top Ten Emerging Technologies 2012Pedro Moneo
 
Mentor your employees to success
Mentor your employees to successMentor your employees to success
Mentor your employees to successCorne Erasmus
 

Andere mochten auch (20)

Web20forprofessionals
Web20forprofessionalsWeb20forprofessionals
Web20forprofessionals
 
NFCB
NFCBNFCB
NFCB
 
the best e-mail in the world!
the best e-mail in the world!the best e-mail in the world!
the best e-mail in the world!
 
Soffer CollectIve Master
Soffer CollectIve MasterSoffer CollectIve Master
Soffer CollectIve Master
 
Hoodies
HoodiesHoodies
Hoodies
 
Sanchar Solutions
Sanchar SolutionsSanchar Solutions
Sanchar Solutions
 
CAA 2015 - Paths Through the Labyrinth
CAA 2015 - Paths Through the LabyrinthCAA 2015 - Paths Through the Labyrinth
CAA 2015 - Paths Through the Labyrinth
 
Vocabularies as Linked Data - OUDCE March2014
Vocabularies as Linked Data - OUDCE March2014Vocabularies as Linked Data - OUDCE March2014
Vocabularies as Linked Data - OUDCE March2014
 
Past Slides
Past SlidesPast Slides
Past Slides
 
Claranetpresentation
ClaranetpresentationClaranetpresentation
Claranetpresentation
 
1315 estella ma_motorlearning
1315 estella ma_motorlearning1315 estella ma_motorlearning
1315 estella ma_motorlearning
 
Audru Finantsiline Seisund
Audru Finantsiline SeisundAudru Finantsiline Seisund
Audru Finantsiline Seisund
 
Longlife
LonglifeLonglife
Longlife
 
Golden Eye Utax
Golden Eye UtaxGolden Eye Utax
Golden Eye Utax
 
I Terremoti Cartosio Zanoli
I Terremoti Cartosio ZanoliI Terremoti Cartosio Zanoli
I Terremoti Cartosio Zanoli
 
1420 peter engineering_learning
1420 peter engineering_learning1420 peter engineering_learning
1420 peter engineering_learning
 
10 Breakthrough Technologies 2013, MIT Technology Review
10 Breakthrough Technologies 2013, MIT Technology Review10 Breakthrough Technologies 2013, MIT Technology Review
10 Breakthrough Technologies 2013, MIT Technology Review
 
1330 susan bridges_ohl
1330 susan bridges_ohl1330 susan bridges_ohl
1330 susan bridges_ohl
 
Tech Review's Top Ten Emerging Technologies 2012
Tech Review's Top Ten Emerging Technologies 2012Tech Review's Top Ten Emerging Technologies 2012
Tech Review's Top Ten Emerging Technologies 2012
 
Mentor your employees to success
Mentor your employees to successMentor your employees to success
Mentor your employees to success
 

Ähnlich wie CAA 2014 - To Boldly or Bravely Go? Experiences of using Semantic Technologies for Archaeological Resources

Barriers and Opportunities for Linked Open Data Use in Archaeology and Cultur...
Barriers and Opportunities for Linked Open Data Use in Archaeology and Cultur...Barriers and Opportunities for Linked Open Data Use in Archaeology and Cultur...
Barriers and Opportunities for Linked Open Data Use in Archaeology and Cultur...ariadnenetwork
 
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...Keith.May
 
Vocabularies as Linked Data: SENESCHAL & HeritageData.org
Vocabularies as Linked Data: SENESCHAL & HeritageData.orgVocabularies as Linked Data: SENESCHAL & HeritageData.org
Vocabularies as Linked Data: SENESCHAL & HeritageData.orgKeith.May
 
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...CIGScotland
 
Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringeswcsummerschool
 
Ariadne: Interoperability
Ariadne: InteroperabilityAriadne: Interoperability
Ariadne: Interoperabilityariadnenetwork
 
Knowledge engineering and the Web
Knowledge engineering and the WebKnowledge engineering and the Web
Knowledge engineering and the WebGuus Schreiber
 
Cork AI Meetup Number 3
Cork AI Meetup Number 3Cork AI Meetup Number 3
Cork AI Meetup Number 3Nick Grattan
 
8. (Semantic Interoperability in the CLARIN infrastructure. Menzo Windhouwer....
8. (Semantic Interoperability in the CLARIN infrastructure. Menzo Windhouwer....8. (Semantic Interoperability in the CLARIN infrastructure. Menzo Windhouwer....
8. (Semantic Interoperability in the CLARIN infrastructure. Menzo Windhouwer....IMPACT Centre of Competence
 
Sharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data LessonsSharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data LessonsGeorge Stathis
 
Easter JISC metadata May25 DT
Easter JISC metadata May25 DTEaster JISC metadata May25 DT
Easter JISC metadata May25 DTdstudhope
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Gautier Poupeau
 
{Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Compone...
{Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Compone...{Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Compone...
{Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Compone...Amit Sheth
 
SKOS, Past, Present and Future
SKOS, Past, Present and FutureSKOS, Past, Present and Future
SKOS, Past, Present and Futureseanb
 
RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)Vladimir Alexiev, PhD, PMP
 
Duraspace Hot Topics Series 6: Metadata and Repository Services
Duraspace Hot Topics Series 6: Metadata and Repository ServicesDuraspace Hot Topics Series 6: Metadata and Repository Services
Duraspace Hot Topics Series 6: Metadata and Repository ServicesMatthew Critchlow
 
QB'er demonstration
QB'er demonstrationQB'er demonstration
QB'er demonstrationCLARIAH
 
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...4Science
 

Ähnlich wie CAA 2014 - To Boldly or Bravely Go? Experiences of using Semantic Technologies for Archaeological Resources (20)

Barriers and Opportunities for Linked Open Data Use in Archaeology and Cultur...
Barriers and Opportunities for Linked Open Data Use in Archaeology and Cultur...Barriers and Opportunities for Linked Open Data Use in Archaeology and Cultur...
Barriers and Opportunities for Linked Open Data Use in Archaeology and Cultur...
 
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
 
Vocabularies as Linked Data: SENESCHAL & HeritageData.org
Vocabularies as Linked Data: SENESCHAL & HeritageData.orgVocabularies as Linked Data: SENESCHAL & HeritageData.org
Vocabularies as Linked Data: SENESCHAL & HeritageData.org
 
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
SENESCHAL: Semantic ENrichment Enabling Sustainability of arCHAeological Link...
 
Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineering
 
Semantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including AstrophysicsSemantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including Astrophysics
 
Ariadne: Interoperability
Ariadne: InteroperabilityAriadne: Interoperability
Ariadne: Interoperability
 
Knowledge engineering and the Web
Knowledge engineering and the WebKnowledge engineering and the Web
Knowledge engineering and the Web
 
Cork AI Meetup Number 3
Cork AI Meetup Number 3Cork AI Meetup Number 3
Cork AI Meetup Number 3
 
8. (Semantic Interoperability in the CLARIN infrastructure. Menzo Windhouwer....
8. (Semantic Interoperability in the CLARIN infrastructure. Menzo Windhouwer....8. (Semantic Interoperability in the CLARIN infrastructure. Menzo Windhouwer....
8. (Semantic Interoperability in the CLARIN infrastructure. Menzo Windhouwer....
 
Sharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data LessonsSharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data Lessons
 
Easter JISC metadata May25 DT
Easter JISC metadata May25 DTEaster JISC metadata May25 DT
Easter JISC metadata May25 DT
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...
 
{Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Compone...
{Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Compone...{Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Compone...
{Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Compone...
 
SKOS, Past, Present and Future
SKOS, Past, Present and FutureSKOS, Past, Present and Future
SKOS, Past, Present and Future
 
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and ApplicationsSemantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
 
RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)
 
Duraspace Hot Topics Series 6: Metadata and Repository Services
Duraspace Hot Topics Series 6: Metadata and Repository ServicesDuraspace Hot Topics Series 6: Metadata and Repository Services
Duraspace Hot Topics Series 6: Metadata and Repository Services
 
QB'er demonstration
QB'er demonstrationQB'er demonstration
QB'er demonstration
 
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
 

Mehr von Keith.May

Keith_May_S12_CAA2023_Amsterdam.pptx
Keith_May_S12_CAA2023_Amsterdam.pptxKeith_May_S12_CAA2023_Amsterdam.pptx
Keith_May_S12_CAA2023_Amsterdam.pptxKeith.May
 
The Matrix: connecting and re-using digital records of archaeological investi...
The Matrix: connecting and re-using digital records of archaeological investi...The Matrix: connecting and re-using digital records of archaeological investi...
The Matrix: connecting and re-using digital records of archaeological investi...Keith.May
 
An Open and Shut Case? Shared Standards for Stratigraphic Data and Heritage L...
An Open and Shut Case? Shared Standards for Stratigraphic Data and Heritage L...An Open and Shut Case? Shared Standards for Stratigraphic Data and Heritage L...
An Open and Shut Case? Shared Standards for Stratigraphic Data and Heritage L...Keith.May
 
Space-Time in the Matrix and Uses of Allen Temporal Operators for Stratigraph...
Space-Time in the Matrix and Uses of Allen Temporal Operators for Stratigraph...Space-Time in the Matrix and Uses of Allen Temporal Operators for Stratigraph...
Space-Time in the Matrix and Uses of Allen Temporal Operators for Stratigraph...Keith.May
 
The matrix ahrc_leadership_fellow_project_feb2020
The matrix ahrc_leadership_fellow_project_feb2020The matrix ahrc_leadership_fellow_project_feb2020
The matrix ahrc_leadership_fellow_project_feb2020Keith.May
 
CAA 2019 Krakow - When Harris met Allen in The Matrix: How can the conceptual...
CAA 2019 Krakow - When Harris met Allen in The Matrix: How can the conceptual...CAA 2019 Krakow - When Harris met Allen in The Matrix: How can the conceptual...
CAA 2019 Krakow - When Harris met Allen in The Matrix: How can the conceptual...Keith.May
 
TAG 2017: Once or twice Upon a Time: Ripping Yarns from the Tablets Edge
TAG 2017: Once or twice Upon a Time: Ripping Yarns from the Tablets EdgeTAG 2017: Once or twice Upon a Time: Ripping Yarns from the Tablets Edge
TAG 2017: Once or twice Upon a Time: Ripping Yarns from the Tablets EdgeKeith.May
 
EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...
EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...
EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...Keith.May
 
CAA 2016 The Matrix: Connecting Time and Space with archaeological research q...
CAA 2016 The Matrix: Connecting Time and Space with archaeological research q...CAA 2016 The Matrix: Connecting Time and Space with archaeological research q...
CAA 2016 The Matrix: Connecting Time and Space with archaeological research q...Keith.May
 
Space, Time and Space-Time. Where, When and How should we use them? Considera...
Space, Time and Space-Time. Where, When and How should we use them? Considera...Space, Time and Space-Time. Where, When and How should we use them? Considera...
Space, Time and Space-Time. Where, When and How should we use them? Considera...Keith.May
 
EAA2013 Archaeological Recording Methods - How Many Archaeologists does it t...
 EAA2013 Archaeological Recording Methods - How Many Archaeologists does it t... EAA2013 Archaeological Recording Methods - How Many Archaeologists does it t...
EAA2013 Archaeological Recording Methods - How Many Archaeologists does it t...Keith.May
 
Arch Ontological Modelling V4
Arch Ontological Modelling V4Arch Ontological Modelling V4
Arch Ontological Modelling V4Keith.May
 

Mehr von Keith.May (12)

Keith_May_S12_CAA2023_Amsterdam.pptx
Keith_May_S12_CAA2023_Amsterdam.pptxKeith_May_S12_CAA2023_Amsterdam.pptx
Keith_May_S12_CAA2023_Amsterdam.pptx
 
The Matrix: connecting and re-using digital records of archaeological investi...
The Matrix: connecting and re-using digital records of archaeological investi...The Matrix: connecting and re-using digital records of archaeological investi...
The Matrix: connecting and re-using digital records of archaeological investi...
 
An Open and Shut Case? Shared Standards for Stratigraphic Data and Heritage L...
An Open and Shut Case? Shared Standards for Stratigraphic Data and Heritage L...An Open and Shut Case? Shared Standards for Stratigraphic Data and Heritage L...
An Open and Shut Case? Shared Standards for Stratigraphic Data and Heritage L...
 
Space-Time in the Matrix and Uses of Allen Temporal Operators for Stratigraph...
Space-Time in the Matrix and Uses of Allen Temporal Operators for Stratigraph...Space-Time in the Matrix and Uses of Allen Temporal Operators for Stratigraph...
Space-Time in the Matrix and Uses of Allen Temporal Operators for Stratigraph...
 
The matrix ahrc_leadership_fellow_project_feb2020
The matrix ahrc_leadership_fellow_project_feb2020The matrix ahrc_leadership_fellow_project_feb2020
The matrix ahrc_leadership_fellow_project_feb2020
 
CAA 2019 Krakow - When Harris met Allen in The Matrix: How can the conceptual...
CAA 2019 Krakow - When Harris met Allen in The Matrix: How can the conceptual...CAA 2019 Krakow - When Harris met Allen in The Matrix: How can the conceptual...
CAA 2019 Krakow - When Harris met Allen in The Matrix: How can the conceptual...
 
TAG 2017: Once or twice Upon a Time: Ripping Yarns from the Tablets Edge
TAG 2017: Once or twice Upon a Time: Ripping Yarns from the Tablets EdgeTAG 2017: Once or twice Upon a Time: Ripping Yarns from the Tablets Edge
TAG 2017: Once or twice Upon a Time: Ripping Yarns from the Tablets Edge
 
EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...
EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...
EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...
 
CAA 2016 The Matrix: Connecting Time and Space with archaeological research q...
CAA 2016 The Matrix: Connecting Time and Space with archaeological research q...CAA 2016 The Matrix: Connecting Time and Space with archaeological research q...
CAA 2016 The Matrix: Connecting Time and Space with archaeological research q...
 
Space, Time and Space-Time. Where, When and How should we use them? Considera...
Space, Time and Space-Time. Where, When and How should we use them? Considera...Space, Time and Space-Time. Where, When and How should we use them? Considera...
Space, Time and Space-Time. Where, When and How should we use them? Considera...
 
EAA2013 Archaeological Recording Methods - How Many Archaeologists does it t...
 EAA2013 Archaeological Recording Methods - How Many Archaeologists does it t... EAA2013 Archaeological Recording Methods - How Many Archaeologists does it t...
EAA2013 Archaeological Recording Methods - How Many Archaeologists does it t...
 
Arch Ontological Modelling V4
Arch Ontological Modelling V4Arch Ontological Modelling V4
Arch Ontological Modelling V4
 

Kürzlich hochgeladen

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 

Kürzlich hochgeladen (20)

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 

CAA 2014 - To Boldly or Bravely Go? Experiences of using Semantic Technologies for Archaeological Resources

  • 1. 
 by Keith May @Keith_May Ceri Binding & Prof Doug Tudhope
 Faculty of Advanced Technology
 University of South Wales
 To Boldly or Bravely Go? Experiences of using Semantic Technologies for Archaeological Resources
  • 2. Excavation record data modelling • CRM-EH focuses on common ‘core’ Concepts of our Archaeological processes • Stratigraphic relationships (e.g. Harris matrix) crucial for relating individual records • Mapped only a Limited degree of the minute archaeological detail to CIDOC CRM • Different broad categories of contexts (Deposits, Masonry, Timber, etc) handled by separate forms but modelled together • Model already "complex" enough - most archaeologists find it a little daunting Details of Context on recording form
  • 3. What about comparing records across different countries? With thanks to Anja Masur
  • 4. Documentation • Different excavation methods bring differing documentation • Comparison of different documentation sheets Similarities and Differences
  • 6. With thanks to Gerald Hiebel English Heritage Recording Manual
  • 7. English Heritage Recording Manual with CRM-EH 'Extensions'
  • 8. German - e.g. Gottingen & Bayer Befunde - Stratigraphic Unit / Context 1. Bayer -Befundbuch (positive deposit?) Bodenbefunde (soil SU) Baubefunde (built SU e.g. Walls) BefundeKomplex - Feature (Group) Planum = Multi-context plans by level?
  • 9. With thanks to Gerald Hiebel Bavarian Recording Manual
  • 10. Catalhoyuk - Hodder's 'Post-Processual' excavation recording Units - Stratigraphic units, similar to Contexts Features - groupings of units or more complex structures, similar to MoLA Groups
  • 11. French - e.g. ???? Please !!!! Examples using Single Context Recording methodology? INRAP N'est pas? Other excavation methodologies?
  • 12.
  • 14. ▪Controlled vocabularies online ▪Vocabularies from EH, RCAHMS, RCAHMW ▪Conversion to a common standard format (SKOS) ▪Persistent globally unique identifiers for every concept ▪Made available online as Linked Open Data ▪Also downloadable data files and listings ▪Web services ▪Facilitate concept searching, browsing, suggestion, validation ▪ Tools to use controlled vocabularies ▪Browser-based ‘widget’ user interface controls ▪Search, browse, suggest, select concepts ▪Case studies ▪Legacy data to thesaurus alignment ▪Thesaurus to thesaurus alignment ▪Third party use of project outcomes
  • 15. STELLAR Project Tools - SKOS Template SKOS = Simple Knowledge Organisation System Using SKOS - W3C standard for Web-based Terminologies
  • 17. Voacabulary Widgets – e.g. for OASIS ▪ Scheme list ▪ Scheme details ▪ Top concepts ▪ Composite control (composite control)(top concepts) (scheme details) (scheme list) More Widget details on HeritageData.org
  • 18. LOD Heritage Vocabularies: http://www.heritagedata.org
  • 20. - Semantic ENrichment Enabling Sustainability of arCHAeological LinksSENESCHAL Early adoption (continued) ▪Clwyd-Powys Archaeological Trust (SENESCHAL widgets embedded into HER application and mobile field recording app)
  • 21. British Oceanographic Data Centre - LOD EH Thesauri of Maritime Craft With Thanks to Adam Leadbetter
  • 22. Typical alignment problems encountered ▪ Simple spelling errors ▪ POSTHLOLE”, “CESS PITT”, “FURRROWS”, FLINT SCRAPPER” ▪ Alternate word forms ▪ “BOUNDARY”/”BOUNDARIES”, “GULLEY”/”GULLIES” ▪ Prefixes / suffixes ▪ “RED HILL (POSSIBLE)”, “TRACKWAY (COBBLED)”, “CROFT?”, “CAIRN (POSSIBLE)”, “PORTAL DOLMEN (RE-ERECTED)” ▪ Nested delimiters ▪ “POTTERY, CERAMIC TILE, IRON OBJECTS, GLASS” ▪ Terms not intended for indexing ▪ “NONE”, “UNIDENTIFIED OBJECT”, “N/A”, “NA”, “INCOHERENT” ▪ Terms that would not be in (any) thesauri ▪ “WOTSITS PACKET”, “CHARLES 2ND COIN”, “ROMAN STRUCTURE POSSIBLY A VILLA“, “ST GUTHLACS BENEDICTINE PRIORY”, “WORCESTER-BIRMINGHAM CANAL”, “KUNGLIGA SLOTTET”, “SUB-FOSSIL BEETLES” ▪ More specific phrases ▪ “SIDE WALL OF POT WITH LUG”, “BRICK-LINED INDUSTRIAL WELL OR MINE SHAFT”, “ALIGNMENT OF PLATFORMS AND STONES”
  • 23. Data alignment - R&D approach ▪Levenshtein edit distance algorithm ▪ Measures optimal number of character edits required to change one string into another ▪ Accommodates small spelling differences/errors ▪ Bulk alignment process ▪ Compares each value to all terms from specified thesaurus – obtain best textual match ▪ Similarity threshold introduced to suppress low scoring matches. Levenshtein algorithm will always produce a match, even if it is a bad one! ▪ Periods require an additional approach due to mixed formats (named periods, numeric ranges etc.)
  • 24. Data Alignment R&D Results – Monument Types Needs some level of Human verification by Domain experts. Do we need semantic wiki -style interfaces To enable that?
  • 25. Conclusions and Challenges - Do you want to share Open Archaeological Data somewhere on or over the horizon? Different archaeological recording systems share common conceptual frameworks and semantic relationships By conceptualising common relationships in our different data sets at a broad (metadata) level and aligning vocabularies of shared reference terms we can cross-search data with more semantic accuracy to find patterns and answers to related research questions The technologies are being developed in other domains but is there a common will for sharing archaeological data Openly in the interests of improving research methods?
  • 26. References Catalin Pavel. "Describing and Interpreting the Past" Tudhope, May, Binding, Vlachidis. "Connecting Archaeological Data and Grey Literature via Semantic Cross Search" - Internet Archaeology Vol 30 Contact: Keith.May@english-heritage.org.uk @Keith_May