SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Downloaden Sie, um offline zu lesen
DURAARK
Preserving Architectural Knowledge
DEDICATE – Final Seminar
Glasgow, October 21st 2013

Michelle Lindlar (LUH / TIB)
1 / 23

21 / 10 / 13
TIB (Technische Informationsbibliothek)
is the German National Library of Science
and Technology
Why architectural data?
subjects: engineering, architecture, chemistry,
computer science, mathematics and physics
Competence centre for non-textual materials (KNM)
2007 – 2011 DFG funded PROBADO3D project
metadata and content based search for digital
architectural 3D models
http://www.probado.de/en_3d.html
Why digital preservation?
2009-2011: Goportis digital preservation pilot project,
together with our Goportis partners ZB MED and ZBW
Since 2012:
Goportis digital preservation system hosted by TIB

A few words about TIB
2 / 23

21 / 10 / 13
DURAARK (DURAble Architectural Knowledge)
FP7 – ICT – Digital Preservation (STReP)
February 2013 – January 2016

Goal
Develop methods and tools for sustainable long-term
preservation of building data (3D and BIM models,
metadata, related knowledge & Web data)
Scope
• address all layers of digital preservation (bit,
logical, semantic)
• interlinked curation and preservation workflows
• focus on two file formats: IFC and E57
• incorporate existing OAIS compliant digital
preservation system

Project overview
3 / 23

21 / 10 / 13
Tangible outcomes
Semantic enrichment: Vocabularies for
description of built structures and
enrichment techniques based on a unified
and sustainable naming scheme
Tailored Workflows: Thoroughly investigate
requirements of institutional stakeholders
(libraries/archives) and SMEs on long-term
archiving. Develop according workflows.
Sustainability of file formats: Face problem of
digital decay by using Industry Foundation
Classes (IFC) and E57 as open and already
well-established file formats suited for
long-term preservation. Ensure availability
of characterization tools for those formats.

Goal and Tangible Outcomes
4 / 23

21 / 10 / 13
DURAARK – an interdisciplinary project
5 / 23

21 / 10 / 13
UBO: Universität Bonn
- Technical Coordinator
- WP4/WP5: change management, shape
recognition
Luleå University of Technology
- WP8 leader, dissemination/exploitation

CITA, Center for Information Technology
and Architecture Copenhagen
- WP7 leader, evaluation, test
TUE, Department of the Built Environment,
Eindhoven University of Technology
- WP3 leader, semantics & metadata
Catenda, SME
- User perspective, market requirements, evaluation
Fraunhofer Austria
- WP2 leader, system specification
& integration

Consortium
6 / 23

21 / 10 / 13

Jakob Beetz (Eindhoven University of Technology)

LUH: German National Library of
Science and Technology (TIB) &
L3S Research Center Hannover
-Coordinator
- WP3 Semantic Enrichment
- WP6 leader, long-term preservation
3 layers of a digital object
7 / 23

21 / 10 / 13
risks:
• media obsolescence
• technical failure
• human error
• DRM
http://commons.wikimedia.org/wiki/
File:Compact_Floppy.jpg

possible actions:
• media migration, refreshing, replication
• technological redundancy, ideally with geographic spread
• error detection, monitoring, recovery & disaster planning
• controlled storage with regular maintenance
• security and trust

Solved through „good IT practice“ (which, of course,
needs to be implemented …)

1. Bit(stream) [Physical] preservation layer
8 / 23

21 / 10 / 13
risks:
• software / file format obsolesence
• software  OS  hardware dependencies
• additionally: configuration / package dependencies
• lack of compliance to format standards („mal-formed objects“)
• DRM
possible actions:
• migration, emulation, normalization
• „hardware museum“
• data/information extraction
• extensive technical metadata capturing
• definition of significant properties (what to preserve)

Established basic processes … but they
require adaptation for new formats.

http://www.flickr.com/photos/89771128@N02/8451172304/in/pool-2121762@N23

2. Logical [object] preservation layer
9 / 23

21 / 10 / 13
risks:
• terminology and concepts change over time
• context and provenance may be lost
(purpose, setting, limitations, cultural context,
related objects)
possible actions:
• semantic enrichment
• tracing of metadata
• audit trail capturing
• migration at semantic level
• documentation of context
• document intended meaning / interpretation

Least developed area of digital
preservation

3. Semantic [interpretability] preservation
layer
10 / 23

21 / 10 / 13
DURAARK Stack
11 / 23

21 / 10 / 13
Use Cases (1/2)
12 / 23

21 / 10 / 13
producers

long-term
data stewards

DURAARK stakeholders
13 / 23

21 / 10 / 13
long-term
data stewards

consumers

DURAARK stakeholders
14 / 23

21 / 10 / 13
producer /
consumer

Creates
data
to be
preserved
by

Actions
need to meet
requirements of

long-term
data
steward

DCC Curation Lifecycle Model
http://www.dcc.ac.uk/resources/curation-lifecycle-model

Curation and Preservation
15 / 23

21 / 10 / 13
Consumer Use Cases
• result of stakeholder analysis
• describe desired use, re-use, access
• will be adressed in geometric and
semantic enrichment processing layer

 Knowing why something should be
preserved helps us in evaluating the
characteristics to be preserved

Use Cases (2/2)
16 / 23

21 / 10 / 13
http://public.ccsds.org/publications/archive/650x0m2.pdf

OAIS: Information Object
17 / 23

21 / 10 / 13
Metadata: Technical
„Metadata that describes the technical state of and process used to create a file.
Often closely related either to its file format or the original software used to
create the file, e.g. scanning equipment and settings used to create or modify a
digital object.“
http://www.digitalpreservation.gov/ndsa/ndsa-glossary.html
 Information needed in order to maintain access to the file

Significant properties:
criteria which an institution
considers important factors of
an object‘s quality, structure
or behaviour, which should be
preserved over time,
i.e. over the course of digital
preservation actions.

http://public.ccsds.org/publications/archive/650x0m2.pdf

Technical Metadata
18 / 23

21 / 10 / 13
Existing tools for various file
formats:
Jhove, Tika, fido, fits,
DROID, …

Few existing tools for IFC
and E57:
E57 validator, IFC validator

File format characterization
19 / 23

21 / 10 / 13
National Library of Australia: Testing Software Tools of Potential Interest for Digital Preservation
http://www.openplanetsfoundation.org/system/files/Digital%20Preservation%20Project%20Report%2
0-%20Testing%20Software%20Tools.pdf
20 / 23

21 / 10 / 13
IFC extraction:
geometry types
schema version
implementation level
application
version of application
measurement units
MVD
geotagged
gross area
number of stories
…

E57 extraction:
geo-referenced (yes/no)
total square metre
number of floors
resolution settings
quality settings
sensor model, sensor serial number, …
total number of scans
total number of points
intensity (yes/no)
colour (yes/no)
reasons for spatial disturbance: distribution
of detected elements
sub quality parameters (positioning) – in %
e.g., distance error matched references;
occupied quadrants
sub quality parameters (references) – in %
e.g., point drift, longitudinal mismatch
…

Potential candidates for technical metadata
21 / 23

21 / 10 / 13
Currently developing stakeholder questionnaire
covering the following areas:
– data holdings (formats, SW, produced internally / externally)
– data storage / management (data carriers, backup practises, archiving
practises)
– access (when, for what reason)
– experience with data loss (yes/no, reasons)

Looking for interested institutions
and multiplicators !

Want to help?
22 / 23

21 / 10 / 13
michelle.lindlar@tib.uni-hannover.de
Thank you. Questions? Suggestions?
23 / 23

21 / 10 / 13

Weitere ähnliche Inhalte

Was ist angesagt?

CH2009 - Architectural information modelling in construction history
CH2009 - Architectural information modelling in construction historyCH2009 - Architectural information modelling in construction history
CH2009 - Architectural information modelling in construction history
Pieter Pauwels
 

Was ist angesagt? (20)

Preserving Computer-Aided Design, Digital Preservation Coalition Report
Preserving Computer-Aided Design, Digital Preservation Coalition ReportPreserving Computer-Aided Design, Digital Preservation Coalition Report
Preserving Computer-Aided Design, Digital Preservation Coalition Report
 
DURAARK Preserving Architectural Knowledge
DURAARK Preserving Architectural KnowledgeDURAARK Preserving Architectural Knowledge
DURAARK Preserving Architectural Knowledge
 
Luigi Selmi - The Big Data Integrator Platform
Luigi Selmi - The Big Data Integrator PlatformLuigi Selmi - The Big Data Integrator Platform
Luigi Selmi - The Big Data Integrator Platform
 
3D ICONS IPR experience
3D ICONS IPR experience 3D ICONS IPR experience
3D ICONS IPR experience
 
IDS@BKM: Gaining Transparency in Automotive Supply Chains
IDS@BKM: Gaining Transparency in Automotive Supply ChainsIDS@BKM: Gaining Transparency in Automotive Supply Chains
IDS@BKM: Gaining Transparency in Automotive Supply Chains
 
3D ICONS Guidelines and Case Studies, Anthony Corns, Discovery Programme
3D ICONS Guidelines and Case Studies, Anthony Corns, Discovery Programme3D ICONS Guidelines and Case Studies, Anthony Corns, Discovery Programme
3D ICONS Guidelines and Case Studies, Anthony Corns, Discovery Programme
 
Bringing Meaning to BIM Data
Bringing Meaning to BIM DataBringing Meaning to BIM Data
Bringing Meaning to BIM Data
 
The Next Generation of the Microdata Information System MISSY - An Integrated...
The Next Generation of the Microdata Information System MISSY - An Integrated...The Next Generation of the Microdata Information System MISSY - An Integrated...
The Next Generation of the Microdata Information System MISSY - An Integrated...
 
Introduction to 3D ICONS
Introduction to 3D ICONSIntroduction to 3D ICONS
Introduction to 3D ICONS
 
BDE SC3.3 Workshop - Agenda
 BDE SC3.3 Workshop - Agenda BDE SC3.3 Workshop - Agenda
BDE SC3.3 Workshop - Agenda
 
Potential usage of 3D data and IPR issues, presented by Sheena Basset
Potential usage of 3D data and IPR issues, presented by Sheena BassetPotential usage of 3D data and IPR issues, presented by Sheena Basset
Potential usage of 3D data and IPR issues, presented by Sheena Basset
 
Developing and applying the CARARE metadata schema for 3D documentation, pres...
Developing and applying the CARARE metadata schema for 3D documentation, pres...Developing and applying the CARARE metadata schema for 3D documentation, pres...
Developing and applying the CARARE metadata schema for 3D documentation, pres...
 
Cora For ITDG
Cora For ITDGCora For ITDG
Cora For ITDG
 
On the relation between Model View Definitions (MVDs) and Linked Data technol...
On the relation between Model View Definitions (MVDs) and Linked Data technol...On the relation between Model View Definitions (MVDs) and Linked Data technol...
On the relation between Model View Definitions (MVDs) and Linked Data technol...
 
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
 
The DURAARK Workbench and PREMIS
The DURAARK Workbench and PREMISThe DURAARK Workbench and PREMIS
The DURAARK Workbench and PREMIS
 
Metadata, the CARARE aggregation service and 3D ICONS
Metadata, the CARARE aggregation service and 3D ICONSMetadata, the CARARE aggregation service and 3D ICONS
Metadata, the CARARE aggregation service and 3D ICONS
 
CH2009 - Architectural information modelling in construction history
CH2009 - Architectural information modelling in construction historyCH2009 - Architectural information modelling in construction history
CH2009 - Architectural information modelling in construction history
 
Kovari Curatecamp born_digitalworkflows_
Kovari Curatecamp born_digitalworkflows_Kovari Curatecamp born_digitalworkflows_
Kovari Curatecamp born_digitalworkflows_
 
Deep Hybrid DataCloud
Deep Hybrid DataCloudDeep Hybrid DataCloud
Deep Hybrid DataCloud
 

Ähnlich wie DURAARK presentation at DEDICATE final seminar, October 21st 2013, Michelle Lindlar

Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
Smita Chandra
 
Real world e-science use-cases
Real world e-science use-casesReal world e-science use-cases
Real world e-science use-cases
Annette Strauch
 

Ähnlich wie DURAARK presentation at DEDICATE final seminar, October 21st 2013, Michelle Lindlar (20)

Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
 
Fedora Oxford Dec09
Fedora Oxford Dec09Fedora Oxford Dec09
Fedora Oxford Dec09
 
Hans Hofman - European Perspectives on Digital Preservation
Hans Hofman - European Perspectives on Digital PreservationHans Hofman - European Perspectives on Digital Preservation
Hans Hofman - European Perspectives on Digital Preservation
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Digitisation Infrastructure - June 2007
Digitisation Infrastructure - June 2007Digitisation Infrastructure - June 2007
Digitisation Infrastructure - June 2007
 
Seminario Sobre Datasets Consorcio Madrono
Seminario Sobre Datasets Consorcio Madrono Seminario Sobre Datasets Consorcio Madrono
Seminario Sobre Datasets Consorcio Madrono
 
PERICLES Presentation at IDCC 2015
PERICLES Presentation at IDCC 2015PERICLES Presentation at IDCC 2015
PERICLES Presentation at IDCC 2015
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
MPDL metadata handling
MPDL metadata handlingMPDL metadata handling
MPDL metadata handling
 
An Introduction to Digital Preservation
An Introduction to Digital PreservationAn Introduction to Digital Preservation
An Introduction to Digital Preservation
 
The Extreme Data Cloud (XDC) Project
The Extreme Data Cloud (XDC) ProjectThe Extreme Data Cloud (XDC) Project
The Extreme Data Cloud (XDC) Project
 
20100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_033020100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_0330
 
20100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_033020100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_0330
 
ARTICLE_MEDICI
ARTICLE_MEDICIARTICLE_MEDICI
ARTICLE_MEDICI
 
Repositories and digital preservation
Repositories and digital preservationRepositories and digital preservation
Repositories and digital preservation
 
Real world e-science use-cases
Real world e-science use-casesReal world e-science use-cases
Real world e-science use-cases
 
Project update - João Fernandes
Project update - João FernandesProject update - João Fernandes
Project update - João Fernandes
 
Digital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and Requirements
 

Kürzlich hochgeladen

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Kürzlich hochgeladen (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 

DURAARK presentation at DEDICATE final seminar, October 21st 2013, Michelle Lindlar

  • 1. DURAARK Preserving Architectural Knowledge DEDICATE – Final Seminar Glasgow, October 21st 2013 Michelle Lindlar (LUH / TIB) 1 / 23 21 / 10 / 13
  • 2. TIB (Technische Informationsbibliothek) is the German National Library of Science and Technology Why architectural data? subjects: engineering, architecture, chemistry, computer science, mathematics and physics Competence centre for non-textual materials (KNM) 2007 – 2011 DFG funded PROBADO3D project metadata and content based search for digital architectural 3D models http://www.probado.de/en_3d.html Why digital preservation? 2009-2011: Goportis digital preservation pilot project, together with our Goportis partners ZB MED and ZBW Since 2012: Goportis digital preservation system hosted by TIB A few words about TIB 2 / 23 21 / 10 / 13
  • 3. DURAARK (DURAble Architectural Knowledge) FP7 – ICT – Digital Preservation (STReP) February 2013 – January 2016 Goal Develop methods and tools for sustainable long-term preservation of building data (3D and BIM models, metadata, related knowledge & Web data) Scope • address all layers of digital preservation (bit, logical, semantic) • interlinked curation and preservation workflows • focus on two file formats: IFC and E57 • incorporate existing OAIS compliant digital preservation system Project overview 3 / 23 21 / 10 / 13
  • 4. Tangible outcomes Semantic enrichment: Vocabularies for description of built structures and enrichment techniques based on a unified and sustainable naming scheme Tailored Workflows: Thoroughly investigate requirements of institutional stakeholders (libraries/archives) and SMEs on long-term archiving. Develop according workflows. Sustainability of file formats: Face problem of digital decay by using Industry Foundation Classes (IFC) and E57 as open and already well-established file formats suited for long-term preservation. Ensure availability of characterization tools for those formats. Goal and Tangible Outcomes 4 / 23 21 / 10 / 13
  • 5. DURAARK – an interdisciplinary project 5 / 23 21 / 10 / 13
  • 6. UBO: Universität Bonn - Technical Coordinator - WP4/WP5: change management, shape recognition Luleå University of Technology - WP8 leader, dissemination/exploitation CITA, Center for Information Technology and Architecture Copenhagen - WP7 leader, evaluation, test TUE, Department of the Built Environment, Eindhoven University of Technology - WP3 leader, semantics & metadata Catenda, SME - User perspective, market requirements, evaluation Fraunhofer Austria - WP2 leader, system specification & integration Consortium 6 / 23 21 / 10 / 13 Jakob Beetz (Eindhoven University of Technology) LUH: German National Library of Science and Technology (TIB) & L3S Research Center Hannover -Coordinator - WP3 Semantic Enrichment - WP6 leader, long-term preservation
  • 7. 3 layers of a digital object 7 / 23 21 / 10 / 13
  • 8. risks: • media obsolescence • technical failure • human error • DRM http://commons.wikimedia.org/wiki/ File:Compact_Floppy.jpg possible actions: • media migration, refreshing, replication • technological redundancy, ideally with geographic spread • error detection, monitoring, recovery & disaster planning • controlled storage with regular maintenance • security and trust Solved through „good IT practice“ (which, of course, needs to be implemented …) 1. Bit(stream) [Physical] preservation layer 8 / 23 21 / 10 / 13
  • 9. risks: • software / file format obsolesence • software  OS  hardware dependencies • additionally: configuration / package dependencies • lack of compliance to format standards („mal-formed objects“) • DRM possible actions: • migration, emulation, normalization • „hardware museum“ • data/information extraction • extensive technical metadata capturing • definition of significant properties (what to preserve) Established basic processes … but they require adaptation for new formats. http://www.flickr.com/photos/89771128@N02/8451172304/in/pool-2121762@N23 2. Logical [object] preservation layer 9 / 23 21 / 10 / 13
  • 10. risks: • terminology and concepts change over time • context and provenance may be lost (purpose, setting, limitations, cultural context, related objects) possible actions: • semantic enrichment • tracing of metadata • audit trail capturing • migration at semantic level • documentation of context • document intended meaning / interpretation Least developed area of digital preservation 3. Semantic [interpretability] preservation layer 10 / 23 21 / 10 / 13
  • 11. DURAARK Stack 11 / 23 21 / 10 / 13
  • 12. Use Cases (1/2) 12 / 23 21 / 10 / 13
  • 15. producer / consumer Creates data to be preserved by Actions need to meet requirements of long-term data steward DCC Curation Lifecycle Model http://www.dcc.ac.uk/resources/curation-lifecycle-model Curation and Preservation 15 / 23 21 / 10 / 13
  • 16. Consumer Use Cases • result of stakeholder analysis • describe desired use, re-use, access • will be adressed in geometric and semantic enrichment processing layer  Knowing why something should be preserved helps us in evaluating the characteristics to be preserved Use Cases (2/2) 16 / 23 21 / 10 / 13
  • 18. Metadata: Technical „Metadata that describes the technical state of and process used to create a file. Often closely related either to its file format or the original software used to create the file, e.g. scanning equipment and settings used to create or modify a digital object.“ http://www.digitalpreservation.gov/ndsa/ndsa-glossary.html  Information needed in order to maintain access to the file Significant properties: criteria which an institution considers important factors of an object‘s quality, structure or behaviour, which should be preserved over time, i.e. over the course of digital preservation actions. http://public.ccsds.org/publications/archive/650x0m2.pdf Technical Metadata 18 / 23 21 / 10 / 13
  • 19. Existing tools for various file formats: Jhove, Tika, fido, fits, DROID, … Few existing tools for IFC and E57: E57 validator, IFC validator File format characterization 19 / 23 21 / 10 / 13
  • 20. National Library of Australia: Testing Software Tools of Potential Interest for Digital Preservation http://www.openplanetsfoundation.org/system/files/Digital%20Preservation%20Project%20Report%2 0-%20Testing%20Software%20Tools.pdf 20 / 23 21 / 10 / 13
  • 21. IFC extraction: geometry types schema version implementation level application version of application measurement units MVD geotagged gross area number of stories … E57 extraction: geo-referenced (yes/no) total square metre number of floors resolution settings quality settings sensor model, sensor serial number, … total number of scans total number of points intensity (yes/no) colour (yes/no) reasons for spatial disturbance: distribution of detected elements sub quality parameters (positioning) – in % e.g., distance error matched references; occupied quadrants sub quality parameters (references) – in % e.g., point drift, longitudinal mismatch … Potential candidates for technical metadata 21 / 23 21 / 10 / 13
  • 22. Currently developing stakeholder questionnaire covering the following areas: – data holdings (formats, SW, produced internally / externally) – data storage / management (data carriers, backup practises, archiving practises) – access (when, for what reason) – experience with data loss (yes/no, reasons) Looking for interested institutions and multiplicators ! Want to help? 22 / 23 21 / 10 / 13