SlideShare a Scribd company logo
1 of 34
Download to read offline
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Benchmarks for
Digital Preservation Tools



Kresimir Duretec, Artur Kulmukhametov, Andreas Rauber and Christoph Becker

Vienna University of Technology and University of Toronto







IPRES 2015 , Chapel Hill, North Carolina, USA
November 3, 2015
1
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Overview
1. Need for evidence in digital
preservation
•Software tools in digital preservation
and evidence around their quality
2. Software benchmarks in digital
preservation
•What, Why, How
•Software quality
3. Benchmark proposals
•Photo migration
•Property extraction from documents
4. Are we ready? Next steps https://goo.gl/ps6x8I
In case you get bored
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Overview
1. Need for evidence in digital
preservation
•Software tools in digital preservation
and evidence around their quality
2. Software benchmarks in digital
preservation
•What, Why, How
•Software quality
3. Benchmark proposals
•Photo migration
•Property extraction from documents
4. Are we ready? Next steps https://goo.gl/ps6x8I
In case you get bored
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Evidence in digital preservation
Important topic !!!
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Evidence in digital preservation
Important topic !!!
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Software tools in digital preservation
•many tools (>400 tools
in COPTR registry
http://coptr.digipres.org)
•categories
•identification,
validation &
characterisation
•migration
•emulation
•….
•high quality required
Droid
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
However !!!
Blog post at
OPF by paul
on 19th Oct
2012.
http://openpreservation.org/blog/2012/10/19/practitioners-have-spoken-we-need-better-characterisation/
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
However !!!
Blog post at
OPF by johan
on 31st Jan
2014.
http://openpreservation.org/blog/2014/01/31/why-cant-we-have-digital-preservation-tools-just-work/
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
However !!!
Panel at IPRES
2014
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Evidence of software quality in digital
preservation
•the need for evaluation is not new !
•testbeds and numerous individual evaluation initiatives
•often take-up is missing
2002 2006 2010
Dutch Digital
Preservation Testbed
DELOS Digital
PreservationTestbed
Planets Testbed
?
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Overview
1. Need for evidence in digital
preservation
•Software tools in digital preservation
and evidence around their quality
2. Software benchmarks in digital
preservation
•What, Why, How
•Software quality
3. Benchmark proposals
•Photo migration
•Property extraction from documents
4. Are we ready? Next steps https://goo.gl/ps6x8I
In case you are already
bored
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Software benchmarks
•Proposal : Start creating software benchmarks to extend
the evidence base around software quality of digital
preservation tools
•major characteristics
•collaborative
•open
•public
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Software benchmarks - definition
Source Definition key words
SPEC
glossary
“a "benchmark" is a test, or set of tests, designed to
compare the performance of one computer system against
the performance of others”
test, compare,
performance
IEEE
Glossary
“1. a standard against which measurements or comparisons
can be made. 2. a procedure, problem, or test that can be
used to compare systems or components to each other or
to a standard. 3. a recovery file”
measurements,
comparison,
test
W. Tichy “a benchmark is a task domain sample executed by a
computer or by a human and computer. During execution,
the human or computer records well-defined performance
measurements. A benchmark provides a level playing field
for competing ideas, and (assuming the benchmark is
sufficiently representative) allows repeatable and objective
comparisons”
task domain
sample,
performance
measurements,
repeatable
objective
comparison
Sim et al. “a test or set of tests used to compare the performance of
alternative tools or techniques”
test, compare
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Software benchmarks - definition
Source Definition key words
SPEC
glossary
“a "benchmark" is a test, or set of tests, designed to
compare the performance of one computer system against
the performance of others”
test, compare,
performance
IEEE
Glossary
“1. a standard against which measurements or comparisons
can be made. 2. a procedure, problem, or test that can be
used to compare systems or components to each other or
to a standard. 3. a recovery file”
measurements,
comparison,
test
W. Tichy “a benchmark is a task domain sample executed by a
computer or by a human and computer. During execution,
the human or computer records well-defined performance
measurements. A benchmark provides a level playing field
for competing ideas, and (assuming the benchmark is
sufficiently representative) allows repeatable and objective
comparisons”
task domain
sample,
performance
measurements,
repeatable
objective
comparison
Sim et al. “a test or set of tests used to compare the performance of
alternative tools or techniques”
test, compare
DP Software Benchmark is a set of tests which enable
rigorous comparison of software solutions
for a specific task in digital preservation
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Benchmarks in other fields
•Software Engineering
•transactions
•virtualisation
•databases
•…
•Information Retrieval
•text
•video
•music
•…
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Benchmarks in other fields
community
specifications tools & methods that
need benchmarks
yearly meeting
resultsiterative process
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Benchmark components
Software engineering
Motivating comparison
Task sample
Performance measures
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Benchmark components
Information retrieval
Tasks
Datasets
Answersets
Measures
Representation formats
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Benchmark components
Software engineering
Motivating comparison
Task sample
Performance measures
Information retrieval
Tasks
Datasets
Answersets
Measures
Representation formats
Digital preservation
Motivating comparison
Function
Dataset
Ground truth
Performance measures
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Motivating comparison
Digital preservation
Motivating comparison
Function
Dataset
Ground truth
Performance measures
•What is benchmark supposed to compare ?
•Which quality aspects are addressed ?
•What would be the benefits ?
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Motivating comparison
Digital preservation
Motivating comparison
Function
Dataset
Ground truth
Performance measures
•software quality
•many aspects
•we need a model
•ISO SQUARE
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Function
Digital preservation
Motivating comparison
Function
Dataset
Ground truth
Performance measures
•specific task tool is expected to perform
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Dataset
Digital preservation
Motivating comparison
Function
Dataset
Ground truth
Performance measures
•set of digital objects
•images
•audio
•video
•relevant
•accurate
•complete
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Ground truth
Digital preservation
Motivating comparison
Function
Dataset
Ground truth
Performance measures
•correct answers
•optional element
•machine readable
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Performance measures
Digital preservation
Motivating comparison
Function
Dataset
Ground truth
Performance measures
•fitness of the benchmarked tool
•type
•quantitative
•qualitative
•calculated
•by human
•by machine
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Overview
1. Need for evidence in digital
preservation
•Software tools in digital preservation
and evidence around their quality
2. Software benchmarks in digital
preservation
•What, Why, How
•Software quality
3. Benchmark proposals
•Raw Photo migration
•Property extraction from documents
4. Are we ready? Next steps https://goo.gl/ps6x8I
In case you are
bored
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
RAW Photo migration benchmark
Digital preservation
Motivating comparison
Function
Dataset
Ground truth
Performance measures
•various RAW file formats
•migrate to DNG
•many tools should be tested
•migrate from RAW to DNG
•collection of photographs in RAW format
•SSIM
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Property values extraction benchmark
Digital preservation
Motivating comparison
Function
Dataset
Ground truth
Performance measures
•different document properties
•coverage of needed properties
•correctness of extracted values
•extract property values from a document
•collection of documents in MS Word 2007
file format
•correct values
•coverage
•accuracy
•exact match ratio
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Overview
1. Need for evidence in digital
preservation
•Software tools in digital preservation
and evidence around their quality
2. Software benchmarks in digital
preservation
•What, Why, How
•Software quality
3. Benchmark proposals
•Raw Photo migration
•Property extraction from documents
4. Are we ready? Next steps https://goo.gl/ps6x8I
In case you are
bored
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Benchmarking - community driven process
•benchmarks emerge from the community consensus
•community participation needs to be enabled
•based on established results where possible
•community needs to incur the costs
•continuous evolution of benchmarks is necessary
•Are we ready ?
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Is the community ready ? Indicators
•projects
•workshops
•own conference
•several other
journals and
conferences
• concerns
•vocabularies
•metrics
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Conclusion
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Next step
create benchmark specifications
Benchmarking forum
workshop
Friday 6th November
9:00 - 17:00
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Questions ?
https://goo.gl/ps6x8I
In case you were not
bored

More Related Content

Viewers also liked

Alexander academy tuition-and-fees-all-programs-2015
Alexander academy tuition-and-fees-all-programs-2015Alexander academy tuition-and-fees-all-programs-2015
Alexander academy tuition-and-fees-all-programs-2015iamprosperous
 
Canadian tourism college hrm diplomas + testimonials
Canadian tourism college hrm diplomas + testimonialsCanadian tourism college hrm diplomas + testimonials
Canadian tourism college hrm diplomas + testimonialsiamprosperous
 
Queens university-handbook
Queens university-handbookQueens university-handbook
Queens university-handbookiamprosperous
 
Glenlyon norfolk school ib
Glenlyon norfolk school ibGlenlyon norfolk school ib
Glenlyon norfolk school ibiamprosperous
 
Pickering College - school and appearance&requirements and expectatins
Pickering College - school and appearance&requirements and expectatinsPickering College - school and appearance&requirements and expectatins
Pickering College - school and appearance&requirements and expectatinsiamprosperous
 
UCW 加西大学 工商管理硕士
UCW 加西大学   工商管理硕士UCW 加西大学   工商管理硕士
UCW 加西大学 工商管理硕士iamprosperous
 
Alexander college general flyer english
Alexander college general flyer englishAlexander college general flyer english
Alexander college general flyer englishiamprosperous
 
York School District unlocking potencial for learning
York School District unlocking potencial for learningYork School District unlocking potencial for learning
York School District unlocking potencial for learningiamprosperous
 

Viewers also liked (15)

Alexander academy tuition-and-fees-all-programs-2015
Alexander academy tuition-and-fees-all-programs-2015Alexander academy tuition-and-fees-all-programs-2015
Alexander academy tuition-and-fees-all-programs-2015
 
Gds
GdsGds
Gds
 
Canadian tourism college hrm diplomas + testimonials
Canadian tourism college hrm diplomas + testimonialsCanadian tourism college hrm diplomas + testimonials
Canadian tourism college hrm diplomas + testimonials
 
Queens university-handbook
Queens university-handbookQueens university-handbook
Queens university-handbook
 
Techniques for Preserving Scientific Software Executions: Preserve the Mess o...
Techniques for Preserving Scientific Software Executions: Preserve the Mess o...Techniques for Preserving Scientific Software Executions: Preserve the Mess o...
Techniques for Preserving Scientific Software Executions: Preserve the Mess o...
 
UCW b comm brochure
UCW b comm brochureUCW b comm brochure
UCW b comm brochure
 
Glenlyon norfolk school ib
Glenlyon norfolk school ibGlenlyon norfolk school ib
Glenlyon norfolk school ib
 
Pickering College - school and appearance&requirements and expectatins
Pickering College - school and appearance&requirements and expectatinsPickering College - school and appearance&requirements and expectatins
Pickering College - school and appearance&requirements and expectatins
 
UCW 加西大学 工商管理硕士
UCW 加西大学   工商管理硕士UCW 加西大学   工商管理硕士
UCW 加西大学 工商管理硕士
 
Alexander college general flyer english
Alexander college general flyer englishAlexander college general flyer english
Alexander college general flyer english
 
York School District unlocking potencial for learning
York School District unlocking potencial for learningYork School District unlocking potencial for learning
York School District unlocking potencial for learning
 
Archiving Deferred Representations Using a Two-Tiered Crawling Approach. Just...
Archiving Deferred Representations Using a Two-Tiered Crawling Approach. Just...Archiving Deferred Representations Using a Two-Tiered Crawling Approach. Just...
Archiving Deferred Representations Using a Two-Tiered Crawling Approach. Just...
 
Preserving the Fruit of Our Labor: Establishing Digital Preservation Policies...
Preserving the Fruit of Our Labor: Establishing Digital Preservation Policies...Preserving the Fruit of Our Labor: Establishing Digital Preservation Policies...
Preserving the Fruit of Our Labor: Establishing Digital Preservation Policies...
 
One Core Preservation System for all your Data. No Exceptions! Marco Klindt a...
One Core Preservation System for all your Data. No Exceptions! Marco Klindt a...One Core Preservation System for all your Data. No Exceptions! Marco Klindt a...
One Core Preservation System for all your Data. No Exceptions! Marco Klindt a...
 
Educational Records of Practice: Preservation and Access Concerns. Elizabeth ...
Educational Records of Practice: Preservation and Access Concerns. Elizabeth ...Educational Records of Practice: Preservation and Access Concerns. Elizabeth ...
Educational Records of Practice: Preservation and Access Concerns. Elizabeth ...
 

Similar to Benchmarks for Digital Preservation tools. Kresimir Duretec, Artur Kulmukhametov, Andreas Rauber and Christoph Becker

Work Measurement Application - Ghent Internship Report - Adel Belasker
Work Measurement Application - Ghent Internship Report - Adel BelaskerWork Measurement Application - Ghent Internship Report - Adel Belasker
Work Measurement Application - Ghent Internship Report - Adel BelaskerAdel Belasker
 
Abstract contents
Abstract contentsAbstract contents
Abstract contentsloisy28
 
bkremer-report-final
bkremer-report-finalbkremer-report-final
bkremer-report-finalBen Kremer
 
Software architecture for developers
Software architecture for developersSoftware architecture for developers
Software architecture for developersChinh Ngo Nguyen
 
Thesis - Nora Szepes - Design and Implementation of an Educational Support Sy...
Thesis - Nora Szepes - Design and Implementation of an Educational Support Sy...Thesis - Nora Szepes - Design and Implementation of an Educational Support Sy...
Thesis - Nora Szepes - Design and Implementation of an Educational Support Sy...Nóra Szepes
 
DBMS_Lab_Manual_&_Solution
DBMS_Lab_Manual_&_SolutionDBMS_Lab_Manual_&_Solution
DBMS_Lab_Manual_&_SolutionSyed Zaid Irshad
 
Windows Internals Part 1_6th Edition.pdf
Windows Internals Part 1_6th Edition.pdfWindows Internals Part 1_6th Edition.pdf
Windows Internals Part 1_6th Edition.pdfLokeshSainathGudivad
 
bonino_thesis_final
bonino_thesis_finalbonino_thesis_final
bonino_thesis_finalDario Bonino
 
Hadoop as an extension of DW
Hadoop as an extension of DWHadoop as an extension of DW
Hadoop as an extension of DWSidi yazid
 
An Optical Character Recognition Engine For Graphical Processing Units
An Optical Character Recognition Engine For Graphical Processing UnitsAn Optical Character Recognition Engine For Graphical Processing Units
An Optical Character Recognition Engine For Graphical Processing UnitsKelly Lipiec
 

Similar to Benchmarks for Digital Preservation tools. Kresimir Duretec, Artur Kulmukhametov, Andreas Rauber and Christoph Becker (20)

Master thesis
Master thesisMaster thesis
Master thesis
 
Moss2007
Moss2007Moss2007
Moss2007
 
Master's Thesis
Master's ThesisMaster's Thesis
Master's Thesis
 
Work Measurement Application - Ghent Internship Report - Adel Belasker
Work Measurement Application - Ghent Internship Report - Adel BelaskerWork Measurement Application - Ghent Internship Report - Adel Belasker
Work Measurement Application - Ghent Internship Report - Adel Belasker
 
Abstract contents
Abstract contentsAbstract contents
Abstract contents
 
bkremer-report-final
bkremer-report-finalbkremer-report-final
bkremer-report-final
 
Aregay_Msc_EEMCS
Aregay_Msc_EEMCSAregay_Msc_EEMCS
Aregay_Msc_EEMCS
 
Predictive Modeling and Analytics select_chapters
Predictive Modeling and Analytics select_chaptersPredictive Modeling and Analytics select_chapters
Predictive Modeling and Analytics select_chapters
 
Vba uk
Vba ukVba uk
Vba uk
 
Systems se
Systems seSystems se
Systems se
 
Software architecture for developers
Software architecture for developersSoftware architecture for developers
Software architecture for developers
 
Thesis - Nora Szepes - Design and Implementation of an Educational Support Sy...
Thesis - Nora Szepes - Design and Implementation of an Educational Support Sy...Thesis - Nora Szepes - Design and Implementation of an Educational Support Sy...
Thesis - Nora Szepes - Design and Implementation of an Educational Support Sy...
 
DBMS_Lab_Manual_&_Solution
DBMS_Lab_Manual_&_SolutionDBMS_Lab_Manual_&_Solution
DBMS_Lab_Manual_&_Solution
 
Windows Internals Part 1_6th Edition.pdf
Windows Internals Part 1_6th Edition.pdfWindows Internals Part 1_6th Edition.pdf
Windows Internals Part 1_6th Edition.pdf
 
bonino_thesis_final
bonino_thesis_finalbonino_thesis_final
bonino_thesis_final
 
Hadoop as an extension of DW
Hadoop as an extension of DWHadoop as an extension of DW
Hadoop as an extension of DW
 
Report
ReportReport
Report
 
Dynamics AX/ X++
Dynamics AX/ X++Dynamics AX/ X++
Dynamics AX/ X++
 
An Optical Character Recognition Engine For Graphical Processing Units
An Optical Character Recognition Engine For Graphical Processing UnitsAn Optical Character Recognition Engine For Graphical Processing Units
An Optical Character Recognition Engine For Graphical Processing Units
 
Mobile d
Mobile dMobile d
Mobile d
 

More from 12th International Conference on Digital Preservation (iPRES 2015)

More from 12th International Conference on Digital Preservation (iPRES 2015) (9)

Project Chrysalis – Transforming the Digital Business of the National Archive...
Project Chrysalis – Transforming the Digital Business of the National Archive...Project Chrysalis – Transforming the Digital Business of the National Archive...
Project Chrysalis – Transforming the Digital Business of the National Archive...
 
DataNet Federation Consortium Preservation Policy Toolkit. Reagan Moore, Arco...
DataNet Federation Consortium Preservation Policy Toolkit. Reagan Moore, Arco...DataNet Federation Consortium Preservation Policy Toolkit. Reagan Moore, Arco...
DataNet Federation Consortium Preservation Policy Toolkit. Reagan Moore, Arco...
 
Characterization of CDROMs for Emulation-based Access. Klaus Rechert, Thomas ...
Characterization of CDROMs for Emulation-based Access. Klaus Rechert, Thomas ...Characterization of CDROMs for Emulation-based Access. Klaus Rechert, Thomas ...
Characterization of CDROMs for Emulation-based Access. Klaus Rechert, Thomas ...
 
Beyond the Binary: Pre-Ingest Preservation of Metadata. Jessica Moran and Jay...
Beyond the Binary: Pre-Ingest Preservation of Metadata. Jessica Moran and Jay...Beyond the Binary: Pre-Ingest Preservation of Metadata. Jessica Moran and Jay...
Beyond the Binary: Pre-Ingest Preservation of Metadata. Jessica Moran and Jay...
 
Experiment, Document & Decide: A Collaborative Approach to Preservation Plann...
Experiment, Document & Decide: A Collaborative Approach to Preservation Plann...Experiment, Document & Decide: A Collaborative Approach to Preservation Plann...
Experiment, Document & Decide: A Collaborative Approach to Preservation Plann...
 
Functional Access to Forensic Disk Images in a Web Service. Kam Woods, Christ...
Functional Access to Forensic Disk Images in a Web Service. Kam Woods, Christ...Functional Access to Forensic Disk Images in a Web Service. Kam Woods, Christ...
Functional Access to Forensic Disk Images in a Web Service. Kam Woods, Christ...
 
Towards a Common Approach for Access to Digital Archival Records in Europe. A...
Towards a Common Approach for Access to Digital Archival Records in Europe. A...Towards a Common Approach for Access to Digital Archival Records in Europe. A...
Towards a Common Approach for Access to Digital Archival Records in Europe. A...
 
Developing a Framework for File Format Migrations. Joey Heinen and Andrea Goe...
Developing a Framework for File Format Migrations. Joey Heinen and Andrea Goe...Developing a Framework for File Format Migrations. Joey Heinen and Andrea Goe...
Developing a Framework for File Format Migrations. Joey Heinen and Andrea Goe...
 
A Foundational Framework for Digital Curation: The Sept Domain Model. Stephen...
A Foundational Framework for Digital Curation: The Sept Domain Model. Stephen...A Foundational Framework for Digital Curation: The Sept Domain Model. Stephen...
A Foundational Framework for Digital Curation: The Sept Domain Model. Stephen...
 

Recently uploaded

Dutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
Dutch Power - 26 maart 2024 - Henk Kras - Circular PlasticsDutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
Dutch Power - 26 maart 2024 - Henk Kras - Circular PlasticsDutch Power
 
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATIONRACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATIONRachelAnnTenibroAmaz
 
Call Girls In Aerocity 🤳 Call Us +919599264170
Call Girls In Aerocity 🤳 Call Us +919599264170Call Girls In Aerocity 🤳 Call Us +919599264170
Call Girls In Aerocity 🤳 Call Us +919599264170Escort Service
 
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...漢銘 謝
 
Mathan flower ppt.pptx slide orchids ✨🌸
Mathan flower ppt.pptx slide orchids ✨🌸Mathan flower ppt.pptx slide orchids ✨🌸
Mathan flower ppt.pptx slide orchids ✨🌸mathanramanathan2005
 
Early Modern Spain. All about this period
Early Modern Spain. All about this periodEarly Modern Spain. All about this period
Early Modern Spain. All about this periodSaraIsabelJimenez
 
The Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism PresentationThe Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism PresentationNathan Young
 
CHROMATOGRAPHY and its types with procedure,diagrams,flow charts,advantages a...
CHROMATOGRAPHY and its types with procedure,diagrams,flow charts,advantages a...CHROMATOGRAPHY and its types with procedure,diagrams,flow charts,advantages a...
CHROMATOGRAPHY and its types with procedure,diagrams,flow charts,advantages a...university
 
Event 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptxEvent 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptxaryanv1753
 
Quality by design.. ppt for RA (1ST SEM
Quality by design.. ppt for  RA (1ST SEMQuality by design.. ppt for  RA (1ST SEM
Quality by design.. ppt for RA (1ST SEMCharmi13
 
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRRINDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRRsarwankumar4524
 
Application of GIS in Landslide Disaster Response.pptx
Application of GIS in Landslide Disaster Response.pptxApplication of GIS in Landslide Disaster Response.pptx
Application of GIS in Landslide Disaster Response.pptxRoquia Salam
 
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.comSaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.comsaastr
 
Internship Presentation | PPT | CSE | SE
Internship Presentation | PPT | CSE | SEInternship Presentation | PPT | CSE | SE
Internship Presentation | PPT | CSE | SESaleh Ibne Omar
 
Engaging Eid Ul Fitr Presentation for Kindergartners.pptx
Engaging Eid Ul Fitr Presentation for Kindergartners.pptxEngaging Eid Ul Fitr Presentation for Kindergartners.pptx
Engaging Eid Ul Fitr Presentation for Kindergartners.pptxAsifArshad8
 
PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.
PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.
PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.KathleenAnnCordero2
 
DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...
DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...
DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...Henrik Hanke
 
proposal kumeneger edited.docx A kumeeger
proposal kumeneger edited.docx A kumeegerproposal kumeneger edited.docx A kumeeger
proposal kumeneger edited.docx A kumeegerkumenegertelayegrama
 
Chizaram's Women Tech Makers Deck. .pptx
Chizaram's Women Tech Makers Deck.  .pptxChizaram's Women Tech Makers Deck.  .pptx
Chizaram's Women Tech Makers Deck. .pptxogubuikealex
 

Recently uploaded (19)

Dutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
Dutch Power - 26 maart 2024 - Henk Kras - Circular PlasticsDutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
Dutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
 
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATIONRACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
 
Call Girls In Aerocity 🤳 Call Us +919599264170
Call Girls In Aerocity 🤳 Call Us +919599264170Call Girls In Aerocity 🤳 Call Us +919599264170
Call Girls In Aerocity 🤳 Call Us +919599264170
 
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
 
Mathan flower ppt.pptx slide orchids ✨🌸
Mathan flower ppt.pptx slide orchids ✨🌸Mathan flower ppt.pptx slide orchids ✨🌸
Mathan flower ppt.pptx slide orchids ✨🌸
 
Early Modern Spain. All about this period
Early Modern Spain. All about this periodEarly Modern Spain. All about this period
Early Modern Spain. All about this period
 
The Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism PresentationThe Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism Presentation
 
CHROMATOGRAPHY and its types with procedure,diagrams,flow charts,advantages a...
CHROMATOGRAPHY and its types with procedure,diagrams,flow charts,advantages a...CHROMATOGRAPHY and its types with procedure,diagrams,flow charts,advantages a...
CHROMATOGRAPHY and its types with procedure,diagrams,flow charts,advantages a...
 
Event 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptxEvent 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptx
 
Quality by design.. ppt for RA (1ST SEM
Quality by design.. ppt for  RA (1ST SEMQuality by design.. ppt for  RA (1ST SEM
Quality by design.. ppt for RA (1ST SEM
 
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRRINDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRR
 
Application of GIS in Landslide Disaster Response.pptx
Application of GIS in Landslide Disaster Response.pptxApplication of GIS in Landslide Disaster Response.pptx
Application of GIS in Landslide Disaster Response.pptx
 
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.comSaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
 
Internship Presentation | PPT | CSE | SE
Internship Presentation | PPT | CSE | SEInternship Presentation | PPT | CSE | SE
Internship Presentation | PPT | CSE | SE
 
Engaging Eid Ul Fitr Presentation for Kindergartners.pptx
Engaging Eid Ul Fitr Presentation for Kindergartners.pptxEngaging Eid Ul Fitr Presentation for Kindergartners.pptx
Engaging Eid Ul Fitr Presentation for Kindergartners.pptx
 
PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.
PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.
PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.
 
DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...
DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...
DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...
 
proposal kumeneger edited.docx A kumeeger
proposal kumeneger edited.docx A kumeegerproposal kumeneger edited.docx A kumeeger
proposal kumeneger edited.docx A kumeeger
 
Chizaram's Women Tech Makers Deck. .pptx
Chizaram's Women Tech Makers Deck.  .pptxChizaram's Women Tech Makers Deck.  .pptx
Chizaram's Women Tech Makers Deck. .pptx
 

Benchmarks for Digital Preservation tools. Kresimir Duretec, Artur Kulmukhametov, Andreas Rauber and Christoph Becker

  • 1. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Benchmarks for Digital Preservation Tools
 
 Kresimir Duretec, Artur Kulmukhametov, Andreas Rauber and Christoph Becker
 Vienna University of Technology and University of Toronto
 
 
 
 IPRES 2015 , Chapel Hill, North Carolina, USA November 3, 2015 1
  • 2. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overview 1. Need for evidence in digital preservation •Software tools in digital preservation and evidence around their quality 2. Software benchmarks in digital preservation •What, Why, How •Software quality 3. Benchmark proposals •Photo migration •Property extraction from documents 4. Are we ready? Next steps https://goo.gl/ps6x8I In case you get bored
  • 3. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overview 1. Need for evidence in digital preservation •Software tools in digital preservation and evidence around their quality 2. Software benchmarks in digital preservation •What, Why, How •Software quality 3. Benchmark proposals •Photo migration •Property extraction from documents 4. Are we ready? Next steps https://goo.gl/ps6x8I In case you get bored
  • 4. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Evidence in digital preservation Important topic !!!
  • 5. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Evidence in digital preservation Important topic !!!
  • 6. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Software tools in digital preservation •many tools (>400 tools in COPTR registry http://coptr.digipres.org) •categories •identification, validation & characterisation •migration •emulation •…. •high quality required Droid
  • 7. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . However !!! Blog post at OPF by paul on 19th Oct 2012. http://openpreservation.org/blog/2012/10/19/practitioners-have-spoken-we-need-better-characterisation/
  • 8. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . However !!! Blog post at OPF by johan on 31st Jan 2014. http://openpreservation.org/blog/2014/01/31/why-cant-we-have-digital-preservation-tools-just-work/
  • 9. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . However !!! Panel at IPRES 2014
  • 10. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Evidence of software quality in digital preservation •the need for evaluation is not new ! •testbeds and numerous individual evaluation initiatives •often take-up is missing 2002 2006 2010 Dutch Digital Preservation Testbed DELOS Digital PreservationTestbed Planets Testbed ?
  • 11. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overview 1. Need for evidence in digital preservation •Software tools in digital preservation and evidence around their quality 2. Software benchmarks in digital preservation •What, Why, How •Software quality 3. Benchmark proposals •Photo migration •Property extraction from documents 4. Are we ready? Next steps https://goo.gl/ps6x8I In case you are already bored
  • 12. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Software benchmarks •Proposal : Start creating software benchmarks to extend the evidence base around software quality of digital preservation tools •major characteristics •collaborative •open •public
  • 13. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Software benchmarks - definition Source Definition key words SPEC glossary “a "benchmark" is a test, or set of tests, designed to compare the performance of one computer system against the performance of others” test, compare, performance IEEE Glossary “1. a standard against which measurements or comparisons can be made. 2. a procedure, problem, or test that can be used to compare systems or components to each other or to a standard. 3. a recovery file” measurements, comparison, test W. Tichy “a benchmark is a task domain sample executed by a computer or by a human and computer. During execution, the human or computer records well-defined performance measurements. A benchmark provides a level playing field for competing ideas, and (assuming the benchmark is sufficiently representative) allows repeatable and objective comparisons” task domain sample, performance measurements, repeatable objective comparison Sim et al. “a test or set of tests used to compare the performance of alternative tools or techniques” test, compare
  • 14. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Software benchmarks - definition Source Definition key words SPEC glossary “a "benchmark" is a test, or set of tests, designed to compare the performance of one computer system against the performance of others” test, compare, performance IEEE Glossary “1. a standard against which measurements or comparisons can be made. 2. a procedure, problem, or test that can be used to compare systems or components to each other or to a standard. 3. a recovery file” measurements, comparison, test W. Tichy “a benchmark is a task domain sample executed by a computer or by a human and computer. During execution, the human or computer records well-defined performance measurements. A benchmark provides a level playing field for competing ideas, and (assuming the benchmark is sufficiently representative) allows repeatable and objective comparisons” task domain sample, performance measurements, repeatable objective comparison Sim et al. “a test or set of tests used to compare the performance of alternative tools or techniques” test, compare DP Software Benchmark is a set of tests which enable rigorous comparison of software solutions for a specific task in digital preservation
  • 15. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Benchmarks in other fields •Software Engineering •transactions •virtualisation •databases •… •Information Retrieval •text •video •music •…
  • 16. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Benchmarks in other fields community specifications tools & methods that need benchmarks yearly meeting resultsiterative process
  • 17. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Benchmark components Software engineering Motivating comparison Task sample Performance measures
  • 18. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Benchmark components Information retrieval Tasks Datasets Answersets Measures Representation formats
  • 19. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Benchmark components Software engineering Motivating comparison Task sample Performance measures Information retrieval Tasks Datasets Answersets Measures Representation formats Digital preservation Motivating comparison Function Dataset Ground truth Performance measures
  • 20. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Motivating comparison Digital preservation Motivating comparison Function Dataset Ground truth Performance measures •What is benchmark supposed to compare ? •Which quality aspects are addressed ? •What would be the benefits ?
  • 21. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Motivating comparison Digital preservation Motivating comparison Function Dataset Ground truth Performance measures •software quality •many aspects •we need a model •ISO SQUARE
  • 22. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Function Digital preservation Motivating comparison Function Dataset Ground truth Performance measures •specific task tool is expected to perform
  • 23. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Dataset Digital preservation Motivating comparison Function Dataset Ground truth Performance measures •set of digital objects •images •audio •video •relevant •accurate •complete
  • 24. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ground truth Digital preservation Motivating comparison Function Dataset Ground truth Performance measures •correct answers •optional element •machine readable
  • 25. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Performance measures Digital preservation Motivating comparison Function Dataset Ground truth Performance measures •fitness of the benchmarked tool •type •quantitative •qualitative •calculated •by human •by machine
  • 26. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overview 1. Need for evidence in digital preservation •Software tools in digital preservation and evidence around their quality 2. Software benchmarks in digital preservation •What, Why, How •Software quality 3. Benchmark proposals •Raw Photo migration •Property extraction from documents 4. Are we ready? Next steps https://goo.gl/ps6x8I In case you are bored
  • 27. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . RAW Photo migration benchmark Digital preservation Motivating comparison Function Dataset Ground truth Performance measures •various RAW file formats •migrate to DNG •many tools should be tested •migrate from RAW to DNG •collection of photographs in RAW format •SSIM
  • 28. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Property values extraction benchmark Digital preservation Motivating comparison Function Dataset Ground truth Performance measures •different document properties •coverage of needed properties •correctness of extracted values •extract property values from a document •collection of documents in MS Word 2007 file format •correct values •coverage •accuracy •exact match ratio
  • 29. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Overview 1. Need for evidence in digital preservation •Software tools in digital preservation and evidence around their quality 2. Software benchmarks in digital preservation •What, Why, How •Software quality 3. Benchmark proposals •Raw Photo migration •Property extraction from documents 4. Are we ready? Next steps https://goo.gl/ps6x8I In case you are bored
  • 30. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Benchmarking - community driven process •benchmarks emerge from the community consensus •community participation needs to be enabled •based on established results where possible •community needs to incur the costs •continuous evolution of benchmarks is necessary •Are we ready ?
  • 31. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Is the community ready ? Indicators •projects •workshops •own conference •several other journals and conferences • concerns •vocabularies •metrics
  • 32. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Conclusion
  • 33. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Next step create benchmark specifications Benchmarking forum workshop Friday 6th November 9:00 - 17:00
  • 34. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Questions ? https://goo.gl/ps6x8I In case you were not bored