SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Thomas Stensitzki, Principal Enterprise Consultant
Long Time Preservation
The Importance of Archiving
Agenda
Long-Term preservation
Why should/must items be archived?
What should/must items be archived?
How can archiving be done?
1
2
3
4
Terms
Outsourcing, Filing, Backup, Archiving
Outsourcing
- Data (e.g. of a specific period) is being exported from a source system and converted (if required)
- Outsourced data is not available in the source system
- Outsourced data can be backed up or archived
- Importing of outsourced data might require conversion, when the target data structure is different
Filing
- Storage of objects in a folder of the file system
- Filed objects can be backed up or archived depended on their file location
Terms
Outsourcing, Filing, Backup, Archiving
Backup
- Copy of existing objects to a storage medium to be able to restore data in the case of data
corruption or accidental deletion
- Performed periodically
- Storage medium is being overwritten in time, older version of an object can therefore not be restored
- Old versions of an object can be restored for a specific period only
Archiving
- Copy of a file or document to an external storage medium
- Standardized file format (tif, jpg) (if required)
- Storage for a longer period
Terms
Document management vs. Long-term preservation
Document management
- Management of documents being edited using Check-In, Check-Out and Versioning
- Documents can be found by attribute value search or full-text search
- Attributes and document links are managed by DMS
- Documents are stored in the file system or a DMS database
Terms
Document management vs. Long-term preservation
Long-term preservation
- Auditable and unchangeable storage of completed objects for a long time
- Copy of objects (e.g. files, documents) to an external storage medium
- Files and raw data are archived in original format
- Documents are converted and archived in standardized format (black/white = TIF, colour = JPEG
or PDF/A)
- Document lookup via index
- Archived files and raw data can be provided in original format
- Archived documents can be provided using a viewer software
Terms
Long-term preservation
Digital archiving
- Database-driven, long-term, secure and unchangeable storage of digital information objects
which are reproducible at any time
Digital long-term preservation
- Storage of digital information for a period longer than 10 years
Auditable digital archiving
- Storage of digital business-related information of in accordance to the requirements of
- Handelsgesetzbuch § 239, § 257 HGB
- Abgabeordnung § 146, §147, § 200 AO
- GoBS
- Secure and orderly storage of business-related documents with retention periods of six to ten
years
Why
Sources of documents/objects
Documents, lifecycle of documents
- Creation and editing documents: in process (e.g. DMS, SharePoint)
- Completed documents: final version of a document
- Additional editing creates new version
Other documents
- Correspondence, reports, rules, pictures, films, letters, invoices, quotations, certificates from
different sources
Workflows
- Information from workflow based systems (with digital signatures)
- Final document can be created from related data as the final workflow step
IT systems
- Raw data is usually available in databases or files
Why
Dealing with documents/objects
Documents
- Documents in process and/or final documents are stored in DMS, SharePoint or a disk drive (local
or network share)
- Documents stored on network shares are backup automatically
- Documents in SharePoint and emails in Outlook are deleted after retention period has expired
- Deleted documents on a network share cannot be restored after the backup period as exceeded
- Final documents signed by hand are archived in paper and/or scanned to PDF and stored as file
(attached to an email)
Why
Dealing with documents/objects
Other documents
- Emails are deleted from the inbox automatically after retention period has expired
- Reports, images, films, invoices, quotations, certificates, etc. available as files are be considered as
documents
- Documents in paper, e.g. correspondence, letters, certificates, etc. are stored in files
Why
Dealing with documents/objects
Workflow vs. documents
- Information created in workflow systems is stored with data of digital signatures in databases
- All data of a finalized workflow is stored digitally within the database (usually), final document can be
created using a template
- Print-out is treated as a copy of the original digital document
- Digitally signed documents are treated equally to documents signed by hand
IT systems vs. raw data
- Raw data is stored in databases or files which grow over time
- Data can be outsourced or exported to reduce the storage size, but the data is not instantly
accessible for the application
- Software manufacturers must guarantee that release changes do not impact the capability to import
outsourced data
Why
Legal and regulatory requirements for archiving
Legal requirements for business documents
- Handelsgesetzbuch (HGB) § 257 regulates which business documents have to be archived
- Legal retention period for business letters is 6 years, for other documents 10 years
- Abgabenordnung (AO) §§ 146, 147 describe similar requirements for administrative regulations
- Digitally archiving of those documents must comply to the principles of proper accounting (GoB)
and GoBS which describe the requirements for process documentation
- Process documentation is the proof of correct operation of the system and describes the overall
organizational and technical process of archiving (collection, indexing, storage, retrieval,
protection against loss / corruption and reproduction of archived information)
Why
Legal and regulatory requirements for archiving
- Digitally signed documents are legally binding as well as conventional paper documents
- Each country has different requirements depending on the business of the company (e.g.
Sarbanes-Oxley Act regarding internal controlling)
- Subject to audits and inspections
Why
Legal and regulatory requirements for archiving
Industry-specific requirements for documentation / archiving
- Gefahrengutverordnung (GGAV)
- Environmental liability and product liability law
- Operational directives and regulations
- Good Practice quality guidelines and regulations
- etc.
Agree with internal departments (QS, Legal, Controlling) and maybe with authorities on the
archiving process
What
Retention policies for information life-cycle in Outlook and SharePoint
Recommendations
Outlook Retention period
Inbox 60 days
Other folders
Sent Items
Drafts
Outbox
2 years
Deleted items 7 days
Calendar
Tasks
2 years
Contacts Duration of
employment
Classes in SharePoint Retention period
Standard 2 years
Review 7 years
Long-Term 10 years
What
Which documents and data
Business units determine
- Which documents have to be archived how and for how long
(storage form, file plan, retention periods)
- Document classes (logical archive)
- Document types
- Index data
What
Requirements
Requirements for long-term preservation are specified by the business
- Processes, workflows, interfaces
- Documents, objects, source, meta data
- Archiving period
- Regulatory aspects
- Permissions, roles, user management, responsibilities
- Purpose of archiving (e.g. display of documents in 15 years)
- Confidentiality, data integrity, sensitive data, availability
- Capacity (data volume, number of users, performance)
- etc.
What
Meta data
Meta data provides structured index and search capabilities to archived objects
- Source of meta data (e.g. master data systems)
- Who maintains the master data?
- Shall meta data be selected or manually entered?
- Is meta data document-dependent?
- Is meta data transferred automatically from other systems?
- Is an audit-trail required? (Who has changed which meta-data, when, why)
Coordination of the meta data in early stages is highly recommended
What
Requirements
If raw data has to be archived
- Raw data is stored as is, bit-wise
- Primary goal is the ability to import raw data as 1:1 copy of the original data
- IT system generating raw data must be able to handle imported raw data even after a long time
- Format of raw data must be coordinated
- Software manufacturers must guarantee that release changes do not impact the capability to import
outsourced data
- Meta data must be defined
- Processing of long-term preserved raw data is the responsibility of the generating IT system,
not of the archiving system
How
Technical aspects
Selection of eligible file formats
- Should the document be displayed as original incl. embedded graphics?
- Should reproduce the original document properties (paper size, font size, header, footer, logos,
color, hand-written notes, etc.)?
- Should documents be archived in different formats but with same content (e.g. XML and graphic)?
- Legal requirements?
- Is “loss of information” acceptable when converting into graphical representations (jpeg)?
- Is the converting process revision-safe?
- Is the archived document format suitable for the archiving period?
How
BSI approved formats
Graphics
- TIFF, storage of screened black-white images
- JPEG, storage of colour and gray scale images
Structure formats
- XML, can be used for long-term preservation of digital documents
Schema and layout have to be archived as well
- PDF/A, subset of PDF, standardized for long-term preservation
Format with structure and layout information and graphical objects
Documents must be validated to be PDF/A compliant
Page  21
How
Storage media
Possible storage media
- Paper
- Microfilm
- Magnetic tapes, floppy disks
- Optical storage media (e.g. CD-R, CD-ROM, DVD, WORM)
- Hard drives
- etc.
Selected media types have a limited lifetime and durability. Long-term preserved
objects must be copied to new media unchanged, if required due to technology
related changes in the storage media.
How
Additional topics
- Storage of sensitive data
- Restart of the archiving system after system outage in a disaster
- Integration in current IT environment
- Migration of archived objects is expensive depending on data volume
- User management
- Usage of storage media must be regulated
- Firewall based separation of archiving system
- Long-Term archiving solution should be in use for a long time, supplier selection should be aware of
this
How
Pros & Cons
Pros
 Single storage of documents/objects
 Save storage space
 Documents/objects available to
authorized persons
 Documents/objects available from
every workplace
 Structured search of
documents/objects
Cons
 Usage of source documents must be
regulated
 Personal must be trained
(end-user, administrator)
 On-going maintenance costs
 Complex IT system and IT
infrastructure required
We would be happy to help.
Do You Have
Any Questions?
http://www.granikos.eu
info@granikos.eu
@Granikos_DE

Weitere ähnliche Inhalte

Andere mochten auch

Chef a la local [autosaved]
Chef a la local [autosaved]Chef a la local [autosaved]
Chef a la local [autosaved]Drakkar Jones
 
Hacking tips for public speaking & presentations
Hacking tips for public speaking & presentationsHacking tips for public speaking & presentations
Hacking tips for public speaking & presentationsGiorgos Varvaris
 
Investment_Attraction_Strategy_2016-2019 (1)
Investment_Attraction_Strategy_2016-2019 (1)Investment_Attraction_Strategy_2016-2019 (1)
Investment_Attraction_Strategy_2016-2019 (1)Kwabena Ansah
 
Real estate by Alpine Housing
Real estate by Alpine HousingReal estate by Alpine Housing
Real estate by Alpine HousingAlpineHousing
 
Enriquez adriana portfolio binder
Enriquez adriana portfolio binderEnriquez adriana portfolio binder
Enriquez adriana portfolio binderAdriana Enriquez
 
NetObjects Fusion 2015 Manual Book
NetObjects Fusion 2015 Manual BookNetObjects Fusion 2015 Manual Book
NetObjects Fusion 2015 Manual BookBrandon Taylor
 
Angela Houseknecht ABA Intervention Presentation copy
Angela Houseknecht ABA Intervention Presentation copyAngela Houseknecht ABA Intervention Presentation copy
Angela Houseknecht ABA Intervention Presentation copyAngela Kambic
 
Tellurian 2016 Corporate Diaries and Notebooks in Dubai, UAE
Tellurian 2016 Corporate Diaries and Notebooks in Dubai, UAETellurian 2016 Corporate Diaries and Notebooks in Dubai, UAE
Tellurian 2016 Corporate Diaries and Notebooks in Dubai, UAETellurian Book Production
 
Fabricio - Docker deploy automation
Fabricio - Docker deploy automationFabricio - Docker deploy automation
Fabricio - Docker deploy automationRinat Khabibiev
 
Speech Presentation :3
Speech Presentation :3Speech Presentation :3
Speech Presentation :3Ki-Pyon
 
Houseknecht Data Project Presentation
Houseknecht Data Project PresentationHouseknecht Data Project Presentation
Houseknecht Data Project PresentationAngela Kambic
 
Susan Mottram CV FCF
Susan Mottram CV FCFSusan Mottram CV FCF
Susan Mottram CV FCFSusan Mottram
 

Andere mochten auch (13)

Chef a la local [autosaved]
Chef a la local [autosaved]Chef a la local [autosaved]
Chef a la local [autosaved]
 
Hacking tips for public speaking & presentations
Hacking tips for public speaking & presentationsHacking tips for public speaking & presentations
Hacking tips for public speaking & presentations
 
Investment_Attraction_Strategy_2016-2019 (1)
Investment_Attraction_Strategy_2016-2019 (1)Investment_Attraction_Strategy_2016-2019 (1)
Investment_Attraction_Strategy_2016-2019 (1)
 
Real estate by Alpine Housing
Real estate by Alpine HousingReal estate by Alpine Housing
Real estate by Alpine Housing
 
Enriquez adriana portfolio binder
Enriquez adriana portfolio binderEnriquez adriana portfolio binder
Enriquez adriana portfolio binder
 
NetObjects Fusion 2015 Manual Book
NetObjects Fusion 2015 Manual BookNetObjects Fusion 2015 Manual Book
NetObjects Fusion 2015 Manual Book
 
Angela Houseknecht ABA Intervention Presentation copy
Angela Houseknecht ABA Intervention Presentation copyAngela Houseknecht ABA Intervention Presentation copy
Angela Houseknecht ABA Intervention Presentation copy
 
Tellurian 2016 Corporate Diaries and Notebooks in Dubai, UAE
Tellurian 2016 Corporate Diaries and Notebooks in Dubai, UAETellurian 2016 Corporate Diaries and Notebooks in Dubai, UAE
Tellurian 2016 Corporate Diaries and Notebooks in Dubai, UAE
 
Fabricio - Docker deploy automation
Fabricio - Docker deploy automationFabricio - Docker deploy automation
Fabricio - Docker deploy automation
 
Print
PrintPrint
Print
 
Speech Presentation :3
Speech Presentation :3Speech Presentation :3
Speech Presentation :3
 
Houseknecht Data Project Presentation
Houseknecht Data Project PresentationHouseknecht Data Project Presentation
Houseknecht Data Project Presentation
 
Susan Mottram CV FCF
Susan Mottram CV FCFSusan Mottram CV FCF
Susan Mottram CV FCF
 

Ähnlich wie Archiving Documents

Implementation of an enterprise level EDMS solution
Implementation of an enterprise level EDMS solutionImplementation of an enterprise level EDMS solution
Implementation of an enterprise level EDMS solutionJose Valdivieso
 
Islandora & Archivematica combined NDSA RAG poster for LITA
Islandora & Archivematica combined NDSA RAG poster for LITAIslandora & Archivematica combined NDSA RAG poster for LITA
Islandora & Archivematica combined NDSA RAG poster for LITAaaroncollie
 
Webinar : Key Challenges, Trends and Approaches in Extracting Data
Webinar : Key Challenges, Trends and Approaches in Extracting DataWebinar : Key Challenges, Trends and Approaches in Extracting Data
Webinar : Key Challenges, Trends and Approaches in Extracting DataSensiple Inc.,
 
Elements of Data Documentation
Elements of Data DocumentationElements of Data Documentation
Elements of Data Documentationssri-duke
 
Database Archiving - Managing Data for Long Retention Periods
Database Archiving - Managing Data for Long Retention PeriodsDatabase Archiving - Managing Data for Long Retention Periods
Database Archiving - Managing Data for Long Retention PeriodsCraig Mullins
 
RecordPoint Overview
RecordPoint OverviewRecordPoint Overview
RecordPoint OverviewIntergen
 
Document Control best practices and definitions
Document Control best practices and definitionsDocument Control best practices and definitions
Document Control best practices and definitionsMohamed Ahmed
 
BP302: Future Proofing Enterprise IT
BP302: Future Proofing Enterprise IT BP302: Future Proofing Enterprise IT
BP302: Future Proofing Enterprise IT panagenda
 
Utilizing PKI to Reduce Risk & Cost
Utilizing PKI to Reduce Risk & CostUtilizing PKI to Reduce Risk & Cost
Utilizing PKI to Reduce Risk & CostChin Wan Lim
 
Keep Calm and Curate
Keep Calm and CurateKeep Calm and Curate
Keep Calm and CurateGarethKnight
 
ConnectED 2015 BP302: Future-Proofing Enterprise IT
ConnectED 2015 BP302: Future-Proofing Enterprise ITConnectED 2015 BP302: Future-Proofing Enterprise IT
ConnectED 2015 BP302: Future-Proofing Enterprise ITDaniel Reimann
 
File organisation in system analysis and design
File organisation in system analysis and designFile organisation in system analysis and design
File organisation in system analysis and designMohitgauri
 

Ähnlich wie Archiving Documents (20)

ITFT- Dbms
ITFT- DbmsITFT- Dbms
ITFT- Dbms
 
Implementation of an enterprise level EDMS solution
Implementation of an enterprise level EDMS solutionImplementation of an enterprise level EDMS solution
Implementation of an enterprise level EDMS solution
 
Andrew waugh
Andrew waughAndrew waugh
Andrew waugh
 
Andrew Waugh presentation
Andrew Waugh   presentationAndrew Waugh   presentation
Andrew Waugh presentation
 
Islandora & Archivematica combined NDSA RAG poster for LITA
Islandora & Archivematica combined NDSA RAG poster for LITAIslandora & Archivematica combined NDSA RAG poster for LITA
Islandora & Archivematica combined NDSA RAG poster for LITA
 
Webinar : Key Challenges, Trends and Approaches in Extracting Data
Webinar : Key Challenges, Trends and Approaches in Extracting DataWebinar : Key Challenges, Trends and Approaches in Extracting Data
Webinar : Key Challenges, Trends and Approaches in Extracting Data
 
Elements of Data Documentation
Elements of Data DocumentationElements of Data Documentation
Elements of Data Documentation
 
Chap01 (ics12)
Chap01 (ics12)Chap01 (ics12)
Chap01 (ics12)
 
Database Archiving - Managing Data for Long Retention Periods
Database Archiving - Managing Data for Long Retention PeriodsDatabase Archiving - Managing Data for Long Retention Periods
Database Archiving - Managing Data for Long Retention Periods
 
RecordPoint Overview
RecordPoint OverviewRecordPoint Overview
RecordPoint Overview
 
DMS and FMS
DMS and FMSDMS and FMS
DMS and FMS
 
Digitization workflow
Digitization workflowDigitization workflow
Digitization workflow
 
Document Control best practices and definitions
Document Control best practices and definitionsDocument Control best practices and definitions
Document Control best practices and definitions
 
BP302: Future Proofing Enterprise IT
BP302: Future Proofing Enterprise IT BP302: Future Proofing Enterprise IT
BP302: Future Proofing Enterprise IT
 
Utilizing PKI to Reduce Risk & Cost
Utilizing PKI to Reduce Risk & CostUtilizing PKI to Reduce Risk & Cost
Utilizing PKI to Reduce Risk & Cost
 
20130222 kaptur training_goldsmiths
20130222 kaptur training_goldsmiths20130222 kaptur training_goldsmiths
20130222 kaptur training_goldsmiths
 
Keep Calm and Curate
Keep Calm and CurateKeep Calm and Curate
Keep Calm and Curate
 
ConnectED 2015 BP302: Future-Proofing Enterprise IT
ConnectED 2015 BP302: Future-Proofing Enterprise ITConnectED 2015 BP302: Future-Proofing Enterprise IT
ConnectED 2015 BP302: Future-Proofing Enterprise IT
 
Digitization
DigitizationDigitization
Digitization
 
File organisation in system analysis and design
File organisation in system analysis and designFile organisation in system analysis and design
File organisation in system analysis and design
 

Mehr von Granikos GmbH & Co. KG

Langzeitarchivierung - Warum ist Archivierung wichtig?
Langzeitarchivierung - Warum ist Archivierung wichtig?Langzeitarchivierung - Warum ist Archivierung wichtig?
Langzeitarchivierung - Warum ist Archivierung wichtig?Granikos GmbH & Co. KG
 
AD FS Workshop | Part 1 | Quick Overview
AD FS Workshop | Part 1 | Quick OverviewAD FS Workshop | Part 1 | Quick Overview
AD FS Workshop | Part 1 | Quick OverviewGranikos GmbH & Co. KG
 
Modern Anti-Spam Protection - Rejection, no sorting
Modern Anti-Spam Protection - Rejection, no sortingModern Anti-Spam Protection - Rejection, no sorting
Modern Anti-Spam Protection - Rejection, no sortingGranikos GmbH & Co. KG
 
Modernes Anti-Spam - Abweisen, nicht sortieren
Modernes Anti-Spam - Abweisen, nicht sortierenModernes Anti-Spam - Abweisen, nicht sortieren
Modernes Anti-Spam - Abweisen, nicht sortierenGranikos GmbH & Co. KG
 

Mehr von Granikos GmbH & Co. KG (6)

Langzeitarchivierung - Warum ist Archivierung wichtig?
Langzeitarchivierung - Warum ist Archivierung wichtig?Langzeitarchivierung - Warum ist Archivierung wichtig?
Langzeitarchivierung - Warum ist Archivierung wichtig?
 
AD FS Workshop | Part 2 | Deep Dive
AD FS Workshop | Part 2 | Deep DiveAD FS Workshop | Part 2 | Deep Dive
AD FS Workshop | Part 2 | Deep Dive
 
AD FS Workshop | Part 1 | Quick Overview
AD FS Workshop | Part 1 | Quick OverviewAD FS Workshop | Part 1 | Quick Overview
AD FS Workshop | Part 1 | Quick Overview
 
Exchange 2013 Site Mailboxes
Exchange 2013 Site MailboxesExchange 2013 Site Mailboxes
Exchange 2013 Site Mailboxes
 
Modern Anti-Spam Protection - Rejection, no sorting
Modern Anti-Spam Protection - Rejection, no sortingModern Anti-Spam Protection - Rejection, no sorting
Modern Anti-Spam Protection - Rejection, no sorting
 
Modernes Anti-Spam - Abweisen, nicht sortieren
Modernes Anti-Spam - Abweisen, nicht sortierenModernes Anti-Spam - Abweisen, nicht sortieren
Modernes Anti-Spam - Abweisen, nicht sortieren
 

Kürzlich hochgeladen

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 

Kürzlich hochgeladen (20)

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 

Archiving Documents

  • 1. Thomas Stensitzki, Principal Enterprise Consultant Long Time Preservation The Importance of Archiving
  • 2. Agenda Long-Term preservation Why should/must items be archived? What should/must items be archived? How can archiving be done? 1 2 3 4
  • 3. Terms Outsourcing, Filing, Backup, Archiving Outsourcing - Data (e.g. of a specific period) is being exported from a source system and converted (if required) - Outsourced data is not available in the source system - Outsourced data can be backed up or archived - Importing of outsourced data might require conversion, when the target data structure is different Filing - Storage of objects in a folder of the file system - Filed objects can be backed up or archived depended on their file location
  • 4. Terms Outsourcing, Filing, Backup, Archiving Backup - Copy of existing objects to a storage medium to be able to restore data in the case of data corruption or accidental deletion - Performed periodically - Storage medium is being overwritten in time, older version of an object can therefore not be restored - Old versions of an object can be restored for a specific period only Archiving - Copy of a file or document to an external storage medium - Standardized file format (tif, jpg) (if required) - Storage for a longer period
  • 5. Terms Document management vs. Long-term preservation Document management - Management of documents being edited using Check-In, Check-Out and Versioning - Documents can be found by attribute value search or full-text search - Attributes and document links are managed by DMS - Documents are stored in the file system or a DMS database
  • 6. Terms Document management vs. Long-term preservation Long-term preservation - Auditable and unchangeable storage of completed objects for a long time - Copy of objects (e.g. files, documents) to an external storage medium - Files and raw data are archived in original format - Documents are converted and archived in standardized format (black/white = TIF, colour = JPEG or PDF/A) - Document lookup via index - Archived files and raw data can be provided in original format - Archived documents can be provided using a viewer software
  • 7. Terms Long-term preservation Digital archiving - Database-driven, long-term, secure and unchangeable storage of digital information objects which are reproducible at any time Digital long-term preservation - Storage of digital information for a period longer than 10 years Auditable digital archiving - Storage of digital business-related information of in accordance to the requirements of - Handelsgesetzbuch § 239, § 257 HGB - Abgabeordnung § 146, §147, § 200 AO - GoBS - Secure and orderly storage of business-related documents with retention periods of six to ten years
  • 8. Why Sources of documents/objects Documents, lifecycle of documents - Creation and editing documents: in process (e.g. DMS, SharePoint) - Completed documents: final version of a document - Additional editing creates new version Other documents - Correspondence, reports, rules, pictures, films, letters, invoices, quotations, certificates from different sources Workflows - Information from workflow based systems (with digital signatures) - Final document can be created from related data as the final workflow step IT systems - Raw data is usually available in databases or files
  • 9. Why Dealing with documents/objects Documents - Documents in process and/or final documents are stored in DMS, SharePoint or a disk drive (local or network share) - Documents stored on network shares are backup automatically - Documents in SharePoint and emails in Outlook are deleted after retention period has expired - Deleted documents on a network share cannot be restored after the backup period as exceeded - Final documents signed by hand are archived in paper and/or scanned to PDF and stored as file (attached to an email)
  • 10. Why Dealing with documents/objects Other documents - Emails are deleted from the inbox automatically after retention period has expired - Reports, images, films, invoices, quotations, certificates, etc. available as files are be considered as documents - Documents in paper, e.g. correspondence, letters, certificates, etc. are stored in files
  • 11. Why Dealing with documents/objects Workflow vs. documents - Information created in workflow systems is stored with data of digital signatures in databases - All data of a finalized workflow is stored digitally within the database (usually), final document can be created using a template - Print-out is treated as a copy of the original digital document - Digitally signed documents are treated equally to documents signed by hand IT systems vs. raw data - Raw data is stored in databases or files which grow over time - Data can be outsourced or exported to reduce the storage size, but the data is not instantly accessible for the application - Software manufacturers must guarantee that release changes do not impact the capability to import outsourced data
  • 12. Why Legal and regulatory requirements for archiving Legal requirements for business documents - Handelsgesetzbuch (HGB) § 257 regulates which business documents have to be archived - Legal retention period for business letters is 6 years, for other documents 10 years - Abgabenordnung (AO) §§ 146, 147 describe similar requirements for administrative regulations - Digitally archiving of those documents must comply to the principles of proper accounting (GoB) and GoBS which describe the requirements for process documentation - Process documentation is the proof of correct operation of the system and describes the overall organizational and technical process of archiving (collection, indexing, storage, retrieval, protection against loss / corruption and reproduction of archived information)
  • 13. Why Legal and regulatory requirements for archiving - Digitally signed documents are legally binding as well as conventional paper documents - Each country has different requirements depending on the business of the company (e.g. Sarbanes-Oxley Act regarding internal controlling) - Subject to audits and inspections
  • 14. Why Legal and regulatory requirements for archiving Industry-specific requirements for documentation / archiving - Gefahrengutverordnung (GGAV) - Environmental liability and product liability law - Operational directives and regulations - Good Practice quality guidelines and regulations - etc. Agree with internal departments (QS, Legal, Controlling) and maybe with authorities on the archiving process
  • 15. What Retention policies for information life-cycle in Outlook and SharePoint Recommendations Outlook Retention period Inbox 60 days Other folders Sent Items Drafts Outbox 2 years Deleted items 7 days Calendar Tasks 2 years Contacts Duration of employment Classes in SharePoint Retention period Standard 2 years Review 7 years Long-Term 10 years
  • 16. What Which documents and data Business units determine - Which documents have to be archived how and for how long (storage form, file plan, retention periods) - Document classes (logical archive) - Document types - Index data
  • 17. What Requirements Requirements for long-term preservation are specified by the business - Processes, workflows, interfaces - Documents, objects, source, meta data - Archiving period - Regulatory aspects - Permissions, roles, user management, responsibilities - Purpose of archiving (e.g. display of documents in 15 years) - Confidentiality, data integrity, sensitive data, availability - Capacity (data volume, number of users, performance) - etc.
  • 18. What Meta data Meta data provides structured index and search capabilities to archived objects - Source of meta data (e.g. master data systems) - Who maintains the master data? - Shall meta data be selected or manually entered? - Is meta data document-dependent? - Is meta data transferred automatically from other systems? - Is an audit-trail required? (Who has changed which meta-data, when, why) Coordination of the meta data in early stages is highly recommended
  • 19. What Requirements If raw data has to be archived - Raw data is stored as is, bit-wise - Primary goal is the ability to import raw data as 1:1 copy of the original data - IT system generating raw data must be able to handle imported raw data even after a long time - Format of raw data must be coordinated - Software manufacturers must guarantee that release changes do not impact the capability to import outsourced data - Meta data must be defined - Processing of long-term preserved raw data is the responsibility of the generating IT system, not of the archiving system
  • 20. How Technical aspects Selection of eligible file formats - Should the document be displayed as original incl. embedded graphics? - Should reproduce the original document properties (paper size, font size, header, footer, logos, color, hand-written notes, etc.)? - Should documents be archived in different formats but with same content (e.g. XML and graphic)? - Legal requirements? - Is “loss of information” acceptable when converting into graphical representations (jpeg)? - Is the converting process revision-safe? - Is the archived document format suitable for the archiving period?
  • 21. How BSI approved formats Graphics - TIFF, storage of screened black-white images - JPEG, storage of colour and gray scale images Structure formats - XML, can be used for long-term preservation of digital documents Schema and layout have to be archived as well - PDF/A, subset of PDF, standardized for long-term preservation Format with structure and layout information and graphical objects Documents must be validated to be PDF/A compliant Page  21
  • 22. How Storage media Possible storage media - Paper - Microfilm - Magnetic tapes, floppy disks - Optical storage media (e.g. CD-R, CD-ROM, DVD, WORM) - Hard drives - etc. Selected media types have a limited lifetime and durability. Long-term preserved objects must be copied to new media unchanged, if required due to technology related changes in the storage media.
  • 23. How Additional topics - Storage of sensitive data - Restart of the archiving system after system outage in a disaster - Integration in current IT environment - Migration of archived objects is expensive depending on data volume - User management - Usage of storage media must be regulated - Firewall based separation of archiving system - Long-Term archiving solution should be in use for a long time, supplier selection should be aware of this
  • 24. How Pros & Cons Pros  Single storage of documents/objects  Save storage space  Documents/objects available to authorized persons  Documents/objects available from every workplace  Structured search of documents/objects Cons  Usage of source documents must be regulated  Personal must be trained (end-user, administrator)  On-going maintenance costs  Complex IT system and IT infrastructure required
  • 25. We would be happy to help. Do You Have Any Questions? http://www.granikos.eu info@granikos.eu @Granikos_DE