SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Lucia Stefan
Archiva Ltd.
London, UK
 Definitia populara:
 depozitare intr-un spatiu exterior
 Scanare si depozitare filelor in
afara sistemului curent
 Arhive de tip zip
 ..... Etc
Se face confuzie intre custodie
si arhivare
 Arhivarea presupune analiza, selectia si
organizarea documentelor electronica de
arhivat, adica definirea:
 Materialul electronic de arhivat
 Procedeelor de arhivare si cautare
 Accesului la materialul arhivat
 Rolurilor participantilor la arhivare
 Definirea standardelor applicate
Prin arhivare, se urmareste ca informatia
sa pastreze urmatoarele caracteristici
(conform ISO 15489, partea 1):
 Autenticitate
 Credibilitate
 Integritate
 Utilizabilitate
 Ref:
Deteriorarea suportului:
 Suportul electronic gen CD, DVD se mentine
maximum 10 ani spre deosebire de piatra
(milenii), pergament (1000 ani), hartie (200 ani)
microfisa (100 ani)
Deteriorarea Hardware
Invechirea sistemului de operare si a
softurilor
Evolutia tehnologica
Invechirea formatelor electronice
Pierderea contextului
S-au propus trei solutii:
1.Muzeu al vechilor sisteme:
hardware, OS si software
2.Simularea (emulation)
3.Migrarea documentelor la intervale
regulate de timp
1 nu este solutie realista
2 si 3 optiuni viabile
Arhivistica traditionala: proces pasiv
Arhivistica electronica: proces activ
Arhivare pasiva
Selectia materialui
Transfer
Arhivare activa
Regasirea materialui
Prezentarea
materialui
Ingest – migrarea catre alte sisteme
- reconcilierea
Administrarea suportului electronic
(media)
- administrarea copiilor si back-upuriloe
- evitarea degradarii suportului si format
obsolescence
Administrarea formatului filei
- Conversia la alte formate
- Gestionarea copiilor in diverse formate ale
aceluiasi document
Mecanismele de autentificare
Securitatea
Audit si certificare: pentru arhive de
incredere
Identificarea materialului de arhiva
Controlul extern (legislatie, regulamete,
control managerial)
Definirea produsului final (in ce format,
media, etc) materialul va fi arhivat
Resurse necesare (umane, infrastructura,
tehnologie)
evitarea degradarii suportului si format
obsolescence
Criterii de alegere a formatului media:
-longevitate: optim 10 ani
-capacitate: adaptata informatiei de arhivat
-viabilitate: evitarea si detectarea erorilor
-adoptie: cat este acceptat de industrie
-cost
-rezistenta la distrugere fizica
Format nativ sau redare in alt format? (ex
format nativ word sau redare in pdf/a)
Criterii de alegere:
- adoptie
- platform independent
- open stadard
- transparenta
- Metadata support
In practica: formatele MS Office, XML, PDF/A,
JPEG
Unitatea de schimb dintre OAIS si mediul
exterior sunt Pachetul de Informatii
(Information Package).
Tipuri de pachete
 Submission Information Package (SIP)
 Archival Information Package (AIP)
 Dissemination Information Package (DIP)
Un pachet de informatii este un container
conceptual de doua tipuri de informatii:
 Continutul (Content Information)
 Informatia descriptiva de arhivare (Preservation
Description Information (PDI).
• Long-term archive and catalogue provided by EUMETSAT Unified Archive
and Retrieval Facility (U-MARF) including on-line catalogue access and
user services.
Metadata for
Cataloguing
Products
User
Search
Browse
Order
Formatting &
media delivery
workshop PFD
MSG Ground Segment
MTP Ground Segment
EPS Ground Segment
Near line archive
on tape
• Technologies:
– Server: SUN Cluster environment and Solaris 10.
– GS communication via redundant Ethernet networks with
automatic failover.
– Internal Gigabit Ethernet network.
– RAID array: EMC2 (CX600 - 35 TBytes).
– SAN using Fibre Channel.
– Archive library and tapes:
• Sun StorageTek SL8500 (1448 slots enabled – 724 TByte
native, 1.5 PByte compressed).
• Primary media: Sun StorageTeek T10000 (12 drives).
• Secondary media: LTO-4 (4 drives) (LTO-2 4 drives).
– Archive HSM: SAM-FS (previously AMASS).
– Distribution media: DVD, DAT(DDS-4), DLT (DLT8000), LTO-2, -4.
– Compression: BZIP2,GZIP,PKZIP,UNIX Compress.
– Catalogue DBMS: Oracle 11.
– Internal communication between subsystems: Corba (TAO).
– Bespoke SW: C++, Java 5.
– Web Services, OGC standards
• Data access and interoperability:
– Data access through catalogue of metadata and catalogue web
portal user access layer:
• Catalogue search and browse information;
• Ordering and follow-up;
• User management functions.
– Implementation of international inter-operability access planned
until end of 2009: ISO metadata standards, ESA HMA / OGC
standards, WMO Information System standards.
• Reprocessing:
– Specific API and internal communication network with Mission
G/S allow optimal retrieval of products for reprocessing.
– Auxiliary data stored in U-MARF and retrieved for reprocessing.
– U-MARF catalogue products versioning scheme.
• Archives data exploitation aspects (based on operational
status from 2007):
– Performance figures & requirements:
• Ingestion throughputs and timelines from GS (re-)processing
facilities:
– Average daily ingestion rate: 186 Gbytes (EPS), 108 GBytes
(Meteosat).
• Retrieval performances (under full ingestion rate):
– over 1800 registered users for the on-line ordering (about 50
new users per month);
• Archive and catalogue capacity sizing:
– archive holding: about 375 TBytes, > 6,900,000 files;
– incremental growth: to day’s configuration 3.2 PBytes.
– Other salient operational aspects:
• Availability constraints:
– max 10 minutes downtime (fulfilled by concept of back-up
ingestion chain).
• providing 2 days autonomy of operations on bulk orders).
• Distinct OPEration and VALidation configurations.
Intrebari?
lucia.stefan@uclmail.net
Multumesc

Weitere ähnliche Inhalte

Ähnlich wie Introducere in arhivistica digitala

2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics ClassCourtney Mumma
 
20100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_033020100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_0330glorykim
 
20100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_033020100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_0330광영 김
 
File Formats for Preservation
File Formats for PreservationFile Formats for Preservation
File Formats for PreservationStephen Gray
 
OpenAIRE webinar: Principles of Research Data Management, with S. Venkatarama...
OpenAIRE webinar: Principles of Research Data Management, with S. Venkatarama...OpenAIRE webinar: Principles of Research Data Management, with S. Venkatarama...
OpenAIRE webinar: Principles of Research Data Management, with S. Venkatarama...OpenAIRE
 
The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...ManjulaPatel
 
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...IFLAAcademicandResea
 
Oxford Common File Layout (OCFL)
Oxford Common File Layout (OCFL)Oxford Common File Layout (OCFL)
Oxford Common File Layout (OCFL)Simeon Warner
 
Archiver at CS3 - Cloud Storage Synchronization and Sharing Services
Archiver at CS3 - Cloud Storage Synchronization and Sharing ServicesArchiver at CS3 - Cloud Storage Synchronization and Sharing Services
Archiver at CS3 - Cloud Storage Synchronization and Sharing ServicesArchiver
 
Hybrid Cloud storage deployment models: ARCHIVER presentation at the CS3 Work...
Hybrid Cloud storage deployment models: ARCHIVER presentation at the CS3 Work...Hybrid Cloud storage deployment models: ARCHIVER presentation at the CS3 Work...
Hybrid Cloud storage deployment models: ARCHIVER presentation at the CS3 Work...Archiver
 
The Reference Model for an Open Archival Information System (OAIS)
The Reference Model for an Open Archival Information System (OAIS)The Reference Model for an Open Archival Information System (OAIS)
The Reference Model for an Open Archival Information System (OAIS)Michael Day
 
Archival Technologies
Archival TechnologiesArchival Technologies
Archival TechnologiesCliff Landis
 
Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...
Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...
Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...EDINA, University of Edinburgh
 
Repositories and digital preservation
Repositories and digital preservationRepositories and digital preservation
Repositories and digital preservationMichael Day
 
The Oxford Common File Layout: A common approach to digital preservation
The Oxford Common File Layout: A common approach to digital preservationThe Oxford Common File Layout: A common approach to digital preservation
The Oxford Common File Layout: A common approach to digital preservationSimeon Warner
 

Ähnlich wie Introducere in arhivistica digitala (20)

2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
 
Completepresentation
CompletepresentationCompletepresentation
Completepresentation
 
20100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_033020100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_0330
 
20100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_033020100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_0330
 
File Formats for Preservation
File Formats for PreservationFile Formats for Preservation
File Formats for Preservation
 
OpenAIRE webinar: Principles of Research Data Management, with S. Venkatarama...
OpenAIRE webinar: Principles of Research Data Management, with S. Venkatarama...OpenAIRE webinar: Principles of Research Data Management, with S. Venkatarama...
OpenAIRE webinar: Principles of Research Data Management, with S. Venkatarama...
 
The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...
 
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
 
Oxford Common File Layout (OCFL)
Oxford Common File Layout (OCFL)Oxford Common File Layout (OCFL)
Oxford Common File Layout (OCFL)
 
Archiver at CS3 - Cloud Storage Synchronization and Sharing Services
Archiver at CS3 - Cloud Storage Synchronization and Sharing ServicesArchiver at CS3 - Cloud Storage Synchronization and Sharing Services
Archiver at CS3 - Cloud Storage Synchronization and Sharing Services
 
Hybrid Cloud storage deployment models: ARCHIVER presentation at the CS3 Work...
Hybrid Cloud storage deployment models: ARCHIVER presentation at the CS3 Work...Hybrid Cloud storage deployment models: ARCHIVER presentation at the CS3 Work...
Hybrid Cloud storage deployment models: ARCHIVER presentation at the CS3 Work...
 
Personal Digital Archiving 2015 - NYU - Workshop
Personal Digital Archiving 2015 - NYU - WorkshopPersonal Digital Archiving 2015 - NYU - Workshop
Personal Digital Archiving 2015 - NYU - Workshop
 
Trm Vilnius Oais New
Trm Vilnius Oais NewTrm Vilnius Oais New
Trm Vilnius Oais New
 
The Reference Model for an Open Archival Information System (OAIS)
The Reference Model for an Open Archival Information System (OAIS)The Reference Model for an Open Archival Information System (OAIS)
The Reference Model for an Open Archival Information System (OAIS)
 
Towards a Common Approach for Access to Digital Archival Records in Europe. A...
Towards a Common Approach for Access to Digital Archival Records in Europe. A...Towards a Common Approach for Access to Digital Archival Records in Europe. A...
Towards a Common Approach for Access to Digital Archival Records in Europe. A...
 
Archival Technologies
Archival TechnologiesArchival Technologies
Archival Technologies
 
Caplan and York, 'What It Takes To Make It Last: E-Resources Preservation"
Caplan and York, 'What It Takes To Make It Last:  E-Resources Preservation"Caplan and York, 'What It Takes To Make It Last:  E-Resources Preservation"
Caplan and York, 'What It Takes To Make It Last: E-Resources Preservation"
 
Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...
Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...
Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...
 
Repositories and digital preservation
Repositories and digital preservationRepositories and digital preservation
Repositories and digital preservation
 
The Oxford Common File Layout: A common approach to digital preservation
The Oxford Common File Layout: A common approach to digital preservationThe Oxford Common File Layout: A common approach to digital preservation
The Oxford Common File Layout: A common approach to digital preservation
 

Mehr von Lucia Stefan

Cum Se Implementeaza un ECM
Cum Se Implementeaza un ECMCum Se Implementeaza un ECM
Cum Se Implementeaza un ECMLucia Stefan
 
Moreq2 Romanian Framework
Moreq2 Romanian FrameworkMoreq2 Romanian Framework
Moreq2 Romanian FrameworkLucia Stefan
 
Records Management Models
Records Management ModelsRecords Management Models
Records Management ModelsLucia Stefan
 
Preservation Issues
Preservation IssuesPreservation Issues
Preservation IssuesLucia Stefan
 
Moreq2 Translation
Moreq2 TranslationMoreq2 Translation
Moreq2 TranslationLucia Stefan
 

Mehr von Lucia Stefan (6)

Cum Se Implementeaza un ECM
Cum Se Implementeaza un ECMCum Se Implementeaza un ECM
Cum Se Implementeaza un ECM
 
Moreq2 Romanian Framework
Moreq2 Romanian FrameworkMoreq2 Romanian Framework
Moreq2 Romanian Framework
 
Records Management Models
Records Management ModelsRecords Management Models
Records Management Models
 
Preservation Issues
Preservation IssuesPreservation Issues
Preservation Issues
 
What is Moreq2
What is Moreq2What is Moreq2
What is Moreq2
 
Moreq2 Translation
Moreq2 TranslationMoreq2 Translation
Moreq2 Translation
 

Introducere in arhivistica digitala

  • 2.  Definitia populara:  depozitare intr-un spatiu exterior  Scanare si depozitare filelor in afara sistemului curent  Arhive de tip zip  ..... Etc Se face confuzie intre custodie si arhivare
  • 3.  Arhivarea presupune analiza, selectia si organizarea documentelor electronica de arhivat, adica definirea:  Materialul electronic de arhivat  Procedeelor de arhivare si cautare  Accesului la materialul arhivat  Rolurilor participantilor la arhivare  Definirea standardelor applicate
  • 4. Prin arhivare, se urmareste ca informatia sa pastreze urmatoarele caracteristici (conform ISO 15489, partea 1):  Autenticitate  Credibilitate  Integritate  Utilizabilitate  Ref:
  • 5. Deteriorarea suportului:  Suportul electronic gen CD, DVD se mentine maximum 10 ani spre deosebire de piatra (milenii), pergament (1000 ani), hartie (200 ani) microfisa (100 ani) Deteriorarea Hardware Invechirea sistemului de operare si a softurilor Evolutia tehnologica Invechirea formatelor electronice Pierderea contextului
  • 6. S-au propus trei solutii: 1.Muzeu al vechilor sisteme: hardware, OS si software 2.Simularea (emulation) 3.Migrarea documentelor la intervale regulate de timp 1 nu este solutie realista 2 si 3 optiuni viabile Arhivistica traditionala: proces pasiv Arhivistica electronica: proces activ
  • 7. Arhivare pasiva Selectia materialui Transfer Arhivare activa Regasirea materialui Prezentarea materialui
  • 8.
  • 9. Ingest – migrarea catre alte sisteme - reconcilierea Administrarea suportului electronic (media) - administrarea copiilor si back-upuriloe - evitarea degradarii suportului si format obsolescence Administrarea formatului filei - Conversia la alte formate - Gestionarea copiilor in diverse formate ale aceluiasi document
  • 10. Mecanismele de autentificare Securitatea Audit si certificare: pentru arhive de incredere
  • 11. Identificarea materialului de arhiva Controlul extern (legislatie, regulamete, control managerial) Definirea produsului final (in ce format, media, etc) materialul va fi arhivat Resurse necesare (umane, infrastructura, tehnologie)
  • 12. evitarea degradarii suportului si format obsolescence Criterii de alegere a formatului media: -longevitate: optim 10 ani -capacitate: adaptata informatiei de arhivat -viabilitate: evitarea si detectarea erorilor -adoptie: cat este acceptat de industrie -cost -rezistenta la distrugere fizica
  • 13. Format nativ sau redare in alt format? (ex format nativ word sau redare in pdf/a) Criterii de alegere: - adoptie - platform independent - open stadard - transparenta - Metadata support In practica: formatele MS Office, XML, PDF/A, JPEG
  • 14. Unitatea de schimb dintre OAIS si mediul exterior sunt Pachetul de Informatii (Information Package). Tipuri de pachete  Submission Information Package (SIP)  Archival Information Package (AIP)  Dissemination Information Package (DIP) Un pachet de informatii este un container conceptual de doua tipuri de informatii:  Continutul (Content Information)  Informatia descriptiva de arhivare (Preservation Description Information (PDI).
  • 15.
  • 16. • Long-term archive and catalogue provided by EUMETSAT Unified Archive and Retrieval Facility (U-MARF) including on-line catalogue access and user services. Metadata for Cataloguing Products User Search Browse Order Formatting & media delivery workshop PFD MSG Ground Segment MTP Ground Segment EPS Ground Segment Near line archive on tape
  • 17. • Technologies: – Server: SUN Cluster environment and Solaris 10. – GS communication via redundant Ethernet networks with automatic failover. – Internal Gigabit Ethernet network. – RAID array: EMC2 (CX600 - 35 TBytes). – SAN using Fibre Channel. – Archive library and tapes: • Sun StorageTek SL8500 (1448 slots enabled – 724 TByte native, 1.5 PByte compressed). • Primary media: Sun StorageTeek T10000 (12 drives). • Secondary media: LTO-4 (4 drives) (LTO-2 4 drives). – Archive HSM: SAM-FS (previously AMASS). – Distribution media: DVD, DAT(DDS-4), DLT (DLT8000), LTO-2, -4. – Compression: BZIP2,GZIP,PKZIP,UNIX Compress. – Catalogue DBMS: Oracle 11. – Internal communication between subsystems: Corba (TAO). – Bespoke SW: C++, Java 5. – Web Services, OGC standards
  • 18. • Data access and interoperability: – Data access through catalogue of metadata and catalogue web portal user access layer: • Catalogue search and browse information; • Ordering and follow-up; • User management functions. – Implementation of international inter-operability access planned until end of 2009: ISO metadata standards, ESA HMA / OGC standards, WMO Information System standards. • Reprocessing: – Specific API and internal communication network with Mission G/S allow optimal retrieval of products for reprocessing. – Auxiliary data stored in U-MARF and retrieved for reprocessing. – U-MARF catalogue products versioning scheme.
  • 19. • Archives data exploitation aspects (based on operational status from 2007): – Performance figures & requirements: • Ingestion throughputs and timelines from GS (re-)processing facilities: – Average daily ingestion rate: 186 Gbytes (EPS), 108 GBytes (Meteosat). • Retrieval performances (under full ingestion rate): – over 1800 registered users for the on-line ordering (about 50 new users per month); • Archive and catalogue capacity sizing: – archive holding: about 375 TBytes, > 6,900,000 files; – incremental growth: to day’s configuration 3.2 PBytes. – Other salient operational aspects: • Availability constraints: – max 10 minutes downtime (fulfilled by concept of back-up ingestion chain). • providing 2 days autonomy of operations on bulk orders). • Distinct OPEration and VALidation configurations.

Hinweis der Redaktion

  1. Architecture drivers: Data driven behaviour SW architecture breakdown in subsystems communicating between subsystems through CORBA but only for invocation of a service from another subsystem (ie. CORBA not used for the transfer of data files, only the references to these files) ftp transfers for data exchanges (as high throughputs required)