SlideShare ist ein Scribd-Unternehmen logo
Migration off DSpace 1.8.2
Max Sullivan
Digital Research Services
Coordinator
max.sullivan@vuw.ac.nz
Stuart Yeates
Library Infrastructure Engineer
stuart.yeates@vuw.ac.nz
Overview
• Four instances of DSpace 1.8.2
• Both OS and application software
out of support
• All content exported as packages
containing mets.xml and files.
• Import to DSpace 6 via METS
package and app using SWORD.
Issue 1 – SWORD Metadata – Stuart
• SWORD transfers METS packages
• SWORD originated in EPrints
• → SWORD METS metadata defaults to EPDCX
– "EPrints Dublin Core X"
– Weird encoding of basic DC
• METS supports every metadata schema
• Determining which is used is hard
• →We stripped 95% of it out for debugging
Issue 2 – Excel Identifiers – Stuart
• Exporting from one DSpace into another
overwrites identifiers by default
• Identifiers needed to build httpd redirects
• Used DSpace CSV metadata export and Excel
'match - lookup' to reconstruct the identifiers
from other metadata.
• There are good tutorials on the web
Issue 2 – Excel Identifiers – Stuart
1. Load CSVs into separate sheets in Excel
2. Bring first identifier into blank third sheet
3. Build a compound key from other columns
4. Use compound key to find row in other sheet
5. Bring second identifier to third sheet
6. Use two identifiers to build redirects
7. Cut-n-paste redirects to httpd config files
Issue 3 – File Ownership Conflicts –
Max
• Migrated 10,000 items via command line
import script as the wrong Linux user.
• This caused intermittent errors when
attaching files to deposits.
chown –R tomcat:tomcat /path/to/assetstore
Issue 4 – Permissions – Max
• Double access policies.
– Default policy from collection
– Policy set in mets.xml
• Any ideas on how to avoid this?
Wrap up and questions
• Setting up redirects for public thesis handle URLs
• Upgrade DSpace in the future, we hope
?

Weitere ähnliche Inhalte

Was ist angesagt?

Directory services
Directory servicesDirectory services
Directory services
Christalin Nelson
 
Relational RDBMS : MySQL, PostgreSQL and SQL SERVER
Relational RDBMS  : MySQL, PostgreSQL and SQL SERVERRelational RDBMS  : MySQL, PostgreSQL and SQL SERVER
Relational RDBMS : MySQL, PostgreSQL and SQL SERVER
Dalila Chouaya
 
EDF2012 Simon Riggs - Open Data, Open Database: PostgreSQL
EDF2012  Simon Riggs - Open Data, Open Database: PostgreSQLEDF2012  Simon Riggs - Open Data, Open Database: PostgreSQL
EDF2012 Simon Riggs - Open Data, Open Database: PostgreSQL
European Data Forum
 
Database History From Codd to Brewer
Database History From Codd to BrewerDatabase History From Codd to Brewer
Database History From Codd to Brewer
OpenSource Connections
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQL
Dr-Dipali Meher
 
Ado Fundamentals
Ado FundamentalsAdo Fundamentals
Ado Fundamentals
asim78
 
Oracle Database Introduction
Oracle Database IntroductionOracle Database Introduction
Oracle Database Introduction
Chhom Karath
 
Directory services by SAJID
Directory services by SAJIDDirectory services by SAJID
Directory services by SAJID
Sajid khan
 
Rise of NewSQL
Rise of NewSQLRise of NewSQL
Rise of NewSQL
Sushant Choudhary
 
Oracle database introduction
Oracle database introductionOracle database introduction
Oracle database introduction
Mohammad Javad Beheshtian
 
Introduction to Oracle Database
Introduction to Oracle DatabaseIntroduction to Oracle Database
Introduction to Oracle Database
puja_dhar
 
Function
FunctionFunction
Function
rey501
 
SNAPL Network Verification
SNAPL Network VerificationSNAPL Network Verification
SNAPL Network Verification
aurojit
 
Data models in NoSQL
Data models in NoSQLData models in NoSQL
Data models in NoSQL
Dr-Dipali Meher
 
Design Patterns for Distributed Non-Relational Databases
Design Patterns for Distributed Non-Relational DatabasesDesign Patterns for Distributed Non-Relational Databases
Design Patterns for Distributed Non-Relational Databases
guestdfd1ec
 
Database Fundamental Concepts- Series 1 - Performance Analysis
Database Fundamental Concepts- Series 1 - Performance AnalysisDatabase Fundamental Concepts- Series 1 - Performance Analysis
Database Fundamental Concepts- Series 1 - Performance Analysis
DAGEOP LTD
 
Decentrailzed File Sharing
Decentrailzed File SharingDecentrailzed File Sharing
Decentrailzed File Sharing
Jayashree Saraf
 
NoSQL Now! Webinar Series: Innovations in NoSQL Query Languages
NoSQL Now! Webinar Series: Innovations in NoSQL Query Languages  NoSQL Now! Webinar Series: Innovations in NoSQL Query Languages
NoSQL Now! Webinar Series: Innovations in NoSQL Query Languages
DATAVERSITY
 
Sqlite
SqliteSqlite
Sqlite
Kumar
 
X.500 More Than a Global Directory
X.500 More Than a Global DirectoryX.500 More Than a Global Directory
X.500 More Than a Global Directory
lurdhu agnes
 

Was ist angesagt? (20)

Directory services
Directory servicesDirectory services
Directory services
 
Relational RDBMS : MySQL, PostgreSQL and SQL SERVER
Relational RDBMS  : MySQL, PostgreSQL and SQL SERVERRelational RDBMS  : MySQL, PostgreSQL and SQL SERVER
Relational RDBMS : MySQL, PostgreSQL and SQL SERVER
 
EDF2012 Simon Riggs - Open Data, Open Database: PostgreSQL
EDF2012  Simon Riggs - Open Data, Open Database: PostgreSQLEDF2012  Simon Riggs - Open Data, Open Database: PostgreSQL
EDF2012 Simon Riggs - Open Data, Open Database: PostgreSQL
 
Database History From Codd to Brewer
Database History From Codd to BrewerDatabase History From Codd to Brewer
Database History From Codd to Brewer
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQL
 
Ado Fundamentals
Ado FundamentalsAdo Fundamentals
Ado Fundamentals
 
Oracle Database Introduction
Oracle Database IntroductionOracle Database Introduction
Oracle Database Introduction
 
Directory services by SAJID
Directory services by SAJIDDirectory services by SAJID
Directory services by SAJID
 
Rise of NewSQL
Rise of NewSQLRise of NewSQL
Rise of NewSQL
 
Oracle database introduction
Oracle database introductionOracle database introduction
Oracle database introduction
 
Introduction to Oracle Database
Introduction to Oracle DatabaseIntroduction to Oracle Database
Introduction to Oracle Database
 
Function
FunctionFunction
Function
 
SNAPL Network Verification
SNAPL Network VerificationSNAPL Network Verification
SNAPL Network Verification
 
Data models in NoSQL
Data models in NoSQLData models in NoSQL
Data models in NoSQL
 
Design Patterns for Distributed Non-Relational Databases
Design Patterns for Distributed Non-Relational DatabasesDesign Patterns for Distributed Non-Relational Databases
Design Patterns for Distributed Non-Relational Databases
 
Database Fundamental Concepts- Series 1 - Performance Analysis
Database Fundamental Concepts- Series 1 - Performance AnalysisDatabase Fundamental Concepts- Series 1 - Performance Analysis
Database Fundamental Concepts- Series 1 - Performance Analysis
 
Decentrailzed File Sharing
Decentrailzed File SharingDecentrailzed File Sharing
Decentrailzed File Sharing
 
NoSQL Now! Webinar Series: Innovations in NoSQL Query Languages
NoSQL Now! Webinar Series: Innovations in NoSQL Query Languages  NoSQL Now! Webinar Series: Innovations in NoSQL Query Languages
NoSQL Now! Webinar Series: Innovations in NoSQL Query Languages
 
Sqlite
SqliteSqlite
Sqlite
 
X.500 More Than a Global Directory
X.500 More Than a Global DirectoryX.500 More Than a Global Directory
X.500 More Than a Global Directory
 

Ähnlich wie Migration off DSpace 1.8.2

D Maeda Bi Portfolio
D Maeda Bi PortfolioD Maeda Bi Portfolio
D Maeda Bi Portfolio
DMaeda
 
Data Lake and the rise of the microservices
Data Lake and the rise of the microservicesData Lake and the rise of the microservices
Data Lake and the rise of the microservices
Bigstep
 
Jagadish-New
Jagadish-NewJagadish-New
Sql server etl framework
Sql server etl frameworkSql server etl framework
Sql server etl framework
nijs
 
How Teradata uses Stacki
How Teradata uses StackiHow Teradata uses Stacki
How Teradata uses Stacki
StackIQ
 
PL SQLDay Machine Learning- Hands on ML.NET.pptx
PL SQLDay Machine Learning- Hands on ML.NET.pptxPL SQLDay Machine Learning- Hands on ML.NET.pptx
PL SQLDay Machine Learning- Hands on ML.NET.pptx
Luis Beltran
 
Ssn0020 ssis 2012 for beginners
Ssn0020   ssis 2012 for beginnersSsn0020   ssis 2012 for beginners
Ssn0020 ssis 2012 for beginners
Antonios Chatzipavlis
 
Resume-SystemsDBA-Brian Wigton
Resume-SystemsDBA-Brian WigtonResume-SystemsDBA-Brian Wigton
Resume-SystemsDBA-Brian Wigton
Brian Wigton
 
C* Summit 2013: Searching for a Needle in a Big Data Haystack by Jason Ruther...
C* Summit 2013: Searching for a Needle in a Big Data Haystack by Jason Ruther...C* Summit 2013: Searching for a Needle in a Big Data Haystack by Jason Ruther...
C* Summit 2013: Searching for a Needle in a Big Data Haystack by Jason Ruther...
DataStax Academy
 
Resume april updated
Resume april updatedResume april updated
Resume april updated
Sukanta Saha
 
Resume
ResumeResume
Resume
Sukanta Saha
 
Resume_sukanta_updated
Resume_sukanta_updatedResume_sukanta_updated
Resume_sukanta_updated
Sukanta Saha
 
Resume_APRIL_updated
Resume_APRIL_updatedResume_APRIL_updated
Resume_APRIL_updated
Sukanta Saha
 
Demystifying Containerization Principles for Data Scientists
Demystifying Containerization Principles for Data ScientistsDemystifying Containerization Principles for Data Scientists
Demystifying Containerization Principles for Data Scientists
Dr Ganesh Iyer
 
TechEvent 2019: Oracle to PostgreSQL - a Travel Guide from Practice; Roland S...
TechEvent 2019: Oracle to PostgreSQL - a Travel Guide from Practice; Roland S...TechEvent 2019: Oracle to PostgreSQL - a Travel Guide from Practice; Roland S...
TechEvent 2019: Oracle to PostgreSQL - a Travel Guide from Practice; Roland S...
Trivadis
 
Monitoring microservices: Docker, Mesos and Kubernetes visibility at scale
Monitoring microservices: Docker, Mesos and Kubernetes visibility at scaleMonitoring microservices: Docker, Mesos and Kubernetes visibility at scale
Monitoring microservices: Docker, Mesos and Kubernetes visibility at scale
Alessandro Gallotta
 
Docker Datacenter Overview and Production Setup Slides
Docker Datacenter Overview and Production Setup SlidesDocker Datacenter Overview and Production Setup Slides
Docker Datacenter Overview and Production Setup Slides
Docker, Inc.
 
Yat (Calvin) Chow Resume - Oct 28 2016
Yat (Calvin) Chow Resume - Oct 28 2016Yat (Calvin) Chow Resume - Oct 28 2016
Yat (Calvin) Chow Resume - Oct 28 2016
Calvin Chow
 
Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...
Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...
Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...
{code} by Dell EMC
 
Steve Molzen Resume 2016
Steve Molzen Resume 2016Steve Molzen Resume 2016
Steve Molzen Resume 2016
Steven Molzen
 

Ähnlich wie Migration off DSpace 1.8.2 (20)

D Maeda Bi Portfolio
D Maeda Bi PortfolioD Maeda Bi Portfolio
D Maeda Bi Portfolio
 
Data Lake and the rise of the microservices
Data Lake and the rise of the microservicesData Lake and the rise of the microservices
Data Lake and the rise of the microservices
 
Jagadish-New
Jagadish-NewJagadish-New
Jagadish-New
 
Sql server etl framework
Sql server etl frameworkSql server etl framework
Sql server etl framework
 
How Teradata uses Stacki
How Teradata uses StackiHow Teradata uses Stacki
How Teradata uses Stacki
 
PL SQLDay Machine Learning- Hands on ML.NET.pptx
PL SQLDay Machine Learning- Hands on ML.NET.pptxPL SQLDay Machine Learning- Hands on ML.NET.pptx
PL SQLDay Machine Learning- Hands on ML.NET.pptx
 
Ssn0020 ssis 2012 for beginners
Ssn0020   ssis 2012 for beginnersSsn0020   ssis 2012 for beginners
Ssn0020 ssis 2012 for beginners
 
Resume-SystemsDBA-Brian Wigton
Resume-SystemsDBA-Brian WigtonResume-SystemsDBA-Brian Wigton
Resume-SystemsDBA-Brian Wigton
 
C* Summit 2013: Searching for a Needle in a Big Data Haystack by Jason Ruther...
C* Summit 2013: Searching for a Needle in a Big Data Haystack by Jason Ruther...C* Summit 2013: Searching for a Needle in a Big Data Haystack by Jason Ruther...
C* Summit 2013: Searching for a Needle in a Big Data Haystack by Jason Ruther...
 
Resume april updated
Resume april updatedResume april updated
Resume april updated
 
Resume
ResumeResume
Resume
 
Resume_sukanta_updated
Resume_sukanta_updatedResume_sukanta_updated
Resume_sukanta_updated
 
Resume_APRIL_updated
Resume_APRIL_updatedResume_APRIL_updated
Resume_APRIL_updated
 
Demystifying Containerization Principles for Data Scientists
Demystifying Containerization Principles for Data ScientistsDemystifying Containerization Principles for Data Scientists
Demystifying Containerization Principles for Data Scientists
 
TechEvent 2019: Oracle to PostgreSQL - a Travel Guide from Practice; Roland S...
TechEvent 2019: Oracle to PostgreSQL - a Travel Guide from Practice; Roland S...TechEvent 2019: Oracle to PostgreSQL - a Travel Guide from Practice; Roland S...
TechEvent 2019: Oracle to PostgreSQL - a Travel Guide from Practice; Roland S...
 
Monitoring microservices: Docker, Mesos and Kubernetes visibility at scale
Monitoring microservices: Docker, Mesos and Kubernetes visibility at scaleMonitoring microservices: Docker, Mesos and Kubernetes visibility at scale
Monitoring microservices: Docker, Mesos and Kubernetes visibility at scale
 
Docker Datacenter Overview and Production Setup Slides
Docker Datacenter Overview and Production Setup SlidesDocker Datacenter Overview and Production Setup Slides
Docker Datacenter Overview and Production Setup Slides
 
Yat (Calvin) Chow Resume - Oct 28 2016
Yat (Calvin) Chow Resume - Oct 28 2016Yat (Calvin) Chow Resume - Oct 28 2016
Yat (Calvin) Chow Resume - Oct 28 2016
 
Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...
Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...
Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...
 
Steve Molzen Resume 2016
Steve Molzen Resume 2016Steve Molzen Resume 2016
Steve Molzen Resume 2016
 

Mehr von stuartayeates

Conflict between the DOAJ acceptance criteria and the C4DISC principles
Conflict between the DOAJ acceptance criteria and the C4DISC principlesConflict between the DOAJ acceptance criteria and the C4DISC principles
Conflict between the DOAJ acceptance criteria and the C4DISC principles
stuartayeates
 
Working with Wikipedia, ResBaz Wellington 2017
Working with Wikipedia, ResBaz Wellington 2017Working with Wikipedia, ResBaz Wellington 2017
Working with Wikipedia, ResBaz Wellington 2017
stuartayeates
 
Working with wikipedia presentation for Rezbaz Wellington 2018
Working with wikipedia presentation for Rezbaz Wellington 2018Working with wikipedia presentation for Rezbaz Wellington 2018
Working with wikipedia presentation for Rezbaz Wellington 2018
stuartayeates
 
5 things to think about when starting wikipedia
5 things to think about when starting wikipedia5 things to think about when starting wikipedia
5 things to think about when starting wikipedia
stuartayeates
 
Doing wikipedia when you're not editing wikipedia
Doing wikipedia when you're not editing wikipediaDoing wikipedia when you're not editing wikipedia
Doing wikipedia when you're not editing wikipedia
stuartayeates
 
20 ways to mark up a sentence
20 ways to mark up a sentence20 ways to mark up a sentence
20 ways to mark up a sentence
stuartayeates
 
eBooks: Why they break ISBNs
eBooks: Why they break ISBNseBooks: Why they break ISBNs
eBooks: Why they break ISBNs
stuartayeates
 

Mehr von stuartayeates (7)

Conflict between the DOAJ acceptance criteria and the C4DISC principles
Conflict between the DOAJ acceptance criteria and the C4DISC principlesConflict between the DOAJ acceptance criteria and the C4DISC principles
Conflict between the DOAJ acceptance criteria and the C4DISC principles
 
Working with Wikipedia, ResBaz Wellington 2017
Working with Wikipedia, ResBaz Wellington 2017Working with Wikipedia, ResBaz Wellington 2017
Working with Wikipedia, ResBaz Wellington 2017
 
Working with wikipedia presentation for Rezbaz Wellington 2018
Working with wikipedia presentation for Rezbaz Wellington 2018Working with wikipedia presentation for Rezbaz Wellington 2018
Working with wikipedia presentation for Rezbaz Wellington 2018
 
5 things to think about when starting wikipedia
5 things to think about when starting wikipedia5 things to think about when starting wikipedia
5 things to think about when starting wikipedia
 
Doing wikipedia when you're not editing wikipedia
Doing wikipedia when you're not editing wikipediaDoing wikipedia when you're not editing wikipedia
Doing wikipedia when you're not editing wikipedia
 
20 ways to mark up a sentence
20 ways to mark up a sentence20 ways to mark up a sentence
20 ways to mark up a sentence
 
eBooks: Why they break ISBNs
eBooks: Why they break ISBNseBooks: Why they break ISBNs
eBooks: Why they break ISBNs
 

Kürzlich hochgeladen

leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
alexjohnson7307
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
Intelisync
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
Dinusha Kumarasiri
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!
GDSC PJATK
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
Hiike
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
Shinana2
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 

Kürzlich hochgeladen (20)

leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 

Migration off DSpace 1.8.2

  • 1. Migration off DSpace 1.8.2 Max Sullivan Digital Research Services Coordinator max.sullivan@vuw.ac.nz Stuart Yeates Library Infrastructure Engineer stuart.yeates@vuw.ac.nz
  • 2. Overview • Four instances of DSpace 1.8.2 • Both OS and application software out of support • All content exported as packages containing mets.xml and files. • Import to DSpace 6 via METS package and app using SWORD.
  • 3. Issue 1 – SWORD Metadata – Stuart • SWORD transfers METS packages • SWORD originated in EPrints • → SWORD METS metadata defaults to EPDCX – "EPrints Dublin Core X" – Weird encoding of basic DC • METS supports every metadata schema • Determining which is used is hard • →We stripped 95% of it out for debugging
  • 4. Issue 2 – Excel Identifiers – Stuart • Exporting from one DSpace into another overwrites identifiers by default • Identifiers needed to build httpd redirects • Used DSpace CSV metadata export and Excel 'match - lookup' to reconstruct the identifiers from other metadata. • There are good tutorials on the web
  • 5. Issue 2 – Excel Identifiers – Stuart 1. Load CSVs into separate sheets in Excel 2. Bring first identifier into blank third sheet 3. Build a compound key from other columns 4. Use compound key to find row in other sheet 5. Bring second identifier to third sheet 6. Use two identifiers to build redirects 7. Cut-n-paste redirects to httpd config files
  • 6. Issue 3 – File Ownership Conflicts – Max • Migrated 10,000 items via command line import script as the wrong Linux user. • This caused intermittent errors when attaching files to deposits. chown –R tomcat:tomcat /path/to/assetstore
  • 7. Issue 4 – Permissions – Max • Double access policies. – Default policy from collection – Policy set in mets.xml • Any ideas on how to avoid this?
  • 8. Wrap up and questions • Setting up redirects for public thesis handle URLs • Upgrade DSpace in the future, we hope ?