SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Downloaden Sie, um offline zu lesen
Implementing
Durham E-Theses
Presented by Sebastian Palucha
#rfringe13
CC BY jitze http://www.flickr.com/photos/jitze1942/3521700792
∂
Durham E-Theses
 Initial project spring/summer
2009
 First deposit September 2009
 ~ 300 research theses per year
 Simple deposit, single PDF
 EThOS interoperability
 EPrints 3.1.3 (born 2009)
CC BY didbygraham
http://www.flickr.com/photos/didbygraham/5646920685/
∂
Registered: EThOS, Driver, OCL Digital Gateway (2010 spr.)
EThOS harvest in operation (2010 sum.)
Google Analytics stats (2010 dec.)
EThOS digitised theses loaded (2011 sum.)
Google Custom Search (aut. 2011)
Collaboration with The BL
to improve EThOS services
(aut. 2011 – spr. 2012)
EU/ICO Cookie Law support (2013 sum.)
local digitisation project,
10k (2012 spr2 – )
MySQL migrated to UTF-8 (2013 spring)
Creative Common Licences
introduced (2012 aut.)
CC BY AlishaV http://www.flickr.com/photos/alishav/3156574283
Key milestones
∂
Branding: uniform user experience
• Issues: browsers, branding
changes
• Durham University CMS CSS
• Eprints 3 CSS
∂
Simplistic single PDF deposit
• Details > Upload > Deposit
• LDAP integration + user field population
• Embargo implemented in first screen
CC BY Pink Sherbet Photography
http://www.flickr.com/photos/pinksherbet/236299644
∂
Cover pages
 Highly customized LaTeX code
 Issues with UTF-8 both LaTeX
and plugin
 Issues with dynamic if/else
∂
Google Analytics: full text
downloads
• Two steps:
1. PDF download link (core code)
2. special GA profile
• URL structure include
department codes
?DDD32
• Internal code modification
∂
EThOS interoperability
through OAI-PMH harvest
• Issues with out of the box plug-in, changes to XML schema needed
• uketdterms:qualificationlevel not defined in EPrints data model
• Embargo date not included. Plugin assumes embargo on an record
level, whereas EP on an document level!
• Added department names
• Occasional issues with UTF-8 encoding
∂
EThOS download WS
• Script for mass download https://github.com/paluchas/ethos-bl
groovy EthosDownloadClient.groovy -i 238830 –m download
∂
EThOS avoiding duplication
• We store EThOS persistent IDs
• We modified /cgi/oai2 script to conditionally exclude ethos records
• Modified record can be exposed to EThOS harvest in future
∂
UTF-8 issues
Unknown copy/paste issues
seen:
 OAI/PMH
 Cover Pages LaTeX
 Abstract pages
Solution:
 Code modification
 Whole MySQL database migration to
UTF-8, fortunately double encoding
CC BY familymwr http://www.flickr.com/photos/familymwr/5548057120//
∂
Creative Common Licences
 Approached by student:
specific query about
particular CC to be used
 A lot of redefinition is code
∂
CC outreach
∂
Better search, DRO integration
Google Custom Search with modified search results
∂
Retrospective digitisation project
• 10k paper theses being digitised by local company
• Mass upload with metadata in XML file and digitised material in PDF
files, web and archive version. A lot of metadata and quality issues
• Interesting samples of other materials:
big prints, DVDs, CDs, cassette tapes,
microfilms, small datasets and research software.
∂
EU/ICO Cookies Law
CC BY USAG-Humphreys
http://www.flickr.com/photos/
31687107@N07/6206906748
∂
Repository versus real life
• Users would like to deposit other than PDF files.
• Requested “Dark” storage
• Encrypted PDFs
• Take down requests, and Web cached content. How far should we liaise
with external world
• Some students are not aware about consequences of web deposits: 3rd
party copyright, sensitive data not embargoed etc.
• Disciplinary differences; not only humanities vs. sciences.
• External user requesting contact with author or supervisors
∂
Sustainability
• Operational:
virtualization, operating systems
support, database
• Customization:
Bespoken changes and technology
deficit
• Support:
hard to coordinate across the
University departments
CC BY Rennett Stowe
http://www.flickr.com/photos/tomsaint/4515448425
∂
Future plans
 Review process, be paper free, include pass list, extend workflow to exam
board
 Actively encourage students to use CC licences by demonstrate its benefit
 Encourage deposit of key data sets and explore data visualization
 Migrate to new repository framework
 Integration with Durham University RIS
 Google Analytics live stats, integration with IRUS-UK
CC BY Boston Public Library
http://www.flickr.com/photos/boston_public_library/8902381985/
∂
Repository of the future
CC by http://www.flickr.com/photos/keoni101/7069578953
CC BY Keoni Cabral http://www.flickr.com/photos/52193570@N04/7069578953

Weitere ähnliche Inhalte

Was ist angesagt?

Maurer Presentation - WARCnet Spring Meeting 2021
Maurer Presentation - WARCnet Spring Meeting 2021Maurer Presentation - WARCnet Spring Meeting 2021
Maurer Presentation - WARCnet Spring Meeting 2021WARCnet
 
E-Science Between Grid And Knowledge Management
E-Science Between Grid And Knowledge ManagementE-Science Between Grid And Knowledge Management
E-Science Between Grid And Knowledge ManagementHans-Christoph Hobohm
 
Technology showcase
Technology showcaseTechnology showcase
Technology showcasejoeahearn
 
Linked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & MuseumsLinked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & MuseumsJon Voss
 
Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Antoine Isaac
 
Stephanie Taylor (UKOLN) – Metadata Forum
Stephanie Taylor (UKOLN) – Metadata ForumStephanie Taylor (UKOLN) – Metadata Forum
Stephanie Taylor (UKOLN) – Metadata ForumRepository Fringe
 
GIS Day 2015: Geoinformatics, Open Source and Videos - a library perspective
GIS Day 2015: Geoinformatics, Open Source and Videos - a library perspectiveGIS Day 2015: Geoinformatics, Open Source and Videos - a library perspective
GIS Day 2015: Geoinformatics, Open Source and Videos - a library perspectivePeter Löwe
 
What’s in a URL? Analysing COVID-19 web archive collections
What’s in a URL? Analysing COVID-19 web archive collectionsWhat’s in a URL? Analysing COVID-19 web archive collections
What’s in a URL? Analysing COVID-19 web archive collectionsWARCnet
 
Listening to the library
Listening to the libraryListening to the library
Listening to the libraryKatie Legere
 

Was ist angesagt? (14)

Maurer Presentation - WARCnet Spring Meeting 2021
Maurer Presentation - WARCnet Spring Meeting 2021Maurer Presentation - WARCnet Spring Meeting 2021
Maurer Presentation - WARCnet Spring Meeting 2021
 
ld4dh demo lecture
ld4dh demo lectureld4dh demo lecture
ld4dh demo lecture
 
E-Science Between Grid And Knowledge Management
E-Science Between Grid And Knowledge ManagementE-Science Between Grid And Knowledge Management
E-Science Between Grid And Knowledge Management
 
04 pisa final_event_111214_wp1_dg
04 pisa final_event_111214_wp1_dg04 pisa final_event_111214_wp1_dg
04 pisa final_event_111214_wp1_dg
 
Technology showcase
Technology showcaseTechnology showcase
Technology showcase
 
Linked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & MuseumsLinked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & Museums
 
Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013
 
Stephanie Taylor (UKOLN) – Metadata Forum
Stephanie Taylor (UKOLN) – Metadata ForumStephanie Taylor (UKOLN) – Metadata Forum
Stephanie Taylor (UKOLN) – Metadata Forum
 
GIS Day 2015: Geoinformatics, Open Source and Videos - a library perspective
GIS Day 2015: Geoinformatics, Open Source and Videos - a library perspectiveGIS Day 2015: Geoinformatics, Open Source and Videos - a library perspective
GIS Day 2015: Geoinformatics, Open Source and Videos - a library perspective
 
What’s in a URL? Analysing COVID-19 web archive collections
What’s in a URL? Analysing COVID-19 web archive collectionsWhat’s in a URL? Analysing COVID-19 web archive collections
What’s in a URL? Analysing COVID-19 web archive collections
 
08b final event_experimente
08b final event_experimente08b final event_experimente
08b final event_experimente
 
03 isaac dm2-e14-full
03 isaac dm2-e14-full03 isaac dm2-e14-full
03 isaac dm2-e14-full
 
Listening to the library
Listening to the libraryListening to the library
Listening to the library
 
C:\fakepath\18
C:\fakepath\18C:\fakepath\18
C:\fakepath\18
 

Ähnlich wie Implementing Durham E-Theses

Open Data analysis with EOSC-hub services
Open Data analysis with EOSC-hub servicesOpen Data analysis with EOSC-hub services
Open Data analysis with EOSC-hub servicesOpenAIRE
 
Software for data management and exploitation
Software for data management and exploitationSoftware for data management and exploitation
Software for data management and exploitationEOSC-hub project
 
Clipper dhra 2016
Clipper dhra 2016Clipper dhra 2016
Clipper dhra 2016John Casey
 
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations EDINA, University of Edinburgh
 
Open Data Masterclass - Europeana and LOD
Open Data Masterclass - Europeana and LODOpen Data Masterclass - Europeana and LOD
Open Data Masterclass - Europeana and LODAntoine Isaac
 
IPTC News Exchange Formats Working Party Autumn 2012
IPTC News Exchange Formats Working Party Autumn 2012IPTC News Exchange Formats Working Party Autumn 2012
IPTC News Exchange Formats Working Party Autumn 2012Stuart Myles
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsCarole Goble
 
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShareCollaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShareEDINA, University of Edinburgh
 
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare University of Edinburgh
 
Accessibility, Automation and Metadata
Accessibility, Automation and MetadataAccessibility, Automation and Metadata
Accessibility, Automation and Metadatalisbk
 
Unified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AIUnified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AIAlluxio, Inc.
 
Approaches to preserving digitized taxonomic data
Approaches to preserving digitized taxonomic dataApproaches to preserving digitized taxonomic data
Approaches to preserving digitized taxonomic dataChris Freeland
 
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked DataDo the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked DataAdrian Stevenson
 
The Heterogenous Zone: Six use cases for six research data collections in Edi...
The Heterogenous Zone: Six use cases for six research data collections in Edi...The Heterogenous Zone: Six use cases for six research data collections in Edi...
The Heterogenous Zone: Six use cases for six research data collections in Edi...EDINA, University of Edinburgh
 
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...Artefactual Systems - AtoM
 
IPTC and the Semantic Web: Two Paths and Seven Lessons
IPTC and the Semantic Web: Two Paths and Seven LessonsIPTC and the Semantic Web: Two Paths and Seven Lessons
IPTC and the Semantic Web: Two Paths and Seven LessonsStuart Myles
 
Cloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hub
Cloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hubCloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hub
Cloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hubBjörn Backeberg
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Alexandru Iosup
 

Ähnlich wie Implementing Durham E-Theses (20)

Open Data analysis with EOSC-hub services
Open Data analysis with EOSC-hub servicesOpen Data analysis with EOSC-hub services
Open Data analysis with EOSC-hub services
 
Software for data management and exploitation
Software for data management and exploitationSoftware for data management and exploitation
Software for data management and exploitation
 
Clipper dhra 2016
Clipper dhra 2016Clipper dhra 2016
Clipper dhra 2016
 
Fedora Oxford Dec09
Fedora Oxford Dec09Fedora Oxford Dec09
Fedora Oxford Dec09
 
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
 
Open Data Masterclass - Europeana and LOD
Open Data Masterclass - Europeana and LODOpen Data Masterclass - Europeana and LOD
Open Data Masterclass - Europeana and LOD
 
IPTC News Exchange Formats Working Party Autumn 2012
IPTC News Exchange Formats Working Party Autumn 2012IPTC News Exchange Formats Working Party Autumn 2012
IPTC News Exchange Formats Working Party Autumn 2012
 
dotte.ppt
dotte.pptdotte.ppt
dotte.ppt
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
 
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShareCollaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
 
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
 
Accessibility, Automation and Metadata
Accessibility, Automation and MetadataAccessibility, Automation and Metadata
Accessibility, Automation and Metadata
 
Unified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AIUnified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AI
 
Approaches to preserving digitized taxonomic data
Approaches to preserving digitized taxonomic dataApproaches to preserving digitized taxonomic data
Approaches to preserving digitized taxonomic data
 
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked DataDo the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
 
The Heterogenous Zone: Six use cases for six research data collections in Edi...
The Heterogenous Zone: Six use cases for six research data collections in Edi...The Heterogenous Zone: Six use cases for six research data collections in Edi...
The Heterogenous Zone: Six use cases for six research data collections in Edi...
 
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
 
IPTC and the Semantic Web: Two Paths and Seven Lessons
IPTC and the Semantic Web: Two Paths and Seven LessonsIPTC and the Semantic Web: Two Paths and Seven Lessons
IPTC and the Semantic Web: Two Paths and Seven Lessons
 
Cloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hub
Cloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hubCloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hub
Cloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hub
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.
 

Mehr von Repository Fringe

Unlocking Thesis Data - Stephen Grace, University of East London
Unlocking Thesis Data - Stephen Grace, University of East LondonUnlocking Thesis Data - Stephen Grace, University of East London
Unlocking Thesis Data - Stephen Grace, University of East LondonRepository Fringe
 
Integration - the heart of researcher centric research data management system...
Integration - the heart of researcher centric research data management system...Integration - the heart of researcher centric research data management system...
Integration - the heart of researcher centric research data management system...Repository Fringe
 
Open Access workshop at Repository Fringe 2015 - Valerie McCutcheon
Open Access workshop at Repository Fringe 2015 - Valerie McCutcheonOpen Access workshop at Repository Fringe 2015 - Valerie McCutcheon
Open Access workshop at Repository Fringe 2015 - Valerie McCutcheonRepository Fringe
 
Repositories for OA, RDM and Beyond - Rory McNicholl
Repositories for OA, RDM and Beyond - Rory McNichollRepositories for OA, RDM and Beyond - Rory McNicholl
Repositories for OA, RDM and Beyond - Rory McNichollRepository Fringe
 
RSpace - Rory Macneil at Repository Fringe 2015
RSpace - Rory Macneil at Repository Fringe 2015RSpace - Rory Macneil at Repository Fringe 2015
RSpace - Rory Macneil at Repository Fringe 2015Repository Fringe
 
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, JiscRepository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, JiscRepository Fringe
 
Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...Repository Fringe
 
Jisc on repositories unleashing data - Daniela Duca
Jisc on repositories unleashing data - Daniela DucaJisc on repositories unleashing data - Daniela Duca
Jisc on repositories unleashing data - Daniela DucaRepository Fringe
 
IRUS-UK at Repository Fringe 2015 - Jo Alcock
IRUS-UK at Repository Fringe 2015 - Jo AlcockIRUS-UK at Repository Fringe 2015 - Jo Alcock
IRUS-UK at Repository Fringe 2015 - Jo AlcockRepository Fringe
 
Impact and EPrints - Rosie-Marie Barbeau and Mick Eadie
Impact and EPrints - Rosie-Marie Barbeau and Mick EadieImpact and EPrints - Rosie-Marie Barbeau and Mick Eadie
Impact and EPrints - Rosie-Marie Barbeau and Mick EadieRepository Fringe
 
Open Data and Sharing Science - Graham Steel, Contentmine
Open Data and Sharing Science - Graham Steel, ContentmineOpen Data and Sharing Science - Graham Steel, Contentmine
Open Data and Sharing Science - Graham Steel, ContentmineRepository Fringe
 
SHERPA Services breakout session - Bill Hubbard
SHERPA Services breakout session - Bill HubbardSHERPA Services breakout session - Bill Hubbard
SHERPA Services breakout session - Bill HubbardRepository Fringe
 
REF compliance - what Jisc is doing
REF compliance - what Jisc is doingREF compliance - what Jisc is doing
REF compliance - what Jisc is doingRepository Fringe
 
Linking Software: citations, roles, references and more
Linking Software: citations, roles, references and moreLinking Software: citations, roles, references and more
Linking Software: citations, roles, references and moreRepository Fringe
 
Linking Research Outputs - Rachel Kotarski
Linking Research Outputs - Rachel KotarskiLinking Research Outputs - Rachel Kotarski
Linking Research Outputs - Rachel KotarskiRepository Fringe
 
HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...
HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...
HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...Repository Fringe
 
Latest developments in Hydra-land - Chris Awre, University of Hull
Latest developments in Hydra-land - Chris Awre, University of HullLatest developments in Hydra-land - Chris Awre, University of Hull
Latest developments in Hydra-land - Chris Awre, University of HullRepository Fringe
 
ArchivesSpace - Scott Renton, University of Edinburgh
ArchivesSpace - Scott Renton, University of EdinburghArchivesSpace - Scott Renton, University of Edinburgh
ArchivesSpace - Scott Renton, University of EdinburghRepository Fringe
 

Mehr von Repository Fringe (20)

Unlocking Thesis Data - Stephen Grace, University of East London
Unlocking Thesis Data - Stephen Grace, University of East LondonUnlocking Thesis Data - Stephen Grace, University of East London
Unlocking Thesis Data - Stephen Grace, University of East London
 
Integration - the heart of researcher centric research data management system...
Integration - the heart of researcher centric research data management system...Integration - the heart of researcher centric research data management system...
Integration - the heart of researcher centric research data management system...
 
Open Access workshop at Repository Fringe 2015 - Valerie McCutcheon
Open Access workshop at Repository Fringe 2015 - Valerie McCutcheonOpen Access workshop at Repository Fringe 2015 - Valerie McCutcheon
Open Access workshop at Repository Fringe 2015 - Valerie McCutcheon
 
Repositories for OA, RDM and Beyond - Rory McNicholl
Repositories for OA, RDM and Beyond - Rory McNichollRepositories for OA, RDM and Beyond - Rory McNicholl
Repositories for OA, RDM and Beyond - Rory McNicholl
 
RSpace - Rory Macneil at Repository Fringe 2015
RSpace - Rory Macneil at Repository Fringe 2015RSpace - Rory Macneil at Repository Fringe 2015
RSpace - Rory Macneil at Repository Fringe 2015
 
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, JiscRepository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
 
Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...
 
Jisc on repositories unleashing data - Daniela Duca
Jisc on repositories unleashing data - Daniela DucaJisc on repositories unleashing data - Daniela Duca
Jisc on repositories unleashing data - Daniela Duca
 
IRUS-UK at Repository Fringe 2015 - Jo Alcock
IRUS-UK at Repository Fringe 2015 - Jo AlcockIRUS-UK at Repository Fringe 2015 - Jo Alcock
IRUS-UK at Repository Fringe 2015 - Jo Alcock
 
Impact and EPrints - Rosie-Marie Barbeau and Mick Eadie
Impact and EPrints - Rosie-Marie Barbeau and Mick EadieImpact and EPrints - Rosie-Marie Barbeau and Mick Eadie
Impact and EPrints - Rosie-Marie Barbeau and Mick Eadie
 
Open Data and Sharing Science - Graham Steel, Contentmine
Open Data and Sharing Science - Graham Steel, ContentmineOpen Data and Sharing Science - Graham Steel, Contentmine
Open Data and Sharing Science - Graham Steel, Contentmine
 
SHERPA Services breakout session - Bill Hubbard
SHERPA Services breakout session - Bill HubbardSHERPA Services breakout session - Bill Hubbard
SHERPA Services breakout session - Bill Hubbard
 
REF compliance - what Jisc is doing
REF compliance - what Jisc is doingREF compliance - what Jisc is doing
REF compliance - what Jisc is doing
 
RCUK - what Jisc is doing
RCUK - what Jisc is doingRCUK - what Jisc is doing
RCUK - what Jisc is doing
 
Linking Software: citations, roles, references and more
Linking Software: citations, roles, references and moreLinking Software: citations, roles, references and more
Linking Software: citations, roles, references and more
 
Jisc Publications Router
Jisc Publications RouterJisc Publications Router
Jisc Publications Router
 
Linking Research Outputs - Rachel Kotarski
Linking Research Outputs - Rachel KotarskiLinking Research Outputs - Rachel Kotarski
Linking Research Outputs - Rachel Kotarski
 
HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...
HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...
HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...
 
Latest developments in Hydra-land - Chris Awre, University of Hull
Latest developments in Hydra-land - Chris Awre, University of HullLatest developments in Hydra-land - Chris Awre, University of Hull
Latest developments in Hydra-land - Chris Awre, University of Hull
 
ArchivesSpace - Scott Renton, University of Edinburgh
ArchivesSpace - Scott Renton, University of EdinburghArchivesSpace - Scott Renton, University of Edinburgh
ArchivesSpace - Scott Renton, University of Edinburgh
 

Kürzlich hochgeladen

How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseCeline George
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Association for Project Management
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...DhatriParmar
 
CLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptxCLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptxAnupam32727
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxkarenfajardo43
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
ARTERIAL BLOOD GAS ANALYSIS........pptx
ARTERIAL BLOOD  GAS ANALYSIS........pptxARTERIAL BLOOD  GAS ANALYSIS........pptx
ARTERIAL BLOOD GAS ANALYSIS........pptxAneriPatwari
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir
 
How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17Celine George
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
Sulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesSulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesVijayaLaxmi84
 
ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6Vanessa Camilleri
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQuiz Club NITW
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17Celine George
 

Kürzlich hochgeladen (20)

How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 Database
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
 
CLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptxCLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptx
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
ARTERIAL BLOOD GAS ANALYSIS........pptx
ARTERIAL BLOOD  GAS ANALYSIS........pptxARTERIAL BLOOD  GAS ANALYSIS........pptx
ARTERIAL BLOOD GAS ANALYSIS........pptx
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdf
 
How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
Sulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesSulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their uses
 
ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17
 

Implementing Durham E-Theses

  • 1. Implementing Durham E-Theses Presented by Sebastian Palucha #rfringe13 CC BY jitze http://www.flickr.com/photos/jitze1942/3521700792
  • 2. ∂ Durham E-Theses  Initial project spring/summer 2009  First deposit September 2009  ~ 300 research theses per year  Simple deposit, single PDF  EThOS interoperability  EPrints 3.1.3 (born 2009) CC BY didbygraham http://www.flickr.com/photos/didbygraham/5646920685/
  • 3. ∂ Registered: EThOS, Driver, OCL Digital Gateway (2010 spr.) EThOS harvest in operation (2010 sum.) Google Analytics stats (2010 dec.) EThOS digitised theses loaded (2011 sum.) Google Custom Search (aut. 2011) Collaboration with The BL to improve EThOS services (aut. 2011 – spr. 2012) EU/ICO Cookie Law support (2013 sum.) local digitisation project, 10k (2012 spr2 – ) MySQL migrated to UTF-8 (2013 spring) Creative Common Licences introduced (2012 aut.) CC BY AlishaV http://www.flickr.com/photos/alishav/3156574283 Key milestones
  • 4. ∂ Branding: uniform user experience • Issues: browsers, branding changes • Durham University CMS CSS • Eprints 3 CSS
  • 5. ∂ Simplistic single PDF deposit • Details > Upload > Deposit • LDAP integration + user field population • Embargo implemented in first screen CC BY Pink Sherbet Photography http://www.flickr.com/photos/pinksherbet/236299644
  • 6. ∂ Cover pages  Highly customized LaTeX code  Issues with UTF-8 both LaTeX and plugin  Issues with dynamic if/else
  • 7. ∂ Google Analytics: full text downloads • Two steps: 1. PDF download link (core code) 2. special GA profile • URL structure include department codes ?DDD32 • Internal code modification
  • 8. ∂ EThOS interoperability through OAI-PMH harvest • Issues with out of the box plug-in, changes to XML schema needed • uketdterms:qualificationlevel not defined in EPrints data model • Embargo date not included. Plugin assumes embargo on an record level, whereas EP on an document level! • Added department names • Occasional issues with UTF-8 encoding
  • 9. ∂ EThOS download WS • Script for mass download https://github.com/paluchas/ethos-bl groovy EthosDownloadClient.groovy -i 238830 –m download
  • 10. ∂ EThOS avoiding duplication • We store EThOS persistent IDs • We modified /cgi/oai2 script to conditionally exclude ethos records • Modified record can be exposed to EThOS harvest in future
  • 11. ∂ UTF-8 issues Unknown copy/paste issues seen:  OAI/PMH  Cover Pages LaTeX  Abstract pages Solution:  Code modification  Whole MySQL database migration to UTF-8, fortunately double encoding CC BY familymwr http://www.flickr.com/photos/familymwr/5548057120//
  • 12. ∂ Creative Common Licences  Approached by student: specific query about particular CC to be used  A lot of redefinition is code
  • 14. ∂ Better search, DRO integration Google Custom Search with modified search results
  • 15. ∂ Retrospective digitisation project • 10k paper theses being digitised by local company • Mass upload with metadata in XML file and digitised material in PDF files, web and archive version. A lot of metadata and quality issues • Interesting samples of other materials: big prints, DVDs, CDs, cassette tapes, microfilms, small datasets and research software.
  • 16. ∂ EU/ICO Cookies Law CC BY USAG-Humphreys http://www.flickr.com/photos/ 31687107@N07/6206906748
  • 17. ∂ Repository versus real life • Users would like to deposit other than PDF files. • Requested “Dark” storage • Encrypted PDFs • Take down requests, and Web cached content. How far should we liaise with external world • Some students are not aware about consequences of web deposits: 3rd party copyright, sensitive data not embargoed etc. • Disciplinary differences; not only humanities vs. sciences. • External user requesting contact with author or supervisors
  • 18. ∂ Sustainability • Operational: virtualization, operating systems support, database • Customization: Bespoken changes and technology deficit • Support: hard to coordinate across the University departments CC BY Rennett Stowe http://www.flickr.com/photos/tomsaint/4515448425
  • 19. ∂ Future plans  Review process, be paper free, include pass list, extend workflow to exam board  Actively encourage students to use CC licences by demonstrate its benefit  Encourage deposit of key data sets and explore data visualization  Migrate to new repository framework  Integration with Durham University RIS  Google Analytics live stats, integration with IRUS-UK CC BY Boston Public Library http://www.flickr.com/photos/boston_public_library/8902381985/
  • 20. ∂ Repository of the future CC by http://www.flickr.com/photos/keoni101/7069578953 CC BY Keoni Cabral http://www.flickr.com/photos/52193570@N04/7069578953

Hinweis der Redaktion

  1. In the last decade a custom has become established among the tribes of higher education universities – namely to have a repository. At Durham we are very proud from our difference, in pursuit to follow other tribe members we decided to have three different repositories: Durham Research Online, Durham ETheses and Archive and Special Collection. This is the story how we implemented Durham E-Theses.
  2. We start our run with limited peripheral vision. We concentrate on in a short time scale to be ready for a first deposit with beginning of 2009 teaching year. We have a goal to have as simple as possible single PDF file deposit. And we had strong inclination to be complaint with EThOS project. We chose EPrints as having some experience with Durham Research Online. However, due to how this was implemented, DRO couldn't be used to handle theses. We opted for a separate EPrints installation. Durham E-Theses background Timeframe of the work Changes made to EPrints plugin Issues recognised in OAI-PMH metadata exchange Harvesting digitised data from Ethos Download Groovy script developed – code snippets and basic use example : Action: Should I develop GUI/GITHUB version En Mass load and metadata correction Changes to OAI-PMH to filter duplication Further digitised material upload - Comments about ETHOS harvesting procedures when metadata elements exchange (e.g. embargo introduced due to take down procedures)
  3. Over the time, however, we changed or corrected our initial expectation. The live brought to us new ideas and theirs implementation were challenging as running up a step hill. But certainly very enjoyable once achieved.
  4. One of the very first task was to branding our new repository. This might seem just merging two CSS style sheets, one from EPrints and one from University CMS. Unfortunately, it requires a lot of time trying to avoid some browser issues, most notably IE. Imagine our excitement when recently University modified its branding.
  5. We also scaled down EPritnst data model just to support theses type and removed unnecessary functionality. The goal was to have a very simple deposit process. Our first deposit screen “Details” collects thesis metada. User, year, department are obtained from user database. We redesign embargo functionality to underpin University 5 years model. Metadata: title, abstract, award, full text status - predefined from LDAP, year, department, non required, keywords, Moving embargo to first stage was with the huge coding expenses
  6. Eprints comes with highly customizable LaTeX based cover page. The initial template is rather uninteresting. We added to our cover pages information how to cite thesis as well as use policy. On some occasion we had character encoding problems so we need to modify the plugin and LaTeX template to avoid this issue.
  7. As everybody we love stats. We use Google Analytics for monitoring number of full text downloads. This was implemented in two stages. Core code was modified to produce link which would trigger GA counting when clicked. A special profile was set in GA. The PDF url has department information so it is easy to produce stats against individual department.
  8. It was our aspiration to be a member of EthOS project. We registered our service very early to be harvested by OAI-PMH protocol. We also modified out of the box UKETD-DC plugin to provide additional information most notably thesis embargo. Recently we participate in the EthOS workshop to share our knowledge and experience.
  9. Through the Britihs Library EthOS project more than 1000 thesis has been digitised on demand by users. We would like to upload those PDF file back to our service. We developed simple command line client which talks with EthOS WS API. Subsequently all files has been mass uploaded to Durham E-Theses.
  10. http://etheses.dur.ac.uk/cgi/oai2?verb=ListRecords&from=2012-07-05T22:16:45Z&metadataPrefix=uketd_dc http://etheses.dur.ac.uk/cgi/oai2?verb=ListRecords&from=2012-07-05T22:16:45Z&metadataPrefix=oai_dc – will display Epid:1530 Modification include: - Adding new fields into EPrints data model - eprinds_field.pl, workflow, phrases. - Adding conditional OAI matadata format filter ( metadataPrefix=uketd_dc ) in oai.pl configuration file - Adding metadata format filter support in /cgi/oai2 script In order to protectively avoid record duplication by EthOS reharvesting we extended editorial data model and added EthOS persisted ID and simple control whether the record should be excluded from EthOS /UKETD-DC harvesting process. We have modified OAI-PMH core plugin to filter EThOS records.
  11. One of our major technical concern was related that the fact that occasionally wrong character encoding could introduced in metadat. This look rather unpleasant and can spoil user experience. Recently we adopted a radical solution and we have converted whole internal database to be UTF-8 compliant and we have updated Eprints database connector.
  12. In our default implementation we assumed that copyright of the theses would retain within the user. This was unnecessary conservative approach which did not encouraged our students to explore CC licences. So we were very glad, once approached by students to implement specific CC with particular jurisdiction. We updated wording and CC version to support 3 jurisdiction England & Wales, world and USA.
  13. And than we realized, that once we have a work licensed under CC we need to clearly state this not only on the abstract page but also in automated harvest as well as cover pages.
  14. Durham E-Theses is a younger sister of Durham Research Online service. One of our later goal was to unify those two repositories by providing single search box for the external users. We use Google Custom Search with bespoken search result. Durham E-Theses results are presented under Theses tab.
  15. In the last year we started local project to digitised our stock of 10k paper theses. Some of those theses comes with supplementary material which is also beeing digitised. Similar to the BL we implemented robust take down policy on the author request.
  16. Currently we are in the process to implement the EU cookies law. It is rather murky area with not clear interpretation which cookies are essential thus required by service. With additional google services we fill like running in the night without light and clear course to follow.
  17. We also introduce permanent “dark” storage based on deletion state. We have not anticipate that users would like to deposit encrypted PDFs or multimedia files.