SlideShare ist ein Scribd-Unternehmen logo
1 von 13
Xubmit:
A system for managing text
collections
Dazhi Jiao, Tamara Lopez, and Jenn Riley
Indiana University Digital Library Program
DLF 2006 Spring Forum: Austin, Texas
April 10-12, 2006
Overview
•
•

IU Environment
Case Studies
–
–

•

Analysis
–
–

•

Process
Problems

Xubmit
–
–

•

EAD IU Finding Aids
TEI The Chymistry of Isaac Newton

Technology overview
Demonstration

Conclusions

Xubmit :: IU DLP :: Spring 2006
IU Environment
• Digital Library Program provides a central
home for library and faculty digital library
activities
– Consulting
– Content creation
– Metadata creation
– Delivery

• Enormous diversity of projects
• Currently undergoing a complete redesign
of our digital library software infrastructure
Xubmit :: IU DLP :: Spring 2006
Case Study - EAD
IU Finding Aids
• Encoded by multiple repositories at IU
• Encoding done in <oXygen />
• XML template created and maintained by EAD
Working Group
• Completed files delivered to DLP for online
delivery
• Currently delivered via DLXS; soon to be
delivered via XTF
• Tested RLG’s EAD Report Card for improving
encoding workflow
Xubmit :: IU DLP :: Spring 2006
Case Study - TEI
The Chymistry of Isaac Newton
multi-stage, iterative
- transcription
- initial encoding
- XSLT transformation
- editing and proofing
- possible modification of encoding practice(s) and/or
XSLT
repeat steps 3-5 until editors are satisfied.

Xubmit :: IU DLP :: Spring 2006
Analysis - Process
Creation

Integration

Delivery

- encoding
- transcription
- one person
- asynchronous

- validation

- transformation
- access
- multiple systems
- asynchronous

- preservation
- multiple people
- asynchronous

Xubmit :: IU DLP :: Spring 2006
Analysis - Process

Creation

Integration

Xubmit :: IU DLP :: Spring 2006

Delivery
Analysis - Architecture
Xubmit
Creation

Repository
Schematron

Xubmit :: IU DLP :: Spring 2006

API

IU Archives
Lilly
Chymistry

Delivery

XTF
DLXS
OAI
WWW
Xubmit :: Implementation
• Submission Flow
– Spring Webflow

• Repository
– JRCS, Platform Independent

• XML Validation / Transformation
– Xerces, Saxon, XML Catalogs

• Web application
– Struts, JSP, Javascript
Xubmit :: IU DLP :: Spring 2006
Xubmit :: Submission Flow
• Flow UML Diagram

Xubmit :: IU DLP :: Spring 2006
Xubmit :: Schematron
Pros
– Simple vocabulary
– Allows for robust
reporting
– Uses standards:
XPATH, XSLT
– Becoming an ISO
standard

Cons
– Verbose
– Maintenance
– No support for
variables
– Not an ISO
standard yet.

Definition
oriented

DTD
XML Schema

DSD

Usage
oriented

Xubmit :: IU DLP :: Spring 2006

Schematron
Pattern based
Constraints-oriented

XCSL
Grammar based
Structure-oriented

Source: Adapted from Lee, D. & Chu, W. (2000),
Jacinto, M. et. al. (2002)
Xubmit :: Demonstration

Xubmit :: IU DLP :: Spring 2006
Resources and References
Resources
•

JRCS: http://www.suigeneris.org/kb/display/jrcs/home

•

Saxon: http://www.saxonica.com

•

Schematron: http://www.schematron.com/

•

Spring Webflow: http://opensource.atlassian.com/confluence/spring/display/WEBFLOW/Home

•

Xerxes: http://xerces.apache.org/xerces-j/

•

XML Catalog: http://www.oasis-open.org/committees/download.php/14809/xml-catalogs.html

References
Baecker, R. M, Nastos, D., Posner, I.R. & Mawby, K.L.(1993). The user-centered iterative design of collaborative writing software. Proceedings
of the SIGCHI conference on Human factors in computing systems. Amsterdam, The Netherlands, 1993. pp. 399 - 405
Dillon, A. (1993) How collaborative is collaborative writing? An analysis of the production of two technical reports. Published in M.Sharples (Ed)
Computer Supported Collaborative Writing, London: Springer-Verlag, pp 69-86. Retrieved April, 2006 from:
http://www.ischool.utexas.edu/~adillon/BookChapters/collaborative%20writing.htm
Jacinto, M., Librelotto, G. R., Ramalho, J.C.L & Henriques, P.R. (2002). Constraint specification languages: comparing XCSL, Schematron and
XML-Schemas. XML Europe, 2002. Retrieved April, 2006 from: http://www.idealliance.org/papers/xmle02/dx_xmle02/papers/03-0302/03-03-02.pdf
Jelliffe, R. (2000). Getting Information into markup: the data model behind the schematron assertion language. A technical whitepaper for
GeoTempo, Inc. October 19, 2000. Retrieved April, 2006 from: http://www.sinica.edu.tw/~ricko/schematron.PDF
Kim, H. & Eklundh, K. (2001). Reviewing Practices in Collaborative Writing. Computer Supported Cooperative Work. 10: 247-259, 2001.
Lee, D. & Chu, W. (2000). Comparative Analysis of Six XML Schema Languages1. ACM SIGMOD Record 29(3), September 2000.Retrieved
April, 2006 from: http://cobase-www.cs.ucla.edu/tech-docs/dongwon/sigmod-record-00.html

Xubmit :: IU DLP :: Spring 2006

Weitere ähnliche Inhalte

Ähnlich wie Managing text collections with Xubmit

IGeLU 2014
IGeLU 2014IGeLU 2014
IGeLU 2014jhkrug
 
Efficient & effective data management for research projects : ILRI's Data Ma...
Efficient & effective  data management for research projects : ILRI's Data Ma...Efficient & effective  data management for research projects : ILRI's Data Ma...
Efficient & effective data management for research projects : ILRI's Data Ma...CIARD Movement
 
European Southern Observatory: Implementing Day CQ5 at ESO
European Southern Observatory: Implementing Day CQ5 at ESOEuropean Southern Observatory: Implementing Day CQ5 at ESO
European Southern Observatory: Implementing Day CQ5 at ESODay Software
 
RDAP14: An analysis and characterization of DMPs in NSF proposals from the Un...
RDAP14: An analysis and characterization of DMPs in NSF proposals from the Un...RDAP14: An analysis and characterization of DMPs in NSF proposals from the Un...
RDAP14: An analysis and characterization of DMPs in NSF proposals from the Un...ASIS&T
 
Frances McNamara - Kuali OLE Implementation at University of Chicago
Frances McNamara - Kuali OLE Implementation at University of ChicagoFrances McNamara - Kuali OLE Implementation at University of Chicago
Frances McNamara - Kuali OLE Implementation at University of ChicagoKuali Days UK
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12ASIS&T
 
Smarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesSmarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesOCLC
 
Take control of your PhD journey: Manage your research data according to best...
Take control of your PhD journey: Manage your research data according to best...Take control of your PhD journey: Manage your research data according to best...
Take control of your PhD journey: Manage your research data according to best...Lars Figenschou
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so farEnrico Daga
 
Southwickc lampert lodlam_training
Southwickc lampert lodlam_trainingSouthwickc lampert lodlam_training
Southwickc lampert lodlam_trainingssouthwick
 
uPortal Roadmap Community Call: uPortal 3.2 and Beyond
uPortal Roadmap Community Call: uPortal 3.2 and BeyonduPortal Roadmap Community Call: uPortal 3.2 and Beyond
uPortal Roadmap Community Call: uPortal 3.2 and BeyondJasig uPortal Project
 
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...datascienceiqss
 
Web archiving challenges and opportunities
Web archiving challenges and opportunitiesWeb archiving challenges and opportunities
Web archiving challenges and opportunitiesAhmed AlSum
 
Alma, the Cloud & the Evolution of the Library Systems Department - Kevin Kidd
Alma, the Cloud & the Evolution of the Library Systems Department - Kevin KiddAlma, the Cloud & the Evolution of the Library Systems Department - Kevin Kidd
Alma, the Cloud & the Evolution of the Library Systems Department - Kevin KiddKevin Kidd
 
Scilab Challenge@NTU 2014/2015 Project Briefing
Scilab Challenge@NTU 2014/2015 Project BriefingScilab Challenge@NTU 2014/2015 Project Briefing
Scilab Challenge@NTU 2014/2015 Project BriefingTBSS Group
 
An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...Megan O'Donnell
 
Using Archivemedia to preserve research data
Using Archivemedia to preserve research dataUsing Archivemedia to preserve research data
Using Archivemedia to preserve research dataARDC
 
"Data in Context" IG sessions @ RDA 3rd Plenary
"Data in Context" IG sessions @  RDA 3rd Plenary"Data in Context" IG sessions @  RDA 3rd Plenary
"Data in Context" IG sessions @ RDA 3rd PlenaryBrigitte Jörg
 
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...Brigitte Jörg
 

Ähnlich wie Managing text collections with Xubmit (20)

IGeLU 2014
IGeLU 2014IGeLU 2014
IGeLU 2014
 
Efficient & effective data management for research projects : ILRI's Data Ma...
Efficient & effective  data management for research projects : ILRI's Data Ma...Efficient & effective  data management for research projects : ILRI's Data Ma...
Efficient & effective data management for research projects : ILRI's Data Ma...
 
European Southern Observatory: Implementing Day CQ5 at ESO
European Southern Observatory: Implementing Day CQ5 at ESOEuropean Southern Observatory: Implementing Day CQ5 at ESO
European Southern Observatory: Implementing Day CQ5 at ESO
 
RDAP14: An analysis and characterization of DMPs in NSF proposals from the Un...
RDAP14: An analysis and characterization of DMPs in NSF proposals from the Un...RDAP14: An analysis and characterization of DMPs in NSF proposals from the Un...
RDAP14: An analysis and characterization of DMPs in NSF proposals from the Un...
 
Frances McNamara - Kuali OLE Implementation at University of Chicago
Frances McNamara - Kuali OLE Implementation at University of ChicagoFrances McNamara - Kuali OLE Implementation at University of Chicago
Frances McNamara - Kuali OLE Implementation at University of Chicago
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
 
Smarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesSmarter Data for Smarter Libraries
Smarter Data for Smarter Libraries
 
Take control of your PhD journey: Manage your research data according to best...
Take control of your PhD journey: Manage your research data according to best...Take control of your PhD journey: Manage your research data according to best...
Take control of your PhD journey: Manage your research data according to best...
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
 
Southwickc lampert lodlam_training
Southwickc lampert lodlam_trainingSouthwickc lampert lodlam_training
Southwickc lampert lodlam_training
 
uPortal Roadmap Community Call: uPortal 3.2 and Beyond
uPortal Roadmap Community Call: uPortal 3.2 and BeyonduPortal Roadmap Community Call: uPortal 3.2 and Beyond
uPortal Roadmap Community Call: uPortal 3.2 and Beyond
 
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
 
Web archiving challenges and opportunities
Web archiving challenges and opportunitiesWeb archiving challenges and opportunities
Web archiving challenges and opportunities
 
Measuring impact
Measuring impactMeasuring impact
Measuring impact
 
Alma, the Cloud & the Evolution of the Library Systems Department - Kevin Kidd
Alma, the Cloud & the Evolution of the Library Systems Department - Kevin KiddAlma, the Cloud & the Evolution of the Library Systems Department - Kevin Kidd
Alma, the Cloud & the Evolution of the Library Systems Department - Kevin Kidd
 
Scilab Challenge@NTU 2014/2015 Project Briefing
Scilab Challenge@NTU 2014/2015 Project BriefingScilab Challenge@NTU 2014/2015 Project Briefing
Scilab Challenge@NTU 2014/2015 Project Briefing
 
An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...
 
Using Archivemedia to preserve research data
Using Archivemedia to preserve research dataUsing Archivemedia to preserve research data
Using Archivemedia to preserve research data
 
"Data in Context" IG sessions @ RDA 3rd Plenary
"Data in Context" IG sessions @  RDA 3rd Plenary"Data in Context" IG sessions @  RDA 3rd Plenary
"Data in Context" IG sessions @ RDA 3rd Plenary
 
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
 

Mehr von Jenn Riley

Understanding Metadata: Looking Forward
Understanding Metadata: Looking ForwardUnderstanding Metadata: Looking Forward
Understanding Metadata: Looking ForwardJenn Riley
 
The future of cataloguing? Future cataloguers!
The future of cataloguing? Future cataloguers!The future of cataloguing? Future cataloguers!
The future of cataloguing? Future cataloguers!Jenn Riley
 
Discovery elsewhere
Discovery elsewhereDiscovery elsewhere
Discovery elsewhereJenn Riley
 
Designing the Garden: Getting Grounded in Linked Data
Designing the Garden: Getting Grounded in Linked DataDesigning the Garden: Getting Grounded in Linked Data
Designing the Garden: Getting Grounded in Linked DataJenn Riley
 
Launching metaware.buzz
Launching metaware.buzzLaunching metaware.buzz
Launching metaware.buzzJenn Riley
 
Getting Comfortable with Metadata Reuse
Getting Comfortable with Metadata ReuseGetting Comfortable with Metadata Reuse
Getting Comfortable with Metadata ReuseJenn Riley
 
Handout for Digital Imaging of Photographs
Handout for Digital Imaging of PhotographsHandout for Digital Imaging of Photographs
Handout for Digital Imaging of PhotographsJenn Riley
 
Digital Imaging of Photographs
Digital Imaging of PhotographsDigital Imaging of Photographs
Digital Imaging of PhotographsJenn Riley
 
The Open Archives Initiative and the Sheet Music Consortium
The Open Archives Initiative and the Sheet Music ConsortiumThe Open Archives Initiative and the Sheet Music Consortium
The Open Archives Initiative and the Sheet Music ConsortiumJenn Riley
 
Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and S...
Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and S...Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and S...
Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and S...Jenn Riley
 
Handout for FRBR; or, How I learned to stop worrying and love the model
Handout for FRBR; or, How I learned to stop worrying and love the modelHandout for FRBR; or, How I learned to stop worrying and love the model
Handout for FRBR; or, How I learned to stop worrying and love the modelJenn Riley
 
Metadata for Brittle Books Page Turner
Metadata for Brittle Books Page TurnerMetadata for Brittle Books Page Turner
Metadata for Brittle Books Page TurnerJenn Riley
 
Digitizing and Delivering Audio and Video
Digitizing and Delivering Audio and VideoDigitizing and Delivering Audio and Video
Digitizing and Delivering Audio and VideoJenn Riley
 
Handout for Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Handout for Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODSHandout for Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Handout for Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODSJenn Riley
 
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODSAlphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODSJenn Riley
 
Handout for Merging Metadata from Multiple Traditions: IN Harmony Sheet Music...
Handout for Merging Metadata from Multiple Traditions: IN Harmony Sheet Music...Handout for Merging Metadata from Multiple Traditions: IN Harmony Sheet Music...
Handout for Merging Metadata from Multiple Traditions: IN Harmony Sheet Music...Jenn Riley
 
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Librar...
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Librar...Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Librar...
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Librar...Jenn Riley
 
Challenges in the Nursery: Linking a Finding Aid with Online Content
Challenges in the Nursery: Linking a Finding Aid with Online ContentChallenges in the Nursery: Linking a Finding Aid with Online Content
Challenges in the Nursery: Linking a Finding Aid with Online ContentJenn Riley
 
Making Interoperability Easier: Creating Shareable Metadata
Making Interoperability Easier: Creating Shareable MetadataMaking Interoperability Easier: Creating Shareable Metadata
Making Interoperability Easier: Creating Shareable MetadataJenn Riley
 

Mehr von Jenn Riley (20)

Understanding Metadata: Looking Forward
Understanding Metadata: Looking ForwardUnderstanding Metadata: Looking Forward
Understanding Metadata: Looking Forward
 
The future of cataloguing? Future cataloguers!
The future of cataloguing? Future cataloguers!The future of cataloguing? Future cataloguers!
The future of cataloguing? Future cataloguers!
 
Discovery elsewhere
Discovery elsewhereDiscovery elsewhere
Discovery elsewhere
 
Designing the Garden: Getting Grounded in Linked Data
Designing the Garden: Getting Grounded in Linked DataDesigning the Garden: Getting Grounded in Linked Data
Designing the Garden: Getting Grounded in Linked Data
 
Launching metaware.buzz
Launching metaware.buzzLaunching metaware.buzz
Launching metaware.buzz
 
Getting Comfortable with Metadata Reuse
Getting Comfortable with Metadata ReuseGetting Comfortable with Metadata Reuse
Getting Comfortable with Metadata Reuse
 
Handout for Digital Imaging of Photographs
Handout for Digital Imaging of PhotographsHandout for Digital Imaging of Photographs
Handout for Digital Imaging of Photographs
 
Digital Imaging of Photographs
Digital Imaging of PhotographsDigital Imaging of Photographs
Digital Imaging of Photographs
 
The Open Archives Initiative and the Sheet Music Consortium
The Open Archives Initiative and the Sheet Music ConsortiumThe Open Archives Initiative and the Sheet Music Consortium
The Open Archives Initiative and the Sheet Music Consortium
 
Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and S...
Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and S...Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and S...
Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and S...
 
Handout for FRBR; or, How I learned to stop worrying and love the model
Handout for FRBR; or, How I learned to stop worrying and love the modelHandout for FRBR; or, How I learned to stop worrying and love the model
Handout for FRBR; or, How I learned to stop worrying and love the model
 
Metadata for Brittle Books Page Turner
Metadata for Brittle Books Page TurnerMetadata for Brittle Books Page Turner
Metadata for Brittle Books Page Turner
 
Digitizing and Delivering Audio and Video
Digitizing and Delivering Audio and VideoDigitizing and Delivering Audio and Video
Digitizing and Delivering Audio and Video
 
Variations2
Variations2Variations2
Variations2
 
Handout for Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Handout for Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODSHandout for Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Handout for Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
 
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODSAlphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
 
Handout for Merging Metadata from Multiple Traditions: IN Harmony Sheet Music...
Handout for Merging Metadata from Multiple Traditions: IN Harmony Sheet Music...Handout for Merging Metadata from Multiple Traditions: IN Harmony Sheet Music...
Handout for Merging Metadata from Multiple Traditions: IN Harmony Sheet Music...
 
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Librar...
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Librar...Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Librar...
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Librar...
 
Challenges in the Nursery: Linking a Finding Aid with Online Content
Challenges in the Nursery: Linking a Finding Aid with Online ContentChallenges in the Nursery: Linking a Finding Aid with Online Content
Challenges in the Nursery: Linking a Finding Aid with Online Content
 
Making Interoperability Easier: Creating Shareable Metadata
Making Interoperability Easier: Creating Shareable MetadataMaking Interoperability Easier: Creating Shareable Metadata
Making Interoperability Easier: Creating Shareable Metadata
 

Kürzlich hochgeladen

Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 

Kürzlich hochgeladen (20)

Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 

Managing text collections with Xubmit

  • 1. Xubmit: A system for managing text collections Dazhi Jiao, Tamara Lopez, and Jenn Riley Indiana University Digital Library Program DLF 2006 Spring Forum: Austin, Texas April 10-12, 2006
  • 2. Overview • • IU Environment Case Studies – – • Analysis – – • Process Problems Xubmit – – • EAD IU Finding Aids TEI The Chymistry of Isaac Newton Technology overview Demonstration Conclusions Xubmit :: IU DLP :: Spring 2006
  • 3. IU Environment • Digital Library Program provides a central home for library and faculty digital library activities – Consulting – Content creation – Metadata creation – Delivery • Enormous diversity of projects • Currently undergoing a complete redesign of our digital library software infrastructure Xubmit :: IU DLP :: Spring 2006
  • 4. Case Study - EAD IU Finding Aids • Encoded by multiple repositories at IU • Encoding done in <oXygen /> • XML template created and maintained by EAD Working Group • Completed files delivered to DLP for online delivery • Currently delivered via DLXS; soon to be delivered via XTF • Tested RLG’s EAD Report Card for improving encoding workflow Xubmit :: IU DLP :: Spring 2006
  • 5. Case Study - TEI The Chymistry of Isaac Newton multi-stage, iterative - transcription - initial encoding - XSLT transformation - editing and proofing - possible modification of encoding practice(s) and/or XSLT repeat steps 3-5 until editors are satisfied. Xubmit :: IU DLP :: Spring 2006
  • 6. Analysis - Process Creation Integration Delivery - encoding - transcription - one person - asynchronous - validation - transformation - access - multiple systems - asynchronous - preservation - multiple people - asynchronous Xubmit :: IU DLP :: Spring 2006
  • 7. Analysis - Process Creation Integration Xubmit :: IU DLP :: Spring 2006 Delivery
  • 8. Analysis - Architecture Xubmit Creation Repository Schematron Xubmit :: IU DLP :: Spring 2006 API IU Archives Lilly Chymistry Delivery XTF DLXS OAI WWW
  • 9. Xubmit :: Implementation • Submission Flow – Spring Webflow • Repository – JRCS, Platform Independent • XML Validation / Transformation – Xerces, Saxon, XML Catalogs • Web application – Struts, JSP, Javascript Xubmit :: IU DLP :: Spring 2006
  • 10. Xubmit :: Submission Flow • Flow UML Diagram Xubmit :: IU DLP :: Spring 2006
  • 11. Xubmit :: Schematron Pros – Simple vocabulary – Allows for robust reporting – Uses standards: XPATH, XSLT – Becoming an ISO standard Cons – Verbose – Maintenance – No support for variables – Not an ISO standard yet. Definition oriented DTD XML Schema DSD Usage oriented Xubmit :: IU DLP :: Spring 2006 Schematron Pattern based Constraints-oriented XCSL Grammar based Structure-oriented Source: Adapted from Lee, D. & Chu, W. (2000), Jacinto, M. et. al. (2002)
  • 12. Xubmit :: Demonstration Xubmit :: IU DLP :: Spring 2006
  • 13. Resources and References Resources • JRCS: http://www.suigeneris.org/kb/display/jrcs/home • Saxon: http://www.saxonica.com • Schematron: http://www.schematron.com/ • Spring Webflow: http://opensource.atlassian.com/confluence/spring/display/WEBFLOW/Home • Xerxes: http://xerces.apache.org/xerces-j/ • XML Catalog: http://www.oasis-open.org/committees/download.php/14809/xml-catalogs.html References Baecker, R. M, Nastos, D., Posner, I.R. & Mawby, K.L.(1993). The user-centered iterative design of collaborative writing software. Proceedings of the SIGCHI conference on Human factors in computing systems. Amsterdam, The Netherlands, 1993. pp. 399 - 405 Dillon, A. (1993) How collaborative is collaborative writing? An analysis of the production of two technical reports. Published in M.Sharples (Ed) Computer Supported Collaborative Writing, London: Springer-Verlag, pp 69-86. Retrieved April, 2006 from: http://www.ischool.utexas.edu/~adillon/BookChapters/collaborative%20writing.htm Jacinto, M., Librelotto, G. R., Ramalho, J.C.L & Henriques, P.R. (2002). Constraint specification languages: comparing XCSL, Schematron and XML-Schemas. XML Europe, 2002. Retrieved April, 2006 from: http://www.idealliance.org/papers/xmle02/dx_xmle02/papers/03-0302/03-03-02.pdf Jelliffe, R. (2000). Getting Information into markup: the data model behind the schematron assertion language. A technical whitepaper for GeoTempo, Inc. October 19, 2000. Retrieved April, 2006 from: http://www.sinica.edu.tw/~ricko/schematron.PDF Kim, H. & Eklundh, K. (2001). Reviewing Practices in Collaborative Writing. Computer Supported Cooperative Work. 10: 247-259, 2001. Lee, D. & Chu, W. (2000). Comparative Analysis of Six XML Schema Languages1. ACM SIGMOD Record 29(3), September 2000.Retrieved April, 2006 from: http://cobase-www.cs.ucla.edu/tech-docs/dongwon/sigmod-record-00.html Xubmit :: IU DLP :: Spring 2006

Hinweis der Redaktion

  1. TODO: Add editing in here somewhere
  2. TODO: Add editing in here somewhere
  3. TODO: Add editing in here somewhere