SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Downloaden Sie, um offline zu lesen
Los Angeles | London | New Delhi
Singapore | Washington DC
May 28, 2014
SAGE Online Content Repository
Six Years with RSuite
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
Keith Lawrenz
Senior Business Analyst & Content
Systems Supervisor,
Publishing Technologies,
SAGE Publications
• 8 years with SAGE
• 23 years with Kinko’s
• Electrical and Computer Engineer
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
SAGE Publications
● Independent, global, scholarly publisher
● Books, journals, reference, primary sources
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
Back to 2007…
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
Platforms…
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
Content management…
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
PDF Conversion
to XML
Load to Staging Editorial Review Edits
~ 45 days
2.2 deliveries
~ $10 / article
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
XML
Conversion
Vendor(Jouve)
OnlineContent
Editor
Content
Recepients
SOCR
Journal
Production
Unit of Content is
a Journal Issue
Start
FTP (UK) or
NFS (US)
Zip Final
Print-Ready
PDFs
Ingest
Unencoded
Issue
Store in
Repository
Deliver
Unencoded
Issue
FTP
Create
SAGEMeta
XML
Nomalize
PDF Files
Zip Issue
FTP Store in
Repository
Deliver HW
Issue
Ingest
Encoded
Issue
FTP –
HighWire
Express
Process
Issue for
Hosting
Quality
Check
Issue
Changes? Edit Articles
Approve
Issue
Deliver Full
Issue
Deliver
PubMed
Abstract
XML
FTP and/or
NFS sites
Online Preview Issue Online
Issue
Online?
End
Deliver
XML Issue
End
Yes
Yes
OK to Host?
Yes
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
XML
Conversion
Vendor(Jouve)
OnlineContent
Editor
Content
Recepients
SOCR
Journal
Production
Unit of Content is
a Journal Issue
Start
FTP (UK) or
NFS (US)
Zip Final
Print-Ready
PDFs
Ingest
Unencoded
Issue
Store in
Repository
Deliver
Unencoded
Issue
FTP
Create
SAGEMeta
XML
Nomalize
PDF Files
Zip Issue
FTP Store in
Repository
Deliver HW
Issue
Ingest
Encoded
Issue
FTP –
HighWire
Express
Process
Issue for
Hosting
Quality
Check
Issue
Changes? Edit Articles
Approve
Issue
Deliver Full
Issue
Deliver
PubMed
Abstract
XML
FTP and/or
NFS sites
Online Preview Issue Online
Issue
Online?
End
Deliver
XML Issue
End
Yes
Yes
OK to Host?
Yes
Ingest print-ready
article PDFS
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
XML
Conversion
Vendor(Jouve)
OnlineContent
Editor
Content
Recepients
SOCR
Journal
Production
Unit of Content is
a Journal Issue
Start
FTP (UK) or
NFS (US)
Zip Final
Print-Ready
PDFs
Ingest
Unencoded
Issue
Store in
Repository
Deliver
Unencoded
Issue
FTP
Create
SAGEMeta
XML
Nomalize
PDF Files
Zip Issue
FTP Store in
Repository
Deliver HW
Issue
Ingest
Encoded
Issue
FTP –
HighWire
Express
Process
Issue for
Hosting
Quality
Check
Issue
Changes? Edit Articles
Approve
Issue
Deliver Full
Issue
Deliver
PubMed
Abstract
XML
FTP and/or
NFS sites
Online Preview Issue Online
Issue
Online?
End
Deliver
XML Issue
End
Yes
Yes
OK to Host?
Yes
Deliver to encoding
vendor
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
XML
Conversion
Vendor(Jouve)
OnlineContent
Editor
Content
Recepients
SOCR
Journal
Production
Unit of Content is
a Journal Issue
Start
FTP (UK) or
NFS (US)
Zip Final
Print-Ready
PDFs
Ingest
Unencoded
Issue
Store in
Repository
Deliver
Unencoded
Issue
FTP
Create
SAGEMeta
XML
Nomalize
PDF Files
Zip Issue
FTP Store in
Repository
Deliver HW
Issue
Ingest
Encoded
Issue
FTP –
HighWire
Express
Process
Issue for
Hosting
Quality
Check
Issue
Changes? Edit Articles
Approve
Issue
Deliver Full
Issue
Deliver
PubMed
Abstract
XML
FTP and/or
NFS sites
Online Preview Issue Online
Issue
Online?
End
Deliver
XML Issue
End
Yes
Yes
OK to Host?
Yes
Ingest xml encoded
issue
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
XML
Conversion
Vendor(Jouve)
OnlineContent
Editor
Content
Recepients
SOCR
Journal
Production
Unit of Content is
a Journal Issue
Start
FTP (UK) or
NFS (US)
Zip Final
Print-Ready
PDFs
Ingest
Unencoded
Issue
Store in
Repository
Deliver
Unencoded
Issue
FTP
Create
SAGEMeta
XML
Nomalize
PDF Files
Zip Issue
FTP Store in
Repository
Deliver HW
Issue
Ingest
Encoded
Issue
FTP –
HighWire
Express
Process
Issue for
Hosting
Quality
Check
Issue
Changes? Edit Articles
Approve
Issue
Deliver Full
Issue
Deliver
PubMed
Abstract
XML
FTP and/or
NFS sites
Online Preview Issue Online
Issue
Online?
End
Deliver
XML Issue
End
Yes
Yes
OK to Host?
Yes
Deliver to hosting
platform
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
XML
Conversion
Vendor(Jouve)
OnlineContent
Editor
Content
Recepients
SOCR
Journal
Production
Unit of Content is
a Journal Issue
Start
FTP (UK) or
NFS (US)
Zip Final
Print-Ready
PDFs
Ingest
Unencoded
Issue
Store in
Repository
Deliver
Unencoded
Issue
FTP
Create
SAGEMeta
XML
Nomalize
PDF Files
Zip Issue
FTP Store in
Repository
Deliver HW
Issue
Ingest
Encoded
Issue
FTP –
HighWire
Express
Process
Issue for
Hosting
Quality
Check
Issue
Changes? Edit Articles
Approve
Issue
Deliver Full
Issue
Deliver
PubMed
Abstract
XML
FTP and/or
NFS sites
Online Preview Issue Online
Issue
Online?
End
Deliver
XML Issue
End
Yes
Yes
OK to Host?
Yes
Support editorial
approval process
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
XML
Conversion
Vendor(Jouve)
OnlineContent
Editor
Content
Recepients
SOCR
Journal
Production
Unit of Content is
a Journal Issue
Start
FTP (UK) or
NFS (US)
Zip Final
Print-Ready
PDFs
Ingest
Unencoded
Issue
Store in
Repository
Deliver
Unencoded
Issue
FTP
Create
SAGEMeta
XML
Nomalize
PDF Files
Zip Issue
FTP Store in
Repository
Deliver HW
Issue
Ingest
Encoded
Issue
FTP –
HighWire
Express
Process
Issue for
Hosting
Quality
Check
Issue
Changes? Edit Articles
Approve
Issue
Deliver Full
Issue
Deliver
PubMed
Abstract
XML
FTP and/or
NFS sites
Online Preview Issue Online
Issue
Online?
End
Deliver
XML Issue
End
Yes
Yes
OK to Host?
Yes
Track go-live on
hosting platform
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
XML
Conversion
Vendor(Jouve)
OnlineContent
Editor
Content
Recepients
SOCR
Journal
Production
Unit of Content is
a Journal Issue
Start
FTP (UK) or
NFS (US)
Zip Final
Print-Ready
PDFs
Ingest
Unencoded
Issue
Store in
Repository
Deliver
Unencoded
Issue
FTP
Create
SAGEMeta
XML
Nomalize
PDF Files
Zip Issue
FTP Store in
Repository
Deliver HW
Issue
Ingest
Encoded
Issue
FTP –
HighWire
Express
Process
Issue for
Hosting
Quality
Check
Issue
Changes? Edit Articles
Approve
Issue
Deliver Full
Issue
Deliver
PubMed
Abstract
XML
FTP and/or
NFS sites
Online Preview Issue Online
Issue
Online?
End
Deliver
XML Issue
End
Yes
Yes
OK to Host?
Yes
Deliver to additional
recepients
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
Learnings
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
Learnings
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
2011 we released SOCR-2
Ingest Store Deliver
Enforce quality
Normalize content
Enforce quality
Version control
Transform (PDF, XML, & Images)
Package
Deliver (and track)
Analytics
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
Capabilities
Store
Transform
DeliverEnrich
Ingest
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
SOCR-Journals
Ingest
• Issues, OnlineFirst, Continuous Publication,
Launch & Archive
• Over 200 quality checks for a current article
Deliver
• 47 delivery recipients
• 13 delivery formats in use
• Batch delivery for launch content
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
-
200,000
400,000
600,000
800,000
1,000,000
1,200,000
1,400,000
Articles
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
Load to Staging Editorial Review Edits
45 days down to ~ -5 days
2.2 down to 1.2 deliveries
$10 / article down to ~ $1
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
SOCR-Books
Ingest
• Books - TEI
• Implemented RelaxNG schema validation
• Over 100 quality checks applied
Deliver
• Platform content
• Discoverability and content licensees
• Interactive DOI Registration
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
0
1000
2000
3000
4000
5000
6000
Books
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
Learnings
X
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
Learnings
People
Process
Software
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
SOCR-3 – SOCR As A Service
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
Learnings
2014 RSuite User Conference Los Angeles | London | New Delhi
Singapore | Washington DC
Questions?
Keith Lawrenz
Sr. Business Analyst & Content Systems Supervisor
Publishing Technologies
SAGE Publications
keith.lawrenz@sagepub.com
twitter: @keithlawrenz

Weitere ähnliche Inhalte

Mehr von SAGE Publishing

Teaching Statistics to People Who (Think They) Hate Statistics: Tips for Over...
Teaching Statistics to People Who (Think They) Hate Statistics: Tips for Over...Teaching Statistics to People Who (Think They) Hate Statistics: Tips for Over...
Teaching Statistics to People Who (Think They) Hate Statistics: Tips for Over...SAGE Publishing
 
Survey Tips for Librarians
Survey Tips for LibrariansSurvey Tips for Librarians
Survey Tips for LibrariansSAGE Publishing
 
5 Tips for Teaching Introduction to Mass Communication: Engaging Students Liv...
5 Tips for Teaching Introduction to Mass Communication: Engaging Students Liv...5 Tips for Teaching Introduction to Mass Communication: Engaging Students Liv...
5 Tips for Teaching Introduction to Mass Communication: Engaging Students Liv...SAGE Publishing
 
Battling bannings: Authors discuss intellectual freedom and the freedom to read
Battling bannings: Authors discuss intellectual freedom and the freedom to readBattling bannings: Authors discuss intellectual freedom and the freedom to read
Battling bannings: Authors discuss intellectual freedom and the freedom to readSAGE Publishing
 
2016 Charleston Photo Contest Winners
2016 Charleston Photo Contest Winners2016 Charleston Photo Contest Winners
2016 Charleston Photo Contest WinnersSAGE Publishing
 
From Publication to the Public Expanding your research beyond academia
From Publication to the Public Expanding your research beyond academiaFrom Publication to the Public Expanding your research beyond academia
From Publication to the Public Expanding your research beyond academiaSAGE Publishing
 
Researching Researchers: Developing Evidence-Based Strategy for Improved Disc...
Researching Researchers: Developing Evidence-Based Strategy for Improved Disc...Researching Researchers: Developing Evidence-Based Strategy for Improved Disc...
Researching Researchers: Developing Evidence-Based Strategy for Improved Disc...SAGE Publishing
 
Search, Serendipity & the Researcher Experience
Search, Serendipity & the Researcher ExperienceSearch, Serendipity & the Researcher Experience
Search, Serendipity & the Researcher ExperienceSAGE Publishing
 
Libraries and Local Businesses: Best practices for supporting your entreprene...
Libraries and Local Businesses: Best practices for supporting your entreprene...Libraries and Local Businesses: Best practices for supporting your entreprene...
Libraries and Local Businesses: Best practices for supporting your entreprene...SAGE Publishing
 
Washington, D.C. and Social and Behavioral Science: The Picture for 2016
Washington, D.C. and Social and Behavioral Science: The Picture for 2016 Washington, D.C. and Social and Behavioral Science: The Picture for 2016
Washington, D.C. and Social and Behavioral Science: The Picture for 2016 SAGE Publishing
 
Teaching Educational Research Methods: Making it Real & Relevant for Students
Teaching Educational Research Methods: Making it Real & Relevant for StudentsTeaching Educational Research Methods: Making it Real & Relevant for Students
Teaching Educational Research Methods: Making it Real & Relevant for StudentsSAGE Publishing
 
Finding Common Ground: Bringing Methods and Analysis into Context
Finding Common Ground: Bringing Methods and Analysis into ContextFinding Common Ground: Bringing Methods and Analysis into Context
Finding Common Ground: Bringing Methods and Analysis into ContextSAGE Publishing
 
Charleston Photo Contest Winners
Charleston Photo Contest WinnersCharleston Photo Contest Winners
Charleston Photo Contest WinnersSAGE Publishing
 
How to Protect the Freedom to Read in Your Library
How to Protect the Freedom to Read in Your LibraryHow to Protect the Freedom to Read in Your Library
How to Protect the Freedom to Read in Your LibrarySAGE Publishing
 
Data for the Non-Data Librarian
Data for the Non-Data LibrarianData for the Non-Data Librarian
Data for the Non-Data LibrarianSAGE Publishing
 
2015 Banned Books Photo Contest Entrees
2015 Banned Books Photo Contest Entrees2015 Banned Books Photo Contest Entrees
2015 Banned Books Photo Contest EntreesSAGE Publishing
 
Student and faculty engagement with streaming video: Beyond the hype
Student and faculty engagement with streaming video: Beyond the hypeStudent and faculty engagement with streaming video: Beyond the hype
Student and faculty engagement with streaming video: Beyond the hypeSAGE Publishing
 
Successful Qualitative Research: Don't get too comfortable!
Successful Qualitative Research: Don't get too comfortable!Successful Qualitative Research: Don't get too comfortable!
Successful Qualitative Research: Don't get too comfortable!SAGE Publishing
 

Mehr von SAGE Publishing (20)

Little Green Facts
Little Green FactsLittle Green Facts
Little Green Facts
 
Teaching Statistics to People Who (Think They) Hate Statistics: Tips for Over...
Teaching Statistics to People Who (Think They) Hate Statistics: Tips for Over...Teaching Statistics to People Who (Think They) Hate Statistics: Tips for Over...
Teaching Statistics to People Who (Think They) Hate Statistics: Tips for Over...
 
Survey Tips for Librarians
Survey Tips for LibrariansSurvey Tips for Librarians
Survey Tips for Librarians
 
5 Tips for Teaching Introduction to Mass Communication: Engaging Students Liv...
5 Tips for Teaching Introduction to Mass Communication: Engaging Students Liv...5 Tips for Teaching Introduction to Mass Communication: Engaging Students Liv...
5 Tips for Teaching Introduction to Mass Communication: Engaging Students Liv...
 
Battling bannings: Authors discuss intellectual freedom and the freedom to read
Battling bannings: Authors discuss intellectual freedom and the freedom to readBattling bannings: Authors discuss intellectual freedom and the freedom to read
Battling bannings: Authors discuss intellectual freedom and the freedom to read
 
2016 Charleston Photo Contest Winners
2016 Charleston Photo Contest Winners2016 Charleston Photo Contest Winners
2016 Charleston Photo Contest Winners
 
From Publication to the Public Expanding your research beyond academia
From Publication to the Public Expanding your research beyond academiaFrom Publication to the Public Expanding your research beyond academia
From Publication to the Public Expanding your research beyond academia
 
Researching Researchers: Developing Evidence-Based Strategy for Improved Disc...
Researching Researchers: Developing Evidence-Based Strategy for Improved Disc...Researching Researchers: Developing Evidence-Based Strategy for Improved Disc...
Researching Researchers: Developing Evidence-Based Strategy for Improved Disc...
 
Search, Serendipity & the Researcher Experience
Search, Serendipity & the Researcher ExperienceSearch, Serendipity & the Researcher Experience
Search, Serendipity & the Researcher Experience
 
Libraries and Local Businesses: Best practices for supporting your entreprene...
Libraries and Local Businesses: Best practices for supporting your entreprene...Libraries and Local Businesses: Best practices for supporting your entreprene...
Libraries and Local Businesses: Best practices for supporting your entreprene...
 
Washington, D.C. and Social and Behavioral Science: The Picture for 2016
Washington, D.C. and Social and Behavioral Science: The Picture for 2016 Washington, D.C. and Social and Behavioral Science: The Picture for 2016
Washington, D.C. and Social and Behavioral Science: The Picture for 2016
 
Teaching Educational Research Methods: Making it Real & Relevant for Students
Teaching Educational Research Methods: Making it Real & Relevant for StudentsTeaching Educational Research Methods: Making it Real & Relevant for Students
Teaching Educational Research Methods: Making it Real & Relevant for Students
 
Finding Common Ground: Bringing Methods and Analysis into Context
Finding Common Ground: Bringing Methods and Analysis into ContextFinding Common Ground: Bringing Methods and Analysis into Context
Finding Common Ground: Bringing Methods and Analysis into Context
 
Charleston Photo Contest Winners
Charleston Photo Contest WinnersCharleston Photo Contest Winners
Charleston Photo Contest Winners
 
How to Protect the Freedom to Read in Your Library
How to Protect the Freedom to Read in Your LibraryHow to Protect the Freedom to Read in Your Library
How to Protect the Freedom to Read in Your Library
 
Data for the Non-Data Librarian
Data for the Non-Data LibrarianData for the Non-Data Librarian
Data for the Non-Data Librarian
 
RX for Discovery
RX for DiscoveryRX for Discovery
RX for Discovery
 
2015 Banned Books Photo Contest Entrees
2015 Banned Books Photo Contest Entrees2015 Banned Books Photo Contest Entrees
2015 Banned Books Photo Contest Entrees
 
Student and faculty engagement with streaming video: Beyond the hype
Student and faculty engagement with streaming video: Beyond the hypeStudent and faculty engagement with streaming video: Beyond the hype
Student and faculty engagement with streaming video: Beyond the hype
 
Successful Qualitative Research: Don't get too comfortable!
Successful Qualitative Research: Don't get too comfortable!Successful Qualitative Research: Don't get too comfortable!
Successful Qualitative Research: Don't get too comfortable!
 

Kürzlich hochgeladen

Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are successPratikSingh115843
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...ThinkInnovation
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etclalithasri22
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaManalVerma4
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfNicoChristianSunaryo
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformationAnnie Melnic
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfnikeshsingh56
 
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...ThinkInnovation
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 

Kürzlich hochgeladen (16)

Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are success
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etc
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in India
 
2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdf
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformation
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdf
 
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 

SAGE Online Content Repository: Six Years with RSuite

  • 1. Los Angeles | London | New Delhi Singapore | Washington DC May 28, 2014 SAGE Online Content Repository Six Years with RSuite
  • 2. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC Keith Lawrenz Senior Business Analyst & Content Systems Supervisor, Publishing Technologies, SAGE Publications • 8 years with SAGE • 23 years with Kinko’s • Electrical and Computer Engineer
  • 3. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC SAGE Publications ● Independent, global, scholarly publisher ● Books, journals, reference, primary sources
  • 4. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC Back to 2007…
  • 5. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC Platforms…
  • 6. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC
  • 7. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC Content management…
  • 8. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC PDF Conversion to XML Load to Staging Editorial Review Edits ~ 45 days 2.2 deliveries ~ $10 / article
  • 9. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC
  • 10. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC XML Conversion Vendor(Jouve) OnlineContent Editor Content Recepients SOCR Journal Production Unit of Content is a Journal Issue Start FTP (UK) or NFS (US) Zip Final Print-Ready PDFs Ingest Unencoded Issue Store in Repository Deliver Unencoded Issue FTP Create SAGEMeta XML Nomalize PDF Files Zip Issue FTP Store in Repository Deliver HW Issue Ingest Encoded Issue FTP – HighWire Express Process Issue for Hosting Quality Check Issue Changes? Edit Articles Approve Issue Deliver Full Issue Deliver PubMed Abstract XML FTP and/or NFS sites Online Preview Issue Online Issue Online? End Deliver XML Issue End Yes Yes OK to Host? Yes
  • 11. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC XML Conversion Vendor(Jouve) OnlineContent Editor Content Recepients SOCR Journal Production Unit of Content is a Journal Issue Start FTP (UK) or NFS (US) Zip Final Print-Ready PDFs Ingest Unencoded Issue Store in Repository Deliver Unencoded Issue FTP Create SAGEMeta XML Nomalize PDF Files Zip Issue FTP Store in Repository Deliver HW Issue Ingest Encoded Issue FTP – HighWire Express Process Issue for Hosting Quality Check Issue Changes? Edit Articles Approve Issue Deliver Full Issue Deliver PubMed Abstract XML FTP and/or NFS sites Online Preview Issue Online Issue Online? End Deliver XML Issue End Yes Yes OK to Host? Yes Ingest print-ready article PDFS
  • 12. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC XML Conversion Vendor(Jouve) OnlineContent Editor Content Recepients SOCR Journal Production Unit of Content is a Journal Issue Start FTP (UK) or NFS (US) Zip Final Print-Ready PDFs Ingest Unencoded Issue Store in Repository Deliver Unencoded Issue FTP Create SAGEMeta XML Nomalize PDF Files Zip Issue FTP Store in Repository Deliver HW Issue Ingest Encoded Issue FTP – HighWire Express Process Issue for Hosting Quality Check Issue Changes? Edit Articles Approve Issue Deliver Full Issue Deliver PubMed Abstract XML FTP and/or NFS sites Online Preview Issue Online Issue Online? End Deliver XML Issue End Yes Yes OK to Host? Yes Deliver to encoding vendor
  • 13. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC XML Conversion Vendor(Jouve) OnlineContent Editor Content Recepients SOCR Journal Production Unit of Content is a Journal Issue Start FTP (UK) or NFS (US) Zip Final Print-Ready PDFs Ingest Unencoded Issue Store in Repository Deliver Unencoded Issue FTP Create SAGEMeta XML Nomalize PDF Files Zip Issue FTP Store in Repository Deliver HW Issue Ingest Encoded Issue FTP – HighWire Express Process Issue for Hosting Quality Check Issue Changes? Edit Articles Approve Issue Deliver Full Issue Deliver PubMed Abstract XML FTP and/or NFS sites Online Preview Issue Online Issue Online? End Deliver XML Issue End Yes Yes OK to Host? Yes Ingest xml encoded issue
  • 14. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC XML Conversion Vendor(Jouve) OnlineContent Editor Content Recepients SOCR Journal Production Unit of Content is a Journal Issue Start FTP (UK) or NFS (US) Zip Final Print-Ready PDFs Ingest Unencoded Issue Store in Repository Deliver Unencoded Issue FTP Create SAGEMeta XML Nomalize PDF Files Zip Issue FTP Store in Repository Deliver HW Issue Ingest Encoded Issue FTP – HighWire Express Process Issue for Hosting Quality Check Issue Changes? Edit Articles Approve Issue Deliver Full Issue Deliver PubMed Abstract XML FTP and/or NFS sites Online Preview Issue Online Issue Online? End Deliver XML Issue End Yes Yes OK to Host? Yes Deliver to hosting platform
  • 15. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC XML Conversion Vendor(Jouve) OnlineContent Editor Content Recepients SOCR Journal Production Unit of Content is a Journal Issue Start FTP (UK) or NFS (US) Zip Final Print-Ready PDFs Ingest Unencoded Issue Store in Repository Deliver Unencoded Issue FTP Create SAGEMeta XML Nomalize PDF Files Zip Issue FTP Store in Repository Deliver HW Issue Ingest Encoded Issue FTP – HighWire Express Process Issue for Hosting Quality Check Issue Changes? Edit Articles Approve Issue Deliver Full Issue Deliver PubMed Abstract XML FTP and/or NFS sites Online Preview Issue Online Issue Online? End Deliver XML Issue End Yes Yes OK to Host? Yes Support editorial approval process
  • 16. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC XML Conversion Vendor(Jouve) OnlineContent Editor Content Recepients SOCR Journal Production Unit of Content is a Journal Issue Start FTP (UK) or NFS (US) Zip Final Print-Ready PDFs Ingest Unencoded Issue Store in Repository Deliver Unencoded Issue FTP Create SAGEMeta XML Nomalize PDF Files Zip Issue FTP Store in Repository Deliver HW Issue Ingest Encoded Issue FTP – HighWire Express Process Issue for Hosting Quality Check Issue Changes? Edit Articles Approve Issue Deliver Full Issue Deliver PubMed Abstract XML FTP and/or NFS sites Online Preview Issue Online Issue Online? End Deliver XML Issue End Yes Yes OK to Host? Yes Track go-live on hosting platform
  • 17. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC XML Conversion Vendor(Jouve) OnlineContent Editor Content Recepients SOCR Journal Production Unit of Content is a Journal Issue Start FTP (UK) or NFS (US) Zip Final Print-Ready PDFs Ingest Unencoded Issue Store in Repository Deliver Unencoded Issue FTP Create SAGEMeta XML Nomalize PDF Files Zip Issue FTP Store in Repository Deliver HW Issue Ingest Encoded Issue FTP – HighWire Express Process Issue for Hosting Quality Check Issue Changes? Edit Articles Approve Issue Deliver Full Issue Deliver PubMed Abstract XML FTP and/or NFS sites Online Preview Issue Online Issue Online? End Deliver XML Issue End Yes Yes OK to Host? Yes Deliver to additional recepients
  • 18. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC Learnings
  • 19. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC Learnings
  • 20. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC 2011 we released SOCR-2 Ingest Store Deliver Enforce quality Normalize content Enforce quality Version control Transform (PDF, XML, & Images) Package Deliver (and track) Analytics
  • 21. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC Capabilities Store Transform DeliverEnrich Ingest
  • 22. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC SOCR-Journals Ingest • Issues, OnlineFirst, Continuous Publication, Launch & Archive • Over 200 quality checks for a current article Deliver • 47 delivery recipients • 13 delivery formats in use • Batch delivery for launch content
  • 23. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC - 200,000 400,000 600,000 800,000 1,000,000 1,200,000 1,400,000 Articles
  • 24. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC Load to Staging Editorial Review Edits 45 days down to ~ -5 days 2.2 down to 1.2 deliveries $10 / article down to ~ $1
  • 25. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC
  • 26. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC SOCR-Books Ingest • Books - TEI • Implemented RelaxNG schema validation • Over 100 quality checks applied Deliver • Platform content • Discoverability and content licensees • Interactive DOI Registration
  • 27. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC 0 1000 2000 3000 4000 5000 6000 Books
  • 28. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC
  • 29. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC Learnings X
  • 30. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC Learnings People Process Software
  • 31. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC SOCR-3 – SOCR As A Service
  • 32. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC Learnings
  • 33. 2014 RSuite User Conference Los Angeles | London | New Delhi Singapore | Washington DC Questions? Keith Lawrenz Sr. Business Analyst & Content Systems Supervisor Publishing Technologies SAGE Publications keith.lawrenz@sagepub.com twitter: @keithlawrenz

Hinweis der Redaktion

  1. Ok, so just some quick facts about SAGE, as a foundation for my talk. Sage is a nearly 50-year old global, independent scholarly publisher with editorial offices in the Los Angeles, Washington DC, London, New Delhi and Singapore. We publish around 800 new print and digital books annually – primarily textbooks and a journal publishing program with over 900 titles. We have a number of imprints and subsidiaries {CLK #1} such as Corwin, CQ Press, Adam Matthew and most recently MD Conference Express. {CLK#2} We publish online through an ever-growing list of content platforms.
  2. Thinking back to 2007… The Sopranos ended The final Harry Potter book was published President Bush was facing Barack Obama for the presidency And Barry Bonds beat Hank Aarons home run record (but that didn’t quite work out…)
  3. In 2007… SAGE published around 300 journals OnlineFirst was supported for 2-3 journals Back file fill to Volume 1, Issue 1 was the big idea Publishing Technologies was focused on refreshing the SJO platform to support the transition of our “collections” product line from CSA
  4. SAGE also introduced our first online reference platform, SAGE eReference, with 45 titles
  5. Our content management practices were pretty manual Maybe 10,000 zip files Each containing an issue made up of XML and PDF “pairs” Stored on disks This presented many challenges: Sales and marketing wanted a count of articles to support pricing decisions. A simple ad hoc content delivery took days of effort and consistent, reliable scheduled deliveries were at best hit and miss There was certainly no thought of reconciling our internal archive with our hosted content The content inside the zips stayed inside the zips – with no thought of mining trends in our journal content stream
  6. The online journal workflow took the PDF from the typesetter, sending it out for conversion in India – 10 days after it was released to the printer Those XML and PDF zips were returned from India maybe 10 days later. The content was loaded to HighWire and went through an editorial review process Sometimes we made the edits in-house and other times we sent the content back to India. This was what we call header-footer encoding. For most journals, the online product included title, authors, abstracts and encoded references. A select group of journals were hosted in full html, or full-text. On average – {click – click – click} 45 days from the print distribution 2.2 deliveries to HighWire – so every issue went through multiple editorial views Hard conversion costs were roughly $10/article
  7. In late 2008 we released the first generation SOCR after18 months of development
  8. The initial implementation was solely focused on the online journal issue production process. - Basically the system did one thing – pretty well…
  9. This initial solution was delivered through two ingestion workflows, one status checking workflow and four delivery workflows plus about 3-4 additional worflows to support the full-text processing. The solution was barely performant for the current content production. The journals business wanted to grow – fast – and we had collected around 40 – 50,000 additional zips that represented back content. They needed all of the content in a secure accessible platform. Oh yeah, by the way, the online books business was building - adding titles to SAGE eReference - experimenting with eBooks - thinking about new online book opportunities.
  10. Don’t try to boil the ocean – we did too much and it took too long Minimum viable product
  11. Content architecture matters – the system was slow… because we did not optimize our metadata against our queries MarkLogic and RSuite were “young”
  12. On ingestion - we implemented a common journals workflow – transitioning to NLM - we implemented Schematron for many of the checks that were previously implemented as custom workflow steps On edit we also implemented the Schematron checking – ensuring quality On delivery we implemented custom delivery formats using a configurable delivery specification. We benchmarked system performance using a representative set of 100 journal issues and we achieved a over 400% increase in performance.
  13. A virtuous cycle
  14. The online is up – on average – 5 days prior to the print release. This is a 50 day improvement in making content accessible to the reader. We have reduced average deliveries to HighWire from 2.2 to 1.2 and negotiated a low six figure savings in hard production costs. In addition, the SOCR automation of launch content loading is expected to save SAGE another low-six figures in 2014. Finally, the standardization on NLM, and training our Typesetters to deliver XML directly to SOCR has reduced hard cost per article from around $10 to $1. All of these benefits are not just due to SOCR – but they are enabled by SOCR. The extensible Schematron error checking is the largest single factor
  15. The business continues to move faster – today we have - 1.2 million articles – issues, onlineFirst, continuous publication in 2 varieties to support new Open access publishing models SAGE Research Methods – launched with around 300 titles including books, journals and videos growing to over 600 tiles plus extensive “born digital” content SAGE Knowledge – launched with 2,500 titles and has been adding
  16. We released SOCR books in early 2012. Our TEI encoded book content required us to implement RelaxNG schema validation into Rsuite. Using the same model as we have for journals allowed us to develop this capability in around 3 months concurrently with other projects Most recently, Sal implemented automated DOI registration workflows that communicate directly with the CrossRef REST service to transmit and confirm DOI registrations 14,000 DOIs were automatically deposited in around 10 days when we turned this on.
  17. Just over 5000 titles ingested since April of last year. The hardest thing with ingesting the book content is that the Schematron rules are catching errors we missed when loading the content directly. With 5000 titles we can easily ingest and reload the content on our host platform, thereby improving the content quality delivered to our subscribers.
  18. Probably the most exciting recent enhancement is the implementation of what we call HubXML for encoding book content. Implemented in weeks, not months, we can ingest this new content type and deliver to either TEI for our SAGE Knowledge or Research Methods, or go to ePub (2 and 3) for our eBook distribution through Core Source. SOCR is now nimble and adaptable and for possibly the first time we are almost ahead of the business. The systems was in place to accept HubXML book number 5. We are ready to ramp this capability to all new books by year end. Once again saving near 6 figures in conversion costs to either ePub or TEI.
  19. Early adoption is very hard And adopting others best practices is a much better idea
  20. Great software is not enough. Having the right people on the team is key to success. John and Sal, With SAGE take the lead in working with Elver and Daniel, RSI development resources in Columbia. The critical skill sets are building CMSs, Marklogic Xquery and JAVA. We have a small but very effective team building and maintaining SOCR But another key portion of our recent success happened around a year ago. We focused on solid development methodology. - Build documentation using Confluence - We brought our job ticketing solution in-house with JIRA - We also brought our code repository in house and spent countless hours reconciling our code that had been left behind by many previous developers - Possibly most importantly we built automated code deployment process with Jenkins 2 years ago we avoided deployments, because deploying new code too often broke something, and without a solid code repository, documentation, and repeatable deployment process finding the cause was often a struggle. The remaining piece here that we need to work on is automated system testing. Automated end to end system testing processes has so far eluded us. But we know we want it. The current challenge is to complete a major overhaul and upgrade.
  21. We call the next generation SOCR-3 Upgrading to RSuite 4 – we have a prototype instance running today with test content. The plan is to deliver a fully operational system by year-end. A REST validation service that will deliver all necessary validations checks through Schematron rule sets and related services to deliver Schema, PDF and image validations. One service call will be used for any content type. And this service will be accessible to internal workflows and upstream production systems. The packaging service takes advantage of the power of MarkLogic to deliver content from Rsuite. Image transform service will give us RESTful access to Apogo PDF enhancer to reformat PDFs and ImageMajik to perform image manipulations. Ingestion, Dispatch, and metadata will be thin wrappers around existing RSuite functionality. But this will position us for future enhancements and scaling. Event logging and Reporting will initially be focused on handling transactional logging and reporting.
  22. A CMS is not a project. It must be a log term program that must integrates with most everything you do. To be successful, you need a knowledgeable, dedicated team that can be focused on making your CMS effective.