SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Downloaden Sie, um offline zu lesen
The Aviva Stadium Dublin, 21-22 November 2013
• TERMINALFOUR Site Manager comes with a number of
tools to approach automated migration.
• Even with automated migration, some manual
migration will be required for content not handled by
the import tools.

• TERMINALFOUR endeavour to migrate at a minimum of
80% of the existing content.
• In some cases, custom tools are required to handle
specific site and content structures.
t44u.2013

2
Manual

Auto

• The amount of content to be migrated <1500
• Is content well structured & marked-up correctly?
• Complexity of the original web site
• Is it coming from another CMS (e.g. Red Dot) ?
• Is the HTML source consistent?
• Can we access an XML extract

• Structured Data: Use Web Objects / Data Objects
/Content Syncer / Web Services integration features
Integration • Live code in pages
• Combination of three options above?
t44u.2013

3
1. Access Web Site
6. Manual updates

5. Test / QA

2. Analyse the Data

4. Run the
Migration

t44u.2013

3. Configure the
Migration Tool

4
STEP 1 :
Copy of the website

STEP 2 :

STEP 3 :

Analysis of the
website

Configure the
Migration Tool

The content in
HTML, XML, or a
Database dump.

How to determine
the Site Structure?

Media files including
images, linked
documents, videos,
flash movies etc.

Is there multi-lingual
content to be
migrated?

Map the elements
from the existing
pages layouts to the
new content
templates

What is the
structure of the
content (pages)?

t44u.2013

5
STEP 4 :
Running the
Migration

Defined XML format &
Defined Structure for
Content

STEP 5 :

STEP 6 :

Test/QA

Manual Update

A link checker is run on
the published website
to determine if there
are any broken links or
missing content.

some manual updates
required in order to fix
issues

Site Structure
Assets.

Two types of manual
review are required

t44u.2013

6
1. How much data is to be migrated i.e. pages, sites by TERMINALFOUR?
2. Will the content be exported from the existing CMS in order to migrate it or will
the published site be used? i.e. what format will the website be provided in.
3. Is the website structure the same or different in the new system? If different a
mapping will be required between the old and the new structure.
4. Is there a one to one mapping from the old page layouts to the new page
layouts?
5. Is there multi-lingual content to be migrated?

6. Will there be content that is not currently in the existing site to be migrated? If
so, what format will this be in?
7. Is there mirrored content (same source content appearing in multiple locations
on the site) within the website that needs to be handled during the migration?
This includes portions of pages.
8. Are the pages well structured with markers to identify different components of
the page?
t44u.2013

7
–
–
–
–
–
–
–
–
–
–
–
–
–
t44u.2013

Serena Collage (University of St. Thomas, University of Liverpool)
InterWoven TeamSites (Southern States Coop)
Documentum (Missouri State technical College)
Open Text RedDot (University of the Arts London)
Vignette (OECD)
Microsoft CMS (UNAIDS)
BroadVision (Aer Lingus)
HannonHill Cascade Server
Percussion (NUIG)
SunGard LuminisCMS (University of Huddersfield)
Active Networks IronPoint CMS (University of Fraser Valley, LMU)
DreamWeaver HTML pages (VCU)
Squiz (RMIT Australia, University of Stirling)
8
Case Study: Weitz & Luxenberg
• A targeted list of page ID’s within a data source to be
migrated in bulk from existing custom CMS.
• Create Hierarchy Builder to build parent and child structure
within Site Manager from data source (MS EXCEL).
• HTML code being migrated to be “cleaned” using specified
non-required HTML tags.

• Link Resolver to recurse imported HTML code and check for
links that can be resolved – continual checking.
• Automatic Static/Regex URL Redirect
t44u.2013

9
Case Study: W&L - Database to Database Migration Proposal
Client Produce table of
WebPageID, Level, Section
Name, ContentHTML,
OriginalURL, TemplateID,
MetaDescrip, MetaKeywords

1

2

Table is imported into the
Content Syncer using predefined fields

Content Syncer

W&L
DB

3
Content is imported
into Site Manager
using template /
column mapping

6

5
New section/page information
is used to resolve the links
using a new function to match
previous ID’s with new SM ID’s

Link Resolver

SM
DB

HTML
Cleaner
The HTML is cleaned in
the SM DB without
resolved links

4
Case Study: W&L – Data Source

Data Source can be Excel,
SQL or MySQL
Fields need to follow
exact naming convention
t44u.2013

11
Case Study: W&L – External Content Syncer Handler

Setup Connection
to Data Source

t44u.2013

12
Case Study: W&L – Content Syncer

Ensure Site Creator
Plugin set & test you
can query Database

t44u.2013

13
Case Study: W&L – Content Hierarchy Built & Imported
• Example ‘t44u’
shows section
created and
hierarchy &
content created

t44u.2013

14
Case Study: W&L – HTML Cleaner

• Current interface available now within Site Manager
Specify Section to clean and upload properties file
t44u.2013

15
Case Study: W&L – HTML Cleaner - Options
• Remove tags only:
Parse the HTML using Jsoup, extract the content
enclosed by them and write it out, minus the tags that
are to be removed.
• Remove attributes only:
Only defined attribute in the tag to be removed. The tag
itself will remain within the content.

• Remove tags and content:
Parse the html, find the relevant tag and pull it and the
enclosed content out of the file.

t44u.2013

16
Case Study: W&L – HTML Cleaner – Properties File

• Sample Properties file to keep listed tags
and remove everything else

t44u.2013

17
Case Study: W&L – URL Redirect

Original URL captured
from import & used to
create Static or Regex
URL Rewrite

t44u.2013

18
Case Study: W&L – URL Redirect

Example ISS Static URL
Rewrite Mapping
t44u.2013

19
Case Study: W&L – URL Redirect – The Future ‘V8’

Beta Screen Grab for
V8 – URL Redirect

t44u.2013

20

Weitere ähnliche Inhalte

Was ist angesagt?

Data base connectivity and flex grid in vb
Data base connectivity and flex grid in vbData base connectivity and flex grid in vb
Data base connectivity and flex grid in vb
Amandeep Kaur
 
Introducing asp
Introducing aspIntroducing asp
Introducing asp
aspnet123
 
Web based database application design using vb.net and sql server
Web based database application design using vb.net and sql serverWeb based database application design using vb.net and sql server
Web based database application design using vb.net and sql server
Ammara Arooj
 
Rails
RailsRails
Rails
SHC
 

Was ist angesagt? (20)

Sqllite
SqlliteSqllite
Sqllite
 
Data base connectivity and flex grid in vb
Data base connectivity and flex grid in vbData base connectivity and flex grid in vb
Data base connectivity and flex grid in vb
 
Introducing asp
Introducing aspIntroducing asp
Introducing asp
 
Web based database application design using vb.net and sql server
Web based database application design using vb.net and sql serverWeb based database application design using vb.net and sql server
Web based database application design using vb.net and sql server
 
Web Application SG
Web Application SGWeb Application SG
Web Application SG
 
Enterprise Integration Patterns - Spring way
Enterprise Integration Patterns - Spring wayEnterprise Integration Patterns - Spring way
Enterprise Integration Patterns - Spring way
 
Html
HtmlHtml
Html
 
Controls
ControlsControls
Controls
 
forms builder
forms builderforms builder
forms builder
 
Bn1038 demo pega
Bn1038 demo  pegaBn1038 demo  pega
Bn1038 demo pega
 
A View about ASP .NET and their objectives
A View about ASP .NET and their objectivesA View about ASP .NET and their objectives
A View about ASP .NET and their objectives
 
Lect06 tomcat1
Lect06 tomcat1Lect06 tomcat1
Lect06 tomcat1
 
Rails
RailsRails
Rails
 
Asp.net
Asp.netAsp.net
Asp.net
 
Asp
AspAsp
Asp
 
Oracle 10g Forms Lesson 9
Oracle 10g Forms Lesson 9Oracle 10g Forms Lesson 9
Oracle 10g Forms Lesson 9
 
Oracle Forms Creation
Oracle Forms CreationOracle Forms Creation
Oracle Forms Creation
 
Asp.net By Durgesh Singh
Asp.net By Durgesh SinghAsp.net By Durgesh Singh
Asp.net By Durgesh Singh
 
Forms 6i guide
Forms 6i guideForms 6i guide
Forms 6i guide
 
Oracle 10g Forms Lesson 8
Oracle 10g Forms Lesson 8Oracle 10g Forms Lesson 8
Oracle 10g Forms Lesson 8
 

Andere mochten auch

Andere mochten auch (20)

Selecting a Digital Agency for Web Redesign - Top Tips
Selecting a Digital Agency for Web Redesign - Top Tips Selecting a Digital Agency for Web Redesign - Top Tips
Selecting a Digital Agency for Web Redesign - Top Tips
 
Client Support Update: TERMINALFOUR tforum 2013
Client Support Update: TERMINALFOUR tforum 2013Client Support Update: TERMINALFOUR tforum 2013
Client Support Update: TERMINALFOUR tforum 2013
 
TERMINALFOUR t44u 2012 - Publish and transfer
TERMINALFOUR t44u 2012 - Publish and transferTERMINALFOUR t44u 2012 - Publish and transfer
TERMINALFOUR t44u 2012 - Publish and transfer
 
T44u 2015, imperial college
T44u 2015, imperial collegeT44u 2015, imperial college
T44u 2015, imperial college
 
PSEWEB 2013 - Make it responsive - TERMINALFOUR
PSEWEB 2013 - Make it responsive - TERMINALFOURPSEWEB 2013 - Make it responsive - TERMINALFOUR
PSEWEB 2013 - Make it responsive - TERMINALFOUR
 
University of Portsmouth Library: A practical approach to Responsive Design
University of Portsmouth Library: A practical approach to Responsive Design University of Portsmouth Library: A practical approach to Responsive Design
University of Portsmouth Library: A practical approach to Responsive Design
 
Personalised Portals: The Path to Student Engagement: EVMS- t44u 2014
Personalised Portals: The Path to Student Engagement: EVMS- t44u 2014 Personalised Portals: The Path to Student Engagement: EVMS- t44u 2014
Personalised Portals: The Path to Student Engagement: EVMS- t44u 2014
 
T44u 2015, improving campaign performance
T44u 2015, improving campaign performanceT44u 2015, improving campaign performance
T44u 2015, improving campaign performance
 
Transforming your team & your tools to own the web: Connecticut College
Transforming your team & your tools to own the web: Connecticut College Transforming your team & your tools to own the web: Connecticut College
Transforming your team & your tools to own the web: Connecticut College
 
Social media: Connecting their networks with your website-mStoner: t44u
Social media: Connecting their networks with your website-mStoner: t44uSocial media: Connecting their networks with your website-mStoner: t44u
Social media: Connecting their networks with your website-mStoner: t44u
 
Newsletter integration: TERMINALFOUR t44u 2013
Newsletter integration: TERMINALFOUR t44u 2013Newsletter integration: TERMINALFOUR t44u 2013
Newsletter integration: TERMINALFOUR t44u 2013
 
Building a research portal: Vidatum & NUIG - t44u - TERMINALFOUR
Building a research portal: Vidatum & NUIG - t44u - TERMINALFOURBuilding a research portal: Vidatum & NUIG - t44u - TERMINALFOUR
Building a research portal: Vidatum & NUIG - t44u - TERMINALFOUR
 
PHP Calendar: TERMINALFOUR t44u 2013
PHP Calendar: TERMINALFOUR t44u 2013PHP Calendar: TERMINALFOUR t44u 2013
PHP Calendar: TERMINALFOUR t44u 2013
 
Reimagining SUNY.EDU: TERMINALFOUR tForum 2014
Reimagining SUNY.EDU: TERMINALFOUR tForum 2014Reimagining SUNY.EDU: TERMINALFOUR tForum 2014
Reimagining SUNY.EDU: TERMINALFOUR tForum 2014
 
Muhlenberg College-Social media strategy: TERMINALFOUR tforum 2013
Muhlenberg College-Social media strategy: TERMINALFOUR tforum 2013Muhlenberg College-Social media strategy: TERMINALFOUR tforum 2013
Muhlenberg College-Social media strategy: TERMINALFOUR tforum 2013
 
PHP Access Control: TERMINALFOUR t44u
PHP Access Control: TERMINALFOUR t44uPHP Access Control: TERMINALFOUR t44u
PHP Access Control: TERMINALFOUR t44u
 
Gedling Borough Council: TERMINALFOUR t44u 2013
Gedling Borough Council: TERMINALFOUR t44u 2013 Gedling Borough Council: TERMINALFOUR t44u 2013
Gedling Borough Council: TERMINALFOUR t44u 2013
 
The fundamentals of digital engagement: TERMINALFOUR t44u 2013
The fundamentals of digital engagement: TERMINALFOUR t44u 2013The fundamentals of digital engagement: TERMINALFOUR t44u 2013
The fundamentals of digital engagement: TERMINALFOUR t44u 2013
 
OECD policies & TERMINALFOUR as a policy for innovation:TERMINALFOUR t44u 2013
OECD policies & TERMINALFOUR as a policy for innovation:TERMINALFOUR t44u 2013OECD policies & TERMINALFOUR as a policy for innovation:TERMINALFOUR t44u 2013
OECD policies & TERMINALFOUR as a policy for innovation:TERMINALFOUR t44u 2013
 
Making a case for CWO in higher ed: TERMINALFOUR tForum 2014
Making a case for CWO in higher ed: TERMINALFOUR tForum 2014Making a case for CWO in higher ed: TERMINALFOUR tForum 2014
Making a case for CWO in higher ed: TERMINALFOUR tForum 2014
 

Ähnlich wie Content migration Part 1: TERMINALFOUR t44u 2013

Migrating Very Large Site Collections (SPSDC)
Migrating Very Large Site Collections (SPSDC)Migrating Very Large Site Collections (SPSDC)
Migrating Very Large Site Collections (SPSDC)
kiwiboris
 
2011.10 Liferay European Symposium. Alistair Oldfield
2011.10 Liferay European Symposium. Alistair Oldfield2011.10 Liferay European Symposium. Alistair Oldfield
2011.10 Liferay European Symposium. Alistair Oldfield
Emeldi Group
 
Rouky Martin Resume
Rouky Martin ResumeRouky Martin Resume
Rouky Martin Resume
roukym
 
2012.05, Liferay and Emeldi Road Show, Alistair Oldfield
2012.05, Liferay and Emeldi Road Show, Alistair Oldfield2012.05, Liferay and Emeldi Road Show, Alistair Oldfield
2012.05, Liferay and Emeldi Road Show, Alistair Oldfield
Emeldi Group
 
Opinioz_intern
Opinioz_internOpinioz_intern
Opinioz_intern
Sai Ganesh
 
Adopting AnswerModules ModuleSuite
Adopting AnswerModules ModuleSuiteAdopting AnswerModules ModuleSuite
Adopting AnswerModules ModuleSuite
AnswerModules
 

Ähnlich wie Content migration Part 1: TERMINALFOUR t44u 2013 (20)

How to Optimize Your Drupal Site with Structured Content
How to Optimize Your Drupal Site with Structured ContentHow to Optimize Your Drupal Site with Structured Content
How to Optimize Your Drupal Site with Structured Content
 
Raybiztech Content Management Approach
Raybiztech Content Management ApproachRaybiztech Content Management Approach
Raybiztech Content Management Approach
 
Sps redmond 2014 deck
Sps redmond 2014 deckSps redmond 2014 deck
Sps redmond 2014 deck
 
Migrating Very Large Site Collections (SPSDC)
Migrating Very Large Site Collections (SPSDC)Migrating Very Large Site Collections (SPSDC)
Migrating Very Large Site Collections (SPSDC)
 
Drupalcampchicago2010.rachel.datamigration.
Drupalcampchicago2010.rachel.datamigration.Drupalcampchicago2010.rachel.datamigration.
Drupalcampchicago2010.rachel.datamigration.
 
SPSNYC17 - The Wall: Overcoming SharePoint’s Site Collection Boundary
SPSNYC17 - The Wall: Overcoming SharePoint’s Site Collection BoundarySPSNYC17 - The Wall: Overcoming SharePoint’s Site Collection Boundary
SPSNYC17 - The Wall: Overcoming SharePoint’s Site Collection Boundary
 
Planning Your Migration to SharePoint Online #SPBiz60
Planning Your Migration to SharePoint Online #SPBiz60Planning Your Migration to SharePoint Online #SPBiz60
Planning Your Migration to SharePoint Online #SPBiz60
 
SPSNE17 - The Wall: Overcoming SharePoint’s Site Collection Boundary
SPSNE17 - The Wall: Overcoming SharePoint’s Site Collection BoundarySPSNE17 - The Wall: Overcoming SharePoint’s Site Collection Boundary
SPSNE17 - The Wall: Overcoming SharePoint’s Site Collection Boundary
 
T44u 2015, content migration
T44u 2015, content migrationT44u 2015, content migration
T44u 2015, content migration
 
2011.10 Liferay European Symposium. Alistair Oldfield
2011.10 Liferay European Symposium. Alistair Oldfield2011.10 Liferay European Symposium. Alistair Oldfield
2011.10 Liferay European Symposium. Alistair Oldfield
 
Rouky Martin Resume
Rouky Martin ResumeRouky Martin Resume
Rouky Martin Resume
 
2012.05, Liferay and Emeldi Road Show, Alistair Oldfield
2012.05, Liferay and Emeldi Road Show, Alistair Oldfield2012.05, Liferay and Emeldi Road Show, Alistair Oldfield
2012.05, Liferay and Emeldi Road Show, Alistair Oldfield
 
Resume
ResumeResume
Resume
 
Opinioz_intern
Opinioz_internOpinioz_intern
Opinioz_intern
 
Migrating very large site collections
Migrating very large site collectionsMigrating very large site collections
Migrating very large site collections
 
Spring data presentation
Spring data presentationSpring data presentation
Spring data presentation
 
SPSNYC14 - Must Love Term Sets: The New and Improved Managed Metadata Service...
SPSNYC14 - Must Love Term Sets: The New and Improved Managed Metadata Service...SPSNYC14 - Must Love Term Sets: The New and Improved Managed Metadata Service...
SPSNYC14 - Must Love Term Sets: The New and Improved Managed Metadata Service...
 
WebDev Crash Course
WebDev Crash CourseWebDev Crash Course
WebDev Crash Course
 
Adopting AnswerModules ModuleSuite
Adopting AnswerModules ModuleSuiteAdopting AnswerModules ModuleSuite
Adopting AnswerModules ModuleSuite
 
DFW SPUG FastTrack migration service for SharePoint
DFW SPUG FastTrack migration service for SharePointDFW SPUG FastTrack migration service for SharePoint
DFW SPUG FastTrack migration service for SharePoint
 

Mehr von Terminalfour

Mehr von Terminalfour (16)

Let's get personal... Website personalisation and content targeting technique...
Let's get personal... Website personalisation and content targeting technique...Let's get personal... Website personalisation and content targeting technique...
Let's get personal... Website personalisation and content targeting technique...
 
Imperial College London: Creating and managing a flexible site for 1,000 editors
Imperial College London: Creating and managing a flexible site for 1,000 editorsImperial College London: Creating and managing a flexible site for 1,000 editors
Imperial College London: Creating and managing a flexible site for 1,000 editors
 
Roundhouse Digital: Responsive Sites- Not just looking good on a mobile
Roundhouse Digital: Responsive Sites- Not just looking good on a mobileRoundhouse Digital: Responsive Sites- Not just looking good on a mobile
Roundhouse Digital: Responsive Sites- Not just looking good on a mobile
 
T44u 2015, learning and development
T44u 2015, learning and developmentT44u 2015, learning and development
T44u 2015, learning and development
 
T44u 2015, web development best practice
T44u 2015, web development best practiceT44u 2015, web development best practice
T44u 2015, web development best practice
 
T44u 2015, upgrading to 8
T44u 2015, upgrading to 8T44u 2015, upgrading to 8
T44u 2015, upgrading to 8
 
T44u 2015, marketing analytics data driven decision making
T44u 2015, marketing analytics   data driven decision makingT44u 2015, marketing analytics   data driven decision making
T44u 2015, marketing analytics data driven decision making
 
T44u 2015, sample data
T44u 2015, sample dataT44u 2015, sample data
T44u 2015, sample data
 
University of Liverpool: TERMINALFOUR & App Development- Making the Most of y...
University of Liverpool: TERMINALFOUR & App Development- Making the Most of y...University of Liverpool: TERMINALFOUR & App Development- Making the Most of y...
University of Liverpool: TERMINALFOUR & App Development- Making the Most of y...
 
Collaborative Working: University of Sunderland & Roundhouse Digital
Collaborative Working: University of Sunderland & Roundhouse Digital Collaborative Working: University of Sunderland & Roundhouse Digital
Collaborative Working: University of Sunderland & Roundhouse Digital
 
Responsive Retrofit-EVMS: tForum TERMINALFOUR 2014
Responsive Retrofit-EVMS: tForum TERMINALFOUR 2014Responsive Retrofit-EVMS: tForum TERMINALFOUR 2014
Responsive Retrofit-EVMS: tForum TERMINALFOUR 2014
 
The external content syncer: TERMINALFOUR t44u 2013
The external content syncer: TERMINALFOUR t44u 2013The external content syncer: TERMINALFOUR t44u 2013
The external content syncer: TERMINALFOUR t44u 2013
 
Preview and transfer enhancements: TERMINALFOUR t44u 2013
Preview and transfer enhancements: TERMINALFOUR t44u 2013Preview and transfer enhancements: TERMINALFOUR t44u 2013
Preview and transfer enhancements: TERMINALFOUR t44u 2013
 
Approaches to higher education course search: TERMINALFOUR t44u 2013
Approaches to higher education course search: TERMINALFOUR t44u 2013Approaches to higher education course search: TERMINALFOUR t44u 2013
Approaches to higher education course search: TERMINALFOUR t44u 2013
 
Content migration part 2: TERMINALFOUR t44u 2013
Content migration part 2: TERMINALFOUR t44u 2013Content migration part 2: TERMINALFOUR t44u 2013
Content migration part 2: TERMINALFOUR t44u 2013
 
Professional Services Update: TERMINALFOUR tforum 2013
Professional Services Update: TERMINALFOUR tforum 2013Professional Services Update: TERMINALFOUR tforum 2013
Professional Services Update: TERMINALFOUR tforum 2013
 

KĂźrzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

KĂźrzlich hochgeladen (20)

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 

Content migration Part 1: TERMINALFOUR t44u 2013

  • 1. The Aviva Stadium Dublin, 21-22 November 2013
  • 2. • TERMINALFOUR Site Manager comes with a number of tools to approach automated migration. • Even with automated migration, some manual migration will be required for content not handled by the import tools. • TERMINALFOUR endeavour to migrate at a minimum of 80% of the existing content. • In some cases, custom tools are required to handle specific site and content structures. t44u.2013 2
  • 3. Manual Auto • The amount of content to be migrated <1500 • Is content well structured & marked-up correctly? • Complexity of the original web site • Is it coming from another CMS (e.g. Red Dot) ? • Is the HTML source consistent? • Can we access an XML extract • Structured Data: Use Web Objects / Data Objects /Content Syncer / Web Services integration features Integration • Live code in pages • Combination of three options above? t44u.2013 3
  • 4. 1. Access Web Site 6. Manual updates 5. Test / QA 2. Analyse the Data 4. Run the Migration t44u.2013 3. Configure the Migration Tool 4
  • 5. STEP 1 : Copy of the website STEP 2 : STEP 3 : Analysis of the website Configure the Migration Tool The content in HTML, XML, or a Database dump. How to determine the Site Structure? Media files including images, linked documents, videos, flash movies etc. Is there multi-lingual content to be migrated? Map the elements from the existing pages layouts to the new content templates What is the structure of the content (pages)? t44u.2013 5
  • 6. STEP 4 : Running the Migration Defined XML format & Defined Structure for Content STEP 5 : STEP 6 : Test/QA Manual Update A link checker is run on the published website to determine if there are any broken links or missing content. some manual updates required in order to fix issues Site Structure Assets. Two types of manual review are required t44u.2013 6
  • 7. 1. How much data is to be migrated i.e. pages, sites by TERMINALFOUR? 2. Will the content be exported from the existing CMS in order to migrate it or will the published site be used? i.e. what format will the website be provided in. 3. Is the website structure the same or different in the new system? If different a mapping will be required between the old and the new structure. 4. Is there a one to one mapping from the old page layouts to the new page layouts? 5. Is there multi-lingual content to be migrated? 6. Will there be content that is not currently in the existing site to be migrated? If so, what format will this be in? 7. Is there mirrored content (same source content appearing in multiple locations on the site) within the website that needs to be handled during the migration? This includes portions of pages. 8. Are the pages well structured with markers to identify different components of the page? t44u.2013 7
  • 8. – – – – – – – – – – – – – t44u.2013 Serena Collage (University of St. Thomas, University of Liverpool) InterWoven TeamSites (Southern States Coop) Documentum (Missouri State technical College) Open Text RedDot (University of the Arts London) Vignette (OECD) Microsoft CMS (UNAIDS) BroadVision (Aer Lingus) HannonHill Cascade Server Percussion (NUIG) SunGard LuminisCMS (University of Huddersfield) Active Networks IronPoint CMS (University of Fraser Valley, LMU) DreamWeaver HTML pages (VCU) Squiz (RMIT Australia, University of Stirling) 8
  • 9. Case Study: Weitz & Luxenberg • A targeted list of page ID’s within a data source to be migrated in bulk from existing custom CMS. • Create Hierarchy Builder to build parent and child structure within Site Manager from data source (MS EXCEL). • HTML code being migrated to be “cleaned” using specified non-required HTML tags. • Link Resolver to recurse imported HTML code and check for links that can be resolved – continual checking. • Automatic Static/Regex URL Redirect t44u.2013 9
  • 10. Case Study: W&L - Database to Database Migration Proposal Client Produce table of WebPageID, Level, Section Name, ContentHTML, OriginalURL, TemplateID, MetaDescrip, MetaKeywords 1 2 Table is imported into the Content Syncer using predefined fields Content Syncer W&L DB 3 Content is imported into Site Manager using template / column mapping 6 5 New section/page information is used to resolve the links using a new function to match previous ID’s with new SM ID’s Link Resolver SM DB HTML Cleaner The HTML is cleaned in the SM DB without resolved links 4
  • 11. Case Study: W&L – Data Source Data Source can be Excel, SQL or MySQL Fields need to follow exact naming convention t44u.2013 11
  • 12. Case Study: W&L – External Content Syncer Handler Setup Connection to Data Source t44u.2013 12
  • 13. Case Study: W&L – Content Syncer Ensure Site Creator Plugin set & test you can query Database t44u.2013 13
  • 14. Case Study: W&L – Content Hierarchy Built & Imported • Example ‘t44u’ shows section created and hierarchy & content created t44u.2013 14
  • 15. Case Study: W&L – HTML Cleaner • Current interface available now within Site Manager Specify Section to clean and upload properties file t44u.2013 15
  • 16. Case Study: W&L – HTML Cleaner - Options • Remove tags only: Parse the HTML using Jsoup, extract the content enclosed by them and write it out, minus the tags that are to be removed. • Remove attributes only: Only defined attribute in the tag to be removed. The tag itself will remain within the content. • Remove tags and content: Parse the html, find the relevant tag and pull it and the enclosed content out of the file. t44u.2013 16
  • 17. Case Study: W&L – HTML Cleaner – Properties File • Sample Properties file to keep listed tags and remove everything else t44u.2013 17
  • 18. Case Study: W&L – URL Redirect Original URL captured from import & used to create Static or Regex URL Rewrite t44u.2013 18
  • 19. Case Study: W&L – URL Redirect Example ISS Static URL Rewrite Mapping t44u.2013 19
  • 20. Case Study: W&L – URL Redirect – The Future ‘V8’ Beta Screen Grab for V8 – URL Redirect t44u.2013 20