SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
Where Will All the Data Go?


        Stephen D. Poe, EDP, CSM, CSPO
                Nautilus Solutions
                                  9 June 2011

Slide 1
Copyright © 2011 Stephen D. Poe
Where Will All the Data Go?
                     Our Agenda
     •   The Problem
     •   Solutions
     •   Technical Questions
     •   Planning Issues




Slide 2
Copyright © 2011 Stephen D. Poe
The Problem




Slide 3
Copyright © 2011 Stephen D. Poe
How Big is the Problem?
     • Overall
          – In 2003, UC Berkley estimated 5 exabytes of new
            data stored on digital drives
               • 1 petabyte = 1,000 terabytes
               • 1 exabyte = 1,000 petabytes
          – In 2008, IDC estimated 281 exabytes of digital
            information was created and replicated globally
               • That’s 45GB for each person on earth
     • Specific examples
          – Internet traffic in March 2010 was estimated at 21
            exabytes
          – Email storage now commonly 25GB per user
          – Individual statement (AFP) used to average
            perhaps 10-15KB per statement
               • Now several MB per statement
                    – Color, more graphics
               • What happens when your online statement includes
                 personalized audio and video?

Slide 4
Copyright © 2011 Stephen D. Poe
How Big is the Problem?
                                  • The number of files is
                                    growing even faster
                                    – Average file size is shrinking
                                       • No longer just large print files
                                       • Emails, IM log, single tweet, QR
                                         request
                                    – Example: storing 1 TB
                                       • 1,000 1GB production files
                                       • 1,000,000,000 1KB email files


Slide 5
Copyright © 2011 Stephen D. Poe
Multi-Channel World
    • You archive your customer correspondence
         – Bills, statements, notices
              • Only 24% of US bank account holders have gone paperless
              • 37% say they will never go paperless (Forrester)




                         • How about new multi-channel messages?
                              – Email, instant messages, mobile, video, voice, Tweets,
                                and blog posts
                                  • Instant messages, Twitter posts and blog posts are not
                                    archived in 80% of the organizations using them.
                         • All may be discoverable
                              – They may need to be stored

Slide 6
Copyright © 2011 Stephen D. Poe
Solutions




Slide 7
Copyright © 2011 Stephen D. Poe
Framing the Archive Issue

        • Our archives must meet:
             – All legal and regulatory requirements
                  • to hold all required electronic documents
                  • for the mandated length(s) of time
                       – in a cost effective manner
                       – with a defensible plan to manage them
                  • Insuring that, when required, we can
                    reproduce the ‘original’
                       – enough to satisfy a judge


Slide 8
Copyright © 2011 Stephen D. Poe
Archival System Components

     • Storage Format(s)
          – Multiple, and growing
     • Archival system
          – Hardware
          – Software
          – Network
     • Retrieval/display software
          – Network
     • Process and procedures
Slide 9
Copyright © 2011 Stephen D. Poe
Archive Drivers




                                  Source: AIIM ECM state of the Industry study 2010




Slide 10
Copyright © 2011 Stephen D. Poe
Archive Projects
                                  Completed
                                  enterprise,
                                                     No plans, 5%
                                     12%
                                                                In next 12
                                                               months, 16%

                    Implementing
                     enterprise,
                        28%
                                                               Departmental
                                                                  , 24%

                                         Across
                                      departments,
                                          15%

                                                Source: AIIM ECM state of the Industry study 2010
Slide 11
Copyright © 2011 Stephen D. Poe
Technical Questions




Slide 12
Copyright © 2011 Stephen D. Poe
What Do You Have Now?

     • ECM, WCM, MCM, repositories of record,
       archives
          – How many silos in-house all ready?
          – Who owns which data?
     • Where should we keep it all?
          – Single repository for all data, all formats?
          – Separate repositories specialized for each?


Slide 13
Copyright © 2011 Stephen D. Poe
Example - Storing Emails




Slide 14
Copyright © 2011 Stephen D. Poe
Storage & Admin & Overhead, Oh My!

    • Storage may be cheap
         – Management and ’-ilities aren’t
    • Metrics to think about
         – $/terabtye continues to fall
              • Perhaps $2000-$3000/TB for near-Pentabyte systems
         – Petabytes/IT Storage Administrator
              • Burdened labor overhead of perhaps $100K per admin
         – And overhead
              • Rent, electricity, cooling, security
                   – What ‘Green Footprint’?


Slide 15
Copyright © 2011 Stephen D. Poe
The Cloud
• Remember ASPs?
     – Review pros and cons
• In-house vs. outsourced
     – Where outsourced?
• Regulatory environment
     – Will this data ever cross a trans-national
       boundary?
• Recent Amazon.com outage
     –   4 days down – 98.9% annual up time
     –   What are the SLAs?
     –   What are the penalties?
     –   But could you do better in-house?
• Corporate level of risk
     – To allow corporate data to be held off-site
     – But is it any safer in-house?

Slide 16
Copyright © 2011 Stephen D. Poe
Legal

     • Compliance with rules and regulations
          – Especially with evolving regulations
          – Joint legal/IT taskforce to keep up with changes?
     • International considerations
          – EU privacy rules considerably tighter
          – Conflicts
               • Limited or no sharing of data across borders
               • US discovery laws vs. EU privacy directives

Slide 17
Copyright © 2011 Stephen D. Poe
Preserving Your Data

     • How long do you need to archive
          – Legal and regulatory requirements
               • 7 years – 100 years
     • Average lifespan of a format & reader software
          – Perhaps 2-3 major OS upgrades
     • Look at PDF/A for possible format
          – ISO standard for very long term archive & retrieval
          – Good for some (but not all) documents



Slide 18
Copyright © 2011 Stephen D. Poe
Finding Your Data

     • Key indices
          – Good enough in the past
               • For legacy applications on older data
     • Structured taxonomies
          – If you develop the taxonomies before designing the archive
     • The New Search
          – Full text search is a goal
          – What does that mean against several Pentabytes of data?
     • Metadata
          – Exceptionally valuable
          – Usually exceptionally expensive, especially to retrofit

Slide 19
Copyright © 2011 Stephen D. Poe
Planning Issues




Slide 20
Copyright © 2011 Stephen D. Poe
The ‘-ilities

     • The ‘-ilities
          – Usability, reliability, maintainability, scalability,
            availability, extensibility, security, portability
               • Difference between a system and a success
          – Requires long term commitment to people,
            process, and standards
          – Set standards, define metrics, monitor and fix
            issues as they arise


Slide 21
Copyright © 2011 Stephen D. Poe
Archive Planning

     • Detailed knowledge of what is to be archived
          –   Current & future production processes
          –   Legacy data and documents
          –   Current multi-channel and social media
          –   Future data and documents?
     • Detailed knowledge of how it will be used
          – By whom
          – On what platform(s)?
          – For what purposes
Slide 22
Copyright © 2011 Stephen D. Poe
Archive Planning

     • Archive system design
          – Implementation
          – Maintenance & upgrades
          – Be flexible – things will change
     • Corporate processes and procedures
          – Satisfy the –’ilities
          – Continue to meet the business goals
          – Plan for regular review and transitions

Slide 23
Copyright © 2011 Stephen D. Poe
Archive Planning – A Checklist
     • Develop the business plan
          – Business goals, business case, costs, funding, project management
     • Technology review
          – Time estimates, requirement gathering, analyze, plan, get consensus
     • Develop policies and process
          – Define processes, people, standards, tools, technologies, metrics
     • Develop Project Plan
          – A PM is a good idea
     • Gap Analysis and build underlying foundation
          – Environment, platforms, skill sets, enterprise architecture
     • Develop plan details
          – Implement, test, modify
     • Maintenance
Slide 24
Copyright © 2011 Stephen D. Poe
For More Information
                       Stephen D. Poe, EDP
                         Nautilus Solutions
                         +1.214.532.0443
                    sdpoe@nautilussolutions.com
Slide 25
Copyright © 2011 Stephen D. Poe                   25

Weitere ähnliche Inhalte

Ähnlich wie Data Archive Considerations for Customer Communication Management

Introduction to Digital Preservation
Introduction to Digital PreservationIntroduction to Digital Preservation
Introduction to Digital PreservationBill LeFurgy
 
Anthony Joseph
Anthony JosephAnthony Joseph
Anthony JosephEduserv
 
Project Controls Expo 18th Nov 2014 - Introduction and key note presentation ...
Project Controls Expo 18th Nov 2014 - Introduction and key note presentation ...Project Controls Expo 18th Nov 2014 - Introduction and key note presentation ...
Project Controls Expo 18th Nov 2014 - Introduction and key note presentation ...Project Controls Expo
 
How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...Vladimir Bacvanski, PhD
 
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...DATAVERSITY
 
Perspectives on digitization of music
Perspectives on digitization of musicPerspectives on digitization of music
Perspectives on digitization of musicOle Bisbjerg
 
Hadoop as a Data Hub
Hadoop as a Data HubHadoop as a Data Hub
Hadoop as a Data HubDianna Doan
 
Spreadmart To Data Mart BISIG Presentation
Spreadmart To Data Mart BISIG PresentationSpreadmart To Data Mart BISIG Presentation
Spreadmart To Data Mart BISIG PresentationDan English
 
Big Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil GamesBig Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil GamesRob Winters
 
Searching patents – a brief introduction
Searching patents – a brief introductionSearching patents – a brief introduction
Searching patents – a brief introductionBjörn Jürgens
 
GWAVACon - Files Matters (English)
GWAVACon - Files Matters (English)GWAVACon - Files Matters (English)
GWAVACon - Files Matters (English)GWAVA
 
Enterprise Content Management 101 for Financial Services
Enterprise Content Management 101 for Financial ServicesEnterprise Content Management 101 for Financial Services
Enterprise Content Management 101 for Financial ServicesAlfresco Software
 

Ähnlich wie Data Archive Considerations for Customer Communication Management (20)

Data Mining & Engineering
Data Mining & EngineeringData Mining & Engineering
Data Mining & Engineering
 
Software sustainability - Patrick Aerts
Software sustainability - Patrick AertsSoftware sustainability - Patrick Aerts
Software sustainability - Patrick Aerts
 
Introduction to Digital Preservation
Introduction to Digital PreservationIntroduction to Digital Preservation
Introduction to Digital Preservation
 
Anthony Joseph
Anthony JosephAnthony Joseph
Anthony Joseph
 
Andrew waugh
Andrew waughAndrew waugh
Andrew waugh
 
Project Controls Expo 18th Nov 2014 - Introduction and key note presentation ...
Project Controls Expo 18th Nov 2014 - Introduction and key note presentation ...Project Controls Expo 18th Nov 2014 - Introduction and key note presentation ...
Project Controls Expo 18th Nov 2014 - Introduction and key note presentation ...
 
Big Data a big deal?
Big Data a big deal?Big Data a big deal?
Big Data a big deal?
 
How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...
 
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
 
Perspectives on digitization of music
Perspectives on digitization of musicPerspectives on digitization of music
Perspectives on digitization of music
 
Hadoop as a Data Hub
Hadoop as a Data HubHadoop as a Data Hub
Hadoop as a Data Hub
 
Spreadmart To Data Mart BISIG Presentation
Spreadmart To Data Mart BISIG PresentationSpreadmart To Data Mart BISIG Presentation
Spreadmart To Data Mart BISIG Presentation
 
Big Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil GamesBig Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil Games
 
Andrew Waugh presentation
Andrew Waugh   presentationAndrew Waugh   presentation
Andrew Waugh presentation
 
Searching patents – a brief introduction
Searching patents – a brief introductionSearching patents – a brief introduction
Searching patents – a brief introduction
 
GWAVACon - Files Matters (English)
GWAVACon - Files Matters (English)GWAVACon - Files Matters (English)
GWAVACon - Files Matters (English)
 
Big data intro.pptx
Big data intro.pptxBig data intro.pptx
Big data intro.pptx
 
Enterprise Content Management 101 for Financial Services
Enterprise Content Management 101 for Financial ServicesEnterprise Content Management 101 for Financial Services
Enterprise Content Management 101 for Financial Services
 
Big Data
Big DataBig Data
Big Data
 
Proact story on Archiving
Proact story on ArchivingProact story on Archiving
Proact story on Archiving
 

Mehr von Stephen D. Poe, SPC4, CSM, CSPO, PMC, EDP

Mehr von Stephen D. Poe, SPC4, CSM, CSPO, PMC, EDP (7)

Steampunk roots and branches stephen poe teslacon 2014
Steampunk roots and branches stephen poe teslacon 2014Steampunk roots and branches stephen poe teslacon 2014
Steampunk roots and branches stephen poe teslacon 2014
 
To Cloud or Not to Cloud for Transaction Document Production
To Cloud or Not to Cloud for Transaction Document ProductionTo Cloud or Not to Cloud for Transaction Document Production
To Cloud or Not to Cloud for Transaction Document Production
 
If You Can't Measure It, You Can't Fix It: Metrics as a Component of Customer...
If You Can't Measure It, You Can't Fix It: Metrics as a Component of Customer...If You Can't Measure It, You Can't Fix It: Metrics as a Component of Customer...
If You Can't Measure It, You Can't Fix It: Metrics as a Component of Customer...
 
This Is Not Your Fathers ADF
This Is Not Your Fathers ADFThis Is Not Your Fathers ADF
This Is Not Your Fathers ADF
 
Electronic Envelope - 2009 Document Strategy Forum
Electronic Envelope - 2009 Document Strategy ForumElectronic Envelope - 2009 Document Strategy Forum
Electronic Envelope - 2009 Document Strategy Forum
 
PDF/A: An Introduction to the PDF ISO Standard for Long Term Document Archive
PDF/A: An Introduction to the PDF ISO Standard for Long Term Document ArchivePDF/A: An Introduction to the PDF ISO Standard for Long Term Document Archive
PDF/A: An Introduction to the PDF ISO Standard for Long Term Document Archive
 
Document Reengineering Introduction
Document Reengineering IntroductionDocument Reengineering Introduction
Document Reengineering Introduction
 

Kürzlich hochgeladen

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 

Kürzlich hochgeladen (20)

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 

Data Archive Considerations for Customer Communication Management

  • 1. Where Will All the Data Go? Stephen D. Poe, EDP, CSM, CSPO Nautilus Solutions 9 June 2011 Slide 1 Copyright © 2011 Stephen D. Poe
  • 2. Where Will All the Data Go? Our Agenda • The Problem • Solutions • Technical Questions • Planning Issues Slide 2 Copyright © 2011 Stephen D. Poe
  • 3. The Problem Slide 3 Copyright © 2011 Stephen D. Poe
  • 4. How Big is the Problem? • Overall – In 2003, UC Berkley estimated 5 exabytes of new data stored on digital drives • 1 petabyte = 1,000 terabytes • 1 exabyte = 1,000 petabytes – In 2008, IDC estimated 281 exabytes of digital information was created and replicated globally • That’s 45GB for each person on earth • Specific examples – Internet traffic in March 2010 was estimated at 21 exabytes – Email storage now commonly 25GB per user – Individual statement (AFP) used to average perhaps 10-15KB per statement • Now several MB per statement – Color, more graphics • What happens when your online statement includes personalized audio and video? Slide 4 Copyright © 2011 Stephen D. Poe
  • 5. How Big is the Problem? • The number of files is growing even faster – Average file size is shrinking • No longer just large print files • Emails, IM log, single tweet, QR request – Example: storing 1 TB • 1,000 1GB production files • 1,000,000,000 1KB email files Slide 5 Copyright © 2011 Stephen D. Poe
  • 6. Multi-Channel World • You archive your customer correspondence – Bills, statements, notices • Only 24% of US bank account holders have gone paperless • 37% say they will never go paperless (Forrester) • How about new multi-channel messages? – Email, instant messages, mobile, video, voice, Tweets, and blog posts • Instant messages, Twitter posts and blog posts are not archived in 80% of the organizations using them. • All may be discoverable – They may need to be stored Slide 6 Copyright © 2011 Stephen D. Poe
  • 7. Solutions Slide 7 Copyright © 2011 Stephen D. Poe
  • 8. Framing the Archive Issue • Our archives must meet: – All legal and regulatory requirements • to hold all required electronic documents • for the mandated length(s) of time – in a cost effective manner – with a defensible plan to manage them • Insuring that, when required, we can reproduce the ‘original’ – enough to satisfy a judge Slide 8 Copyright © 2011 Stephen D. Poe
  • 9. Archival System Components • Storage Format(s) – Multiple, and growing • Archival system – Hardware – Software – Network • Retrieval/display software – Network • Process and procedures Slide 9 Copyright © 2011 Stephen D. Poe
  • 10. Archive Drivers Source: AIIM ECM state of the Industry study 2010 Slide 10 Copyright © 2011 Stephen D. Poe
  • 11. Archive Projects Completed enterprise, No plans, 5% 12% In next 12 months, 16% Implementing enterprise, 28% Departmental , 24% Across departments, 15% Source: AIIM ECM state of the Industry study 2010 Slide 11 Copyright © 2011 Stephen D. Poe
  • 12. Technical Questions Slide 12 Copyright © 2011 Stephen D. Poe
  • 13. What Do You Have Now? • ECM, WCM, MCM, repositories of record, archives – How many silos in-house all ready? – Who owns which data? • Where should we keep it all? – Single repository for all data, all formats? – Separate repositories specialized for each? Slide 13 Copyright © 2011 Stephen D. Poe
  • 14. Example - Storing Emails Slide 14 Copyright © 2011 Stephen D. Poe
  • 15. Storage & Admin & Overhead, Oh My! • Storage may be cheap – Management and ’-ilities aren’t • Metrics to think about – $/terabtye continues to fall • Perhaps $2000-$3000/TB for near-Pentabyte systems – Petabytes/IT Storage Administrator • Burdened labor overhead of perhaps $100K per admin – And overhead • Rent, electricity, cooling, security – What ‘Green Footprint’? Slide 15 Copyright © 2011 Stephen D. Poe
  • 16. The Cloud • Remember ASPs? – Review pros and cons • In-house vs. outsourced – Where outsourced? • Regulatory environment – Will this data ever cross a trans-national boundary? • Recent Amazon.com outage – 4 days down – 98.9% annual up time – What are the SLAs? – What are the penalties? – But could you do better in-house? • Corporate level of risk – To allow corporate data to be held off-site – But is it any safer in-house? Slide 16 Copyright © 2011 Stephen D. Poe
  • 17. Legal • Compliance with rules and regulations – Especially with evolving regulations – Joint legal/IT taskforce to keep up with changes? • International considerations – EU privacy rules considerably tighter – Conflicts • Limited or no sharing of data across borders • US discovery laws vs. EU privacy directives Slide 17 Copyright © 2011 Stephen D. Poe
  • 18. Preserving Your Data • How long do you need to archive – Legal and regulatory requirements • 7 years – 100 years • Average lifespan of a format & reader software – Perhaps 2-3 major OS upgrades • Look at PDF/A for possible format – ISO standard for very long term archive & retrieval – Good for some (but not all) documents Slide 18 Copyright © 2011 Stephen D. Poe
  • 19. Finding Your Data • Key indices – Good enough in the past • For legacy applications on older data • Structured taxonomies – If you develop the taxonomies before designing the archive • The New Search – Full text search is a goal – What does that mean against several Pentabytes of data? • Metadata – Exceptionally valuable – Usually exceptionally expensive, especially to retrofit Slide 19 Copyright © 2011 Stephen D. Poe
  • 20. Planning Issues Slide 20 Copyright © 2011 Stephen D. Poe
  • 21. The ‘-ilities • The ‘-ilities – Usability, reliability, maintainability, scalability, availability, extensibility, security, portability • Difference between a system and a success – Requires long term commitment to people, process, and standards – Set standards, define metrics, monitor and fix issues as they arise Slide 21 Copyright © 2011 Stephen D. Poe
  • 22. Archive Planning • Detailed knowledge of what is to be archived – Current & future production processes – Legacy data and documents – Current multi-channel and social media – Future data and documents? • Detailed knowledge of how it will be used – By whom – On what platform(s)? – For what purposes Slide 22 Copyright © 2011 Stephen D. Poe
  • 23. Archive Planning • Archive system design – Implementation – Maintenance & upgrades – Be flexible – things will change • Corporate processes and procedures – Satisfy the –’ilities – Continue to meet the business goals – Plan for regular review and transitions Slide 23 Copyright © 2011 Stephen D. Poe
  • 24. Archive Planning – A Checklist • Develop the business plan – Business goals, business case, costs, funding, project management • Technology review – Time estimates, requirement gathering, analyze, plan, get consensus • Develop policies and process – Define processes, people, standards, tools, technologies, metrics • Develop Project Plan – A PM is a good idea • Gap Analysis and build underlying foundation – Environment, platforms, skill sets, enterprise architecture • Develop plan details – Implement, test, modify • Maintenance Slide 24 Copyright © 2011 Stephen D. Poe
  • 25. For More Information Stephen D. Poe, EDP Nautilus Solutions +1.214.532.0443 sdpoe@nautilussolutions.com Slide 25 Copyright © 2011 Stephen D. Poe 25