SlideShare ist ein Scribd-Unternehmen logo
1 von 44
Downloaden Sie, um offline zu lesen
Determining Requirements for
    Managing Unstructured Data
    Christine Connors
    TriviumRLG LLC
    Information Management Consulting




    March 22, 2012


Thursday, March 22, 12
Overview

    ✤    Triggers

    ✤    Techniques

    ✤    Input

    ✤    Output

    ✤    Scale

    ✤    Mapping requirements to capabilities


Thursday, March 22, 12
Triggers

    ✤    “Didn’t we already do that?”

    ✤    “I found it once. It’s in there somewhere.”

    ✤    “Who knows how to do this?”

    ✤    “We maintain how many document management systems?!?”

    ✤    “Why can’t we use this content to do ... ?”

    ✤    “Which customer wanted that feature?”


Thursday, March 22, 12
As true today...

    ✤    “The search engine is poor to inadequate. I needed to find an appropriations data
         sheet and was returned 366 entries, none which had anything to do with
         appropriations. I spend far too much time looking through the search results for this
         engine to be effective. If I could find this document on the INTERNET I would do
         so, but this is an internal document that is successfully hidden somewhere in the
         archives with the Ark of the Covenant.”
                          Unidentified search and browse survey participant, June, 2003



    ✤    “Who gets more hits: www.amazon.com or
         www.thequaintbookstoredownthestreet.com? Listen up people: Our intranet is a
         wasteland of information. We need to unify - we need to standardize. Information is
         power - but only if it is on my desktop, not hidden away in some server waiting for
         a lucky adventurer to uncover it like some lost continent.”
                 Another unidentified search and browse survey participant, June, 2003

Thursday, March 22, 12
Wonderful
    objects with no
    metadata (context)
    A secret garden


    “Secret Garden” by wonderlane | Flickr |
    CC Attribution 2.0 Generic




Thursday, March 22, 12
Objects with
    can’t-be-bothered
    metadata
    A maze


    “Longleat Maze” by odolphie | Flickr |
    CC Attribution 2.0 Generic




Thursday, March 22, 12
Lots of
    unmarked
    repositories

    Silos


    “Silo” by Plano Light | Flickr | CC
    Attribution 2.0 Generic




Thursday, March 22, 12
Techniques




Thursday, March 22, 12
Sometimes, it’s obvious
    ✤    Environmental scan

          ✤    Do we really need 40 document management systems?

          ➡    We need to reduce the number of systems

                ➡    Improve the finability of the objects contained

    ✤    Budget analysis

          ✤    Projections indicate un-supportable costs of maintaining servers

                ✤    Costs are going down, but not as fast as our rate of acquisition

          ➡    We need to archive or compress the data, intelligently

Thursday, March 22, 12
Here there be dragons...




Thursday, March 22, 12
Standard
    Techniques

    ✤    Surveys

    ✤    Focus groups

    ✤    Observation

    ✤    SWOT

    ✤    Capabilities analysis



Thursday, March 22, 12
User Personas
                ✤    Craft fictional characters
                     based on your key user
                     groups

                         ✤   These archetypes will
                             represent the users of your
                             new system or process

                ✤    Give them attributes and
                     stories

                ✤    Figure out what you need to
                     solve their problems



Thursday, March 22, 12
Knowledge
    Audits

    ✤    Identify what types of
         information are critical for the
         organization

          ✤    Dashboards?

    ✤    Note gaps

    ✤    Note overlaps - redundancies,
         duplication and collaborate



Thursday, March 22, 12
Business
    Process Map
                ✤    Document the steps in
                     standard business
                     processes

                ✤    Identify where
                     unstructured data is used
                     and created

                ✤    Identify critical inputs/
                     outputs

                ✤    Identify breaks and blocks
                     in the system                  Photo by ottonassar | CC Attribution-Share Alike
                                                  http://www.fotopedia.com/items/flickr-3417427945




Thursday, March 22, 12
Social Tagging
    Analysis

                ✤    Analyze the metadata and
                     folksonomy - the organic
                     hierarchies and social tags
                     that have been created ad
                     hoc in the systems

                ✤    Are there synonymous or
                     near-synonymous terms?

                ✤    Are there trends by date or
                     location?


Thursday, March 22, 12
Survey Stakeholders

    ✤    What problem(s) are you solving?

          ✤    What are the pain points in the digital asset management strategy?
               Discovery, re-use, IP management?

    ✤    What are the benefits?

          ✤    New products, increased customer and/or employee satisfaction?

    ✤    Are there restrictions on how it gets done?


Thursday, March 22, 12
Typical Project Structure

    ✤    Analysis of needs & wants

    ✤    Define requirements

    ✤    Commit

    ✤    Resourcing

    ✤    Develop and Deploy

    ✤    Define & Publish Maintenance Processes and Governance Rules


Thursday, March 22, 12
Improve Efficiencies
    Reduce Costs




Thursday, March 22, 12
Input - Lay of the Land

    ✤    Data discovery in an 80k employee multi-national

          ✤    85% of the data “unstructured”

          ✤    90% had no metadata

                ✤    most of that was “bad” metadata

          ✤    13% exact duplicate

          ✤    True age of object hard to determine due to web scripting, server
               migrations, shared access

Thursday, March 22, 12
Input

    ✤    Qualify searches by

          ✤    function, organization, and business

          ✤    date

          ✤    document type (especially web pages)

          ✤    category (tags)

    ✤    Provide sorting of results by date, document type

    ✤    Do not change URLs of pages (users bookmarked)

Thursday, March 22, 12
Improved Efficiencies

    ✤    Delphi Group:

          ✤     Business professionals spend more than 2 hours per day searching for
               information

          ✤     Half of that time – 1 hour per day is wasted by failure to find what
               they seek

          ✤    The single factor most attributed to the large amount of time wasted
               was

          ✤     data changes (location 35%) and

          ✤     bad tools (ineffective search and lack of labeling 28%)

Thursday, March 22, 12
Output

    ✤    Objects must have metadata

          ✤    Title, Author, Subject

    ✤    Repositories should be created for organization/business/function

    ✤    Objects must be stored in one location to reduce duplicates

    ✤    Objects need to be shared to many locations

    ✤    Search & browse UI tools must provide filters for the index created

    ✤    File naming conventions need to be created and enforced

Thursday, March 22, 12
Improved Efficiencies
      Dollars Returned to the Business for Growth (1 hour per year per general employee plus 1 hour per month)




                         $4,000,000




                         $3,000,000




                         $2,000,000




                         $1,000,000




                                $0
                                      1.2k   2k   10.4k   12.3k   4k   11.6k   11.9k   8.3k   13.9k




Thursday, March 22, 12
Reduce Storage Costs
                                                                             Data growth assuming 60% annual growth rate
                                                  $90                                                                                                       3000
                                                                           T1 Only
                                                                           General tiered move

                                                                           Unintelligent Move
                                                                           Policy based Move


                                                  $68                                                                                                        2250
                                                                                                                                                            2228
                         Millions (Annual Cost)




                                                  $45                                                                                                        1500
                                                                                                                                                            $44.3
                                                                                                                                   1393


                                                                                                                                   $27.7
                                                                                                           870
                                                  $23                                                                                                          750
                                                                                   544                     $17.3
                                                          $12.2                    $10.8
                                                          340

                                                   $0                                                                                                            0
                                                     Year 1                  Year 2                  Year 3                   Year 4                  Year 5
                                                        Relative of starting point, growth curves represent storage acquisition cost increases over time.


Thursday, March 22, 12
Identify Opportunities




Thursday, March 22, 12
Input
    ✤    Curate the content for me

    ✤    Allow me to reuse content easily

          ✤    a part, not the whole

          ✤    in a new package

          ✤    without copying/pasting

          ✤    with citations

    ✤    Allow me to annotate content

    ✤    Allow me to refine content based on my needs

Thursday, March 22, 12
Content Re-use and Re-purposing

    ✤    Skills: people do not learn at the same pace nor neatly align to ‘grade’
         levels

    ✤    Product catalog: name and image as a tile on a sale page as well as in a
         detailed product description

    ✤    A taxonomy focused on a subject from introductory to mastery levels of
         understanding can be used to tag content fragments

    ✤    Combined with a taxonomy of skill levels, the content can be aggregated
         into packages consistently addressing the right audience in the right order

    ✤    These fragments can be re-used in a variety of products: multiple skill
         levels, multiple assessments, multiple delivery channels

Thursday, March 22, 12
Output
    ✤    CRM content must be indexed and categorized

    ✤    Objects must have metadata

          ✤    Title, Author, Subject, Skill Level, Process Step

    ✤    Objects need to be shared to many locations

    ✤    Objects must be usable in multiple systems and platforms

    ✤    File naming conventions need to be created and enforced

    ✤    Source data/citations must be available

    ✤    Objects must be written in a re-usable, neutral voice

Thursday, March 22, 12
Define Requirements
                ✤    Functional Requirements

                ✤    User Requirements

                ✤    Administrative Requirements

                         ✤   Authentication/Authorization/Security

                         ✤   Metrics

                         ✤   Documentation requirements

                ✤    Technical Requirements

                         ✤   Back End

                         ✤   Front End

                         ✤   Platform

                         ✤   Interoperability

Thursday, March 22, 12
Authentication, Authorization and
    Security
    ✤    Consider the content collections that will be part of the program.

          ✤    Do you anticipate any of it having restrictions?

                ✤    If so, then what are those restrictions?

          ✤    How will authorized users authenticate and gain access?

                ✤    Will you restrict access by entity type?

                ✤    By rules-based classification?

                ✤    By system access and control policies?

Thursday, March 22, 12
Back End

    ✤    How will you architect the back end to scale effectively?

    ✤    Will it be easily repeated on additional clusters?

    ✤    What OS and software will it need to run?

    ✤    Will it fail over?

    ✤    Can it scale to handle the number of users, documents and entities
         predicted for the anticipated life of the hardware?


Thursday, March 22, 12
Front End
    ✤    How will users interact with the system?

          ✤    Create - Read - Update - Delete as permissioned

          ✤    Search, browse, publish, integrate, migrate and import to and from other
               systems.

    ✤    What tools are needed to support these actions?

    ✤    Should select users be able to perform administrative tasks via a client or browser
         interface?

          ✤    How about the ability to generate reports?

    ✤    What operating system(s) does this interface need to function on?

          ✤    Mobile? Offline?

Thursday, March 22, 12
Interoperability

    ✤    How are you going to package and publish the data?

          ✤    File servers?

                ✤    Cloud?

          ✤    XML? Office suites? Analytics packages? Other tools?

    ✤     What other applications need to use the data created by one of the
         above?

          ✤    DMS/DAM/CMS/CRM

Thursday, March 22, 12
Metadata Management

    ✤    What kinds of information is important to manage - what metadata
         elements?

          ✤    Title, Author, Subject, Process, Skill, Dates, Business, Function...

    ✤    Will you need a taxonomy?

          ✤    Enforce some control on the description of attributes

    ✤    Do you need an external tool or is there a module within your CMS,
         DMS or portal solution that will suffice?


Thursday, March 22, 12
Resourcing


    ✤    Build vs. buy

    ✤    Human resources - staff or contractors needed

    ✤    Technology needs

          ✤    Hardware? Software? Network? Costs?




Thursday, March 22, 12
Define & Publish Processes and
    Rules
    ✤    Maintenance processes

          ✤    Schedule for review and updates

          ✤    Rules for additions, changes, deletions

          ✤    Implementation and publishing process

    ✤    Governance rules

          ✤    Editor? Committee? User input?

          ✤    Standards compliance?

Thursday, March 22, 12
Scale




Thursday, March 22, 12
Scale



    ✤    According to the 2011 Digital Universe study by IDC/Sponsored by
         EMC, by 2020 the world will generate 50x the amount of information
         we have now, on 75x the number of containers, and increase IT
         support for those systems only by a factor of 1.5.




Thursday, March 22, 12
Scale Using Tools


    ✤    Compression technologies

    ✤    Metadata management

    ✤    Indexing, NLP, Search

    ✤    Business rule generation and application

    ✤    Virtualization



Thursday, March 22, 12
Scale Using Processes

    ✤    Standards

    ✤    Metadata governance

          ✤    Schema

          ✤    Taxonomy

                ✤    Subject Matter Experts

          ✤    Editorial Boards

    ✤    Product development

Thursday, March 22, 12
Mapping requirements to
    technologies




Thursday, March 22, 12
What’s available?



    ✤    Latest technologies

    ✤    Information management frameworks

    ✤    Business process best practices




Thursday, March 22, 12
Questions?




Thursday, March 22, 12
Thank you for attending!
    Christine Connors
    TriviumRLG.com
    christine@triviumrlg.com
    http://www.slideshare.net/triviumrlg




Thursday, March 22, 12

Weitere ähnliche Inhalte

Andere mochten auch

Social Media for Assisted Living: Best Friend or Worst Enemy?
Social Media for Assisted Living: Best Friend or Worst Enemy?Social Media for Assisted Living: Best Friend or Worst Enemy?
Social Media for Assisted Living: Best Friend or Worst Enemy?Laura Click
 
How to get published as a PhD student
How to get published as a PhD studentHow to get published as a PhD student
How to get published as a PhD studentDeakinlibraryresearch
 
Bringing Networks to Life Using Visualization for User Engagement
Bringing Networks to Life Using Visualization for User EngagementBringing Networks to Life Using Visualization for User Engagement
Bringing Networks to Life Using Visualization for User EngagementCambridge Intelligence
 
Top Engineering Colleges In Delhi, NCR
Top Engineering Colleges In Delhi, NCRTop Engineering Colleges In Delhi, NCR
Top Engineering Colleges In Delhi, NCRDronacharya
 
ACCU16 "Let's Not Repeat the Mistakes of SOA: 'Micro' Services, Macro Organis...
ACCU16 "Let's Not Repeat the Mistakes of SOA: 'Micro' Services, Macro Organis...ACCU16 "Let's Not Repeat the Mistakes of SOA: 'Micro' Services, Macro Organis...
ACCU16 "Let's Not Repeat the Mistakes of SOA: 'Micro' Services, Macro Organis...Daniel Bryant
 
3820 kh2 launch leaflet press v2
3820 kh2 launch leaflet press v23820 kh2 launch leaflet press v2
3820 kh2 launch leaflet press v2Victor Mitov
 
JEE (Mains) and MH-CET Coaching Classes Nagpur
JEE (Mains) and MH-CET Coaching Classes NagpurJEE (Mains) and MH-CET Coaching Classes Nagpur
JEE (Mains) and MH-CET Coaching Classes NagpurSomalwarAcadamy
 
93-42-eec相关标准
93-42-eec相关标准93-42-eec相关标准
93-42-eec相关标准Jacky Gee
 
Personality & the Brain: A new perspective on the INTP
Personality & the Brain: A new perspective on the INTPPersonality & the Brain: A new perspective on the INTP
Personality & the Brain: A new perspective on the INTPAnne Dranitsaris, Ph.D.
 
LEÇON 363 – Cet instant saint, je voudrais Te le donner.
LEÇON 363 – Cet instant saint, je voudrais Te le donner.LEÇON 363 – Cet instant saint, je voudrais Te le donner.
LEÇON 363 – Cet instant saint, je voudrais Te le donner.Pierrot Caron
 

Andere mochten auch (13)

Social Media for Assisted Living: Best Friend or Worst Enemy?
Social Media for Assisted Living: Best Friend or Worst Enemy?Social Media for Assisted Living: Best Friend or Worst Enemy?
Social Media for Assisted Living: Best Friend or Worst Enemy?
 
How to get published as a PhD student
How to get published as a PhD studentHow to get published as a PhD student
How to get published as a PhD student
 
Presentation for AUF
Presentation for AUFPresentation for AUF
Presentation for AUF
 
loca
localoca
loca
 
Bringing Networks to Life Using Visualization for User Engagement
Bringing Networks to Life Using Visualization for User EngagementBringing Networks to Life Using Visualization for User Engagement
Bringing Networks to Life Using Visualization for User Engagement
 
Top Engineering Colleges In Delhi, NCR
Top Engineering Colleges In Delhi, NCRTop Engineering Colleges In Delhi, NCR
Top Engineering Colleges In Delhi, NCR
 
ACCU16 "Let's Not Repeat the Mistakes of SOA: 'Micro' Services, Macro Organis...
ACCU16 "Let's Not Repeat the Mistakes of SOA: 'Micro' Services, Macro Organis...ACCU16 "Let's Not Repeat the Mistakes of SOA: 'Micro' Services, Macro Organis...
ACCU16 "Let's Not Repeat the Mistakes of SOA: 'Micro' Services, Macro Organis...
 
3820 kh2 launch leaflet press v2
3820 kh2 launch leaflet press v23820 kh2 launch leaflet press v2
3820 kh2 launch leaflet press v2
 
JEE (Mains) and MH-CET Coaching Classes Nagpur
JEE (Mains) and MH-CET Coaching Classes NagpurJEE (Mains) and MH-CET Coaching Classes Nagpur
JEE (Mains) and MH-CET Coaching Classes Nagpur
 
989 781-00-draehte boegen
989 781-00-draehte boegen989 781-00-draehte boegen
989 781-00-draehte boegen
 
93-42-eec相关标准
93-42-eec相关标准93-42-eec相关标准
93-42-eec相关标准
 
Personality & the Brain: A new perspective on the INTP
Personality & the Brain: A new perspective on the INTPPersonality & the Brain: A new perspective on the INTP
Personality & the Brain: A new perspective on the INTP
 
LEÇON 363 – Cet instant saint, je voudrais Te le donner.
LEÇON 363 – Cet instant saint, je voudrais Te le donner.LEÇON 363 – Cet instant saint, je voudrais Te le donner.
LEÇON 363 – Cet instant saint, je voudrais Te le donner.
 

Ähnlich wie Requirements for Managing Unstructured Data

Presentation by Ivan Schotsmans (DV Community) at the Data Vault Modelling an...
Presentation by Ivan Schotsmans (DV Community) at the Data Vault Modelling an...Presentation by Ivan Schotsmans (DV Community) at the Data Vault Modelling an...
Presentation by Ivan Schotsmans (DV Community) at the Data Vault Modelling an...Patrick Van Renterghem
 
Data to Dollars™ - Practical Analytics in the Big Data Era Jaime Fitzgerald A...
Data to Dollars™ - Practical Analytics in the Big Data Era Jaime Fitzgerald A...Data to Dollars™ - Practical Analytics in the Big Data Era Jaime Fitzgerald A...
Data to Dollars™ - Practical Analytics in the Big Data Era Jaime Fitzgerald A...Fitzgerald Analytics, Inc.
 
A Morning with MongoDB Barcelona: Use Cases and Roadmap
A Morning with MongoDB Barcelona: Use Cases and RoadmapA Morning with MongoDB Barcelona: Use Cases and Roadmap
A Morning with MongoDB Barcelona: Use Cases and RoadmapMongoDB
 
Applications for Social Networking Strategies in an Agency Context: Exploitin...
Applications for Social Networking Strategies in an Agency Context: Exploitin...Applications for Social Networking Strategies in an Agency Context: Exploitin...
Applications for Social Networking Strategies in an Agency Context: Exploitin...BoaB Team
 
Designing Tag Navigation
Designing Tag NavigationDesigning Tag Navigation
Designing Tag Navigationadunne
 
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012Gigaom
 
Designing Tag Navigation
Designing Tag NavigationDesigning Tag Navigation
Designing Tag Navigationadunne
 
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMETHE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMEGigaom
 
Linked Data Warehouses: A new breed of Business Intelligence
Linked Data Warehouses: A new breed of Business IntelligenceLinked Data Warehouses: A new breed of Business Intelligence
Linked Data Warehouses: A new breed of Business Intelligence3 Round Stones
 
Crowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup
Crowdsourcing Approaches to Big Data Curation - Rio Big Data MeetupCrowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup
Crowdsourcing Approaches to Big Data Curation - Rio Big Data MeetupEdward Curry
 
[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...
[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...
[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...Scott Abel
 
Analyzing Your Deliverables: Developing the Optimal Documentation Library
Analyzing Your Deliverables: Developing the Optimal Documentation LibraryAnalyzing Your Deliverables: Developing the Optimal Documentation Library
Analyzing Your Deliverables: Developing the Optimal Documentation LibraryScott Abel
 
Visual Analysis of Massive Web Session Data
Visual Analysis of Massive Web Session DataVisual Analysis of Massive Web Session Data
Visual Analysis of Massive Web Session Databigdataviz_bay
 
Data wharehousing and OLAP
Data wharehousing and OLAPData wharehousing and OLAP
Data wharehousing and OLAPAsma CHERIF
 
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...DATAVERSITY
 
JOSA Data Science Bootcamp Overview
JOSA Data Science Bootcamp OverviewJOSA Data Science Bootcamp Overview
JOSA Data Science Bootcamp OverviewMahmoud Jalajel
 
Applications for Social Networking Strategies in an Agency Context
Applications for Social Networking Strategies in an Agency ContextApplications for Social Networking Strategies in an Agency Context
Applications for Social Networking Strategies in an Agency ContextJohn Brisbin
 
1. Overview_of_data_analytics (1).pdf
1. Overview_of_data_analytics (1).pdf1. Overview_of_data_analytics (1).pdf
1. Overview_of_data_analytics (1).pdfAyele40
 
Building a Data Platform Strata SF 2019
Building a Data Platform Strata SF 2019Building a Data Platform Strata SF 2019
Building a Data Platform Strata SF 2019mark madsen
 
Data Mining – A Perspective Approach
Data Mining – A Perspective ApproachData Mining – A Perspective Approach
Data Mining – A Perspective ApproachIRJET Journal
 

Ähnlich wie Requirements for Managing Unstructured Data (20)

Presentation by Ivan Schotsmans (DV Community) at the Data Vault Modelling an...
Presentation by Ivan Schotsmans (DV Community) at the Data Vault Modelling an...Presentation by Ivan Schotsmans (DV Community) at the Data Vault Modelling an...
Presentation by Ivan Schotsmans (DV Community) at the Data Vault Modelling an...
 
Data to Dollars™ - Practical Analytics in the Big Data Era Jaime Fitzgerald A...
Data to Dollars™ - Practical Analytics in the Big Data Era Jaime Fitzgerald A...Data to Dollars™ - Practical Analytics in the Big Data Era Jaime Fitzgerald A...
Data to Dollars™ - Practical Analytics in the Big Data Era Jaime Fitzgerald A...
 
A Morning with MongoDB Barcelona: Use Cases and Roadmap
A Morning with MongoDB Barcelona: Use Cases and RoadmapA Morning with MongoDB Barcelona: Use Cases and Roadmap
A Morning with MongoDB Barcelona: Use Cases and Roadmap
 
Applications for Social Networking Strategies in an Agency Context: Exploitin...
Applications for Social Networking Strategies in an Agency Context: Exploitin...Applications for Social Networking Strategies in an Agency Context: Exploitin...
Applications for Social Networking Strategies in an Agency Context: Exploitin...
 
Designing Tag Navigation
Designing Tag NavigationDesigning Tag Navigation
Designing Tag Navigation
 
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
 
Designing Tag Navigation
Designing Tag NavigationDesigning Tag Navigation
Designing Tag Navigation
 
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMETHE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
 
Linked Data Warehouses: A new breed of Business Intelligence
Linked Data Warehouses: A new breed of Business IntelligenceLinked Data Warehouses: A new breed of Business Intelligence
Linked Data Warehouses: A new breed of Business Intelligence
 
Crowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup
Crowdsourcing Approaches to Big Data Curation - Rio Big Data MeetupCrowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup
Crowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup
 
[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...
[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...
[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...
 
Analyzing Your Deliverables: Developing the Optimal Documentation Library
Analyzing Your Deliverables: Developing the Optimal Documentation LibraryAnalyzing Your Deliverables: Developing the Optimal Documentation Library
Analyzing Your Deliverables: Developing the Optimal Documentation Library
 
Visual Analysis of Massive Web Session Data
Visual Analysis of Massive Web Session DataVisual Analysis of Massive Web Session Data
Visual Analysis of Massive Web Session Data
 
Data wharehousing and OLAP
Data wharehousing and OLAPData wharehousing and OLAP
Data wharehousing and OLAP
 
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
 
JOSA Data Science Bootcamp Overview
JOSA Data Science Bootcamp OverviewJOSA Data Science Bootcamp Overview
JOSA Data Science Bootcamp Overview
 
Applications for Social Networking Strategies in an Agency Context
Applications for Social Networking Strategies in an Agency ContextApplications for Social Networking Strategies in an Agency Context
Applications for Social Networking Strategies in an Agency Context
 
1. Overview_of_data_analytics (1).pdf
1. Overview_of_data_analytics (1).pdf1. Overview_of_data_analytics (1).pdf
1. Overview_of_data_analytics (1).pdf
 
Building a Data Platform Strata SF 2019
Building a Data Platform Strata SF 2019Building a Data Platform Strata SF 2019
Building a Data Platform Strata SF 2019
 
Data Mining – A Perspective Approach
Data Mining – A Perspective ApproachData Mining – A Perspective Approach
Data Mining – A Perspective Approach
 

Mehr von DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 

Mehr von DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Kürzlich hochgeladen

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 

Kürzlich hochgeladen (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

Requirements for Managing Unstructured Data

  • 1. Determining Requirements for Managing Unstructured Data Christine Connors TriviumRLG LLC Information Management Consulting March 22, 2012 Thursday, March 22, 12
  • 2. Overview ✤ Triggers ✤ Techniques ✤ Input ✤ Output ✤ Scale ✤ Mapping requirements to capabilities Thursday, March 22, 12
  • 3. Triggers ✤ “Didn’t we already do that?” ✤ “I found it once. It’s in there somewhere.” ✤ “Who knows how to do this?” ✤ “We maintain how many document management systems?!?” ✤ “Why can’t we use this content to do ... ?” ✤ “Which customer wanted that feature?” Thursday, March 22, 12
  • 4. As true today... ✤ “The search engine is poor to inadequate. I needed to find an appropriations data sheet and was returned 366 entries, none which had anything to do with appropriations. I spend far too much time looking through the search results for this engine to be effective. If I could find this document on the INTERNET I would do so, but this is an internal document that is successfully hidden somewhere in the archives with the Ark of the Covenant.” Unidentified search and browse survey participant, June, 2003 ✤ “Who gets more hits: www.amazon.com or www.thequaintbookstoredownthestreet.com? Listen up people: Our intranet is a wasteland of information. We need to unify - we need to standardize. Information is power - but only if it is on my desktop, not hidden away in some server waiting for a lucky adventurer to uncover it like some lost continent.” Another unidentified search and browse survey participant, June, 2003 Thursday, March 22, 12
  • 5. Wonderful objects with no metadata (context) A secret garden “Secret Garden” by wonderlane | Flickr | CC Attribution 2.0 Generic Thursday, March 22, 12
  • 6. Objects with can’t-be-bothered metadata A maze “Longleat Maze” by odolphie | Flickr | CC Attribution 2.0 Generic Thursday, March 22, 12
  • 7. Lots of unmarked repositories Silos “Silo” by Plano Light | Flickr | CC Attribution 2.0 Generic Thursday, March 22, 12
  • 9. Sometimes, it’s obvious ✤ Environmental scan ✤ Do we really need 40 document management systems? ➡ We need to reduce the number of systems ➡ Improve the finability of the objects contained ✤ Budget analysis ✤ Projections indicate un-supportable costs of maintaining servers ✤ Costs are going down, but not as fast as our rate of acquisition ➡ We need to archive or compress the data, intelligently Thursday, March 22, 12
  • 10. Here there be dragons... Thursday, March 22, 12
  • 11. Standard Techniques ✤ Surveys ✤ Focus groups ✤ Observation ✤ SWOT ✤ Capabilities analysis Thursday, March 22, 12
  • 12. User Personas ✤ Craft fictional characters based on your key user groups ✤ These archetypes will represent the users of your new system or process ✤ Give them attributes and stories ✤ Figure out what you need to solve their problems Thursday, March 22, 12
  • 13. Knowledge Audits ✤ Identify what types of information are critical for the organization ✤ Dashboards? ✤ Note gaps ✤ Note overlaps - redundancies, duplication and collaborate Thursday, March 22, 12
  • 14. Business Process Map ✤ Document the steps in standard business processes ✤ Identify where unstructured data is used and created ✤ Identify critical inputs/ outputs ✤ Identify breaks and blocks in the system Photo by ottonassar | CC Attribution-Share Alike http://www.fotopedia.com/items/flickr-3417427945 Thursday, March 22, 12
  • 15. Social Tagging Analysis ✤ Analyze the metadata and folksonomy - the organic hierarchies and social tags that have been created ad hoc in the systems ✤ Are there synonymous or near-synonymous terms? ✤ Are there trends by date or location? Thursday, March 22, 12
  • 16. Survey Stakeholders ✤ What problem(s) are you solving? ✤ What are the pain points in the digital asset management strategy? Discovery, re-use, IP management? ✤ What are the benefits? ✤ New products, increased customer and/or employee satisfaction? ✤ Are there restrictions on how it gets done? Thursday, March 22, 12
  • 17. Typical Project Structure ✤ Analysis of needs & wants ✤ Define requirements ✤ Commit ✤ Resourcing ✤ Develop and Deploy ✤ Define & Publish Maintenance Processes and Governance Rules Thursday, March 22, 12
  • 18. Improve Efficiencies Reduce Costs Thursday, March 22, 12
  • 19. Input - Lay of the Land ✤ Data discovery in an 80k employee multi-national ✤ 85% of the data “unstructured” ✤ 90% had no metadata ✤ most of that was “bad” metadata ✤ 13% exact duplicate ✤ True age of object hard to determine due to web scripting, server migrations, shared access Thursday, March 22, 12
  • 20. Input ✤ Qualify searches by ✤ function, organization, and business ✤ date ✤ document type (especially web pages) ✤ category (tags) ✤ Provide sorting of results by date, document type ✤ Do not change URLs of pages (users bookmarked) Thursday, March 22, 12
  • 21. Improved Efficiencies ✤ Delphi Group: ✤ Business professionals spend more than 2 hours per day searching for information ✤ Half of that time – 1 hour per day is wasted by failure to find what they seek ✤ The single factor most attributed to the large amount of time wasted was ✤ data changes (location 35%) and ✤ bad tools (ineffective search and lack of labeling 28%) Thursday, March 22, 12
  • 22. Output ✤ Objects must have metadata ✤ Title, Author, Subject ✤ Repositories should be created for organization/business/function ✤ Objects must be stored in one location to reduce duplicates ✤ Objects need to be shared to many locations ✤ Search & browse UI tools must provide filters for the index created ✤ File naming conventions need to be created and enforced Thursday, March 22, 12
  • 23. Improved Efficiencies Dollars Returned to the Business for Growth (1 hour per year per general employee plus 1 hour per month) $4,000,000 $3,000,000 $2,000,000 $1,000,000 $0 1.2k 2k 10.4k 12.3k 4k 11.6k 11.9k 8.3k 13.9k Thursday, March 22, 12
  • 24. Reduce Storage Costs Data growth assuming 60% annual growth rate $90 3000 T1 Only General tiered move Unintelligent Move Policy based Move $68 2250 2228 Millions (Annual Cost) $45 1500 $44.3 1393 $27.7 870 $23 750 544 $17.3 $12.2 $10.8 340 $0 0 Year 1 Year 2 Year 3 Year 4 Year 5 Relative of starting point, growth curves represent storage acquisition cost increases over time. Thursday, March 22, 12
  • 26. Input ✤ Curate the content for me ✤ Allow me to reuse content easily ✤ a part, not the whole ✤ in a new package ✤ without copying/pasting ✤ with citations ✤ Allow me to annotate content ✤ Allow me to refine content based on my needs Thursday, March 22, 12
  • 27. Content Re-use and Re-purposing ✤ Skills: people do not learn at the same pace nor neatly align to ‘grade’ levels ✤ Product catalog: name and image as a tile on a sale page as well as in a detailed product description ✤ A taxonomy focused on a subject from introductory to mastery levels of understanding can be used to tag content fragments ✤ Combined with a taxonomy of skill levels, the content can be aggregated into packages consistently addressing the right audience in the right order ✤ These fragments can be re-used in a variety of products: multiple skill levels, multiple assessments, multiple delivery channels Thursday, March 22, 12
  • 28. Output ✤ CRM content must be indexed and categorized ✤ Objects must have metadata ✤ Title, Author, Subject, Skill Level, Process Step ✤ Objects need to be shared to many locations ✤ Objects must be usable in multiple systems and platforms ✤ File naming conventions need to be created and enforced ✤ Source data/citations must be available ✤ Objects must be written in a re-usable, neutral voice Thursday, March 22, 12
  • 29. Define Requirements ✤ Functional Requirements ✤ User Requirements ✤ Administrative Requirements ✤ Authentication/Authorization/Security ✤ Metrics ✤ Documentation requirements ✤ Technical Requirements ✤ Back End ✤ Front End ✤ Platform ✤ Interoperability Thursday, March 22, 12
  • 30. Authentication, Authorization and Security ✤ Consider the content collections that will be part of the program. ✤ Do you anticipate any of it having restrictions? ✤ If so, then what are those restrictions? ✤ How will authorized users authenticate and gain access? ✤ Will you restrict access by entity type? ✤ By rules-based classification? ✤ By system access and control policies? Thursday, March 22, 12
  • 31. Back End ✤ How will you architect the back end to scale effectively? ✤ Will it be easily repeated on additional clusters? ✤ What OS and software will it need to run? ✤ Will it fail over? ✤ Can it scale to handle the number of users, documents and entities predicted for the anticipated life of the hardware? Thursday, March 22, 12
  • 32. Front End ✤ How will users interact with the system? ✤ Create - Read - Update - Delete as permissioned ✤ Search, browse, publish, integrate, migrate and import to and from other systems. ✤ What tools are needed to support these actions? ✤ Should select users be able to perform administrative tasks via a client or browser interface? ✤ How about the ability to generate reports? ✤ What operating system(s) does this interface need to function on? ✤ Mobile? Offline? Thursday, March 22, 12
  • 33. Interoperability ✤ How are you going to package and publish the data? ✤ File servers? ✤ Cloud? ✤ XML? Office suites? Analytics packages? Other tools? ✤ What other applications need to use the data created by one of the above? ✤ DMS/DAM/CMS/CRM Thursday, March 22, 12
  • 34. Metadata Management ✤ What kinds of information is important to manage - what metadata elements? ✤ Title, Author, Subject, Process, Skill, Dates, Business, Function... ✤ Will you need a taxonomy? ✤ Enforce some control on the description of attributes ✤ Do you need an external tool or is there a module within your CMS, DMS or portal solution that will suffice? Thursday, March 22, 12
  • 35. Resourcing ✤ Build vs. buy ✤ Human resources - staff or contractors needed ✤ Technology needs ✤ Hardware? Software? Network? Costs? Thursday, March 22, 12
  • 36. Define & Publish Processes and Rules ✤ Maintenance processes ✤ Schedule for review and updates ✤ Rules for additions, changes, deletions ✤ Implementation and publishing process ✤ Governance rules ✤ Editor? Committee? User input? ✤ Standards compliance? Thursday, March 22, 12
  • 38. Scale ✤ According to the 2011 Digital Universe study by IDC/Sponsored by EMC, by 2020 the world will generate 50x the amount of information we have now, on 75x the number of containers, and increase IT support for those systems only by a factor of 1.5. Thursday, March 22, 12
  • 39. Scale Using Tools ✤ Compression technologies ✤ Metadata management ✤ Indexing, NLP, Search ✤ Business rule generation and application ✤ Virtualization Thursday, March 22, 12
  • 40. Scale Using Processes ✤ Standards ✤ Metadata governance ✤ Schema ✤ Taxonomy ✤ Subject Matter Experts ✤ Editorial Boards ✤ Product development Thursday, March 22, 12
  • 41. Mapping requirements to technologies Thursday, March 22, 12
  • 42. What’s available? ✤ Latest technologies ✤ Information management frameworks ✤ Business process best practices Thursday, March 22, 12
  • 44. Thank you for attending! Christine Connors TriviumRLG.com christine@triviumrlg.com http://www.slideshare.net/triviumrlg Thursday, March 22, 12