SlideShare a Scribd company logo
1 of 66
at the 2012 National Library / ExLibris Meeting

Director, Portfolio Strategy
MSR Connections
•   (and me)
•
•
•
•
•
Some Background
Worldwide Presence

                                                       MSR India




                                                                   MSR New England
                                              Redmond



• Redmond, Washington          Sept 1991
• Cambridge, United Kingdom    July 1997                           MSR Cambridge, UK
• Beijing, China               Nov 1998
• Silicon Valley, California   July 2001
                                           MSR Asia (Beijing)
• Bangalore, India             Jan 2005
• Cambridge, Massachusetts     July 2008
• New York City, NY            May 2012                         Silicon Valley, California
•

•
•


    http://research.microsoft.com/
|
Outreach. Collaboration. Innovation.
•




•


•

•

             http://research.microsoft.com/collaboration/
Engagement and Collaboration Focus

Core Computer   Natural User      Earth,      Education &    Health &
   Science       Interface       Energy &      Scholarly     Wellbeing
                               Environment   Communication
What is the
challenge before us?
Data Tidal Wave
…Thus far we seem to be worse off than before—for we can
enormously extend the record; yet even in its present bulk we
can hardly consult it. This is a much larger matter than merely
the extraction of data for the purposes of scientific research; it
involves the entire process by which man profits by his
inheritance of acquired knowledge. The prime action of use is
selection, and here we are halting indeed. There may be
millions of fine thoughts, and the account of the experience on
which they are based, all encased within stone walls of
acceptable architectural form; but if the scholar can get at
only one a week by diligent search, his syntheses are not likely
to keep up with the current scene…


                      As We May Think
                      by Vannevar Bush
                    The Atlantic, July 1945
             http://www.theatlantic.com/doc/194507/bush
According to study called How Much Information by
the University of California at San Diego,

“…consumption totaled 3.6 zettabytes and 10,845
trillion words, corresponding to 100,500 words and
34 gigabytes for an average person on an average
day. A zettabyte is 10 to the 21st power bytes, a
million million gigabytes. These estimates are from
an analysis of more than 20 different sources of
information, from very old (newspapers and books)
to very new (portable computer games, satellite
radio, and Internet video)."

[Note: Information at work is not included!]
“It’s not information overload.
        It’s filter failure.”
          Clay Shirky
       at Web 2.0 Expo 2008
http://en.wikipedia.org/wiki/Big_data
•      information technology                    [1][2][3]              data sets

              [4]                              [5]

•                                               [update]

    petabytes       exabytes             [9]

                                                                       meteorology
    genomics [10] connectomics                                        [11]
                                                     [12]

    Internet search finance             business informatics
•
                               remote sensing
                      radio-frequency identification
                         [13][14]

•
                                                               [15]             [update]

                     quintillion                                             [16]
So, what about libraries?
Present
              The Future: an Explosion of Data
Experiments     Simulations      Archives   Literature        Instruments




                                            The Challenge:
                                            Enable Discovery
                              Petabytes       Deliver the capability to mine,
                                              search and analyze this
                                              data in near real-time.
What resources are available?
DataUp:
Data Curation Add-in for Microsoft Excel




•
• California Digital Library’s Curation Center
    •
    •         DataONE
•
• Figshare is the first online repository for
    storing and sharing all of your preliminary
    findings in the form of individual figures,
    datasets, media or filesets. Post preprint
    figures on Figshare to claim priority and
    receive feedback on your findings prior to       http://figshare.com/
    formal publication.
•   Figshare allows researchers to publish all
    of their research outputs in seconds in an
    easily citable, sharable and discoverable
    manner. All file formats can be published,
    including videos and datasets that are
    often demoted to the supplemental
    materials section in current publishing
    models.
•   Figshare uses Creative Commons
    licensing to allow frictionless sharing of
    research data while allowing users to
    maintain their ownership. Figshare gives      http://www.digital-science.com/
    users unlimited public space and 1GB of
    private storage space for free.
•

•

•
    •                                   http://datacite.org/
    •


    •
                                        http://databib.org/
•                                       Registry of research data
    •                                    repositories (hosted by
    •                        [Digital       Purdue University)
        Object Identifier]
    •
    •
 Developed by the Institute
 of Quantitative Social
 Science (IQSS) at Harvard
 University




                               http://thedata.org/
DataFlow



                       http://www.dataflow.ox.ac.u
                       k/
           DataStage

DataBank

•
•
Preservation
http://duracloud.org/
WorldWideScience.org is a global science gateway connecting you to national and
international scientific databases and portals. WorldWideScience.org accelerates
scientific discovery and progress by providing one-stop searching of global science
sources. The WorldWideScience Alliance, a multilateral partnership, consists of
participating member countries and provides the governance structure for
WorldWideScience.org.
WorldWideScience.org was developed and is maintained by the Office of Scientific and
Technical Information (OSTI), an element of the Office of Science within the U.S.
Department of Energy. Please contact webmaster@worldwidescience.org if you
represent a national or international science database or portal and would like your
source searched by WorldWideScience.org.
 In 3+ years since launch, the site has grown to 65+
 countries, 400+ million pages – 96.5% of which is *not*
 available via commercial search engines – and can now be
 translated into multiple world languages (on demand).
Who does the work?
“Future Career Opportunities and
Educational Requirements for Digital Curation”




 1.

 2.



 3.

 4.


       http://sites.nationalacademies.org/PGA/brdi/PGA_069853/
Symposium on Digital Curation




                        The Future Workforce
                                     Steven Miller

                                         IBM




                                                     © 2012 IBM Corporation

32
Symposium on Digital Curation




                                     © 2012 IBM Corporation

33
Symposium on Digital Curation




                                     © 2012 IBM Corporation

34
Symposium on Digital Curation




                                     © 2012 IBM Corporation

35
Symposium on Digital Curation




                                     © 2012 IBM Corporation

36
Symposium on Digital Curation


     Enterprise Governance Architect

      Define & manage the quality, consistency, usability, security, & availability of information

      Ensure compliance with local, state, federal, and international regulations

      Define and protect key organizational information assets

      Define and manage processes to ensure data quality and remediate data errors

      Define and manage appropriate levels of security at many levels

      Define processes to protect against security issues such as identity and data theft

      Define processes to ensure appropriate testing occurs before implementing


                                                                                        © 2012 IBM Corporation

37
Symposium on Digital Curation


     Enterprise Architect – Data Governance
      Perform baseline logical reviews on key system, content, data, and process assets.

      Create and maintain a comprehensive governance architecture for the Enterprise Conceptual
       Information Model, Content Assets, and Data Assets.

      Ensure governed assets adhere to architectural principles and “Golden Rules”

      Work with Domain Architects as key interaction point for communication, evangelism, governance
       and feedback into central architecture

      Work with Business / Product Strategy in order to stay up to date with business / product direction
       in order to anticipate long-lead-time technology needs.

      Work with peers within other enterprise information management pillars to develop and maintain
       business strategy, policies, standards and guidelines pertaining to global enterprise information

      Ensure information model, information assets, governance architecture and program are aligned to
       the business goals across the company and the various business units


                                                                                               © 2012 IBM Corporation

38
Symposium on Digital Curation


     Data Curator & Analyst
      Develop and maintain tools/codes for day-to-day extraction, curation and management of
       phenotypic, genomic, breeding process, and logistical data
     
       Be instrumental for extracting and providing clean data to statistical analysis team, IT team,
       breeders and managements as per request
     
       Develop matrix to measure and track the quality improvements of phenotypic and genomic data
     
       Proactively increase awareness of value of the data quality among breeders and researchers across
       the company
     
       Work closely with corporate IT groups and statistical teams to identify and implement methods to
       automate tracking of breeding pipeline and increase quality of pipeline data in order to reduce time
       in structuring, characterizing and cleaning data.
     
       Create and present summary statistics and reports to researchers and management




                                                                                               © 2012 IBM Corporation

39
Symposium on Digital Curation


     Senior Data Steward
      Support, build, and sustain relationships with analysts, leads, supervisors and managers within the
       TFS Business organization on designated projects

       Lead all activities (planning, analysis, testing & reconciliation) in support of the delivery of small,
       medium, and large data and reporting projects and initiatives

       Serve as liaison between Business Intelligence and TFS Business units to support requests for data
       and analysis

       Analyze, monitor, profile and administer the metadata, quality and reconciliation of data within
       assigned areas on designated projects

       Prepare and execute detailed data assessments and corrective action plans

       Co-develop and execute the process of training business users on how to fully leverage and use TFS'
       business intelligence tools/reports/applications

       Serve as a subject matter expert on TFS data for specific subject areas


                                                                                                      © 2012 IBM Corporation

40
"Data Services for the Sciences: A Needs Assessment”
            Study by Brian Westra (University of Oregon, July 2010)

            1.      Data storage and backup
            2.      Making this data findable by others
            3.      Connecting data acquisition to data storage
            4.      Allowing or controlling access to this data by others
            5.      Documenting and tracking updates to the asset
            6.      Data analysis/manipulation
            7.      Finding and accessing related data from others
            8.      Connecting data storage to data analysis
            9.      Linking this data to publications or other assets
            10.     Insuring data is secure/trustworthy
            11.     Other

Westra, B. "Data Services for the Sciences: A Needs Assessment“ (30-July-2010) Ariadne Issue 64 [URL:
http://www.ariadne.ac.uk/issue64/westra/]
How do we
prepare for this
 new world of
    work?
Workforce Demand
and Career Opportunities
    in University and
   Research Libraries

NAS   Symposium   on   Digital   Curation


           Anne R. Kenney
             July 19, 2012
7 NEW ROLES FOR LIBRARIANS*
 1.   Acquisitions and Rights Advisors

 2.   Instructional Partners in Learning Spaces

 3.   Observers/anthropologists of Information Users and
      Producers

 4.   Systems Builders

 5.   Content Producers and Disseminators

 6.   Organizational Designers

 7.   Collaborative Network Creators and Participants



                                 Walters and Skinner, New Roles for New Times:
                                 Digital Curation for Preservation, ARL, Mar 2001
RATINGS OF IMPORTANCE AND FREQUENCY OF
ESCIENCE INTERNSHIP TASKS




              From Youngseek Kim, et al, “Education for eScience Professionals”,
              IJDC 6:1 (2011) http://www.ijdc.net/index.php/ijdc/article/view/168
MOST SIGNIFICANT SKILLS GAPS IN
SUPPORTING EVOLVING RESEARCHERS’
INFORMATION NEEDS

    1. Ability to advise on preserving research outputs

    2. Knowledge to advise on data management and curation,
       including ingest, discovery, access, dissemination,
       preservation, and portability

    3. Knowledge to support researchers in complying with the
       various mandates of funders, including open access
       requirements

    4. Knowledge to advise on potential data manipulation tools
       used in the discipline/subject




                                          Mary Auckland, “Re-skilling for Research,” RLUK, January 2012
© Information School / University of Sheffield 2012
MOST SIGNIFICANT SKILLS GAPS
(CONTINUED)


 5. Knowledge to advise on data mining

 6. Knowledge to advocate, and advise on, the use of metadata

 7. Ability to advise on the preservation of project records, e.g.
    correspondence

 8. Knowledge of sources of research funding to assist
    researchers to identify potential funders

 9. Skills to develop metadata schema, and advise on
    discipline/subject standards and practices, for individual
    research projects



                   Mary Auckland, “Re-skilling for Research,” RLUK, January 2012
REQUISITE EXPERTISE FOR DIGITAL
HUMANITIES AND SOCIAL SCIENCES

Requisite Expertise
Domain/subject expertise
Analytical expertise
Data expertise
Project management expertise




   Williford and Henry, “One Culture: Computationally Intensive Research in the Humanities
   and Social Sciences,” CLIR, 2012
39 schools worldwide and growing
                                                   http://www.ischools.org/
The iSchools organization was founded in 2005 by a collective of Information Schools dedicated to advancing the
information field in the 21st Century. These schools, colleges, and departments have been newly created or are evolving
from programs formerly focused on specific tracks such as information technology, library science, informatics, and
information science. While each individual iSchoolhas its own strengths and specializations, together they share a
fundamental interest in the relationships between information, people, and technology.
Digital Curation
as a Core Competency
Symposium on Digital Curation in the Era of Big Data:
Career Opportunities & Educational Requirements

July 19, 2012




Dean Elizabeth D. Liddy
iSchool, Syracuse University
5 Stages of the Data Life Cycle

        Data Archiving / Preservation

Data Presentation / Visualization

             Data Analytics

  Data Management

Data Collection
Three Additional Vital Competencies




                Data Archiving / Preservation


       Data Presentation / Visualization

                  Data Analytics


        Data Management

     Data Collection
Preserving Access to Our Digital Future:
               Building an International Digital Curation Curriculum




DigCCurr Matrix




                 Competencies for Curators
        http://www.ils.unc.edu/digccurr/digccurr-matrix.html/
http://digital-scholarship.org/dcrg/dcrg.htm




      Available under a Creative Commons Attribution-NonCommercial 3.0 Unported License.
Evolved Infrastructure




Evolved Tools & Resources




Evolved Librarians
Servers are the
 new shelves.
The Opportunity Before Us
• Seek out and initiate data projects
   – Cross-domain partnerships
   – Enhance broad availability

• Pursue value-added services
   –   Data storage and backup services
   –   Enhancing data mark-up and findability
   –   Securing/controlling access to data
   –   Maintaining provenance
   –   Developing analytical and visualization tools
   –   Seeking related data/research
   –   Hosting and linking data to publications/assets
   –   Ensuring that data is preserved for the long-term

• Grow your people
   – Invest in training your existing staff
   – Change the technical profile of who you hire
   – Support the evolution of how we educate the field
                                                           60
– Helen Harkness


http://www.amazon.com/Career-Chase-Creative-Control-Chaotic/dp/0891060987
"If you don't like change, you're
going to like irrelevance even less.“
                —General Eric Shinseki
                Retired United States Army four-star general,
                   currently US Secretary of Veterans Affairs
Paintings by
 Xiaoze Xie
Xiaoze Xie immigrated from the People’s Republic
of China in 1992, where he was born and studied
art and architecture. He has MFA degrees from
Beijing and North Texas University, and taught at
Bucknell University before assuming his current
post at Stanford. His works are in the collections of
the Museum of Fine Arts, Houston, the Scottsdale
Museum of Contemporary Art and distinguished
private collections.
Xie’s oil paintings bring together serene qualities of
traditional still-life painting and photography. From
the long tradition of still-life painting he employs a
rich and selected palette to represent the books,
which take on a nearly symbolic role.

  Webpage: http://art.stanford.edu/profile/Xiaoze+Xie/
              Email: xzxie@stanford.edu
“We’d now like to open the floor
to shorter speeches disguised as questions.”


                                         Published in The New Yorker 10/18/2010
                                                                by Steve Macone
Thank you!




                    Lee Dirks
           Directory, Portfolio Strategy
        Microsoft Research | Connections
              ldirks@microsoft.com
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012

More Related Content

What's hot

Free as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen ScienceFree as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen ScienceAndrea Wiggins
 
Introduction to Big Data and Data Science
Introduction to Big Data and Data ScienceIntroduction to Big Data and Data Science
Introduction to Big Data and Data ScienceFeyzi R. Bagirov
 
Big Data Content Organization, Discovery, and Management
Big Data Content Organization, Discovery, and ManagementBig Data Content Organization, Discovery, and Management
Big Data Content Organization, Discovery, and ManagementAccess Innovations, Inc.
 
Forethoughts (or Four Provocations) on Linked Data and Digital Scholarship
Forethoughts (or Four Provocations) on Linked Data and Digital ScholarshipForethoughts (or Four Provocations) on Linked Data and Digital Scholarship
Forethoughts (or Four Provocations) on Linked Data and Digital ScholarshipDavid De Roure
 
2013 bio it world
2013 bio it world2013 bio it world
2013 bio it worldChris Dwan
 
Citizen Science And a Manufacturing Revolution: Major trends research notes
Citizen Science And a Manufacturing Revolution: Major trends research notesCitizen Science And a Manufacturing Revolution: Major trends research notes
Citizen Science And a Manufacturing Revolution: Major trends research notesChris Jones
 
Citizen Science Phenotypes
Citizen Science PhenotypesCitizen Science Phenotypes
Citizen Science PhenotypesAndrea Wiggins
 
 Blockchain Overview: Possibilities and Issues
 Blockchain Overview: Possibilities and Issues Blockchain Overview: Possibilities and Issues
 Blockchain Overview: Possibilities and IssuesBohyun Kim
 
Data Management for Citizen Science
Data Management for Citizen ScienceData Management for Citizen Science
Data Management for Citizen ScienceAndrea Wiggins
 
SPONSORED WORKSHOP by Amplidata from Structure:Data 2012:
SPONSORED WORKSHOP by Amplidata from Structure:Data 2012:  SPONSORED WORKSHOP by Amplidata from Structure:Data 2012:
SPONSORED WORKSHOP by Amplidata from Structure:Data 2012: Gigaom
 
2 7-2013-big data and e-discovery
2 7-2013-big data and e-discovery2 7-2013-big data and e-discovery
2 7-2013-big data and e-discoveryExterro
 
Supporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of StirlingSupporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of StirlingLisa Haddow
 
RIKM3 Leveraging the relationship between RM, IM and KM
RIKM3 Leveraging the relationship between RM, IM and KMRIKM3 Leveraging the relationship between RM, IM and KM
RIKM3 Leveraging the relationship between RM, IM and KMDavid Williams
 
Behind the Scenes with Data.gov
Behind the Scenes with Data.govBehind the Scenes with Data.gov
Behind the Scenes with Data.govJeanne Holm
 
091020 E Research Otago
091020 E Research Otago091020 E Research Otago
091020 E Research OtagoNick Jones
 
Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...
Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...
Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...Andrea Wiggins
 
Data wars - management information to data driven intelligence
Data wars - management information to data driven intelligenceData wars - management information to data driven intelligence
Data wars - management information to data driven intelligenceKen Chad Consulting Ltd
 
Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...
Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...
Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...Andrea Wiggins
 

What's hot (20)

Free as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen ScienceFree as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen Science
 
Introduction to Big Data and Data Science
Introduction to Big Data and Data ScienceIntroduction to Big Data and Data Science
Introduction to Big Data and Data Science
 
Big Data Content Organization, Discovery, and Management
Big Data Content Organization, Discovery, and ManagementBig Data Content Organization, Discovery, and Management
Big Data Content Organization, Discovery, and Management
 
Forethoughts (or Four Provocations) on Linked Data and Digital Scholarship
Forethoughts (or Four Provocations) on Linked Data and Digital ScholarshipForethoughts (or Four Provocations) on Linked Data and Digital Scholarship
Forethoughts (or Four Provocations) on Linked Data and Digital Scholarship
 
2013 bio it world
2013 bio it world2013 bio it world
2013 bio it world
 
Crowdsourcing Science
Crowdsourcing ScienceCrowdsourcing Science
Crowdsourcing Science
 
Citizen Science And a Manufacturing Revolution: Major trends research notes
Citizen Science And a Manufacturing Revolution: Major trends research notesCitizen Science And a Manufacturing Revolution: Major trends research notes
Citizen Science And a Manufacturing Revolution: Major trends research notes
 
Citizen Science Phenotypes
Citizen Science PhenotypesCitizen Science Phenotypes
Citizen Science Phenotypes
 
 Blockchain Overview: Possibilities and Issues
 Blockchain Overview: Possibilities and Issues Blockchain Overview: Possibilities and Issues
 Blockchain Overview: Possibilities and Issues
 
Data Management for Citizen Science
Data Management for Citizen ScienceData Management for Citizen Science
Data Management for Citizen Science
 
SPONSORED WORKSHOP by Amplidata from Structure:Data 2012:
SPONSORED WORKSHOP by Amplidata from Structure:Data 2012:  SPONSORED WORKSHOP by Amplidata from Structure:Data 2012:
SPONSORED WORKSHOP by Amplidata from Structure:Data 2012:
 
2 7-2013-big data and e-discovery
2 7-2013-big data and e-discovery2 7-2013-big data and e-discovery
2 7-2013-big data and e-discovery
 
Supporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of StirlingSupporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of Stirling
 
RIKM3 Leveraging the relationship between RM, IM and KM
RIKM3 Leveraging the relationship between RM, IM and KMRIKM3 Leveraging the relationship between RM, IM and KM
RIKM3 Leveraging the relationship between RM, IM and KM
 
Behind the Scenes with Data.gov
Behind the Scenes with Data.govBehind the Scenes with Data.gov
Behind the Scenes with Data.gov
 
091020 E Research Otago
091020 E Research Otago091020 E Research Otago
091020 E Research Otago
 
Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...
Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...
Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...
 
Little eScience
Little eScienceLittle eScience
Little eScience
 
Data wars - management information to data driven intelligence
Data wars - management information to data driven intelligenceData wars - management information to data driven intelligence
Data wars - management information to data driven intelligence
 
Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...
Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...
Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...
 

Viewers also liked

Creating Moments That Matter Research Studies | 2013 Google Nielsen Report
Creating Moments That Matter Research Studies | 2013 Google Nielsen ReportCreating Moments That Matter Research Studies | 2013 Google Nielsen Report
Creating Moments That Matter Research Studies | 2013 Google Nielsen ReportRichard Bouchez
 
M2 roadshow us jesse haines, google
M2 roadshow us   jesse haines, googleM2 roadshow us   jesse haines, google
M2 roadshow us jesse haines, googlemobilesquared Ltd
 
Google's Mobile Search Presentation from #MMSEM11
Google's Mobile Search Presentation from #MMSEM11Google's Mobile Search Presentation from #MMSEM11
Google's Mobile Search Presentation from #MMSEM11Marcel Media
 
The mobile-movement research-studies
The mobile-movement research-studiesThe mobile-movement research-studies
The mobile-movement research-studiesNigel Mark Dias
 
Are you mobile ready 280311
Are you mobile ready 280311Are you mobile ready 280311
Are you mobile ready 280311PhilipNagle
 
Bricks and Mobile - State of Retail Mobile
Bricks and Mobile - State of Retail MobileBricks and Mobile - State of Retail Mobile
Bricks and Mobile - State of Retail MobileRemodista
 
Mobile Search: The Landscape, testing, & Getting results
Mobile Search: The Landscape, testing, & Getting resultsMobile Search: The Landscape, testing, & Getting results
Mobile Search: The Landscape, testing, & Getting resultsaiCommerce
 

Viewers also liked (7)

Creating Moments That Matter Research Studies | 2013 Google Nielsen Report
Creating Moments That Matter Research Studies | 2013 Google Nielsen ReportCreating Moments That Matter Research Studies | 2013 Google Nielsen Report
Creating Moments That Matter Research Studies | 2013 Google Nielsen Report
 
M2 roadshow us jesse haines, google
M2 roadshow us   jesse haines, googleM2 roadshow us   jesse haines, google
M2 roadshow us jesse haines, google
 
Google's Mobile Search Presentation from #MMSEM11
Google's Mobile Search Presentation from #MMSEM11Google's Mobile Search Presentation from #MMSEM11
Google's Mobile Search Presentation from #MMSEM11
 
The mobile-movement research-studies
The mobile-movement research-studiesThe mobile-movement research-studies
The mobile-movement research-studies
 
Are you mobile ready 280311
Are you mobile ready 280311Are you mobile ready 280311
Are you mobile ready 280311
 
Bricks and Mobile - State of Retail Mobile
Bricks and Mobile - State of Retail MobileBricks and Mobile - State of Retail Mobile
Bricks and Mobile - State of Retail Mobile
 
Mobile Search: The Landscape, testing, & Getting results
Mobile Search: The Landscape, testing, & Getting resultsMobile Search: The Landscape, testing, & Getting results
Mobile Search: The Landscape, testing, & Getting results
 

Similar to ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012

Bleeding, Leading, or Not Competing
Bleeding, Leading, or Not CompetingBleeding, Leading, or Not Competing
Bleeding, Leading, or Not CompetingRobert H. McDonald
 
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011Lee Dirks
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Alexandru Iosup
 
Big Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & InnovationBig Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & InnovationPhilip Bourne
 
Research, the Cloud, and the IRB
Research, the Cloud, and the IRBResearch, the Cloud, and the IRB
Research, the Cloud, and the IRBMichael Zimmer
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Sarah Anna Stewart
 
ACS Summer Institute - Emerging Roles of Librarians - 14_0731
ACS Summer Institute - Emerging Roles of Librarians - 14_0731ACS Summer Institute - Emerging Roles of Librarians - 14_0731
ACS Summer Institute - Emerging Roles of Librarians - 14_0731jeffreylancaster
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsSri Ambati
 
DataViz & Future of Research - LDirks SXSWiMar12
DataViz & Future of Research - LDirks SXSWiMar12DataViz & Future of Research - LDirks SXSWiMar12
DataViz & Future of Research - LDirks SXSWiMar12Lee Dirks
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
Rising tide of data update 20171024
Rising tide of data update 20171024Rising tide of data update 20171024
Rising tide of data update 20171024Keith Russell
 
Rising tide of data update
Rising tide of data update Rising tide of data update
Rising tide of data update ARDC
 
Simbios - Open Science in Biocomputational Research
Simbios - Open Science in Biocomputational ResearchSimbios - Open Science in Biocomputational Research
Simbios - Open Science in Biocomputational Researchjpk
 
Big and Small Web Data
Big and Small Web DataBig and Small Web Data
Big and Small Web DataMarieke Guy
 
Databases & Challenges of a Digital Age
Databases & Challenges of a Digital AgeDatabases & Challenges of a Digital Age
Databases & Challenges of a Digital AgeWendy Lile
 
Building a Digital Library
Building a Digital LibraryBuilding a Digital Library
Building a Digital LibraryEd Fay
 

Similar to ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012 (20)

CAEPIA 2011
CAEPIA 2011CAEPIA 2011
CAEPIA 2011
 
Bleeding, Leading, or Not Competing
Bleeding, Leading, or Not CompetingBleeding, Leading, or Not Competing
Bleeding, Leading, or Not Competing
 
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
Grant: The Impact of Cloud, Mobile, and Managing the Changing Platforms of Di...
Grant: The Impact of Cloud, Mobile, and Managing the Changing Platforms of Di...Grant: The Impact of Cloud, Mobile, and Managing the Changing Platforms of Di...
Grant: The Impact of Cloud, Mobile, and Managing the Changing Platforms of Di...
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.
 
Big Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & InnovationBig Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & Innovation
 
Research, the Cloud, and the IRB
Research, the Cloud, and the IRBResearch, the Cloud, and the IRB
Research, the Cloud, and the IRB
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
ACS Summer Institute - Emerging Roles of Librarians - 14_0731
ACS Summer Institute - Emerging Roles of Librarians - 14_0731ACS Summer Institute - Emerging Roles of Librarians - 14_0731
ACS Summer Institute - Emerging Roles of Librarians - 14_0731
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data Scientists
 
DataViz & Future of Research - LDirks SXSWiMar12
DataViz & Future of Research - LDirks SXSWiMar12DataViz & Future of Research - LDirks SXSWiMar12
DataViz & Future of Research - LDirks SXSWiMar12
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Rising tide of data update 20171024
Rising tide of data update 20171024Rising tide of data update 20171024
Rising tide of data update 20171024
 
Rising tide of data update
Rising tide of data update Rising tide of data update
Rising tide of data update
 
Simbios - Open Science in Biocomputational Research
Simbios - Open Science in Biocomputational ResearchSimbios - Open Science in Biocomputational Research
Simbios - Open Science in Biocomputational Research
 
Big and Small Web Data
Big and Small Web DataBig and Small Web Data
Big and Small Web Data
 
Databases & Challenges of a Digital Age
Databases & Challenges of a Digital AgeDatabases & Challenges of a Digital Age
Databases & Challenges of a Digital Age
 
Building a Digital Library
Building a Digital LibraryBuilding a Digital Library
Building a Digital Library
 

Recently uploaded

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 

Recently uploaded (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012

  • 1. at the 2012 National Library / ExLibris Meeting Director, Portfolio Strategy MSR Connections
  • 2. (and me) • • • • •
  • 4. Worldwide Presence MSR India MSR New England Redmond • Redmond, Washington Sept 1991 • Cambridge, United Kingdom July 1997 MSR Cambridge, UK • Beijing, China Nov 1998 • Silicon Valley, California July 2001 MSR Asia (Beijing) • Bangalore, India Jan 2005 • Cambridge, Massachusetts July 2008 • New York City, NY May 2012 Silicon Valley, California
  • 5. • • • http://research.microsoft.com/
  • 6.
  • 7. | Outreach. Collaboration. Innovation. • • • • http://research.microsoft.com/collaboration/
  • 8. Engagement and Collaboration Focus Core Computer Natural User Earth, Education & Health & Science Interface Energy & Scholarly Wellbeing Environment Communication
  • 11. …Thus far we seem to be worse off than before—for we can enormously extend the record; yet even in its present bulk we can hardly consult it. This is a much larger matter than merely the extraction of data for the purposes of scientific research; it involves the entire process by which man profits by his inheritance of acquired knowledge. The prime action of use is selection, and here we are halting indeed. There may be millions of fine thoughts, and the account of the experience on which they are based, all encased within stone walls of acceptable architectural form; but if the scholar can get at only one a week by diligent search, his syntheses are not likely to keep up with the current scene… As We May Think by Vannevar Bush The Atlantic, July 1945 http://www.theatlantic.com/doc/194507/bush
  • 12. According to study called How Much Information by the University of California at San Diego, “…consumption totaled 3.6 zettabytes and 10,845 trillion words, corresponding to 100,500 words and 34 gigabytes for an average person on an average day. A zettabyte is 10 to the 21st power bytes, a million million gigabytes. These estimates are from an analysis of more than 20 different sources of information, from very old (newspapers and books) to very new (portable computer games, satellite radio, and Internet video)." [Note: Information at work is not included!]
  • 13. “It’s not information overload. It’s filter failure.” Clay Shirky at Web 2.0 Expo 2008
  • 14. http://en.wikipedia.org/wiki/Big_data • information technology [1][2][3] data sets [4] [5] • [update] petabytes exabytes [9] meteorology genomics [10] connectomics [11] [12] Internet search finance business informatics • remote sensing radio-frequency identification [13][14] • [15] [update] quintillion [16]
  • 15. So, what about libraries?
  • 16. Present The Future: an Explosion of Data Experiments Simulations Archives Literature Instruments The Challenge: Enable Discovery Petabytes Deliver the capability to mine, search and analyze this data in near real-time.
  • 17.
  • 18. What resources are available?
  • 19. DataUp: Data Curation Add-in for Microsoft Excel • • California Digital Library’s Curation Center • • DataONE •
  • 20.
  • 21. • Figshare is the first online repository for storing and sharing all of your preliminary findings in the form of individual figures, datasets, media or filesets. Post preprint figures on Figshare to claim priority and receive feedback on your findings prior to http://figshare.com/ formal publication. • Figshare allows researchers to publish all of their research outputs in seconds in an easily citable, sharable and discoverable manner. All file formats can be published, including videos and datasets that are often demoted to the supplemental materials section in current publishing models. • Figshare uses Creative Commons licensing to allow frictionless sharing of research data while allowing users to maintain their ownership. Figshare gives http://www.digital-science.com/ users unlimited public space and 1GB of private storage space for free.
  • 22. • • • • http://datacite.org/ • • http://databib.org/ • Registry of research data • repositories (hosted by • [Digital Purdue University) Object Identifier] • •
  • 23.  Developed by the Institute of Quantitative Social Science (IQSS) at Harvard University http://thedata.org/
  • 24.
  • 25. DataFlow http://www.dataflow.ox.ac.u k/ DataStage DataBank • •
  • 28.
  • 29. WorldWideScience.org is a global science gateway connecting you to national and international scientific databases and portals. WorldWideScience.org accelerates scientific discovery and progress by providing one-stop searching of global science sources. The WorldWideScience Alliance, a multilateral partnership, consists of participating member countries and provides the governance structure for WorldWideScience.org. WorldWideScience.org was developed and is maintained by the Office of Scientific and Technical Information (OSTI), an element of the Office of Science within the U.S. Department of Energy. Please contact webmaster@worldwidescience.org if you represent a national or international science database or portal and would like your source searched by WorldWideScience.org. In 3+ years since launch, the site has grown to 65+ countries, 400+ million pages – 96.5% of which is *not* available via commercial search engines – and can now be translated into multiple world languages (on demand).
  • 30. Who does the work?
  • 31. “Future Career Opportunities and Educational Requirements for Digital Curation” 1. 2. 3. 4. http://sites.nationalacademies.org/PGA/brdi/PGA_069853/
  • 32. Symposium on Digital Curation The Future Workforce Steven Miller IBM © 2012 IBM Corporation 32
  • 33. Symposium on Digital Curation © 2012 IBM Corporation 33
  • 34. Symposium on Digital Curation © 2012 IBM Corporation 34
  • 35. Symposium on Digital Curation © 2012 IBM Corporation 35
  • 36. Symposium on Digital Curation © 2012 IBM Corporation 36
  • 37. Symposium on Digital Curation Enterprise Governance Architect Define & manage the quality, consistency, usability, security, & availability of information Ensure compliance with local, state, federal, and international regulations Define and protect key organizational information assets Define and manage processes to ensure data quality and remediate data errors Define and manage appropriate levels of security at many levels Define processes to protect against security issues such as identity and data theft Define processes to ensure appropriate testing occurs before implementing © 2012 IBM Corporation 37
  • 38. Symposium on Digital Curation Enterprise Architect – Data Governance  Perform baseline logical reviews on key system, content, data, and process assets.  Create and maintain a comprehensive governance architecture for the Enterprise Conceptual Information Model, Content Assets, and Data Assets.  Ensure governed assets adhere to architectural principles and “Golden Rules”  Work with Domain Architects as key interaction point for communication, evangelism, governance and feedback into central architecture  Work with Business / Product Strategy in order to stay up to date with business / product direction in order to anticipate long-lead-time technology needs.  Work with peers within other enterprise information management pillars to develop and maintain business strategy, policies, standards and guidelines pertaining to global enterprise information  Ensure information model, information assets, governance architecture and program are aligned to the business goals across the company and the various business units © 2012 IBM Corporation 38
  • 39. Symposium on Digital Curation Data Curator & Analyst  Develop and maintain tools/codes for day-to-day extraction, curation and management of phenotypic, genomic, breeding process, and logistical data  Be instrumental for extracting and providing clean data to statistical analysis team, IT team, breeders and managements as per request  Develop matrix to measure and track the quality improvements of phenotypic and genomic data  Proactively increase awareness of value of the data quality among breeders and researchers across the company  Work closely with corporate IT groups and statistical teams to identify and implement methods to automate tracking of breeding pipeline and increase quality of pipeline data in order to reduce time in structuring, characterizing and cleaning data.  Create and present summary statistics and reports to researchers and management © 2012 IBM Corporation 39
  • 40. Symposium on Digital Curation Senior Data Steward  Support, build, and sustain relationships with analysts, leads, supervisors and managers within the TFS Business organization on designated projects Lead all activities (planning, analysis, testing & reconciliation) in support of the delivery of small, medium, and large data and reporting projects and initiatives Serve as liaison between Business Intelligence and TFS Business units to support requests for data and analysis Analyze, monitor, profile and administer the metadata, quality and reconciliation of data within assigned areas on designated projects Prepare and execute detailed data assessments and corrective action plans Co-develop and execute the process of training business users on how to fully leverage and use TFS' business intelligence tools/reports/applications Serve as a subject matter expert on TFS data for specific subject areas © 2012 IBM Corporation 40
  • 41. "Data Services for the Sciences: A Needs Assessment” Study by Brian Westra (University of Oregon, July 2010) 1. Data storage and backup 2. Making this data findable by others 3. Connecting data acquisition to data storage 4. Allowing or controlling access to this data by others 5. Documenting and tracking updates to the asset 6. Data analysis/manipulation 7. Finding and accessing related data from others 8. Connecting data storage to data analysis 9. Linking this data to publications or other assets 10. Insuring data is secure/trustworthy 11. Other Westra, B. "Data Services for the Sciences: A Needs Assessment“ (30-July-2010) Ariadne Issue 64 [URL: http://www.ariadne.ac.uk/issue64/westra/]
  • 42. How do we prepare for this new world of work?
  • 43. Workforce Demand and Career Opportunities in University and Research Libraries NAS Symposium on Digital Curation Anne R. Kenney July 19, 2012
  • 44. 7 NEW ROLES FOR LIBRARIANS* 1. Acquisitions and Rights Advisors 2. Instructional Partners in Learning Spaces 3. Observers/anthropologists of Information Users and Producers 4. Systems Builders 5. Content Producers and Disseminators 6. Organizational Designers 7. Collaborative Network Creators and Participants Walters and Skinner, New Roles for New Times: Digital Curation for Preservation, ARL, Mar 2001
  • 45. RATINGS OF IMPORTANCE AND FREQUENCY OF ESCIENCE INTERNSHIP TASKS From Youngseek Kim, et al, “Education for eScience Professionals”, IJDC 6:1 (2011) http://www.ijdc.net/index.php/ijdc/article/view/168
  • 46. MOST SIGNIFICANT SKILLS GAPS IN SUPPORTING EVOLVING RESEARCHERS’ INFORMATION NEEDS 1. Ability to advise on preserving research outputs 2. Knowledge to advise on data management and curation, including ingest, discovery, access, dissemination, preservation, and portability 3. Knowledge to support researchers in complying with the various mandates of funders, including open access requirements 4. Knowledge to advise on potential data manipulation tools used in the discipline/subject Mary Auckland, “Re-skilling for Research,” RLUK, January 2012 © Information School / University of Sheffield 2012
  • 47. MOST SIGNIFICANT SKILLS GAPS (CONTINUED) 5. Knowledge to advise on data mining 6. Knowledge to advocate, and advise on, the use of metadata 7. Ability to advise on the preservation of project records, e.g. correspondence 8. Knowledge of sources of research funding to assist researchers to identify potential funders 9. Skills to develop metadata schema, and advise on discipline/subject standards and practices, for individual research projects Mary Auckland, “Re-skilling for Research,” RLUK, January 2012
  • 48. REQUISITE EXPERTISE FOR DIGITAL HUMANITIES AND SOCIAL SCIENCES Requisite Expertise Domain/subject expertise Analytical expertise Data expertise Project management expertise Williford and Henry, “One Culture: Computationally Intensive Research in the Humanities and Social Sciences,” CLIR, 2012
  • 49. 39 schools worldwide and growing http://www.ischools.org/ The iSchools organization was founded in 2005 by a collective of Information Schools dedicated to advancing the information field in the 21st Century. These schools, colleges, and departments have been newly created or are evolving from programs formerly focused on specific tracks such as information technology, library science, informatics, and information science. While each individual iSchoolhas its own strengths and specializations, together they share a fundamental interest in the relationships between information, people, and technology.
  • 50. Digital Curation as a Core Competency Symposium on Digital Curation in the Era of Big Data: Career Opportunities & Educational Requirements July 19, 2012 Dean Elizabeth D. Liddy iSchool, Syracuse University
  • 51. 5 Stages of the Data Life Cycle Data Archiving / Preservation Data Presentation / Visualization Data Analytics Data Management Data Collection
  • 52.
  • 53. Three Additional Vital Competencies Data Archiving / Preservation Data Presentation / Visualization Data Analytics Data Management Data Collection
  • 54.
  • 55.
  • 56. Preserving Access to Our Digital Future: Building an International Digital Curation Curriculum DigCCurr Matrix Competencies for Curators http://www.ils.unc.edu/digccurr/digccurr-matrix.html/
  • 57. http://digital-scholarship.org/dcrg/dcrg.htm Available under a Creative Commons Attribution-NonCommercial 3.0 Unported License.
  • 58. Evolved Infrastructure Evolved Tools & Resources Evolved Librarians
  • 59. Servers are the new shelves.
  • 60. The Opportunity Before Us • Seek out and initiate data projects – Cross-domain partnerships – Enhance broad availability • Pursue value-added services – Data storage and backup services – Enhancing data mark-up and findability – Securing/controlling access to data – Maintaining provenance – Developing analytical and visualization tools – Seeking related data/research – Hosting and linking data to publications/assets – Ensuring that data is preserved for the long-term • Grow your people – Invest in training your existing staff – Change the technical profile of who you hire – Support the evolution of how we educate the field 60
  • 62. "If you don't like change, you're going to like irrelevance even less.“ —General Eric Shinseki Retired United States Army four-star general, currently US Secretary of Veterans Affairs
  • 63. Paintings by Xiaoze Xie Xiaoze Xie immigrated from the People’s Republic of China in 1992, where he was born and studied art and architecture. He has MFA degrees from Beijing and North Texas University, and taught at Bucknell University before assuming his current post at Stanford. His works are in the collections of the Museum of Fine Arts, Houston, the Scottsdale Museum of Contemporary Art and distinguished private collections. Xie’s oil paintings bring together serene qualities of traditional still-life painting and photography. From the long tradition of still-life painting he employs a rich and selected palette to represent the books, which take on a nearly symbolic role. Webpage: http://art.stanford.edu/profile/Xiaoze+Xie/ Email: xzxie@stanford.edu
  • 64. “We’d now like to open the floor to shorter speeches disguised as questions.” Published in The New Yorker 10/18/2010 by Steve Macone
  • 65. Thank you! Lee Dirks Directory, Portfolio Strategy Microsoft Research | Connections ldirks@microsoft.com

Editor's Notes

  1. XiaozeXieLos Angeles Public Library (R740), 2009, oil on canvas, 32 x 64 inches
  2. Rob Vargas created it from a study called How Much Information by the University of California at San Diego
  3. TITLE:  Stanford Art Library (NA7764-NA8206) ARTIST:  XiaozeXieWORK DATE:  2009 CATEGORY:  Paintings MATERIALS:  oil on canvas SIZE:  h: 30 x w: 60 in / h: 76.2 x w: 152.4 cm REGION:  Chinese STYLE:  Contemporary (ca. 1945-present) PRICE*:  Contact Gallery for Price GALLERY:  415.433.2710    Send EmailONLINE CATALOGUE(S):  XiaozeXie  Jan 6 - Jan 30, 2010
  4. Chinese Library No. 42, 2009oil on canvas32” x 61”
  5. The MoMA Library (3), 2007, oil on canvas, 40 x 60 inches
  6. http://www.newyorkerstore.com/wed-now-like-to-open-the-floor-to-shorter-speeches-disguised-as-questions/invt/136206/