SlideShare a Scribd company logo
1 of 17
Shared Data: What it Means
for the Future of Libraries
Robin Fay @georgiawebgurl
Head, DBM/Cataloging / UGA Libraries
Peter Murray
Lyrasis
Draft Content for Discussion group 05.01.2013 / robinfay
Agenda
• Overview of big data
▫ What is big data? What is shared data?
▫ Implications and challenges
▫ Background: Alistair Croll talk at ALA
Midwinter
http://www.youtube.com/watch?v=Ic_Bl
PesEls
• Discussion framed around Alistair’s
presentation topics
Draft Content for Discussion group 04.30.2013
How did our data get big?
• Technology that has unforeseen consequences
• Technology changes.
• We leave digital trails wherever we go.
• Think> internet browsing history, email,
medical records, bank transactions, buying
history at shopping sites, Amazon reviews,
Facebook photos, comments on websites, and
much more.
Draft Content for Discussion group 05.01.2013
How did our data get big?
• “Collectively the data
that we leave behind
is Big Data. “
• and of course.. There
is the data that
others (people and
machines) create
about us.
• Big Data is about us
and has far reaching
consequences.
Draft Content for Discussion group 05.01.2013
What is Big Data?
• It is a not a technology
– it is a shift in how we
view and use
information
• Taking large amounts of
information spread
across many different
resources in different
formats making them
explore
• It doesn’t have to be
“that big just bigger
than what you can go
through by hand”
Draft Content for Discussion group 05.01.2013
3 attributes of Big Data
• Large
• Fast (manual
time needed)
• and
unstructured
(formats
differ)
=3 Vs of Big
Data
Draft Content for Discussion group 05.01.2013
Big Data
• Relational (relationships) database - our ILS systems
are often relational databases
• Mathematical database – computations
• Big Data is the intersection of two
• Health– analyzing health records to identify allergies,
sickness, etc
• Philanthropy (datakind) – analyze behavior of
farmers and knowledge workers to evaluate the impact
(ROI) of philanthropic work
• Think about potential for library use: we have patron
data, bibliographic data and more!
Concerns of Big Data
• Privacy – erodes privacy potentially leaking private
information
• Justify stereotypes (data can be misused or used in a
negative) and polarize social groups
• Facebook open graph search – pulling together
information from diverse information to get lists of
seemingly innocent ways such as movie watching
habits or music can be used in negative ways to
reinforce stereotypes or drawn conclusions about
people
• “Personalization can look like prejudice”
• We live in grey areas
• Computers do not understand that
Draft Content for Discussion group 05.01.2013
Which side of the fence?
• Big Data is going to change our lives!
• Are you
• a semantic idealist ? if we can taxonomize
and organize it, we can make sense of it
▫ Wolfram Alpha – we can ask it and it will reason
(mathematical)
• A chaotic nihilist? Algorithms will handle it –
correct data will bubble up given enough information
▫ Watson – doesn’t know answers but will analyze to
interpret answer
Draft Content for Discussion group 05.01.2013
So, how would you file a cup of coffee?
• Depends upon how you will use
the information!
• Understandings do not take
advantage of digital information
which slows semantic idealism –
much information not organized
so we have to rely algorithms (for
now) but it is vunerable.
• Tagging is often done by
machines – even in libraries we
batch load, harvest, update data
globally.
Draft Content for Discussion group 05.01.2013
Humans and technology
• Our reasoning can be flawed - we make
decisions evolutionary – we look at
simple correlations and patterns (false
positives)
• If comments after a post are highly
negative, responders are more likely to
take polarizing viewpoints
• Even with math is good, data can be
wrong
Draft Content for Discussion group 05.01.2013
Shared data
• We are a mosaic of data from other resources
• Unified digital history – record of all of our
data and could aggregate health information and
share with doctors – just one example
• Veracity (can verify) and Value (how we can
make sense of our data)
• Shared data : connecting networks will collect
data; algorithms will tag and assign metadata
but it will be up to humans to add value - this
can then be shared in ways that are useful
Draft Content for Discussion group 05.01.2013
Linked data makes it possible
• Linked data keeps us from having to re-enter or
copy information
It makes data:
• reusable
• easy to correct (correct one record instead of
multiples)
• efficient
• and potentially useful to others
Draft Content for Discussion group 05.01.2013
Linked data makes it possible
• It can build relationships in different ways -
allowing us to create temporary collections (a
user could organize their search results in a way
that makes sense to them) or more permanent
(collocating ALL works by a particular author
more easily; pulling together photographs more
easily)
• It can help make sense of Big Data and
facilitate sharing data.
Linked data makes it possible
• Linked data keeps us from having to re-enter or
copy information
It makes data:
• reusable
• easy to correct (correct one record instead of
multiples)
• efficient
• and potentially useful to others
Thinking of data in the library environment
• Automation and new technologies
• The web has changed
• Large scale bibliographic databases
• User expectations and needs
• Patron data
• Cooperative cataloging
• Greater variety of media in library collections
(electronic!)
• FRBR is our data model – semantic web
friendly!
Draft Content for Discussion group 05.01.2013
Discussion points
• Obviously, WorldCat is a shared data resource
we have all been using for years. What are
some other examples of big data, shared data,
or linked data that libraries use now?
2. What are some examples of data that
libraries could share that we aren't sharing
already?
3. What are some of the pitfalls of data sharing
on a massive scale?
Draft Content for Discussion group 05.01.2013

More Related Content

What's hot

Democratizing Open Data
Democratizing Open DataDemocratizing Open Data
Democratizing Open Data
Alvaro Graves
 
Creating digital libraries in support of learning communities using free t...
Creating digital libraries in support of learning communities using free t...Creating digital libraries in support of learning communities using free t...
Creating digital libraries in support of learning communities using free t...
Maggie Verster
 
On line footprint @upc
On line footprint @upcOn line footprint @upc
On line footprint @upc
Silvia Puglisi
 
Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...
Research Data Alliance
 

What's hot (20)

New trends in Libraries with IT, AI & i4.0
New trends in Libraries with IT, AI & i4.0New trends in Libraries with IT, AI & i4.0
New trends in Libraries with IT, AI & i4.0
 
The Role of the Library in a Digital World
The Role of the Library in a Digital WorldThe Role of the Library in a Digital World
The Role of the Library in a Digital World
 
It’s Local, It’s Personal -- Search Engine & Information Service Trends
It’s Local, It’s Personal -- Search Engine & Information Service TrendsIt’s Local, It’s Personal -- Search Engine & Information Service Trends
It’s Local, It’s Personal -- Search Engine & Information Service Trends
 
Freedman Center for Digital Scholarship Colloquium - 14_1106
Freedman Center for Digital Scholarship Colloquium - 14_1106Freedman Center for Digital Scholarship Colloquium - 14_1106
Freedman Center for Digital Scholarship Colloquium - 14_1106
 
Introduction to digital scholarship tools
Introduction to digital scholarship toolsIntroduction to digital scholarship tools
Introduction to digital scholarship tools
 
Data management
Data management Data management
Data management
 
Democratizing Open Data
Democratizing Open DataDemocratizing Open Data
Democratizing Open Data
 
Using a digital library to organise your life
Using a digital library to organise your lifeUsing a digital library to organise your life
Using a digital library to organise your life
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open Science
 
Creating digital libraries in support of learning communities using free t...
Creating digital libraries in support of learning communities using free t...Creating digital libraries in support of learning communities using free t...
Creating digital libraries in support of learning communities using free t...
 
Using Social Media to Develop Your Academic Profile and Engage Others in Your...
Using Social Media to Develop Your Academic Profile and Engage Others in Your...Using Social Media to Develop Your Academic Profile and Engage Others in Your...
Using Social Media to Develop Your Academic Profile and Engage Others in Your...
 
Building a Distributed Data Portal
Building a Distributed Data PortalBuilding a Distributed Data Portal
Building a Distributed Data Portal
 
On line footprint @upc
On line footprint @upcOn line footprint @upc
On line footprint @upc
 
HumanityRoad training - Basic Crisis Information Management
HumanityRoad training - Basic Crisis Information ManagementHumanityRoad training - Basic Crisis Information Management
HumanityRoad training - Basic Crisis Information Management
 
Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...
 
Capacity Building: Data Science in the University At Rensselaer Polytechnic ...
Capacity Building: Data Science in the University  At Rensselaer Polytechnic ...Capacity Building: Data Science in the University  At Rensselaer Polytechnic ...
Capacity Building: Data Science in the University At Rensselaer Polytechnic ...
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Using social media to address professional issues in LIS
Using social media to address professional issues in LISUsing social media to address professional issues in LIS
Using social media to address professional issues in LIS
 
Datascience and python
Datascience and pythonDatascience and python
Datascience and python
 
ACS Summer Institute - Emerging Roles of Librarians - 14_0731
ACS Summer Institute - Emerging Roles of Librarians - 14_0731ACS Summer Institute - Emerging Roles of Librarians - 14_0731
ACS Summer Institute - Emerging Roles of Librarians - 14_0731
 

Similar to Shared Data & Big Data for Libraries

Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
Thinkful
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of libraries
Regan Harper
 

Similar to Shared Data & Big Data for Libraries (20)

Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypse
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPTool
 
Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
Tutorial Data Management and workflows
Tutorial Data Management and workflowsTutorial Data Management and workflows
Tutorial Data Management and workflows
 
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
 
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)
 
Carl idigpres
Carl idigpresCarl idigpres
Carl idigpres
 
CARLIdigpres
CARLIdigpresCARLIdigpres
CARLIdigpres
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of libraries
 
AI and Libraries - Yasser Ayyash.pptx
AI and Libraries - Yasser Ayyash.pptxAI and Libraries - Yasser Ayyash.pptx
AI and Libraries - Yasser Ayyash.pptx
 
00-01 DSnDA.pdf
00-01 DSnDA.pdf00-01 DSnDA.pdf
00-01 DSnDA.pdf
 
ERN-Data-Ethics.pptx
ERN-Data-Ethics.pptxERN-Data-Ethics.pptx
ERN-Data-Ethics.pptx
 
ASA conference Feb 2013
ASA conference Feb 2013ASA conference Feb 2013
ASA conference Feb 2013
 
Community Informatics for Community Informaticians (keynote at CIRN 2010, Pra...
Community Informatics for Community Informaticians (keynote at CIRN 2010, Pra...Community Informatics for Community Informaticians (keynote at CIRN 2010, Pra...
Community Informatics for Community Informaticians (keynote at CIRN 2010, Pra...
 
APLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with DataAPLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with Data
 

More from robin fay

Go google - tips and tricks for getting started with google
Go google - tips and tricks for getting started with googleGo google - tips and tricks for getting started with google
Go google - tips and tricks for getting started with google
robin fay
 

More from robin fay (20)

RWOs & linkeddata for libraries
RWOs & linkeddata for librariesRWOs & linkeddata for libraries
RWOs & linkeddata for libraries
 
Project Management for Libraries
Project Management for LibrariesProject Management for Libraries
Project Management for Libraries
 
Controlled Vocabularies & Cataloging
Controlled Vocabularies & Cataloging Controlled Vocabularies & Cataloging
Controlled Vocabularies & Cataloging
 
Catexpress - Quick overview
Catexpress - Quick overviewCatexpress - Quick overview
Catexpress - Quick overview
 
IOT, Real World Things, & Linked data
IOT, Real World Things, & Linked dataIOT, Real World Things, & Linked data
IOT, Real World Things, & Linked data
 
Cataloging101 foundations: Authorities
Cataloging101 foundations: AuthoritiesCataloging101 foundations: Authorities
Cataloging101 foundations: Authorities
 
BIBFRAME, Linked data, RDA
BIBFRAME, Linked data, RDA BIBFRAME, Linked data, RDA
BIBFRAME, Linked data, RDA
 
Cataloging101 foundations frbr - 2019 version
Cataloging101 foundations frbr - 2019 versionCataloging101 foundations frbr - 2019 version
Cataloging101 foundations frbr - 2019 version
 
Challenges and opportunities in library discovery services gen
Challenges and opportunities in library discovery services genChallenges and opportunities in library discovery services gen
Challenges and opportunities in library discovery services gen
 
Tech Bits: Taking your mobile photography to the next level
Tech Bits: Taking your mobile photography to the next levelTech Bits: Taking your mobile photography to the next level
Tech Bits: Taking your mobile photography to the next level
 
Copyright, Public Domain & Creative Commons for Educators
Copyright, Public Domain & Creative Commons for Educators Copyright, Public Domain & Creative Commons for Educators
Copyright, Public Domain & Creative Commons for Educators
 
Multimedia, Virtual Reality and 3D Technologies in Higher Ed
Multimedia, Virtual Reality and 3D Technologies in Higher EdMultimedia, Virtual Reality and 3D Technologies in Higher Ed
Multimedia, Virtual Reality and 3D Technologies in Higher Ed
 
Building a digital_library_from_scratch
Building a digital_library_from_scratchBuilding a digital_library_from_scratch
Building a digital_library_from_scratch
 
Digitizing our past
Digitizing our past Digitizing our past
Digitizing our past
 
Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)
 
Cataloging roundtable discussion questions
Cataloging roundtable discussion questionsCataloging roundtable discussion questions
Cataloging roundtable discussion questions
 
Metadata then & now
Metadata then & now Metadata then & now
Metadata then & now
 
Omeka introduction (revised) 2015
Omeka introduction (revised) 2015Omeka introduction (revised) 2015
Omeka introduction (revised) 2015
 
Go google - tips and tricks for getting started with google
Go google - tips and tricks for getting started with googleGo google - tips and tricks for getting started with google
Go google - tips and tricks for getting started with google
 
Organizing your online stuff
Organizing your online stuffOrganizing your online stuff
Organizing your online stuff
 

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

Shared Data & Big Data for Libraries

  • 1. Shared Data: What it Means for the Future of Libraries Robin Fay @georgiawebgurl Head, DBM/Cataloging / UGA Libraries Peter Murray Lyrasis Draft Content for Discussion group 05.01.2013 / robinfay
  • 2. Agenda • Overview of big data ▫ What is big data? What is shared data? ▫ Implications and challenges ▫ Background: Alistair Croll talk at ALA Midwinter http://www.youtube.com/watch?v=Ic_Bl PesEls • Discussion framed around Alistair’s presentation topics Draft Content for Discussion group 04.30.2013
  • 3. How did our data get big? • Technology that has unforeseen consequences • Technology changes. • We leave digital trails wherever we go. • Think> internet browsing history, email, medical records, bank transactions, buying history at shopping sites, Amazon reviews, Facebook photos, comments on websites, and much more. Draft Content for Discussion group 05.01.2013
  • 4. How did our data get big? • “Collectively the data that we leave behind is Big Data. “ • and of course.. There is the data that others (people and machines) create about us. • Big Data is about us and has far reaching consequences. Draft Content for Discussion group 05.01.2013
  • 5. What is Big Data? • It is a not a technology – it is a shift in how we view and use information • Taking large amounts of information spread across many different resources in different formats making them explore • It doesn’t have to be “that big just bigger than what you can go through by hand” Draft Content for Discussion group 05.01.2013
  • 6. 3 attributes of Big Data • Large • Fast (manual time needed) • and unstructured (formats differ) =3 Vs of Big Data Draft Content for Discussion group 05.01.2013
  • 7. Big Data • Relational (relationships) database - our ILS systems are often relational databases • Mathematical database – computations • Big Data is the intersection of two • Health– analyzing health records to identify allergies, sickness, etc • Philanthropy (datakind) – analyze behavior of farmers and knowledge workers to evaluate the impact (ROI) of philanthropic work • Think about potential for library use: we have patron data, bibliographic data and more!
  • 8. Concerns of Big Data • Privacy – erodes privacy potentially leaking private information • Justify stereotypes (data can be misused or used in a negative) and polarize social groups • Facebook open graph search – pulling together information from diverse information to get lists of seemingly innocent ways such as movie watching habits or music can be used in negative ways to reinforce stereotypes or drawn conclusions about people • “Personalization can look like prejudice” • We live in grey areas • Computers do not understand that Draft Content for Discussion group 05.01.2013
  • 9. Which side of the fence? • Big Data is going to change our lives! • Are you • a semantic idealist ? if we can taxonomize and organize it, we can make sense of it ▫ Wolfram Alpha – we can ask it and it will reason (mathematical) • A chaotic nihilist? Algorithms will handle it – correct data will bubble up given enough information ▫ Watson – doesn’t know answers but will analyze to interpret answer Draft Content for Discussion group 05.01.2013
  • 10. So, how would you file a cup of coffee? • Depends upon how you will use the information! • Understandings do not take advantage of digital information which slows semantic idealism – much information not organized so we have to rely algorithms (for now) but it is vunerable. • Tagging is often done by machines – even in libraries we batch load, harvest, update data globally. Draft Content for Discussion group 05.01.2013
  • 11. Humans and technology • Our reasoning can be flawed - we make decisions evolutionary – we look at simple correlations and patterns (false positives) • If comments after a post are highly negative, responders are more likely to take polarizing viewpoints • Even with math is good, data can be wrong Draft Content for Discussion group 05.01.2013
  • 12. Shared data • We are a mosaic of data from other resources • Unified digital history – record of all of our data and could aggregate health information and share with doctors – just one example • Veracity (can verify) and Value (how we can make sense of our data) • Shared data : connecting networks will collect data; algorithms will tag and assign metadata but it will be up to humans to add value - this can then be shared in ways that are useful Draft Content for Discussion group 05.01.2013
  • 13. Linked data makes it possible • Linked data keeps us from having to re-enter or copy information It makes data: • reusable • easy to correct (correct one record instead of multiples) • efficient • and potentially useful to others Draft Content for Discussion group 05.01.2013
  • 14. Linked data makes it possible • It can build relationships in different ways - allowing us to create temporary collections (a user could organize their search results in a way that makes sense to them) or more permanent (collocating ALL works by a particular author more easily; pulling together photographs more easily) • It can help make sense of Big Data and facilitate sharing data.
  • 15. Linked data makes it possible • Linked data keeps us from having to re-enter or copy information It makes data: • reusable • easy to correct (correct one record instead of multiples) • efficient • and potentially useful to others
  • 16. Thinking of data in the library environment • Automation and new technologies • The web has changed • Large scale bibliographic databases • User expectations and needs • Patron data • Cooperative cataloging • Greater variety of media in library collections (electronic!) • FRBR is our data model – semantic web friendly! Draft Content for Discussion group 05.01.2013
  • 17. Discussion points • Obviously, WorldCat is a shared data resource we have all been using for years. What are some other examples of big data, shared data, or linked data that libraries use now? 2. What are some examples of data that libraries could share that we aren't sharing already? 3. What are some of the pitfalls of data sharing on a massive scale? Draft Content for Discussion group 05.01.2013

Editor's Notes

  1. Did we get the internet exactly wrong? We started out using wired connections for local communications and satellite for long distance; we now rely on satellites and wifi for long distance.
  2. Did we get the internet exactly wrong? We started out using wired connections for local communications and satellite for long distance; we now rely on satellites and wifi for long distance.
  3. Did we get the internet exactly wrong? We started out using wired connections for local communications and satellite for long distance; we now rely on satellites and wifi for long distance.
  4. Robin is apparently a semantic idealist!