SlideShare a Scribd company logo
1 of 30
Data visualisations as a gateway
        to programming
           Mia Ridge @mia_out
        THATCamp Feminisms West
  Scripps College, California, March 2013
AKA: a whirlwind tour of data
visualisation

(and some bits to tempt you into playing
with code)
‘Start small, make things, and
then when you’re done, make
some more things.’
Jake Levine, http://www.niemanlab.org/2013/03/jake-levine-why-
learning-to-code-isnt-as-important-as-learning-to-build-something/
Probably impossible things
• Asking a question that’s actually stupid
• Breaking the computer
Some points about code
• Computers are annoyingly pedantic
• Scripting isn't rocket science (but it is 'hard
  fun')
Overview
•   What is data visualisation?
•   Tools and types of visualisations
•   A bit of programming jargon
•   Activity options: play with data in ManyEyes
    or tweak timeline/map code to try basic
    programming
Registering with Many Eyes
• In your browser, go to http://www-
  958.ibm.com/software/analytics/manyeyes/regis
  ter and register for a Many Eyes account
  – Check your email to make sure the registration has
    come through for later use

• There’s a dataset loaded into ManyEyes that you
  can try different things with but you might find
  that you want to tweak new versions to achieve
  particular effects
Who am I?




http://bit.ly/13JqWpr Tool from http://neatline.org/
Who are you?
• One sentence on your interest in data
  visualisation, do you have any potential uses
  in mind?
What is data visualisation?
• '…the graphical display of abstract information for
  two purposes: sense-making (also called data
  analysis) and communication’ (Stephen Few)
• '…showing quantitative and qualitative
  information so that a viewer can see
  patterns, trends, or anomalies, constancy or
  variation, in ways that other forms – text and
  tables – do not allow.' (Michael Friendly)
• '…interactive, visual representations of abstract
  data to amplify cognition‘ (Card et al., 1999)
Scholarly data visualisations
• Visualisations as ‘distant reading’ where
  distance is ‘a specific form of knowledge:
  fewer elements, hence a sharper sense of
  their overall interconnection’ (Moretti, 2005)
• Inspiring curiosity and research questions
• But - what do they leave out?
Types of visualisations
• Different types of data in:
  quantitative, qualitative, geographic, time
  series, entities
  (people, places, events, concepts, things)
• Static, interactive
• Exploratory, explanatory: find new insights, or
  tell a story?
• Pragmatic, analytic? Abstract, emotive?
• http://infosthetics.com/archives/infovis/
Visualisation types in Many Eyes




http://www-958.ibm.com/software/analytics/manyeyes/page/Visualization_Options.html
Considerations for humanities data
• Commercial tools often assume
  complete, born-digital datasets – no missing
  fields, consistent data entry over time
• Humanities and GLAM
  (galleries, libraries, museums, archives)
  records contain uncertainty and fuzziness (e.g.
  date ranges, uncertain places, creators, etc)
Messiness in data
• 'Begun in Kiryu, Japan, finished in France'
• 'Bali? Java? Mexico?'
• Variations on USA:
    –   U.S.
    –   U.S.A
    –   U.S.A.
    –   USA
    –   United States of America
    –   USA ?
    –   United States (case)
• Inconsistency in uncertainty
    – U.S.A. or England
    – U.S.A./England ?
    – England & U.S.A.
Computers don't cope
Cleaning data for visualisations
Humanities data often needs manual cleaning to:
 remove rows where vital information is missing

 tidying inconsistencies in term lists or spelling

 converting words to numbers (e.g. dates)

 remove hard returns and non-ASCII characters (or

  change data format)
 split multiple values in one field into other

  columns (e.g. author name, date in one field)
 expanded coded values (e.g. countries, language)
What other data can you join to yours?
   Information from general sites like
    Wikipedia, Freebase, VIAF
   Information from other GLAMs
   Other information about the same
    event, place, person, object, etc
   General contextualising information –
    science, history, reviews, citations?
Dealing with complex data
• Find a visualisation type that can harbour the
  data in a meaningful way or reduce the data in
  a meaningful way.
  – e.g. go from individual values to distribution of
    values
  – e.g. introduce interaction: overview, zoom and
    filter, details on demand (Ben Shneiderman)
Visualisation tools




http://selection.datavisualization.ch/
IBM Many Eyes
SIMILE example
• Data: http://bit.ly/ZwH6iy
• http://modernbluestockings.org.uk/Simile/Si
  mileexample.html
Programming concepts
Variables and comments
• Variables: containers that store things
• Comments: leave messages for other programmers;
  the computer can't see them
• Operators: small, simple bits of functionality
Getting unstuck
• Try copy/pasting or typing the error message into
  Google.
• Make different versions as you go, use software
  to compare two versions of a file
• Asking for help: what steps would someone need
  to take to reproduce the problem? What did you
  expect the output to be and what happened
  instead?
• Most browsers have built-in tools to help you
  debug JavaScript.
Getting unstuck
• Make a copy of the exercise file first so you can
  always compare with one that works
• If it breaks or doesn't work:
   – Check that “quotes’ and {brackets) are matched
   – Check that any named thing is spelt consistently
   – Check upper/lower case
   – Ask the person next to you (sometimes explaining it
     helps you spot the issue)
   – If the last version works, use software to compare two
     versions of a file
Visualising ‘Inspiring Women’
• ManyEyes – online tool, no code required
• SIMILE – start with a working example, read
  through the commented code and try the
  exercises listed in the comments
‘Inspiring Women’ in ManyEyes
• Log into ManyEyes
• Go to http://ibm.co/ZP7UKI – visualisation
  options available from there
• Choose a type of visualisation and evaluate
  the results
  – What cleaning, extra data or transformation might
    be needed?
  – You may need to iterate with different versions of
    the data from http://bit.ly/ZwH6iy
Review: visualisation tools
• What did the tools you tried do well? Poorly?
• Were the tool and the data a good match for
  each other?
• Which tools might be useful in the future?
‘Start small, make things, and then
 when you’re done, make some more
                 things.’

Some links: http://bit.ly/UJwgEz

Thank you!
Mia Ridge, Open University
http://openobjects.org.uk/
@mia_out

More Related Content

What's hot

The Social Web and Cultural Heritage
The Social Web and Cultural Heritage The Social Web and Cultural Heritage
The Social Web and Cultural Heritage Julita Vassileva
 
Museums and Digital Technologies
Museums and Digital TechnologiesMuseums and Digital Technologies
Museums and Digital TechnologiesEd Rodley
 
Digital technology and museum collections
Digital technology and museum collections Digital technology and museum collections
Digital technology and museum collections Fiona Marshall
 
Digitalstorytelling and education: an introduction
Digitalstorytelling and education: an introductionDigitalstorytelling and education: an introduction
Digitalstorytelling and education: an introductionBryan Alexander
 
Library time bomb - CPX
Library time bomb - CPXLibrary time bomb - CPX
Library time bomb - CPXHamish Curry
 
Diversity of Wikipedia
Diversity of WikipediaDiversity of Wikipedia
Diversity of WikipediaIlona Buchem
 
P2PU School of Webcraft - SXSW 2011
P2PU School of Webcraft - SXSW 2011P2PU School of Webcraft - SXSW 2011
P2PU School of Webcraft - SXSW 2011johndbritton
 
Privacy and libraries
Privacy and librariesPrivacy and libraries
Privacy and librariesDorothea Salo
 
Evaluating Social Media: American Association of Museums (AAM) 2010
Evaluating Social Media: American Association of Museums (AAM) 2010Evaluating Social Media: American Association of Museums (AAM) 2010
Evaluating Social Media: American Association of Museums (AAM) 2010Dana Allen-Greil
 
Courage of our Connections
Courage of our ConnectionsCourage of our Connections
Courage of our ConnectionsRachel Frick
 

What's hot (20)

Generous Interfaces Evaluated
Generous Interfaces EvaluatedGenerous Interfaces Evaluated
Generous Interfaces Evaluated
 
The Social Web and Cultural Heritage
The Social Web and Cultural Heritage The Social Web and Cultural Heritage
The Social Web and Cultural Heritage
 
Museums and Digital Technologies
Museums and Digital TechnologiesMuseums and Digital Technologies
Museums and Digital Technologies
 
One Big Library
One Big LibraryOne Big Library
One Big Library
 
2013-07-15 AMM Session: The Digital Road Ahead
2013-07-15 AMM Session: The Digital Road Ahead2013-07-15 AMM Session: The Digital Road Ahead
2013-07-15 AMM Session: The Digital Road Ahead
 
Digital technology and museum collections
Digital technology and museum collections Digital technology and museum collections
Digital technology and museum collections
 
2012 Nelson-Atkins Tech Summit
2012 Nelson-Atkins Tech Summit2012 Nelson-Atkins Tech Summit
2012 Nelson-Atkins Tech Summit
 
Digitalstorytelling and education: an introduction
Digitalstorytelling and education: an introductionDigitalstorytelling and education: an introduction
Digitalstorytelling and education: an introduction
 
Mobile Social Media
Mobile Social MediaMobile Social Media
Mobile Social Media
 
Verso AAM Webinar Presentation v5
Verso AAM Webinar Presentation v5Verso AAM Webinar Presentation v5
Verso AAM Webinar Presentation v5
 
Library time bomb - CPX
Library time bomb - CPXLibrary time bomb - CPX
Library time bomb - CPX
 
Diversity of Wikipedia
Diversity of WikipediaDiversity of Wikipedia
Diversity of Wikipedia
 
Makerspaces2
Makerspaces2Makerspaces2
Makerspaces2
 
Paying for it
Paying for itPaying for it
Paying for it
 
P2PU School of Webcraft - SXSW 2011
P2PU School of Webcraft - SXSW 2011P2PU School of Webcraft - SXSW 2011
P2PU School of Webcraft - SXSW 2011
 
PandoraRevised
PandoraRevisedPandoraRevised
PandoraRevised
 
Privacy and libraries
Privacy and librariesPrivacy and libraries
Privacy and libraries
 
Evaluating Social Media: American Association of Museums (AAM) 2010
Evaluating Social Media: American Association of Museums (AAM) 2010Evaluating Social Media: American Association of Museums (AAM) 2010
Evaluating Social Media: American Association of Museums (AAM) 2010
 
2013-05-16 D Hegley Keynote Summit@Summit
2013-05-16 D Hegley Keynote Summit@Summit2013-05-16 D Hegley Keynote Summit@Summit
2013-05-16 D Hegley Keynote Summit@Summit
 
Courage of our Connections
Courage of our ConnectionsCourage of our Connections
Courage of our Connections
 

Viewers also liked

відкриття малих олімпійських ігор 2015
відкриття малих олімпійських ігор 2015відкриття малих олімпійських ігор 2015
відкриття малих олімпійських ігор 2015jekah
 
cfgmgmtcamp 2016 - Roll out active Supervision with Salt, Graphite and Grafana
cfgmgmtcamp 2016 - Roll out active Supervision with Salt, Graphite and Grafanacfgmgmtcamp 2016 - Roll out active Supervision with Salt, Graphite and Grafana
cfgmgmtcamp 2016 - Roll out active Supervision with Salt, Graphite and GrafanaArthur Lutz
 
Biorremediação
BiorremediaçãoBiorremediação
BiorremediaçãoRuan Reis
 
toni presentacion
toni presentaciontoni presentacion
toni presentaciontonican86
 
Restore Data from Damaged or Inaccessible CF Card
Restore Data from Damaged or Inaccessible CF CardRestore Data from Damaged or Inaccessible CF Card
Restore Data from Damaged or Inaccessible CF CardCard Data Recovery
 
MICE Presentation - Interlaken Congress & Events
MICE Presentation - Interlaken Congress & EventsMICE Presentation - Interlaken Congress & Events
MICE Presentation - Interlaken Congress & EventsMICEboard
 
A Ta Boutique - Startup Weekend Brest
A Ta Boutique - Startup Weekend BrestA Ta Boutique - Startup Weekend Brest
A Ta Boutique - Startup Weekend BrestAnDaolVras
 
EVALUACION REPRODUCCION
EVALUACION REPRODUCCIONEVALUACION REPRODUCCION
EVALUACION REPRODUCCIONcursocecam07
 

Viewers also liked (16)

Ghtptynfwbz
GhtptynfwbzGhtptynfwbz
Ghtptynfwbz
 
відкриття малих олімпійських ігор 2015
відкриття малих олімпійських ігор 2015відкриття малих олімпійських ігор 2015
відкриття малих олімпійських ігор 2015
 
cfgmgmtcamp 2016 - Roll out active Supervision with Salt, Graphite and Grafana
cfgmgmtcamp 2016 - Roll out active Supervision with Salt, Graphite and Grafanacfgmgmtcamp 2016 - Roll out active Supervision with Salt, Graphite and Grafana
cfgmgmtcamp 2016 - Roll out active Supervision with Salt, Graphite and Grafana
 
Etkinlik 1
Etkinlik 1Etkinlik 1
Etkinlik 1
 
Biorremediação
BiorremediaçãoBiorremediação
Biorremediação
 
toni presentacion
toni presentaciontoni presentacion
toni presentacion
 
Minőség
MinőségMinőség
Minőség
 
Use LxR
Use LxRUse LxR
Use LxR
 
Restore Data from Damaged or Inaccessible CF Card
Restore Data from Damaged or Inaccessible CF CardRestore Data from Damaged or Inaccessible CF Card
Restore Data from Damaged or Inaccessible CF Card
 
gci presentation
gci presentationgci presentation
gci presentation
 
20151215 hirosue draft ideas
20151215 hirosue draft ideas20151215 hirosue draft ideas
20151215 hirosue draft ideas
 
MICE Presentation - Interlaken Congress & Events
MICE Presentation - Interlaken Congress & EventsMICE Presentation - Interlaken Congress & Events
MICE Presentation - Interlaken Congress & Events
 
Medios de transmisión
Medios de transmisiónMedios de transmisión
Medios de transmisión
 
A Ta Boutique - Startup Weekend Brest
A Ta Boutique - Startup Weekend BrestA Ta Boutique - Startup Weekend Brest
A Ta Boutique - Startup Weekend Brest
 
EVALUACION REPRODUCCION
EVALUACION REPRODUCCIONEVALUACION REPRODUCCION
EVALUACION REPRODUCCION
 
Contenidos_Taller_de_Capacitacion
Contenidos_Taller_de_CapacitacionContenidos_Taller_de_Capacitacion
Contenidos_Taller_de_Capacitacion
 

Similar to Data visualisations as a gateway to programming

Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectbodaceacat
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSara-Jayne Terp
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data ScienceThinkful
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Thinkful
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Thinkful
 
MPhil Lecture on Data Vis for Analysis
MPhil Lecture on Data Vis for AnalysisMPhil Lecture on Data Vis for Analysis
MPhil Lecture on Data Vis for AnalysisShawn Day
 
116 Machine learning for Product Managers
116   Machine learning for Product Managers116   Machine learning for Product Managers
116 Machine learning for Product ManagersProductCamp Boston
 
Machine learning for product managers. Presented at Boston ProductCamp (June...
Machine learning for product  managers. Presented at Boston ProductCamp (June...Machine learning for product  managers. Presented at Boston ProductCamp (June...
Machine learning for product managers. Presented at Boston ProductCamp (June...Mukund Seshadri
 
Visual and interactive storytelling slides cmg 2015-final
Visual and interactive storytelling slides    cmg 2015-finalVisual and interactive storytelling slides    cmg 2015-final
Visual and interactive storytelling slides cmg 2015-finalKatherine-CWACanada
 
Trendspotting: Helping you make sense of large information sources
Trendspotting: Helping you make sense of large information sourcesTrendspotting: Helping you make sense of large information sources
Trendspotting: Helping you make sense of large information sourcesMarieke Guy
 
Creating & managing your scholarly web presence
Creating & managing your scholarly web presenceCreating & managing your scholarly web presence
Creating & managing your scholarly web presenceRebecca Kate Miller
 
Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisOpen Analytics
 
Accessible Next Level Visualizations
Accessible Next Level VisualizationsAccessible Next Level Visualizations
Accessible Next Level VisualizationsTed Gies
 
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantPOWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantLynne Thomas
 
Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)Thinkful
 
Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)Thinkful
 
Introduction to Information Architecture & Design - 12/06/14
Introduction to Information Architecture & Design - 12/06/14Introduction to Information Architecture & Design - 12/06/14
Introduction to Information Architecture & Design - 12/06/14Robert Stribley
 
Thinkful - Intro to Data Science - Washington DC
Thinkful - Intro to Data Science - Washington DCThinkful - Intro to Data Science - Washington DC
Thinkful - Intro to Data Science - Washington DCTJ Stalcup
 
Teaching with Technology
Teaching with TechnologyTeaching with Technology
Teaching with TechnologyDavid Bond
 

Similar to Data visualisations as a gateway to programming (20)

Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)
 
MPhil Lecture on Data Vis for Analysis
MPhil Lecture on Data Vis for AnalysisMPhil Lecture on Data Vis for Analysis
MPhil Lecture on Data Vis for Analysis
 
116 Machine learning for Product Managers
116   Machine learning for Product Managers116   Machine learning for Product Managers
116 Machine learning for Product Managers
 
Machine learning for product managers. Presented at Boston ProductCamp (June...
Machine learning for product  managers. Presented at Boston ProductCamp (June...Machine learning for product  managers. Presented at Boston ProductCamp (June...
Machine learning for product managers. Presented at Boston ProductCamp (June...
 
Visual and interactive storytelling slides cmg 2015-final
Visual and interactive storytelling slides    cmg 2015-finalVisual and interactive storytelling slides    cmg 2015-final
Visual and interactive storytelling slides cmg 2015-final
 
Trendspotting: Helping you make sense of large information sources
Trendspotting: Helping you make sense of large information sourcesTrendspotting: Helping you make sense of large information sources
Trendspotting: Helping you make sense of large information sources
 
Creating & managing your scholarly web presence
Creating & managing your scholarly web presenceCreating & managing your scholarly web presence
Creating & managing your scholarly web presence
 
Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media Analysis
 
00-01 DSnDA.pdf
00-01 DSnDA.pdf00-01 DSnDA.pdf
00-01 DSnDA.pdf
 
Accessible Next Level Visualizations
Accessible Next Level VisualizationsAccessible Next Level Visualizations
Accessible Next Level Visualizations
 
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantPOWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
 
Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)
 
Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)
 
Introduction to Information Architecture & Design - 12/06/14
Introduction to Information Architecture & Design - 12/06/14Introduction to Information Architecture & Design - 12/06/14
Introduction to Information Architecture & Design - 12/06/14
 
Thinkful - Intro to Data Science - Washington DC
Thinkful - Intro to Data Science - Washington DCThinkful - Intro to Data Science - Washington DC
Thinkful - Intro to Data Science - Washington DC
 
Teaching with Technology
Teaching with TechnologyTeaching with Technology
Teaching with Technology
 

More from Mia

Living with Machines year two update
Living with Machines year two updateLiving with Machines year two update
Living with Machines year two updateMia
 
Rethink research, illuminate history with the British Library
Rethink research, illuminate history with the British LibraryRethink research, illuminate history with the British Library
Rethink research, illuminate history with the British LibraryMia
 
Living with Machines: one year in
Living with Machines: one year inLiving with Machines: one year in
Living with Machines: one year inMia
 
Festival of Maintenance talk: Apps, microsites and collections online: innova...
Festival of Maintenance talk: Apps, microsites and collections online: innova...Festival of Maintenance talk: Apps, microsites and collections online: innova...
Festival of Maintenance talk: Apps, microsites and collections online: innova...Mia
 
Operationalising AI at a national library
Operationalising AI at a national libraryOperationalising AI at a national library
Operationalising AI at a national libraryMia
 
Hopes, dreams and reality: crowdsourcing and the democratisation of knowledge...
Hopes, dreams and reality: crowdsourcing and the democratisation of knowledge...Hopes, dreams and reality: crowdsourcing and the democratisation of knowledge...
Hopes, dreams and reality: crowdsourcing and the democratisation of knowledge...Mia
 
In search of the sweet spot: infrastructure at the intersection of cultural h...
In search of the sweet spot: infrastructure at the intersection of cultural h...In search of the sweet spot: infrastructure at the intersection of cultural h...
In search of the sweet spot: infrastructure at the intersection of cultural h...Mia
 
Living with Machines at The Past, Present and Future of Digital Scholarship w...
Living with Machines at The Past, Present and Future of Digital Scholarship w...Living with Machines at The Past, Present and Future of Digital Scholarship w...
Living with Machines at The Past, Present and Future of Digital Scholarship w...Mia
 
Enabling digital scholarship through staff training: the British Library's ex...
Enabling digital scholarship through staff training: the British Library's ex...Enabling digital scholarship through staff training: the British Library's ex...
Enabling digital scholarship through staff training: the British Library's ex...Mia
 
A modest proposal: crowdsourcing in cultural heritage benefits us all.
A modest proposal: crowdsourcing in cultural heritage benefits us all.A modest proposal: crowdsourcing in cultural heritage benefits us all.
A modest proposal: crowdsourcing in cultural heritage benefits us all.Mia
 
Crowdsourcing at the British Library: lessons learnt and future directions
Crowdsourcing at the British Library: lessons learnt and future directionsCrowdsourcing at the British Library: lessons learnt and future directions
Crowdsourcing at the British Library: lessons learnt and future directionsMia
 
Crowdsourcing 'In the Spotlight' at the British Library
Crowdsourcing 'In the Spotlight' at the British LibraryCrowdsourcing 'In the Spotlight' at the British Library
Crowdsourcing 'In the Spotlight' at the British LibraryMia
 
Crowdsourcing: the British Library experience
Crowdsourcing: the British Library experienceCrowdsourcing: the British Library experience
Crowdsourcing: the British Library experienceMia
 
Chair's welcome, MCG's Museums+Tech 2017
Chair's welcome, MCG's Museums+Tech 2017Chair's welcome, MCG's Museums+Tech 2017
Chair's welcome, MCG's Museums+Tech 2017Mia
 
Historical thinking in crowdsourcing and citizen history projects
Historical thinking in crowdsourcing and citizen history projectsHistorical thinking in crowdsourcing and citizen history projects
Historical thinking in crowdsourcing and citizen history projectsMia
 
Cross-sector collaboration for digital museum and library projects
Cross-sector collaboration for digital museum and library projectsCross-sector collaboration for digital museum and library projects
Cross-sector collaboration for digital museum and library projectsMia
 
Connected heritage: How should Cultural Institutions Open and Connect Data?
Connected heritage: How should Cultural Institutions Open and Connect Data?Connected heritage: How should Cultural Institutions Open and Connect Data?
Connected heritage: How should Cultural Institutions Open and Connect Data?Mia
 
Wish upon a star: making crowdsourcing in cultural heritage a reality
Wish upon a star: making crowdsourcing in cultural heritage a realityWish upon a star: making crowdsourcing in cultural heritage a reality
Wish upon a star: making crowdsourcing in cultural heritage a realityMia
 
Doing Digital Research @ British Library
Doing Digital Research @ British LibraryDoing Digital Research @ British Library
Doing Digital Research @ British LibraryMia
 
Digitised Manuscripts and the British Library's new IIIF viewer
Digitised Manuscripts and the British Library's new IIIF viewer Digitised Manuscripts and the British Library's new IIIF viewer
Digitised Manuscripts and the British Library's new IIIF viewer Mia
 

More from Mia (20)

Living with Machines year two update
Living with Machines year two updateLiving with Machines year two update
Living with Machines year two update
 
Rethink research, illuminate history with the British Library
Rethink research, illuminate history with the British LibraryRethink research, illuminate history with the British Library
Rethink research, illuminate history with the British Library
 
Living with Machines: one year in
Living with Machines: one year inLiving with Machines: one year in
Living with Machines: one year in
 
Festival of Maintenance talk: Apps, microsites and collections online: innova...
Festival of Maintenance talk: Apps, microsites and collections online: innova...Festival of Maintenance talk: Apps, microsites and collections online: innova...
Festival of Maintenance talk: Apps, microsites and collections online: innova...
 
Operationalising AI at a national library
Operationalising AI at a national libraryOperationalising AI at a national library
Operationalising AI at a national library
 
Hopes, dreams and reality: crowdsourcing and the democratisation of knowledge...
Hopes, dreams and reality: crowdsourcing and the democratisation of knowledge...Hopes, dreams and reality: crowdsourcing and the democratisation of knowledge...
Hopes, dreams and reality: crowdsourcing and the democratisation of knowledge...
 
In search of the sweet spot: infrastructure at the intersection of cultural h...
In search of the sweet spot: infrastructure at the intersection of cultural h...In search of the sweet spot: infrastructure at the intersection of cultural h...
In search of the sweet spot: infrastructure at the intersection of cultural h...
 
Living with Machines at The Past, Present and Future of Digital Scholarship w...
Living with Machines at The Past, Present and Future of Digital Scholarship w...Living with Machines at The Past, Present and Future of Digital Scholarship w...
Living with Machines at The Past, Present and Future of Digital Scholarship w...
 
Enabling digital scholarship through staff training: the British Library's ex...
Enabling digital scholarship through staff training: the British Library's ex...Enabling digital scholarship through staff training: the British Library's ex...
Enabling digital scholarship through staff training: the British Library's ex...
 
A modest proposal: crowdsourcing in cultural heritage benefits us all.
A modest proposal: crowdsourcing in cultural heritage benefits us all.A modest proposal: crowdsourcing in cultural heritage benefits us all.
A modest proposal: crowdsourcing in cultural heritage benefits us all.
 
Crowdsourcing at the British Library: lessons learnt and future directions
Crowdsourcing at the British Library: lessons learnt and future directionsCrowdsourcing at the British Library: lessons learnt and future directions
Crowdsourcing at the British Library: lessons learnt and future directions
 
Crowdsourcing 'In the Spotlight' at the British Library
Crowdsourcing 'In the Spotlight' at the British LibraryCrowdsourcing 'In the Spotlight' at the British Library
Crowdsourcing 'In the Spotlight' at the British Library
 
Crowdsourcing: the British Library experience
Crowdsourcing: the British Library experienceCrowdsourcing: the British Library experience
Crowdsourcing: the British Library experience
 
Chair's welcome, MCG's Museums+Tech 2017
Chair's welcome, MCG's Museums+Tech 2017Chair's welcome, MCG's Museums+Tech 2017
Chair's welcome, MCG's Museums+Tech 2017
 
Historical thinking in crowdsourcing and citizen history projects
Historical thinking in crowdsourcing and citizen history projectsHistorical thinking in crowdsourcing and citizen history projects
Historical thinking in crowdsourcing and citizen history projects
 
Cross-sector collaboration for digital museum and library projects
Cross-sector collaboration for digital museum and library projectsCross-sector collaboration for digital museum and library projects
Cross-sector collaboration for digital museum and library projects
 
Connected heritage: How should Cultural Institutions Open and Connect Data?
Connected heritage: How should Cultural Institutions Open and Connect Data?Connected heritage: How should Cultural Institutions Open and Connect Data?
Connected heritage: How should Cultural Institutions Open and Connect Data?
 
Wish upon a star: making crowdsourcing in cultural heritage a reality
Wish upon a star: making crowdsourcing in cultural heritage a realityWish upon a star: making crowdsourcing in cultural heritage a reality
Wish upon a star: making crowdsourcing in cultural heritage a reality
 
Doing Digital Research @ British Library
Doing Digital Research @ British LibraryDoing Digital Research @ British Library
Doing Digital Research @ British Library
 
Digitised Manuscripts and the British Library's new IIIF viewer
Digitised Manuscripts and the British Library's new IIIF viewer Digitised Manuscripts and the British Library's new IIIF viewer
Digitised Manuscripts and the British Library's new IIIF viewer
 

Data visualisations as a gateway to programming

  • 1. Data visualisations as a gateway to programming Mia Ridge @mia_out THATCamp Feminisms West Scripps College, California, March 2013
  • 2. AKA: a whirlwind tour of data visualisation (and some bits to tempt you into playing with code)
  • 3. ‘Start small, make things, and then when you’re done, make some more things.’ Jake Levine, http://www.niemanlab.org/2013/03/jake-levine-why- learning-to-code-isnt-as-important-as-learning-to-build-something/
  • 4. Probably impossible things • Asking a question that’s actually stupid • Breaking the computer
  • 5. Some points about code • Computers are annoyingly pedantic • Scripting isn't rocket science (but it is 'hard fun')
  • 6. Overview • What is data visualisation? • Tools and types of visualisations • A bit of programming jargon • Activity options: play with data in ManyEyes or tweak timeline/map code to try basic programming
  • 7. Registering with Many Eyes • In your browser, go to http://www- 958.ibm.com/software/analytics/manyeyes/regis ter and register for a Many Eyes account – Check your email to make sure the registration has come through for later use • There’s a dataset loaded into ManyEyes that you can try different things with but you might find that you want to tweak new versions to achieve particular effects
  • 8. Who am I? http://bit.ly/13JqWpr Tool from http://neatline.org/
  • 9. Who are you? • One sentence on your interest in data visualisation, do you have any potential uses in mind?
  • 10. What is data visualisation? • '…the graphical display of abstract information for two purposes: sense-making (also called data analysis) and communication’ (Stephen Few) • '…showing quantitative and qualitative information so that a viewer can see patterns, trends, or anomalies, constancy or variation, in ways that other forms – text and tables – do not allow.' (Michael Friendly) • '…interactive, visual representations of abstract data to amplify cognition‘ (Card et al., 1999)
  • 11. Scholarly data visualisations • Visualisations as ‘distant reading’ where distance is ‘a specific form of knowledge: fewer elements, hence a sharper sense of their overall interconnection’ (Moretti, 2005) • Inspiring curiosity and research questions • But - what do they leave out?
  • 12. Types of visualisations • Different types of data in: quantitative, qualitative, geographic, time series, entities (people, places, events, concepts, things) • Static, interactive • Exploratory, explanatory: find new insights, or tell a story? • Pragmatic, analytic? Abstract, emotive? • http://infosthetics.com/archives/infovis/
  • 13. Visualisation types in Many Eyes http://www-958.ibm.com/software/analytics/manyeyes/page/Visualization_Options.html
  • 14. Considerations for humanities data • Commercial tools often assume complete, born-digital datasets – no missing fields, consistent data entry over time • Humanities and GLAM (galleries, libraries, museums, archives) records contain uncertainty and fuzziness (e.g. date ranges, uncertain places, creators, etc)
  • 15. Messiness in data • 'Begun in Kiryu, Japan, finished in France' • 'Bali? Java? Mexico?' • Variations on USA: – U.S. – U.S.A – U.S.A. – USA – United States of America – USA ? – United States (case) • Inconsistency in uncertainty – U.S.A. or England – U.S.A./England ? – England & U.S.A.
  • 17. Cleaning data for visualisations Humanities data often needs manual cleaning to:  remove rows where vital information is missing  tidying inconsistencies in term lists or spelling  converting words to numbers (e.g. dates)  remove hard returns and non-ASCII characters (or change data format)  split multiple values in one field into other columns (e.g. author name, date in one field)  expanded coded values (e.g. countries, language)
  • 18. What other data can you join to yours?  Information from general sites like Wikipedia, Freebase, VIAF  Information from other GLAMs  Other information about the same event, place, person, object, etc  General contextualising information – science, history, reviews, citations?
  • 19. Dealing with complex data • Find a visualisation type that can harbour the data in a meaningful way or reduce the data in a meaningful way. – e.g. go from individual values to distribution of values – e.g. introduce interaction: overview, zoom and filter, details on demand (Ben Shneiderman)
  • 22. SIMILE example • Data: http://bit.ly/ZwH6iy • http://modernbluestockings.org.uk/Simile/Si mileexample.html
  • 24. Variables and comments • Variables: containers that store things • Comments: leave messages for other programmers; the computer can't see them • Operators: small, simple bits of functionality
  • 25. Getting unstuck • Try copy/pasting or typing the error message into Google. • Make different versions as you go, use software to compare two versions of a file • Asking for help: what steps would someone need to take to reproduce the problem? What did you expect the output to be and what happened instead? • Most browsers have built-in tools to help you debug JavaScript.
  • 26. Getting unstuck • Make a copy of the exercise file first so you can always compare with one that works • If it breaks or doesn't work: – Check that “quotes’ and {brackets) are matched – Check that any named thing is spelt consistently – Check upper/lower case – Ask the person next to you (sometimes explaining it helps you spot the issue) – If the last version works, use software to compare two versions of a file
  • 27. Visualising ‘Inspiring Women’ • ManyEyes – online tool, no code required • SIMILE – start with a working example, read through the commented code and try the exercises listed in the comments
  • 28. ‘Inspiring Women’ in ManyEyes • Log into ManyEyes • Go to http://ibm.co/ZP7UKI – visualisation options available from there • Choose a type of visualisation and evaluate the results – What cleaning, extra data or transformation might be needed? – You may need to iterate with different versions of the data from http://bit.ly/ZwH6iy
  • 29. Review: visualisation tools • What did the tools you tried do well? Poorly? • Were the tool and the data a good match for each other? • Which tools might be useful in the future?
  • 30. ‘Start small, make things, and then when you’re done, make some more things.’ Some links: http://bit.ly/UJwgEz Thank you! Mia Ridge, Open University http://openobjects.org.uk/ @mia_out

Editor's Notes

  1. Learn the basics of programming by fiddling with existing visualisations and prepared exercises.Background: participants will be thinking about how to structure data for use in software, learning basic programming concepts, and moving towards tinkering with scripts. This is a great workshop for humanists who want a friendly intro to the world of programming.Find out more at http://www.miaridge.com/resources-for-data-visualisation-for-analysis-in-scholarly-research/
  2. This is the “don't be scared” slide! Computers are really picky about spelling, white space, matching quote marks, how sentences end... Think of your most pedantic friend, and multiply that by 1000. It's like dealing with a grumpy six year old - it might be tricky to negotiate, but it's not going to kill either of you. Thinking computationally is like cooking a few courses for a fancy dinner party – you learn what needs to be prepped in advance or just before serving, which steps must be done in a particular order and what can be done at any time.Hard fun – phrase comes from gaming – when something is challenging it's even more rewarding when you finally crack it. A lot of my 'don't be scared' message is aimed at getting you over those first hurdles and into the rewarding stuff. Persistence (or stubbornness) is one of the key characteristics of a good programmer. The process of finding a path through something you're still figuring out is something programmers and researchers have in common.
  3. Short workshop, leaving loads out – have prepared two routes you can go – one is using pre-made data in a tool called ManyEyes to learn about how different types of visualisations work, the other is about loading up a page that will draw a timeline based on data in a Google Spreadsheet, and playing with bits of code to start to learn how it all comes together on a web page.When you’re working with your own data, about 80% of your time is spent massaging it into shape. Researching data also takes a long time – several evenings spent putting together this list, and it’s nowhere near complete and lots of values are still missing. Starts to bring in questions about writing history – it’s not like working with born-digital scientific etc datasets.There’s a bit of me talking at the start, but I want to let you get stuck into trying things out as soon as possible. This does mean it’s up to you to get the most out of it – ask questions, let me know when you get stuck, follow your own curiosity in thinking about what to try in the time.Knowing your way around a browser will help but no hardcore technical skills are required. Making good visualisations takes time, but I hope you’ll get a taste of what can be done.
  4. You can load this and have a play while I talk. I created this as an excuse to play with software called Neatline that’s designed for hand-crafted visualisations with maps and timelines. One nice thing about this is that it illustrates how far some technical skills can take you – and it’s not all about code, some of it has a big overlap with things like design and library science.Currently PhD student in Digital Humanities in the Department of History, Open UniversityPhD and MSc (Human-Computer Interaction) research on crowdsourcingCall myself a cultural heritage technologist (Science Museum, Museum of London, Melbourne Museum) because it encompasses my background as programmer and business analyst, my later interest in user experience design and research, and now my Digital Humanities research.
  5. Data visualisation is about creating insight, or the formation of a mental model – a new way of thinking about data.Few, Stephen. 2013. ‘Data Visualization for Human Perception’. Ed. MadsSoegaard and RikkeFriis Dam. The Encyclopedia of Human-Computer Interaction, 2nd Ed. Aarhus, Denmark: The Interaction Design Foundation. Accessed January 14. http://www.interaction-design.org/encyclopedia/data_visualization_for_human_perception.html.Michael Friendly quoted at http://www.visualcomplexity.com/vc/blog/?p=1076If interested in the history of visualisation, find out more http://datavis.ca/milestones/ Milestones in the history of data visualisation or http://www.cabinetmagazine.org/issues/13/timelines.php CABINET // A Timeline of Timelines
  6. Hopefully have some ideas now for how visualisations can enable 'scholars to ask increasingly complex research questions by analysing large scale datasets with freely available tools.’ Thinking now about how visualisations can be used to understand, analyse and present large-scale datasets in the humanities and science, and the value of visualisation tools in understanding the shape of a data set. In digital humanities, part of discourse around distant and close reading. Enables overview of many sources over long periods of time, highlighting changes in style, genre or content. Visualisation allows a view of large numbers of items and with tools like entity recognition, can help put them in spatial, historical or cultural context.  Ultimately about enabling spotting of patterns; patterns can lead to hypothesis.
  7. Lots of different ways to think about types... Do you want to find new insights, or to communicate or convince? Can be exploratory (find stories)/explanatory (tell stories) in purpose, and range from analytic/pragmatic - abstract/emotive axis Source: http://www.slideshare.net/visualisingdata/andy-kirks-facebook-talkA Tale of Two Types of Visualization and Much Confusion, Robert Kosara: 'two major types of data-based visualization, and understanding the differences. … Pragmatic Visualization…even if understanding this requires some work and experience, the goal of this method is to communicate the data, as efficiently as possible. ... If a visualization is designed to visually represent data, and to do that in such a way as to gain new insights into that data, it shall be called a pragmatic visualization. The basic idea is that using the human visual system (instead of automatic means like data mining or statistics), we can gain insight into data, and develop an understanding of the data and the structures in it. To determine whether a visualization is pragmatic, we simply ask if it allows us to efficiently read the data (or at least the relationships between subsets) from the display.' Cf Artistic Visualization
  8. Scatterplots: good for relationships between variablesMatrix chart: good for multi-dimensional dataBubble chart: good for data with big variations in numbersLine, stack graphs: good for changes in numbers over timePie charts: good for showing proportionsTreemap: good for hierarchical structuresWord tree: good for unstructured textPhrase Net: display common relationships between words in textMaps: display data by location
  9. What types of data are suitable for visualisation? ; the issues researchers commonly encounter when applying tools designed for the commercial sector to typically fuzzy, incomplete and complex humanities data; Data within one dataset might have been prepared by different departments, in different original systems or at different times, so when cleaning data, some content might be more likely to drop out than others.
  10. Examples from the Cooper Hewitt collection. I spent 3/5 of my time at the Cooper Hewitt just trying to get the data clean enough to vaguely represent the collection. The problem is that computers think U.S., U. S. , U.S.A., U. S. A. , United States, United States of America are six different places.Fields also contain things like internal notes about potential duplicates, unexpected extra information - notes on what type of location, etc. Lots of inconsistencies - uncertainty and date ranges expressed in different ways.More common museum issues - What year is 'early 18th century'? What do you do with '1836 (probably)'?
  11. Tools die when they encounter messy data
  12. There are also lots of software libraries for creating visualisations http://selection.datavisualization.ch/ lets you toggle between ones that require you to code and ones that don’t) but many require some programming knowledge.If you want to do really interesting things, invent new type of visualisations or find ways of presenting your specific data, you might need to get stuck into some code. Finding someone to work with can be a good way of learning if you don’t have any training available to you.
  13. Visualization Options Available in Many Eyeshttp://www-958.ibm.com/software/data/cognos/manyeyes/page/Visualization_Options.htmlData formats for uploading data1] Prepare your data. First, find the data set that you want to put into Many Eyes. The size limit is 5 megabytes. Data tables If your data is a list of values, first format it into a table with informative column headers. If your columns have different units of measure, be sure to include the units in the headers. Use a spreadsheet program such as Microsoft Excel or a text file where columns are separated with tabs. If this is your first upload, read the format guidelines. If you have a specific visualization in mind, take a look at its explanation page for additional information.Free TextIf your data is free text (such as an essay or a speech), open the data in a word processor or web browser, select the text, and copy it to the clipboard by typing control-C (Windows) or command-C (Macintosh).http://www-958.ibm.com/software/analytics/manyeyes/datasets/new
  14. The code is heavily (and chattily) commented with things to try so that you can start to see how the code effects what happens on the page.
  15. CSDiff(Windows)
  16. It physically hurts me to see unmatched quotes because they have been the cause of so much trauma in the past
  17. Visualisation type - review previous slides, think about whether you're:Comparing categories;Assessing hierarchies & part-to-whole relationships;Showing changes over time;Charting connections and relationships;Mapping geo-spatial dataYou might get further working in pairs… [Exercises must include: creating a data visualisation (learn how to use online tools to create visualisations that explore British Library datasets such as the British National Bibliography or 19th Century books, designed to result in something to take home to mum); using Google Refine to clean and prepare data. Do, clean, re-do? How to design so that failure is a learning experience? Small, controlled 'compare and contrast' experiments with ManyEyes? Do exercise on discussing how visualisations are good or bad in terms of design?]
  18. Find out more at http://www.miaridge.com/resources-for-data-visualisation-for-analysis-in-scholarly-research/