1. Telling Stories With Data
Class #1
March 20th, 2017
David Newbury — @workergnome 1
2. What We're Doing Today:
— Syllabus Review
— (Brief) History of Data Visualization
— (Tiny) Theory of Visualization
— (Nerdy) Overview of Concepts
— (Fake) Data Exploration
David Newbury — @workergnome 2
23. Dataviz is constructed reality.
You are telling a story, not (just) stating facts.
David Newbury — @workergnome 23
24. data art
as opposed to
data visualization
as opposed to
statistical graphics
David Newbury — @workergnome 24
25. Statistical
Graphics
How do I create Statistical Graphs
in SAS 9.1.3 without Proc Gplot.
UCLA: Statistical Consulting
Group.
http://www.ats.ucla.edu/stat/sas/
notes2/
David Newbury — @workergnome 25
26. Data Art
Dear Data
Giorgia Lupi & Stefanie Posavec.
http://www.dear-data.com
David Newbury — @workergnome 26
27. Two Uses1). help people grasp things outside their reach
David Newbury — @workergnome 27
28. Two Uses1). help people grasp things outside their reach
2.) tell stories
David Newbury — @workergnome 28
30. Dataviz is constructed reality.
Do you care how true your story is?
Do you care how accurate your story is?
Are you trying to teach, entertain, or convince?
David Newbury — @workergnome 30
32. What can you visualise?
David Newbury — @workergnome 32
33. Potential Subjects.
subways, sheep, the solar system,
shoes, sleep, skyline,
snow, supermarket, sausages,
school,the sea, spiders,
staircases, syrup, soap,
sawmills, stereos...
David Newbury — @workergnome 33
34. Potential Subjects.
subways, sheep, the solar system,
shoes, sleep, skyline,
snow, supermarket, sausages,
school,the sea, spiders,
staircases, syrup, soap,
sawmills, stereos...
...and other things that begin with S.
David Newbury — @workergnome 34
35. What are you interested in?
I'm interested in subways.
David Newbury — @workergnome 35
37. What question about your subject are you interested in?
— Are subways more efficient than owning a car?
— How often do I ride the subway in a year?
— What's locations have the best access to subways?
— What's the average subway commute in Pittsburgh?
David Newbury — @workergnome 37
39. Dimension
Which bits of information about a subject
are you going to focus on?
David Newbury — @workergnome 39
40. Possible Dimensions
number of cars
duration of ride
date of a ride
different lines
number of stops
cost per ride
number of stops per day
time between stops
cleanliness
David Newbury — @workergnome 40
41. Scope
Out of the infinite ways to look at your subject,
how are you going to choose one?
David Newbury — @workergnome 41
42. Possible Scopes
All trains in a day
All the rides that I've been on this year
My train this morning
All of the stops in the city
Each line
Every train stop in the past 50 years
David Newbury — @workergnome 42
44. Choose one.
subways, sheep, the solar system,
shoes, sleep, skyline,
snow, supermarket, sausages,
school,the sea, spiders,
staircases, syrup, soap,
sawmills, stereos...
...and other things that begin with S.
David Newbury — @workergnome 44
45. TRY IT.1. Write down your subject
2. Write down your question
3. Write down as many dimensions as you can
4. Write down possible scopes for your data
David Newbury — @workergnome 45
48. Types of Data
number of cars - Numeric
duration of ride - Numeric
date of a ride - Date
different lines - Category
number of stops - Numeric
cost per ride - Category
number of stops per day - Numeric
time between stops - Numeric
cleanliness - String
David Newbury — @workergnome 48
51. number of cars - Measure
duration of ride - Measure
date of a ride - Measure
different lines - Categories
number of stops - Measure
cost per ride - Categories
number of stops per day - Measure
time between stops - Measure
cleanliness - Categories
David Newbury — @workergnome 51
53. A hidden dimension:
David (1), Daniel (2), Dawn (3), Danique (4)
Position of the item in the group.
David Newbury — @workergnome 53
54. TRY IT.1. Choose a scope for your data.
2. Identify which dimensions are relevant.
3. Is the dimension is a category or a measure?
David Newbury — @workergnome 54
56. We need to map our data
from a domain
to a range.
David Newbury — @workergnome 56
57. Domain
number of cars - 1...8
duration of ride - 30 sec...2 hours
date of a ride - - 24ft...200ft
different lines - Red line, Blue line, Green line, Silver
Line, Yellow Line
number of stops - **2..20
cost per ride - "$2.50, $1.75, $3.00, $0.00"
number of stops per day - ??...???
time between stops - 30 sec..5 minutes
David Newbury — @workergnome 57
58. Range
Domain is the possible input values
Range is the possible output values
David Newbury — @workergnome 58
59. Data
3, 7, 10, 6, 2
Position of the item in the group.
Domain
[0-10]
[1-5]
Range
X: 400px
Y: 800px
Mapping
X: item position
Y: numeric value
David Newbury — @workergnome 59
60. Data
3, 7, 10, 6, 2
Position of the item in the group.
Area
David Newbury — @workergnome 60
61. Data
3, 7, 10, 6, 2
Position of the item in the group.
Color
David Newbury — @workergnome 61
62. Data
val1: 3, 7, 10, 6, 2
val2: 5, 8, 1, 8, 3
val3: Cat, Dog, Cat, Cat, Dog
Position of the item in the group.
Mapping
X: item position
Y: val1
Size: val2
Color: val3
David Newbury — @workergnome 62
63. Dimensions beyond X and Y.
Color
Size
Shape
Labels
Patterns
Icons
Anything Else You Can Imagine
David Newbury — @workergnome 63
64. TRY IT.1. Identify your domains
2. For each domain, choose a range
3. Draw it!
David Newbury — @workergnome 64