Data 101: Introduction to Data Visualization
- 3. What We're Doing Today:
—(Brief) History of Data Visualization
—(Tiny) Theory of Visualization
—(Nerdy) Overview of Concepts
—(Fake) Data Exploration
—(Incomplete) Overview of Tools
Data 101. David Newbury — @workergnome 3
- 4. What We're not Doing Today:
—Writing Code
—Thinking about Mapping
—Worrying about Data Provenance
Data 101. David Newbury — @workergnome 4
- 5. Which is biggest?
15012, 8271, 30193, 1189, 9913, 16000, 92481, 49801,
100407, 2910, 3809, 8018, 61528, 18083, 38691, 1800
Data 101. David Newbury — @workergnome 5
- 28. Dataviz is constructed reality.
You are telling a story, not (just) stating facts.
Data 101. David Newbury — @workergnome 28
- 29. data art
as opposed to
data visualization
as opposed to
statistical graphics
Data 101. David Newbury — @workergnome 29
- 30. Statistical
Graphics
How do I create Statistical
Graphs in SAS 9.1.3 without
Proc Gplot. UCLA: Statistical
Consulting Group.
http://www.ats.ucla.edu/stat/
sas/notes2/
Data 101. David Newbury — @workergnome 30
- 32. Two Uses1). help people grasp things outside their reach
Data 101. David Newbury — @workergnome 32
- 33. Two Uses1). help people grasp things outside their reach
2.) tell stories
Data 101. David Newbury — @workergnome 33
- 35. Dataviz is constructed reality.
Do you care how true your story is?
Do you care how accurate your story is?
Are you trying to teach, entertain, or convince?
Data 101. David Newbury — @workergnome 35
- 41. What can you visualise?
Data 101. David Newbury — @workergnome 41
- 42. Potential Subjects.
subways, sheep, the solar system,
shoes, sleep, skyline,
snow, supermarket, sausages,
school,the sea, spiders,
staircases, syrup, soap,
sawmills, stereos...
Data 101. David Newbury — @workergnome 42
- 43. Potential Subjects.
subways, sheep, the solar system,
shoes, sleep, skyline,
snow, supermarket, sausages,
school,the sea, spiders,
staircases, syrup, soap,
sawmills, stereos...
...and other things that begin with S.
Data 101. David Newbury — @workergnome 43
- 45. Scope
Out of the infinite stories about any subject,
which parts are you going to choose?
Data 101. David Newbury — @workergnome 45
- 46. Possible Scopes
All trains in a day
All the rides that I've been on this year
My train this morning
All of the stops in the city
Each line
Every train stop in the past 50 years
Data 101. David Newbury — @workergnome 46
- 47. Dimension
Which bits of information about a subject
are you going to focus on?
Data 101. David Newbury — @workergnome 47
- 48. Possible Dimensions
number of cars
duration of ride
date of a ride
different lines
number of stops
cost per ride
number of stops per day
time between stops
Data 101. David Newbury — @workergnome 48
- 51. Types of Data
number of cars - Numeric
duration of ride - Numeric
date of a ride - Date
different lines - Category
number of stops - Numeric
cost per ride - Category
number of stops per day - Numeric
time between stops - Numeric
Data 101. David Newbury — @workergnome 51
- 54. number of cars - Measure
duration of ride - Measure
date of a ride - Measure
different lines - Categories
number of stops - Measure
cost per ride - Categories
number of stops per day - Measure
time between stops - Measure
cleanliness - Categories
Data 101. David Newbury — @workergnome 54
- 56. A hidden dimension:
David (1), Daniel (2), Dawn (3), Danique (4)
Position of the item in the group.
Data 101. David Newbury — @workergnome 56
- 59. Choose one.
subways, sheep, the solar system,
shoes, sleep, skyline,
snow, supermarket, sausages,
school,the sea, spiders,
staircases, syrup, soap,
sawmills, stereos...
...and other things that begin with S.
Data 101. David Newbury — @workergnome 59
- 61. We need to map our data
from a domain
to a range.
Data 101. David Newbury — @workergnome 61
- 62. Domain
number of cars - 1...8
duration of ride - 30 sec...2 hours
date of a ride - - 24ft...200ft
different lines - Red line, Blue line, Green line, Silver
Line, Yellow Line
number of stops - **2..20
cost per ride - "$2.50, $1.75, $3.00, $0.00"
number of stops per day - ??...???
Data 101. David Newbury — @workergnome 62
- 63. Range
Domain is the possible input values
Range is the possible output values
Data 101. David Newbury — @workergnome 63
- 64. Data
3, 7, 10, 6, 2
Position of the item in the group.
Domain
[0-10]
[1-5]
Range
X: 400px
Y: 800px
Mapping
X: item position
Y: numeric value
Data 101. David Newbury — @workergnome 64
- 65. Data
3, 7, 10, 6, 2
Position of the item in the group.
Area
Data 101. David Newbury — @workergnome 65
- 66. Data
3, 7, 10, 6, 2
Position of the item in the group.
Color
Data 101. David Newbury — @workergnome 66
- 67. Data
3, 7, 10, 6, 2
Position of the item in the group.
Multiples
Dimensions
Data 101. David Newbury — @workergnome 67
- 68. Data
val1: 3, 7, 10, 6, 2
val2: 5, 8, 1, 8, 3
val3: Cat, Dog, Cat, Cat, Dog
Position of the item in the group.
Mapping
X: item position
Y: val1
Size: val2
Color: val3
Data 101. David Newbury — @workergnome 68
- 69. Dimensions beyond X and Y.
Color
Size
Shape
Labels
Patterns
Icons
Anything Else You Can Imagine
Data 101. David Newbury — @workergnome 69