[Slides for presentation to STEAM Vent Meetup on Jan 8th, 2015.] Presents a (somewhat fictional) progression of ideas using graphs to analyze data, and how blindly following this progression leads to failed analysis because we did not consider supporting human reasoning as an end goal. Audience is general math and science knowledgeable group.
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
The Bigger Data Story
1. the
BIGger
data story
Ben Keller [@vinegarbin bjkeller.github.io linkedin.com/in/bjkeller ]
STEAM Vent
8 January 2015
ger/
Creative Commons Attribution-ShareAlike 4.0 International License
9. graphs
a
bc
d
1 2
3
used to represent relationships
in our algebra, represents
which multiplications work
ab, abc, ad, aba, …
ba = bd = cb = … = 0
while others don’t
14. similarity graph
Brian
Charles
cherries
apples
model similarity of items
by shared likes of users
to construct new edges
weighted by number of
shared users
cherries
apples
giving a new graph
representing similarity
between items
apples
bananas
cherries
doughnuts
eggs
16. making recommendations
make list by ranking them by weight
recommend items similar to those a user likes
apples
bananas
cherries
doughnuts
eggs
Abby
18. genetic disease
• causal factors of disease are
inherited
• assumed to manifest
themselves as variations of
the genome
• may combine in complex
ways
20. a genetic disease question
chr A
chr B
have paired regions of
genome where variations
cooccur in bipolar
disorder patients
how are these
regions related
by biology?
21. a genetic disease question
chr A
chr B
…
… …
genes in regions
“biology” of genes
23. a familiar graph
• model biological factors of
genes by words found in
descriptions of what gene
does
• gives us a graph similar to
starting graph for
recommenders
• form similarity graph only
between genes in regions
…
……
40. scientist wants to know
how groups here
affect values below
and how values here
are affected by groups
of values above
41. scientist wants to know
how groups here
affect values below
and how values here
are affected by groups
of values above
but cannot answer easily with the tools we’ve chosen
42. A
B
C
D
we will ultimately solve this problem by
supporting the scientist in her reasoning
not by choosing our favorite tool