The bioinformatics community faces numerous challenges in developing research and clinical software for analyzing genomic data. The DNAnexus Platform enables solutions to these problems. This presentation will cover its features: massively scalable ondemand cloud infrastructure, fully configurable and scriptable genomics API, commandline interface and SDKs, collaboration and community support, visualization and publication tools, enterprise-grade security, compliance with clinical and diagnostic standards.
Genomics Applications in the Cloud with the DNAnexus Platform
1. Powering the genomics revolution
Genomics applications in the cloud
with the DNAnexus Platform
Andrey
Kislyuk
SFAF
2013
2. What drives DNAnexus?
Grad
students,
industry
scien9sts,
researchers.
We
kept
hi>ng
bioinforma9cs
roadblocks.
– Don’t
know
how
much
compute
we’ll
need
– Resource
conten9on
– I/O
boKlenecks
– Storage
shortages
– Bad
interfaces
– Byzan9ne
IT
– Overpriced
and
underu-lized
– Blocks
clinical
applica-ons
3. What if there was a way to…
Break IT limitations
Publish your tools
Work together
17. Those are not just acronyms…
• All
data
encrypted
with
full-‐disk
AES-‐256
at
rest,
SSL
on
the
move
• Produc9on
access
controls
• Third-‐party
security
audits
• Op9onal
2-‐Factor
Auth
• LXC
(Linux
Containers)
hypervisor
• Auditable
by
user
18. Your data is yours
We
will
never
hold
your
data
hostage
– Always
exportable
– Always
downloadable
– We’re
not
allowed
to
look
at
it
19. The DNAnexus Platform
Reliability
• Triple
data
redundancy
• Geographically
distributed
• Job-‐level
hardware
fault
tolerance
• Reproducible
and
auditable
results
for
6+
years
DNAnexus
is
ready
for
clinical
data
21. Open-source stack
App
wizard
walks
you
through
app
crea1on
Learn
by
example
fork
our
repos
Collaborate
deploy
apps
from
GitHub
22. Debug quickly
• SDK
tools
for
debugging
• Jobs
start
in
5
seconds
under
most
circumstances
• Real-‐9me
job
logs
23. Data standards
• GTable:
Columnar
indexed
data
store
for
genome
data
– Suitable
for
storing
SAM,
GFF/GTF,
BED,
VCF,
etc.
• I/O
specifica9ons
– Contracts
between
apps
enable
composi9on
• Data
type
defini9ons
and
validators
Foundations for
Interoperability
29. Instant collaboration
• Eliminate
data
transfer
headaches
• Collaborate
on
data,
tools,
workflows
in
one
environment
• Enable
bioinforma9cs
experts
to
deliver
tools
to
biologists
47. Reproducibility
Ever
try
to
reproduce
results
from
a
bioinforma9cs
paper?
How
about
CLIA
compliance?
All
objects
are
versioned
Analysis
I/O
is
read-‐only
Jobs
enter
into
project’s
permanent
record
48. Publishing
Authors
who
publish
their
sohware
• Don’t
worry
about
suppor9ng
diverse
installs
– You
installed
my
package
on
WHAT?
• Leverage
all
Plajorm
features
– Accessible
UI
• Compose
with
other
apps
– It’s
an
ecosystem
– Publish
your
workflows
as
apps,
too
57. DNAnexus is the platform for
delivering genomics results to users
58. Powering the genomics revolution
Acknowledgments
DNAnexus is
Andreas Sundquist
Arend Sidow
Serafim Batzoglou
Evan Worley
George Asimenos
Joseph Dale
Lee Bendekgey
Mike Furness
Michael Kawai