SlideShare ist ein Scribd-Unternehmen logo
1 von 35
Downloaden Sie, um offline zu lesen
A	
  501(c)(3)	
  not-­‐for-­‐profit	
  
                             operaCng	
  clouds	
  for	
  science.	
  


  The	
  Open	
  Science	
  Data	
  Cloud:	
  
Empowering	
  the	
  Long	
  Tail	
  of	
  Science	
  
               October	
  12,	
  2012	
  

               Robert	
  L.	
  Grossman	
  
           University	
  of	
  Chicago	
  
        and	
  Open	
  Cloud	
  ConsorCum	
  
QuesCon	
  1.	
  What	
  is	
  the	
  
cyberinfrastructure	
  required	
  to	
  manage,	
  
analyze,	
  archive	
  and	
  share	
  big	
  data?	
  	
  	
  
	
  
Call	
  this	
  analyCc	
  infrastructure.	
  
QuesCon	
  2.	
  What	
  is	
  the	
  analogy	
  of	
  the	
  
GLIF*	
  for	
  analyCc	
  infrastructure?	
  

*GLIF	
  (www.glif.is),	
  the	
  Global	
  Lambda	
  Integrated	
  Facility,	
  is	
  an	
  internaConal	
  
virtual	
  organizaCon	
  that	
  promotes	
  the	
  paradigm	
  of	
  lambda	
  networking.	
  GLIF	
  
provides	
  lambdas	
  internaConally	
  as	
  an	
  integrated	
  facility	
  to	
  support	
  data-­‐
intensive	
  scienCfic	
  research,	
  and	
  supports	
  middleware	
  development	
  for	
  
lambda	
  networking.	
  	
  
Number	
  



1000’s	
       Individual	
  scienCsts	
  &	
  
               small	
  projects	
  

100’s	
  
                                  Community	
  based	
  
                                  science	
  via	
  Science	
  as	
  a	
  
10’s	
                            Service	
  
                                                                      very	
  large	
  projects	
  
                                                                              Data	
  Size	
  
              Small	
              Medium	
  to	
  Large	
  	
   Very	
  Large	
  
             Public	
                 Shared	
  community	
               Dedicated	
  	
  
             infrastructure	
         infrastructure	
                    infrastructure	
  
The	
  long	
  tail	
  of	
  data	
  science	
  




A	
  few	
  large	
  data	
                      Many	
  smaller	
  data	
  
science	
  projects.	
                           science	
  projects.	
  
Part	
  1.	
  
What	
  Instrument	
  Do	
  we	
  Use	
  to	
  	
  
Make	
  Big	
  Data	
  Discoveries?	
  




How	
  do	
  we	
  build	
  a	
  “datascope?”	
  
TB?	
  
                                 PB?	
  
                                 EB?	
  
                                 ZB?	
  


What	
  is	
  big	
  data?	
  
Another	
  way:	
  




                                                        opencompute.org	
  

Think	
  of	
  data	
  as	
  big	
  if	
  you	
  measure	
  it	
  in	
  MW,	
  as	
  in	
  
   Facebook’s	
  Pineville	
  Data	
  Center	
  is	
  30	
  MW.	
  
An	
  algorithm	
  and	
  
compuCng	
  
infrastructure	
  is	
  “big-­‐
data	
  scalable”	
  if	
  adding	
  
a	
  rack	
  (or	
  container)	
  of	
  
data	
  (and	
  corresponding	
  
processors)	
  allows	
  you	
  
to	
  do	
  the	
  same	
  
computaCon	
  in	
  the	
  
same	
  Cme	
  but	
  over	
  
more	
  data.	
  
Commercial	
  Cloud	
  Service	
  Provider	
  (CSP)	
  	
  
        15	
  MW	
  Data	
  Center	
  

              Monitoring,	
  
                                                     AccounCng	
  and	
  
            network	
  security	
  
                                                         billing	
                                Customer	
  
             and	
  forensics	
  
                                                                                                   Facing	
  
                                                                                                   Portal	
  
               AutomaCc	
  
            provisioning	
  and	
                   100,000	
  servers	
  
             infrastructure	
                         1	
  PB	
  DRAM	
  
             management	
                          100’s	
  of	
  PB	
  of	
  disk	
   ~1	
  Tbps	
  egress	
  bandwidth	
  
                                                                                       	
  

 25	
  operators	
  for	
  15	
  MW	
  Commercial	
  Cloud	
           Data	
  center	
  network	
  
My	
  vote	
  for	
  a	
  datascope:	
  a	
  (bouCque)	
  
data	
  center	
  scale	
  facility	
  with	
  a	
  big-­‐
data	
  scalable	
  analyCc	
  infrastructure.	
  

What	
  would	
  a	
  global	
  integrated	
  
facility	
  for	
  datascopes	
  look	
  like?	
  
Some	
  Examples	
  of	
  Big	
  Data	
  Science	
  

Discipline	
                                                       Dura2on	
   Size	
                                                                                 #	
  Devices	
  
HEP	
  -­‐	
  LHC	
                                                10	
  years	
   15	
  PB/year*	
                                                                   One	
  

Astronomy	
  -­‐	
  LSST	
   10	
  years	
   12	
  PB/year**	
                                                                                                        One	
  

Genomics	
  -­‐	
  NGS	
                                           2-­‐4	
  years	
   0.5	
  TB/genome	
   1000’s	
  


*At	
  full	
  capacity,	
  the	
  Large	
  Hadron	
  Collider	
  (LHC),	
  the	
  world's	
  largest	
  parCcle	
  accelerator,	
  is	
  expected	
  to	
  produce	
  more	
  than	
  15	
  
million	
  Gigabytes	
  of	
  data	
  each	
  year.	
  	
  …	
  This	
  ambiCous	
  project	
  connects	
  and	
  combines	
  the	
  IT	
  power	
  of	
  more	
  than	
  140	
  computer	
  
centres	
  in	
  33	
  countries.	
  	
  Source:	
  hjp://press.web.cern.ch/public/en/Spotlight/SpotlightGrid_081008-­‐en.html	
  
	
  
**As	
  it	
  carries	
  out	
  its	
  10-­‐year	
  survey,	
  LSST	
  will	
  produce	
  over	
  15	
  terabytes	
  of	
  raw	
  astronomical	
  data	
  each	
  night	
  (30	
  terabytes	
  
processed),	
  resulCng	
  in	
  a	
  database	
  catalog	
  of	
  22	
  petabytes	
  and	
  an	
  image	
  archive	
  of	
  100	
  petabytes.	
  	
  Source:	
  hjp://www.lsst.org/
News/enews/teragrid-­‐1004.html	
  
One	
  large	
  instrument	
     Many	
  smaller	
  instruments	
  
Sci	
  CSP	
  services	
  




                                        Data	
  scienCst	
  




Datascope	
  –	
  Science	
  Cloud	
  Service	
  
Provider	
  (Sci	
  CSP)	
  
What	
  are	
  some	
  of	
  the	
  important	
  
differences	
  between	
  commercial	
  
 and	
  research-­‐focused	
  Sci	
  CSPs?	
  	
  
Science	
  CSP	
                           Commercial	
  CSP	
  
POV	
            DemocraCze	
  access	
  to	
               As	
  long	
  as	
  you	
  pay	
  the	
  bill;	
  
                 data.	
  	
  Integrate	
  data	
  to	
     as	
  long	
  as	
  the	
  business	
  
                 make	
  discoveries.	
  	
  Long	
         model	
  holds.	
  
                 term	
  archive.	
  
Data	
  &	
      Data	
  intensive	
                Internet	
  style	
  scale	
  out	
  
Storage	
               Science	
  Clouds	
  
                 compuCng	
  &	
  HP	
  storage	
   and	
  object-­‐based	
  storage	
  
Flows	
          Large	
  data	
  flows	
  in	
  and	
       Lots	
  of	
  small	
  web	
  flows	
  
                 out	
  
Streams	
        Streaming	
  processing	
                  NA	
  
                 required	
  
AccounCng	
      EssenCal	
                                 EssenCal	
  
Lock	
  in	
     Moving	
  environment	
                    Lock	
  in	
  is	
  good	
  
                 between	
  CSPs	
  essenCal	
  
Part	
  2.	
  
The	
  Open	
  Cloud	
  ConsorCum’s	
  	
  
Open	
  Science	
  Data	
  Cloud	
  
•  U.S	
  based	
  not-­‐for-­‐profit	
  corporaCon.	
  
•  Manages	
  cloud	
  compuCng	
  infrastructure	
  to	
  
   support	
  scienCfic	
  research:	
  Open	
  Science	
  
   Data	
  Cloud.	
  
•  Manages	
  cloud	
  compuCng	
  testbeds:	
  Open	
  
   Cloud	
  Testbed.	
  
	
  


www.opencloudconsorCum.org	
                                  18	
  
OCC	
  Members	
  &	
  Partners	
  
•  Companies:	
  Cisco,	
  Yahoo!,	
  Citrix,	
  …	
  
•  UniversiCes:	
  	
  University	
  of	
  Chicago,	
  
   Northwestern	
  Univ.,	
  Johns	
  Hopkins,	
  Calit2,	
  
   ORNL,	
  University	
  of	
  Illinois	
  at	
  Chicago,	
  …	
  
•  Federal	
  agencies	
  and	
  labs:	
  NASA,	
  LLNL,	
  ORNL	
  
•  InternaConal	
  Partners:	
  AIST	
  (Japan),	
  U.	
  
   Edinburgh,	
  U.	
  Amsterdam,	
  …	
  
•  Partners:	
  NaConal	
  Lambda	
  Rail	
  
                                                                       19	
  
OCC	
  2011	
  Resources	
  
Resource	
                   Type	
                    Comments	
  
OSDC	
  Adler	
  &	
         UClity	
  Cloud	
  	
     1248	
  cores	
  and	
  0.4	
  PB	
  disk	
  
Sullivan	
  
OCC	
  –	
  Y	
              Data	
  Cloud	
           928	
  cores	
  and	
  1.0	
  	
  PB	
  disk	
  
OCC	
  –	
  Matsu	
          Mixed	
                   1	
  rack	
  
OSDC	
  Root	
               Storage	
                 0.8	
  PB	
  


        •  OCC-­‐Adler,	
  Sullivan	
  &	
  Root	
  will	
  more	
  than	
  double	
  in	
  
           size	
  in	
  2012.	
  
Bionimbus	
  WG	
  




bionimbus.opensciencedatacloud.org	
  (biological	
  data)	
  
One	
  Million	
  Genomes	
  
•  Sequencing	
  a	
  million	
  genomes	
  would	
  most	
  
   likely	
  fundamentally	
  change	
  the	
  way	
  we	
  
   understand	
  genomic	
  variaCon.	
  
•  The	
  genomic	
  data	
  for	
  a	
  paCent	
  is	
  about	
  1	
  TB	
  
   (including	
  samples	
  from	
  both	
  tumor	
  and	
  
   normal	
  Cssue).	
  
•  One	
  million	
  genomes	
  is	
  about	
  1000	
  PB	
  or	
  1	
  EB	
  
•  With	
  compression,	
  it	
  may	
  be	
  about	
  100	
  PB	
  
•  At	
  $1000/genome,	
  the	
  sequencing	
  would	
  cost	
  
   about	
  $1B	
  
Big	
  data	
  driven	
  discovery	
  on	
  
                1,000,000	
  genomes	
  and	
  1	
  EB	
  of	
  data.	
  



Genomic-­‐                         Improved	
                         	
  Genomic-­‐	
  
 driven	
                        understanding	
                      driven	
  drug	
  
diagnosis	
                       of	
  genomic	
                    development	
  
                                    science	
  



                          Precision	
  diagnosis	
  and	
  
                          treatment.	
  	
  PrevenCve	
  
                                health	
  care.	
  
Project Matsu WG:
Clouds to Support Earth Science




matsu.opensciencedatacloud.org	
  
                                     24
UDR	
  




•  UDT	
  is	
  a	
  high	
  performance	
  network	
  transport	
  protocol	
  
•  UDR	
  =	
  rsync	
  +	
  UDT	
  	
  
•  It	
  is	
  easy	
  for	
  an	
  average	
  systems	
  administrator	
  to	
  keep	
  
   100’s	
  of	
  TB	
  of	
  distributed	
  data	
  synchronized.	
  	
  
•  We	
  are	
  using	
  it	
  to	
  distribute	
  c.	
  1	
  PB	
  from	
  the	
  OSDC	
  
OpenFlow-­‐Enabled	
  Hadoop	
  WG	
  
•  When	
  running	
  Hadoop	
  some	
  map	
  and	
  reduce	
  jobs	
  
   take	
  significantly	
  longer	
  than	
  others.	
  
•  These	
  are	
  stragglers	
  and	
  can	
  significantly	
  slow	
  down	
  
   a	
  MapReduce	
  computaCon.	
  	
  
•  Stragglers	
  are	
  common	
  (dirty	
  secret	
  about	
  Hadoop)	
  
•  Infoblox	
  and	
  UChicago	
  are	
  leading	
  a	
  OCC	
  Working	
  
   Group	
  on	
  OpenFlow-­‐enabled	
  Hadoop	
  that	
  will	
  
   provide	
  addiConal	
  bandwidth	
  to	
  stragglers.	
  	
  
•  We	
  have	
  a	
  testbed	
  for	
  a	
  wide	
  area	
  version	
  of	
  this	
  
   project.	
  
OSDC	
  PIRE	
  Project	
  
                                                    We	
  select	
  OSDC	
  PIRE	
  Fellows	
  
                                                    (US	
  ciCzens	
  or	
  permanent	
  
                                                    residents):	
  	
  
                                                    •  We	
  give	
  them	
  tutorials	
  and	
  
                                                       training	
  on	
  big	
  data	
  science.	
  
                                                    •  We	
  provide	
  them	
  
                                                       fellowships	
  to	
  work	
  with	
  
                                                       OSDC	
  internaConal	
  
                                                       partners.	
  
                                                    •  We	
  give	
  them	
  preferred	
  
                                                       access	
  to	
  the	
  OSDC.	
  

Nominate	
  your	
  favorite	
  scienCst	
  as	
  an	
  OSDC	
  PIRE	
  Fellow.	
  	
  
www.opensciencedatacloud.org	
  	
  (look	
  for	
  PIRE)	
  
Part	
  3.	
  
Cloud	
  Services	
  OperaCons	
  Centers	
  
Open	
  Science	
  Data	
  Cloud	
  
                                                         AccounCng	
  and	
  
                  Monitoring,	
                           billing	
  (OSDC)	
  
                 compliance,	
  &	
  
                   security	
                                                                        Customer	
  Facing	
  
                                                       Science	
  Cloud	
  SW	
  
                                                           &	
  Services	
                            Portal	
  (Tukey)	
  
                  AutomaCc	
  
               provisioning	
  and	
  
                                                            3	
  PB	
  2011	
  
                infrastructure	
                           10	
  PB	
  2012	
  	
  
                management	
                                                             ~100	
  Gbps	
  bandwidth	
  
                                                         able	
  to	
  scale	
  to	
  
                                                                                         	
  
                                                             100	
  PB?	
  

   5-­‐12	
  operators	
  to	
  operate	
  1-­‐5	
  MW	
  Science	
  Cloud	
   Data	
  center	
  network	
  



OSDC	
  Data	
  Stack	
  based	
  upon	
  OpenStack,	
  Hadoop,	
  GlusterFS,	
  UDT,	
  …	
  
Cloud	
  Services	
  	
  
          OperaCons	
  Centers	
  (CSOC)	
  

•  The	
  OSDC	
  operates	
  Cloud	
  Services	
  OperaCons	
  
   Center	
  (or	
  CSOC).	
  
•  It	
  is	
  a	
  CSOC	
  focused	
  on	
  supporCng	
  Science	
  
   Clouds	
  for	
  researchers.	
  
•  Compare	
  to	
  Network	
  OperaCons	
  Center	
  or	
  
   NOC.	
  
•  Both	
  are	
  an	
  important	
  part	
  of	
  cyber	
  
   infrastructure	
  for	
  big	
  data	
  science.	
  
OSDC	
  Racks	
  
                                               •  How	
  quickly	
  can	
  
                                                  we	
  set	
  up	
  a	
  rack?	
  
                                               •  How	
  efficiently	
  can	
  
                                                  we	
  operate	
  a	
  rack?	
  
                                                  (racks/admin)	
  


2012	
  OSDC	
  rack	
  design	
  (dray)	
  
•  950	
  TB	
  /	
  rack	
  
•  600	
  cores	
  /	
  rack	
  
EssenCal	
  Services	
  for	
  a	
  Science	
  CSP	
  
•  Support	
  for	
  data	
  intensive	
  compuCng	
  
•  Support	
  for	
  big	
  data	
  flows	
  
•  Account	
  management,	
  authenCcaCon	
  and	
  
   authorizaCon	
  services	
  
•  Health	
  and	
  status	
  monitoring	
  
•  Billing	
  and	
  accounCng	
  
•  Ability	
  to	
  rapidly	
  provision	
  infrastructure	
  
•  Security	
  services,	
  logging,	
  event	
  reporCng	
  
•  Access	
  to	
  large	
  amounts	
  of	
  public	
  data	
  
•  High	
  performance	
  storage	
  
•  Simple	
  data	
  export	
  and	
  import	
  services	
  
Please	
  Join	
  Us!	
  	
  
                   	
  
(Help	
  us	
  from	
  making	
  even	
  
     more	
  mistakes.)	
  
Acknowledgements	
  
Major	
  funding	
  and	
  support	
  for	
  the	
  Open	
  Science	
  Data	
  Cloud	
  (OSDC)	
  is	
  provided	
  by	
  the	
  
Gordon	
  and	
  Bejy	
  Moore	
  FoundaCon.	
  	
  This	
  funding	
  is	
  used	
  to	
  support	
  the	
  OSDC-­‐Adler,	
  
Sullivan	
  and	
  Root	
  faciliCes.	
  
	
  
AddiConal	
  funding	
  for	
  the	
  OSDC	
  has	
  been	
  provided	
  by	
  the	
  following	
  sponsors:	
  
	
  
•  The	
  OCC-­‐Y	
  Hadoop	
  Cluster	
  (approximately	
  1000	
  cores	
  and	
  1	
  PB	
  of	
  storage)	
  was	
  
     donated	
  by	
  Yahoo!	
  in	
  2011.	
  
•  Cisco	
  provides	
  the	
  OSDC	
  access	
  to	
  the	
  Cisco	
  C-­‐Wave,	
  which	
  connects	
  OSDC	
  data	
  
     centers	
  with	
  10	
  Gbps	
  wide	
  area	
  networks.	
  
•  NSF	
  awarded	
  the	
  OSDC	
  a	
  5-­‐year	
  (2010-­‐2016)	
  PIRE	
  award	
  to	
  train	
  scienCsts	
  to	
  use	
  
     the	
  OSDC	
  and	
  to	
  further	
  develop	
  the	
  underlying	
  technology.	
  
•  OSDC	
  technology	
  for	
  high	
  performance	
  data	
  transport	
  is	
  support	
  in	
  part	
  by	
  	
  NSF	
  
     Award	
  1127316.	
  
•  The	
  StarLight	
  Facility	
  in	
  Chicago	
  enables	
  the	
  OSDC	
  to	
  connect	
  to	
  over	
  30	
  high	
  
     performance	
  research	
  networks	
  around	
  the	
  world	
  at	
  10	
  Gbps	
  or	
  higher,	
  with	
  an	
  
     increasing	
  number	
  of	
  100	
  Gbps	
  connecCons.	
  
	
  
The	
  OSDC	
  is	
  managed	
  by	
  the	
  Open	
  Cloud	
  ConsorCum,	
  a	
  501(c)(3)	
  not-­‐for-­‐profit	
  
corporaCon.	
  If	
  you	
  are	
  interested	
  in	
  providing	
  funding	
  or	
  donaCng	
  equipment	
  or	
  
services,	
  please	
  contact	
  us	
  at	
  info@opensciencedatacloud.org.	
  
For	
  more	
  informaCon	
  
•  You	
  can	
  find	
  some	
  more	
  informaCon	
  on	
  my	
  blog:	
  
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  rgrossman.com.	
  
•  Some	
  of	
  my	
  technical	
  papers	
  are	
  also	
  available	
  there.	
  	
  
•  My	
  email	
  address	
  is	
  robert.grossman	
  at	
  uchicago	
  dot	
  edu.	
  
	
  




                                                                                                         Center for
                                                                                                         Research
                                                                                                         Informatics

Weitere ähnliche Inhalte

Was ist angesagt?

Health & Status Monitoring (2010-v8)
Health & Status Monitoring (2010-v8)Health & Status Monitoring (2010-v8)
Health & Status Monitoring (2010-v8)Robert Grossman
 
What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? Robert Grossman
 
Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11Robert Grossman
 
Open Science Data Cloud (IEEE Cloud 2011)
Open Science Data Cloud (IEEE Cloud 2011)Open Science Data Cloud (IEEE Cloud 2011)
Open Science Data Cloud (IEEE Cloud 2011)Robert Grossman
 
Large Scale On-Demand Image Processing For Disaster Relief
Large Scale On-Demand Image Processing For Disaster ReliefLarge Scale On-Demand Image Processing For Disaster Relief
Large Scale On-Demand Image Processing For Disaster ReliefRobert Grossman
 
OCC Overview OMG Clouds Meeting 07-13-09 v3
OCC Overview OMG Clouds Meeting 07-13-09 v3OCC Overview OMG Clouds Meeting 07-13-09 v3
OCC Overview OMG Clouds Meeting 07-13-09 v3Robert Grossman
 
Bioclouds CAMDA (Robert Grossman) 09-v9p
Bioclouds CAMDA (Robert Grossman) 09-v9pBioclouds CAMDA (Robert Grossman) 09-v9p
Bioclouds CAMDA (Robert Grossman) 09-v9pRobert Grossman
 
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...Geoffrey Fox
 
Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)Robert Grossman
 
An Overview of Bionimbus (March 2010)
An Overview of Bionimbus (March 2010)An Overview of Bionimbus (March 2010)
An Overview of Bionimbus (March 2010)Robert Grossman
 
Processing Big Data (Chapter 3, SC 11 Tutorial)
Processing Big Data (Chapter 3, SC 11 Tutorial)Processing Big Data (Chapter 3, SC 11 Tutorial)
Processing Big Data (Chapter 3, SC 11 Tutorial)Robert Grossman
 
My Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataMy Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataRobert Grossman
 
Project Matsu: Elastic Clouds for Disaster Relief
Project Matsu: Elastic Clouds for Disaster ReliefProject Matsu: Elastic Clouds for Disaster Relief
Project Matsu: Elastic Clouds for Disaster ReliefRobert Grossman
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for ScienceIan Foster
 
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
Materials Data Facility: Streamlined and automated data sharing,  discovery, ...Materials Data Facility: Streamlined and automated data sharing,  discovery, ...
Materials Data Facility: Streamlined and automated data sharing, discovery, ...Ian Foster
 
Open Science Data Cloud (June 21, 2010)
Open Science Data Cloud (June 21, 2010)Open Science Data Cloud (June 21, 2010)
Open Science Data Cloud (June 21, 2010)Robert Grossman
 
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...Robert Grossman
 
Big Data HPC Convergence
Big Data HPC ConvergenceBig Data HPC Convergence
Big Data HPC ConvergenceGeoffrey Fox
 
Coding the Continuum
Coding the ContinuumCoding the Continuum
Coding the ContinuumIan Foster
 
Cloud Services for Big Data Analytics
Cloud Services for Big Data AnalyticsCloud Services for Big Data Analytics
Cloud Services for Big Data AnalyticsGeoffrey Fox
 

Was ist angesagt? (20)

Health & Status Monitoring (2010-v8)
Health & Status Monitoring (2010-v8)Health & Status Monitoring (2010-v8)
Health & Status Monitoring (2010-v8)
 
What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care?
 
Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11
 
Open Science Data Cloud (IEEE Cloud 2011)
Open Science Data Cloud (IEEE Cloud 2011)Open Science Data Cloud (IEEE Cloud 2011)
Open Science Data Cloud (IEEE Cloud 2011)
 
Large Scale On-Demand Image Processing For Disaster Relief
Large Scale On-Demand Image Processing For Disaster ReliefLarge Scale On-Demand Image Processing For Disaster Relief
Large Scale On-Demand Image Processing For Disaster Relief
 
OCC Overview OMG Clouds Meeting 07-13-09 v3
OCC Overview OMG Clouds Meeting 07-13-09 v3OCC Overview OMG Clouds Meeting 07-13-09 v3
OCC Overview OMG Clouds Meeting 07-13-09 v3
 
Bioclouds CAMDA (Robert Grossman) 09-v9p
Bioclouds CAMDA (Robert Grossman) 09-v9pBioclouds CAMDA (Robert Grossman) 09-v9p
Bioclouds CAMDA (Robert Grossman) 09-v9p
 
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
 
Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)
 
An Overview of Bionimbus (March 2010)
An Overview of Bionimbus (March 2010)An Overview of Bionimbus (March 2010)
An Overview of Bionimbus (March 2010)
 
Processing Big Data (Chapter 3, SC 11 Tutorial)
Processing Big Data (Chapter 3, SC 11 Tutorial)Processing Big Data (Chapter 3, SC 11 Tutorial)
Processing Big Data (Chapter 3, SC 11 Tutorial)
 
My Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataMy Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big Data
 
Project Matsu: Elastic Clouds for Disaster Relief
Project Matsu: Elastic Clouds for Disaster ReliefProject Matsu: Elastic Clouds for Disaster Relief
Project Matsu: Elastic Clouds for Disaster Relief
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for Science
 
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
Materials Data Facility: Streamlined and automated data sharing,  discovery, ...Materials Data Facility: Streamlined and automated data sharing,  discovery, ...
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
 
Open Science Data Cloud (June 21, 2010)
Open Science Data Cloud (June 21, 2010)Open Science Data Cloud (June 21, 2010)
Open Science Data Cloud (June 21, 2010)
 
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
 
Big Data HPC Convergence
Big Data HPC ConvergenceBig Data HPC Convergence
Big Data HPC Convergence
 
Coding the Continuum
Coding the ContinuumCoding the Continuum
Coding the Continuum
 
Cloud Services for Big Data Analytics
Cloud Services for Big Data AnalyticsCloud Services for Big Data Analytics
Cloud Services for Big Data Analytics
 

Andere mochten auch

Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)
Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)
Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)Robert Grossman
 
AnalyticOps - Chicago PAW 2016
AnalyticOps - Chicago PAW 2016AnalyticOps - Chicago PAW 2016
AnalyticOps - Chicago PAW 2016Robert Grossman
 
Why a Manifesto for Open Science?
Why a Manifesto for Open Science?Why a Manifesto for Open Science?
Why a Manifesto for Open Science?Leslie Chan
 
Big Data - Lab A1 (SC 11 Tutorial)
Big Data - Lab A1 (SC 11 Tutorial)Big Data - Lab A1 (SC 11 Tutorial)
Big Data - Lab A1 (SC 11 Tutorial)Robert Grossman
 
Next-generation sequencing format and visualization with ngs.plot
Next-generation sequencing format and visualization with ngs.plotNext-generation sequencing format and visualization with ngs.plot
Next-generation sequencing format and visualization with ngs.plotLi Shen
 
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)Robert Grossman
 
NGS data formats and analyses
NGS data formats and analysesNGS data formats and analyses
NGS data formats and analysesrjorton
 
Practical Methods for Identifying Anomalies That Matter in Large Datasets
Practical Methods for Identifying Anomalies That Matter in Large DatasetsPractical Methods for Identifying Anomalies That Matter in Large Datasets
Practical Methods for Identifying Anomalies That Matter in Large DatasetsRobert Grossman
 
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...Robert Grossman
 
Adversarial Analytics - 2013 Strata & Hadoop World Talk
Adversarial Analytics - 2013 Strata & Hadoop World TalkAdversarial Analytics - 2013 Strata & Hadoop World Talk
Adversarial Analytics - 2013 Strata & Hadoop World TalkRobert Grossman
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Robert Grossman
 
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...Robert Grossman
 

Andere mochten auch (13)

Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)
Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)
Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)
 
AnalyticOps - Chicago PAW 2016
AnalyticOps - Chicago PAW 2016AnalyticOps - Chicago PAW 2016
AnalyticOps - Chicago PAW 2016
 
Why a Manifesto for Open Science?
Why a Manifesto for Open Science?Why a Manifesto for Open Science?
Why a Manifesto for Open Science?
 
Big Data - Lab A1 (SC 11 Tutorial)
Big Data - Lab A1 (SC 11 Tutorial)Big Data - Lab A1 (SC 11 Tutorial)
Big Data - Lab A1 (SC 11 Tutorial)
 
NGS - QC & Dataformat
NGS - QC & Dataformat NGS - QC & Dataformat
NGS - QC & Dataformat
 
Next-generation sequencing format and visualization with ngs.plot
Next-generation sequencing format and visualization with ngs.plotNext-generation sequencing format and visualization with ngs.plot
Next-generation sequencing format and visualization with ngs.plot
 
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
 
NGS data formats and analyses
NGS data formats and analysesNGS data formats and analyses
NGS data formats and analyses
 
Practical Methods for Identifying Anomalies That Matter in Large Datasets
Practical Methods for Identifying Anomalies That Matter in Large DatasetsPractical Methods for Identifying Anomalies That Matter in Large Datasets
Practical Methods for Identifying Anomalies That Matter in Large Datasets
 
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
 
Adversarial Analytics - 2013 Strata & Hadoop World Talk
Adversarial Analytics - 2013 Strata & Hadoop World TalkAdversarial Analytics - 2013 Strata & Hadoop World Talk
Adversarial Analytics - 2013 Strata & Hadoop World Talk
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
 
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
 

Ähnlich wie The Open Science Data Cloud: Empowering the Long Tail of Science

2015 04 bio it world
2015 04 bio it world2015 04 bio it world
2015 04 bio it worldChris Dwan
 
Positioning University of California Information Technology for the Future: S...
Positioning University of California Information Technology for the Future: S...Positioning University of California Information Technology for the Future: S...
Positioning University of California Information Technology for the Future: S...Larry Smarr
 
High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...
High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...
High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...Larry Smarr
 
High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...
High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...
High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...Larry Smarr
 
Grid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applicationsGrid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applicationsTal Lavian Ph.D.
 
Cyberinfrastructure and Applications Overview: Howard University June22
Cyberinfrastructure and Applications Overview: Howard University June22Cyberinfrastructure and Applications Overview: Howard University June22
Cyberinfrastructure and Applications Overview: Howard University June22marpierc
 
The Pacific Research Platform:a Science-Driven Big-Data Freeway System
The Pacific Research Platform:a Science-Driven Big-Data Freeway SystemThe Pacific Research Platform:a Science-Driven Big-Data Freeway System
The Pacific Research Platform:a Science-Driven Big-Data Freeway SystemLarry Smarr
 
Impact of Grid Computing on Network Operators and HW Vendors
Impact of Grid Computing on Network Operators and HW VendorsImpact of Grid Computing on Network Operators and HW Vendors
Impact of Grid Computing on Network Operators and HW VendorsTal Lavian Ph.D.
 
Creating a Science-Driven Big Data Superhighway
Creating a Science-Driven Big Data SuperhighwayCreating a Science-Driven Big Data Superhighway
Creating a Science-Driven Big Data SuperhighwayLarry Smarr
 
Clouds, Grids and Data
Clouds, Grids and DataClouds, Grids and Data
Clouds, Grids and DataGuy Coates
 
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...Larry Smarr
 
Louise McCluskey, Kx Engineer at Kx Systems
Louise McCluskey, Kx Engineer at Kx SystemsLouise McCluskey, Kx Engineer at Kx Systems
Louise McCluskey, Kx Engineer at Kx SystemsDataconomy Media
 
Bionimbus - An Overview (2010-v6)
Bionimbus - An Overview (2010-v6)Bionimbus - An Overview (2010-v6)
Bionimbus - An Overview (2010-v6)Robert Grossman
 
OptIPuter Overview
OptIPuter OverviewOptIPuter Overview
OptIPuter OverviewLarry Smarr
 

Ähnlich wie The Open Science Data Cloud: Empowering the Long Tail of Science (20)

2015 04 bio it world
2015 04 bio it world2015 04 bio it world
2015 04 bio it world
 
Positioning University of California Information Technology for the Future: S...
Positioning University of California Information Technology for the Future: S...Positioning University of California Information Technology for the Future: S...
Positioning University of California Information Technology for the Future: S...
 
High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...
High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...
High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...
 
High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...
High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...
High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...
 
Grid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applicationsGrid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applications
 
Cyberinfrastructure and Applications Overview: Howard University June22
Cyberinfrastructure and Applications Overview: Howard University June22Cyberinfrastructure and Applications Overview: Howard University June22
Cyberinfrastructure and Applications Overview: Howard University June22
 
The Pacific Research Platform:a Science-Driven Big-Data Freeway System
The Pacific Research Platform:a Science-Driven Big-Data Freeway SystemThe Pacific Research Platform:a Science-Driven Big-Data Freeway System
The Pacific Research Platform:a Science-Driven Big-Data Freeway System
 
Cifar
CifarCifar
Cifar
 
Impact of Grid Computing on Network Operators and HW Vendors
Impact of Grid Computing on Network Operators and HW VendorsImpact of Grid Computing on Network Operators and HW Vendors
Impact of Grid Computing on Network Operators and HW Vendors
 
Grid computing & its applications
Grid computing & its applicationsGrid computing & its applications
Grid computing & its applications
 
1. GRID COMPUTING
1. GRID COMPUTING1. GRID COMPUTING
1. GRID COMPUTING
 
Jorge gomes
Jorge gomesJorge gomes
Jorge gomes
 
Jorge gomes
Jorge gomesJorge gomes
Jorge gomes
 
Jorge gomes
Jorge gomesJorge gomes
Jorge gomes
 
Creating a Science-Driven Big Data Superhighway
Creating a Science-Driven Big Data SuperhighwayCreating a Science-Driven Big Data Superhighway
Creating a Science-Driven Big Data Superhighway
 
Clouds, Grids and Data
Clouds, Grids and DataClouds, Grids and Data
Clouds, Grids and Data
 
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
 
Louise McCluskey, Kx Engineer at Kx Systems
Louise McCluskey, Kx Engineer at Kx SystemsLouise McCluskey, Kx Engineer at Kx Systems
Louise McCluskey, Kx Engineer at Kx Systems
 
Bionimbus - An Overview (2010-v6)
Bionimbus - An Overview (2010-v6)Bionimbus - An Overview (2010-v6)
Bionimbus - An Overview (2010-v6)
 
OptIPuter Overview
OptIPuter OverviewOptIPuter Overview
OptIPuter Overview
 

Mehr von Robert Grossman

Some Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your CompanySome Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your CompanyRobert Grossman
 
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsSome Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsRobert Grossman
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataRobert Grossman
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedCrossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedRobert Grossman
 
A Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchA Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchRobert Grossman
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?Robert Grossman
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...Robert Grossman
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...Robert Grossman
 
Bionimbus - Northwestern CGI Workshop 4-21-2011
Bionimbus - Northwestern CGI Workshop 4-21-2011Bionimbus - Northwestern CGI Workshop 4-21-2011
Bionimbus - Northwestern CGI Workshop 4-21-2011Robert Grossman
 

Mehr von Robert Grossman (9)

Some Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your CompanySome Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your Company
 
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsSome Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data Platforms
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate Data
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedCrossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
 
A Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchA Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical Research
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
Bionimbus - Northwestern CGI Workshop 4-21-2011
Bionimbus - Northwestern CGI Workshop 4-21-2011Bionimbus - Northwestern CGI Workshop 4-21-2011
Bionimbus - Northwestern CGI Workshop 4-21-2011
 

Kürzlich hochgeladen

TDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s Leadership
TDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s LeadershipTDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s Leadership
TDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s Leadershipanjanibaddipudi1
 
America Is the Target; Israel Is the Front Line _ Andy Blumenthal _ The Blogs...
America Is the Target; Israel Is the Front Line _ Andy Blumenthal _ The Blogs...America Is the Target; Israel Is the Front Line _ Andy Blumenthal _ The Blogs...
America Is the Target; Israel Is the Front Line _ Andy Blumenthal _ The Blogs...Andy (Avraham) Blumenthal
 
05052024_First India Newspaper Jaipur.pdf
05052024_First India Newspaper Jaipur.pdf05052024_First India Newspaper Jaipur.pdf
05052024_First India Newspaper Jaipur.pdfFIRST INDIA
 
04052024_First India Newspaper Jaipur.pdf
04052024_First India Newspaper Jaipur.pdf04052024_First India Newspaper Jaipur.pdf
04052024_First India Newspaper Jaipur.pdfFIRST INDIA
 
Embed-2 (1).pdfb[k[k[[k[kkkpkdpokkdpkopko
Embed-2 (1).pdfb[k[k[[k[kkkpkdpokkdpkopkoEmbed-2 (1).pdfb[k[k[[k[kkkpkdpokkdpkopko
Embed-2 (1).pdfb[k[k[[k[kkkpkdpokkdpkopkobhavenpr
 
Nurturing Families, Empowering Lives: TDP's Vision for Family Welfare in Andh...
Nurturing Families, Empowering Lives: TDP's Vision for Family Welfare in Andh...Nurturing Families, Empowering Lives: TDP's Vision for Family Welfare in Andh...
Nurturing Families, Empowering Lives: TDP's Vision for Family Welfare in Andh...narsireddynannuri1
 
Transformative Leadership: N Chandrababu Naidu and TDP's Vision for Innovatio...
Transformative Leadership: N Chandrababu Naidu and TDP's Vision for Innovatio...Transformative Leadership: N Chandrababu Naidu and TDP's Vision for Innovatio...
Transformative Leadership: N Chandrababu Naidu and TDP's Vision for Innovatio...srinuseo15
 
Enjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
BDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 46 (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 46 (Gurgaon)Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 46 (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 46 (Gurgaon)Delhi Call girls
 
Nara Chandrababu Naidu's Visionary Policies For Andhra Pradesh's Development
Nara Chandrababu Naidu's Visionary Policies For Andhra Pradesh's DevelopmentNara Chandrababu Naidu's Visionary Policies For Andhra Pradesh's Development
Nara Chandrababu Naidu's Visionary Policies For Andhra Pradesh's Developmentnarsireddynannuri1
 
1971 war india pakistan bangladesh liberation.ppt
1971 war india pakistan bangladesh liberation.ppt1971 war india pakistan bangladesh liberation.ppt
1971 war india pakistan bangladesh liberation.pptsammehtumblr
 
Enjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort ServiceDelhi Call girls
 
₹5.5k {Cash Payment} Independent Greater Noida Call Girls In [Delhi INAYA] 🔝|...
₹5.5k {Cash Payment} Independent Greater Noida Call Girls In [Delhi INAYA] 🔝|...₹5.5k {Cash Payment} Independent Greater Noida Call Girls In [Delhi INAYA] 🔝|...
₹5.5k {Cash Payment} Independent Greater Noida Call Girls In [Delhi INAYA] 🔝|...Diya Sharma
 
2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docx
2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docx2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docx
2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docxkfjstone13
 
BDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 48 (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 48 (Gurgaon)Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 48 (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 48 (Gurgaon)Delhi Call girls
 
AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...
AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...
AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...Axel Bruns
 
06052024_First India Newspaper Jaipur.pdf
06052024_First India Newspaper Jaipur.pdf06052024_First India Newspaper Jaipur.pdf
06052024_First India Newspaper Jaipur.pdfFIRST INDIA
 
Kishan Reddy Report To People (2019-24).pdf
Kishan Reddy Report To People (2019-24).pdfKishan Reddy Report To People (2019-24).pdf
Kishan Reddy Report To People (2019-24).pdfKISHAN REDDY OFFICE
 

Kürzlich hochgeladen (20)

TDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s Leadership
TDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s LeadershipTDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s Leadership
TDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s Leadership
 
America Is the Target; Israel Is the Front Line _ Andy Blumenthal _ The Blogs...
America Is the Target; Israel Is the Front Line _ Andy Blumenthal _ The Blogs...America Is the Target; Israel Is the Front Line _ Andy Blumenthal _ The Blogs...
America Is the Target; Israel Is the Front Line _ Andy Blumenthal _ The Blogs...
 
05052024_First India Newspaper Jaipur.pdf
05052024_First India Newspaper Jaipur.pdf05052024_First India Newspaper Jaipur.pdf
05052024_First India Newspaper Jaipur.pdf
 
04052024_First India Newspaper Jaipur.pdf
04052024_First India Newspaper Jaipur.pdf04052024_First India Newspaper Jaipur.pdf
04052024_First India Newspaper Jaipur.pdf
 
Embed-2 (1).pdfb[k[k[[k[kkkpkdpokkdpkopko
Embed-2 (1).pdfb[k[k[[k[kkkpkdpokkdpkopkoEmbed-2 (1).pdfb[k[k[[k[kkkpkdpokkdpkopko
Embed-2 (1).pdfb[k[k[[k[kkkpkdpokkdpkopko
 
Nurturing Families, Empowering Lives: TDP's Vision for Family Welfare in Andh...
Nurturing Families, Empowering Lives: TDP's Vision for Family Welfare in Andh...Nurturing Families, Empowering Lives: TDP's Vision for Family Welfare in Andh...
Nurturing Families, Empowering Lives: TDP's Vision for Family Welfare in Andh...
 
Transformative Leadership: N Chandrababu Naidu and TDP's Vision for Innovatio...
Transformative Leadership: N Chandrababu Naidu and TDP's Vision for Innovatio...Transformative Leadership: N Chandrababu Naidu and TDP's Vision for Innovatio...
Transformative Leadership: N Chandrababu Naidu and TDP's Vision for Innovatio...
 
Enjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort Service
 
BDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort Service
 
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 46 (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 46 (Gurgaon)Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 46 (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 46 (Gurgaon)
 
Nara Chandrababu Naidu's Visionary Policies For Andhra Pradesh's Development
Nara Chandrababu Naidu's Visionary Policies For Andhra Pradesh's DevelopmentNara Chandrababu Naidu's Visionary Policies For Andhra Pradesh's Development
Nara Chandrababu Naidu's Visionary Policies For Andhra Pradesh's Development
 
1971 war india pakistan bangladesh liberation.ppt
1971 war india pakistan bangladesh liberation.ppt1971 war india pakistan bangladesh liberation.ppt
1971 war india pakistan bangladesh liberation.ppt
 
Enjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort Service
 
₹5.5k {Cash Payment} Independent Greater Noida Call Girls In [Delhi INAYA] 🔝|...
₹5.5k {Cash Payment} Independent Greater Noida Call Girls In [Delhi INAYA] 🔝|...₹5.5k {Cash Payment} Independent Greater Noida Call Girls In [Delhi INAYA] 🔝|...
₹5.5k {Cash Payment} Independent Greater Noida Call Girls In [Delhi INAYA] 🔝|...
 
2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docx
2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docx2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docx
2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docx
 
BDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort Service
 
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 48 (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 48 (Gurgaon)Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 48 (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 48 (Gurgaon)
 
AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...
AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...
AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...
 
06052024_First India Newspaper Jaipur.pdf
06052024_First India Newspaper Jaipur.pdf06052024_First India Newspaper Jaipur.pdf
06052024_First India Newspaper Jaipur.pdf
 
Kishan Reddy Report To People (2019-24).pdf
Kishan Reddy Report To People (2019-24).pdfKishan Reddy Report To People (2019-24).pdf
Kishan Reddy Report To People (2019-24).pdf
 

The Open Science Data Cloud: Empowering the Long Tail of Science

  • 1. A  501(c)(3)  not-­‐for-­‐profit   operaCng  clouds  for  science.   The  Open  Science  Data  Cloud:   Empowering  the  Long  Tail  of  Science   October  12,  2012   Robert  L.  Grossman   University  of  Chicago   and  Open  Cloud  ConsorCum  
  • 2. QuesCon  1.  What  is  the   cyberinfrastructure  required  to  manage,   analyze,  archive  and  share  big  data?         Call  this  analyCc  infrastructure.  
  • 3. QuesCon  2.  What  is  the  analogy  of  the   GLIF*  for  analyCc  infrastructure?   *GLIF  (www.glif.is),  the  Global  Lambda  Integrated  Facility,  is  an  internaConal   virtual  organizaCon  that  promotes  the  paradigm  of  lambda  networking.  GLIF   provides  lambdas  internaConally  as  an  integrated  facility  to  support  data-­‐ intensive  scienCfic  research,  and  supports  middleware  development  for   lambda  networking.    
  • 4. Number   1000’s   Individual  scienCsts  &   small  projects   100’s   Community  based   science  via  Science  as  a   10’s   Service   very  large  projects   Data  Size   Small   Medium  to  Large     Very  Large   Public   Shared  community   Dedicated     infrastructure   infrastructure   infrastructure  
  • 5. The  long  tail  of  data  science   A  few  large  data   Many  smaller  data   science  projects.   science  projects.  
  • 6. Part  1.   What  Instrument  Do  we  Use  to     Make  Big  Data  Discoveries?   How  do  we  build  a  “datascope?”  
  • 7. TB?   PB?   EB?   ZB?   What  is  big  data?  
  • 8. Another  way:   opencompute.org   Think  of  data  as  big  if  you  measure  it  in  MW,  as  in   Facebook’s  Pineville  Data  Center  is  30  MW.  
  • 9. An  algorithm  and   compuCng   infrastructure  is  “big-­‐ data  scalable”  if  adding   a  rack  (or  container)  of   data  (and  corresponding   processors)  allows  you   to  do  the  same   computaCon  in  the   same  Cme  but  over   more  data.  
  • 10. Commercial  Cloud  Service  Provider  (CSP)     15  MW  Data  Center   Monitoring,   AccounCng  and   network  security   billing   Customer   and  forensics   Facing   Portal   AutomaCc   provisioning  and   100,000  servers   infrastructure   1  PB  DRAM   management   100’s  of  PB  of  disk   ~1  Tbps  egress  bandwidth     25  operators  for  15  MW  Commercial  Cloud   Data  center  network  
  • 11. My  vote  for  a  datascope:  a  (bouCque)   data  center  scale  facility  with  a  big-­‐ data  scalable  analyCc  infrastructure.   What  would  a  global  integrated   facility  for  datascopes  look  like?  
  • 12. Some  Examples  of  Big  Data  Science   Discipline   Dura2on   Size   #  Devices   HEP  -­‐  LHC   10  years   15  PB/year*   One   Astronomy  -­‐  LSST   10  years   12  PB/year**   One   Genomics  -­‐  NGS   2-­‐4  years   0.5  TB/genome   1000’s   *At  full  capacity,  the  Large  Hadron  Collider  (LHC),  the  world's  largest  parCcle  accelerator,  is  expected  to  produce  more  than  15   million  Gigabytes  of  data  each  year.    …  This  ambiCous  project  connects  and  combines  the  IT  power  of  more  than  140  computer   centres  in  33  countries.    Source:  hjp://press.web.cern.ch/public/en/Spotlight/SpotlightGrid_081008-­‐en.html     **As  it  carries  out  its  10-­‐year  survey,  LSST  will  produce  over  15  terabytes  of  raw  astronomical  data  each  night  (30  terabytes   processed),  resulCng  in  a  database  catalog  of  22  petabytes  and  an  image  archive  of  100  petabytes.    Source:  hjp://www.lsst.org/ News/enews/teragrid-­‐1004.html  
  • 13. One  large  instrument   Many  smaller  instruments  
  • 14. Sci  CSP  services   Data  scienCst   Datascope  –  Science  Cloud  Service   Provider  (Sci  CSP)  
  • 15. What  are  some  of  the  important   differences  between  commercial   and  research-­‐focused  Sci  CSPs?    
  • 16. Science  CSP   Commercial  CSP   POV   DemocraCze  access  to   As  long  as  you  pay  the  bill;   data.    Integrate  data  to   as  long  as  the  business   make  discoveries.    Long   model  holds.   term  archive.   Data  &   Data  intensive   Internet  style  scale  out   Storage   Science  Clouds   compuCng  &  HP  storage   and  object-­‐based  storage   Flows   Large  data  flows  in  and   Lots  of  small  web  flows   out   Streams   Streaming  processing   NA   required   AccounCng   EssenCal   EssenCal   Lock  in   Moving  environment   Lock  in  is  good   between  CSPs  essenCal  
  • 17. Part  2.   The  Open  Cloud  ConsorCum’s     Open  Science  Data  Cloud  
  • 18. •  U.S  based  not-­‐for-­‐profit  corporaCon.   •  Manages  cloud  compuCng  infrastructure  to   support  scienCfic  research:  Open  Science   Data  Cloud.   •  Manages  cloud  compuCng  testbeds:  Open   Cloud  Testbed.     www.opencloudconsorCum.org   18  
  • 19. OCC  Members  &  Partners   •  Companies:  Cisco,  Yahoo!,  Citrix,  …   •  UniversiCes:    University  of  Chicago,   Northwestern  Univ.,  Johns  Hopkins,  Calit2,   ORNL,  University  of  Illinois  at  Chicago,  …   •  Federal  agencies  and  labs:  NASA,  LLNL,  ORNL   •  InternaConal  Partners:  AIST  (Japan),  U.   Edinburgh,  U.  Amsterdam,  …   •  Partners:  NaConal  Lambda  Rail   19  
  • 20. OCC  2011  Resources   Resource   Type   Comments   OSDC  Adler  &   UClity  Cloud     1248  cores  and  0.4  PB  disk   Sullivan   OCC  –  Y   Data  Cloud   928  cores  and  1.0    PB  disk   OCC  –  Matsu   Mixed   1  rack   OSDC  Root   Storage   0.8  PB   •  OCC-­‐Adler,  Sullivan  &  Root  will  more  than  double  in   size  in  2012.  
  • 22. One  Million  Genomes   •  Sequencing  a  million  genomes  would  most   likely  fundamentally  change  the  way  we   understand  genomic  variaCon.   •  The  genomic  data  for  a  paCent  is  about  1  TB   (including  samples  from  both  tumor  and   normal  Cssue).   •  One  million  genomes  is  about  1000  PB  or  1  EB   •  With  compression,  it  may  be  about  100  PB   •  At  $1000/genome,  the  sequencing  would  cost   about  $1B  
  • 23. Big  data  driven  discovery  on   1,000,000  genomes  and  1  EB  of  data.   Genomic-­‐ Improved    Genomic-­‐   driven   understanding   driven  drug   diagnosis   of  genomic   development   science   Precision  diagnosis  and   treatment.    PrevenCve   health  care.  
  • 24. Project Matsu WG: Clouds to Support Earth Science matsu.opensciencedatacloud.org   24
  • 25. UDR   •  UDT  is  a  high  performance  network  transport  protocol   •  UDR  =  rsync  +  UDT     •  It  is  easy  for  an  average  systems  administrator  to  keep   100’s  of  TB  of  distributed  data  synchronized.     •  We  are  using  it  to  distribute  c.  1  PB  from  the  OSDC  
  • 26. OpenFlow-­‐Enabled  Hadoop  WG   •  When  running  Hadoop  some  map  and  reduce  jobs   take  significantly  longer  than  others.   •  These  are  stragglers  and  can  significantly  slow  down   a  MapReduce  computaCon.     •  Stragglers  are  common  (dirty  secret  about  Hadoop)   •  Infoblox  and  UChicago  are  leading  a  OCC  Working   Group  on  OpenFlow-­‐enabled  Hadoop  that  will   provide  addiConal  bandwidth  to  stragglers.     •  We  have  a  testbed  for  a  wide  area  version  of  this   project.  
  • 27. OSDC  PIRE  Project   We  select  OSDC  PIRE  Fellows   (US  ciCzens  or  permanent   residents):     •  We  give  them  tutorials  and   training  on  big  data  science.   •  We  provide  them   fellowships  to  work  with   OSDC  internaConal   partners.   •  We  give  them  preferred   access  to  the  OSDC.   Nominate  your  favorite  scienCst  as  an  OSDC  PIRE  Fellow.     www.opensciencedatacloud.org    (look  for  PIRE)  
  • 28. Part  3.   Cloud  Services  OperaCons  Centers  
  • 29. Open  Science  Data  Cloud   AccounCng  and   Monitoring,   billing  (OSDC)   compliance,  &   security   Customer  Facing   Science  Cloud  SW   &  Services   Portal  (Tukey)   AutomaCc   provisioning  and   3  PB  2011   infrastructure   10  PB  2012     management   ~100  Gbps  bandwidth   able  to  scale  to     100  PB?   5-­‐12  operators  to  operate  1-­‐5  MW  Science  Cloud   Data  center  network   OSDC  Data  Stack  based  upon  OpenStack,  Hadoop,  GlusterFS,  UDT,  …  
  • 30. Cloud  Services     OperaCons  Centers  (CSOC)   •  The  OSDC  operates  Cloud  Services  OperaCons   Center  (or  CSOC).   •  It  is  a  CSOC  focused  on  supporCng  Science   Clouds  for  researchers.   •  Compare  to  Network  OperaCons  Center  or   NOC.   •  Both  are  an  important  part  of  cyber   infrastructure  for  big  data  science.  
  • 31. OSDC  Racks   •  How  quickly  can   we  set  up  a  rack?   •  How  efficiently  can   we  operate  a  rack?   (racks/admin)   2012  OSDC  rack  design  (dray)   •  950  TB  /  rack   •  600  cores  /  rack  
  • 32. EssenCal  Services  for  a  Science  CSP   •  Support  for  data  intensive  compuCng   •  Support  for  big  data  flows   •  Account  management,  authenCcaCon  and   authorizaCon  services   •  Health  and  status  monitoring   •  Billing  and  accounCng   •  Ability  to  rapidly  provision  infrastructure   •  Security  services,  logging,  event  reporCng   •  Access  to  large  amounts  of  public  data   •  High  performance  storage   •  Simple  data  export  and  import  services  
  • 33. Please  Join  Us!       (Help  us  from  making  even   more  mistakes.)  
  • 34. Acknowledgements   Major  funding  and  support  for  the  Open  Science  Data  Cloud  (OSDC)  is  provided  by  the   Gordon  and  Bejy  Moore  FoundaCon.    This  funding  is  used  to  support  the  OSDC-­‐Adler,   Sullivan  and  Root  faciliCes.     AddiConal  funding  for  the  OSDC  has  been  provided  by  the  following  sponsors:     •  The  OCC-­‐Y  Hadoop  Cluster  (approximately  1000  cores  and  1  PB  of  storage)  was   donated  by  Yahoo!  in  2011.   •  Cisco  provides  the  OSDC  access  to  the  Cisco  C-­‐Wave,  which  connects  OSDC  data   centers  with  10  Gbps  wide  area  networks.   •  NSF  awarded  the  OSDC  a  5-­‐year  (2010-­‐2016)  PIRE  award  to  train  scienCsts  to  use   the  OSDC  and  to  further  develop  the  underlying  technology.   •  OSDC  technology  for  high  performance  data  transport  is  support  in  part  by    NSF   Award  1127316.   •  The  StarLight  Facility  in  Chicago  enables  the  OSDC  to  connect  to  over  30  high   performance  research  networks  around  the  world  at  10  Gbps  or  higher,  with  an   increasing  number  of  100  Gbps  connecCons.     The  OSDC  is  managed  by  the  Open  Cloud  ConsorCum,  a  501(c)(3)  not-­‐for-­‐profit   corporaCon.  If  you  are  interested  in  providing  funding  or  donaCng  equipment  or   services,  please  contact  us  at  info@opensciencedatacloud.org.  
  • 35. For  more  informaCon   •  You  can  find  some  more  informaCon  on  my  blog:                                                  rgrossman.com.   •  Some  of  my  technical  papers  are  also  available  there.     •  My  email  address  is  robert.grossman  at  uchicago  dot  edu.     Center for Research Informatics