SlideShare ist ein Scribd-Unternehmen logo
1 von 11
Downloaden Sie, um offline zu lesen
The	
  OPTIRAD	
  Pla-orm:	
  Cloud-­‐hosted	
  IPython	
  Notebooks	
  
for	
  collabora?ve	
  EO	
  Data	
  Analysis	
  and	
  Processing	
  
ESA	
  EO	
  Open	
  Science	
  2.0	
  Conference	
  12-­‐14	
  October	
  2015	
  
	
  
Philip	
  Kershaw	
  (CEDA),	
  John	
  Holt	
  (Tessella	
  plc.)	
  José	
  Gómez-­‐Dans,	
  Philip	
  Lewis	
  (UCL)	
  
Nicola	
  Pounder,	
  Jon	
  Styles	
  (Assimila	
  Ltd.)	
  
JASMIN	
  (STFC/Stephen	
  Kill)	
  
Introduc?on	
  
•  OPTIRAD	
  =	
  OPTImisa?on	
  environment	
  for	
  joint	
  retrieval	
  of	
  
mul?-­‐sensor	
  RADiances	
  
–  Collabora?on:	
  CEDA,	
  UCL,	
  Assimila	
  Ltd,	
  FastOpt	
  and	
  VU	
  Amsterdam	
  
–  Funded	
  by	
  ESA	
  
•  Overview	
  of	
  technical	
  solu?on	
  
–  Introduc?on	
  to	
  IPython	
  (Jupyter)	
  Notebook	
  
–  Deployment	
  on	
  JASMIN-­‐CEMS	
  science	
  cloud	
  
•  Make	
  the	
  case,	
  IPython	
  Notebook	
  +	
  Cloud	
  =	
  powerful	
  
combina?on	
  for	
  EO	
  Open	
  Science	
  2.0	
  
OPTIRAD	
  Goals	
  
Address	
  the	
  challenge	
  of	
  
producing	
  consistent	
  EO	
  
land	
  surface	
  informa?on	
  
products	
  from	
  
heterogeneous	
  EO	
  data	
  
input:	
  
Collabora?on:	
  provide	
  a	
  collabora?ve	
  
research	
  environment	
  as	
  a	
  means	
  to	
  
engender	
  closer	
  working	
  between	
  algorithm	
  
specialists,	
  modellers	
  and	
  end	
  users.	
  	
  	
  
Compu?ng	
  resources:	
  processing	
  at	
  high	
  
spa?al	
  and	
  temporal	
  resolu?ons	
  with	
  
computa?onally	
  expensive	
  algorithms.	
  	
  	
  
Usability	
  and	
  access:	
  easy	
  execu?on	
  and	
  
development	
  of	
  exis?ng	
  Python	
  code	
  and	
  
the	
  provision	
  of	
  interac?ve	
  tutorials	
  for	
  new	
  
users	
  
IPython	
  Notebook	
  
•  Provides	
  Python	
  kernels	
  accessible	
  via	
  a	
  
web	
  browser	
  	
  
•  Sessions	
  can	
  be	
  saved	
  and	
  shared	
  
	
  
•  Trivial	
  access	
  to	
  parallel	
  processing	
  
capabili?es	
  –	
  IPython.parallel	
  (ipyparallel)	
  
•  IPython	
  Jupyter	
  Notebook	
  
•  Support	
  for	
  other	
  languages	
  such	
  as	
  
R	
  
•  New	
  JupyterHub	
  allows	
  mul?-­‐user	
  
management	
  of	
  notebooks	
  
•  Gained	
  trac?on	
  as	
  a	
  teaching	
  and	
  
collabora?ve	
  tool	
  
	
  
IPython	
  Notebook	
  +	
  Cloud	
  
•  Cloud’s	
  characteris?cs:	
  
–  Broad	
  network	
  access,	
  resource	
  pooling,	
  elas?city,	
  scale	
  –	
  compute	
  and	
  
storage	
  	
  
–  Good	
  fit	
  for	
  Big	
  Data	
  science	
  applica?ons	
  
	
  
•  Cloud-­‐hosted	
  Notebook	
  -­‐	
  a	
  model	
  already	
  demonstrated	
  with	
  
public	
  cloud	
  services	
  e.g.	
  
–  Wakari,	
  Azure,	
  Rackspace	
  
	
  
•  Central	
  hos?ng	
  allows	
  central	
  management	
  of	
  socware	
  packages	
  
–  no	
  installa?on	
  steps	
  needed	
  for	
  the	
  user	
  
	
  
•  Algorithm	
  prototyping	
  environment	
  next	
  to	
  Big	
  Data	
  
–  Acts	
  as	
  a	
  precursor	
  to	
  opera?onal	
  processing	
  services	
  
Notebook:	
  	
  
a	
  user	
  –	
  applica?on	
  perspec?ve	
  
Support	
  a	
  spectrum	
  of	
  usage	
  models	
  
Different	
  
classes	
  of	
  
user	
  
Long-­‐tail	
  of	
  science	
  users	
  è	
  
Design	
  and	
  development	
  	
  
considera?ons	
  
•  Host	
  on	
  JASMIN-­‐CEMS	
  	
  
–  Data	
  analysis	
  facility	
  and	
  science	
  cloud	
  at	
  Rutherford	
  Appleton	
  Lab,	
  UK	
  
–  Advantage	
  of	
  proximity	
  to	
  locally	
  hosted	
  EO	
  and	
  climate	
  science	
  datasets	
  
–  Integra?on	
  with	
  environmental	
  sciences	
  community	
  
	
  
•  Lightweight	
  development	
  and	
  deployment	
  philosophy	
  
–  Build	
  on	
  Open	
  Source	
  and	
  community	
  efforts	
  to	
  use	
  what’s	
  already	
  available	
  
	
  
•  How	
  to	
  meet	
  mul?-­‐user	
  support	
  requirement?	
  
–  Buy	
  off-­‐the-­‐shelf:	
  run	
  Wakari	
  on	
  JASMIN-­‐CEMS	
  pla-orm	
  or	
  
–  Try	
  JupyterHub:	
  mul?-­‐user	
  IPython	
  Notebook	
  solu?on	
  or	
  
–  Roll	
  our	
  own	
  solu?on	
  
	
  
•  How	
  to	
  integrate	
  parallel	
  processing?	
  
–  IPython.parallel	
  (ipyparallel)	
  Python	
  API	
  accessed	
  via	
  the	
  Notebook	
  
OPTIRAD	
  JASMIN	
  Cloud	
  Tenancy	
  
Docker	
  Container	
  
VM:	
  Swarm	
  pool	
  0	
  
VM:	
  Swarm	
  pool	
  0	
  
Deployment	
  Architecture	
  
JupyterHub	
  
VM:	
  Swarm	
  pool	
  0	
  
Docker	
  Container	
  
IPython	
  
Notebook	
  
Kernel	
  
Docker	
  Container	
  
IPython	
  
Notebook	
  
Kernel	
  
Kernel	
  
Kernel	
   Parallel	
  
Controller	
  
Parallel	
  
Controller	
  
VM:	
  Swarm	
  
pool	
  0	
  
VM:	
  Swarm	
  
pool	
  0	
  
VM:	
  slave	
  0	
  
Parallel	
  
Engine	
  
Parallel	
  
Engine	
  
Nodes	
  for	
  parallel	
  
Processing	
  
Notebooks	
  and	
  kernels	
  in	
  
containers	
  
Swarm	
  manages	
  alloca?on	
  
of	
  containers	
  for	
  notebooks	
  
Manage	
  users	
  
and	
  provision	
  of	
  
notebooks	
  
Swarm	
  
Firewall	
  	
  
VM:	
  shared	
  services	
  
NFS	
   LDAP	
  
Browser	
  access	
  
Conclusions	
  +	
  Next	
  Steps	
  
•  Experiences	
  from	
  project	
  delivery	
  
–  Off-­‐shelf	
  solu?on	
  using	
  JupyterHub	
  paid	
  off	
  
–  JupyterHub	
  and	
  Swarm	
  was	
  new	
  but	
  
–  Installa?on	
  straigh-orward	
  +	
  opera?onally	
  robust	
  
•  Challenges	
  and	
  future	
  development	
  
–  Extend	
  use	
  of	
  containers	
  for	
  parallel	
  compute	
  
–  Challenge:	
  managing	
  cloud	
  elas?city	
  with	
  both	
  containers	
  and	
  host	
  
VMs	
  
–  Provide	
  object	
  storage	
  –	
  CEPH	
  likely	
  to	
  be	
  adopted	
  
–  Expand	
  from	
  OPTIRAD	
  pilot	
  to	
  wider	
  user	
  community	
  
–  Deploy	
  with	
  toolboxes	
  e.g.	
  Sen?nels	
  or	
  CIS.	
  
Demo	
  .	
  .	
  .	
  
•  A	
  tutorial	
  on	
  EO	
  data	
  
assimila?on	
  
–  Notebook	
  blurs	
  the	
  
tradi?onal	
  separa?on	
  
between	
  tutorial	
  
documenta?on	
  and	
  
using	
  the	
  target	
  system	
  
–  The	
  two	
  are	
  one	
  self-­‐
contained	
  interac?ve	
  
unit	
  J	
  
Further	
  informa?on	
  
•  OPTIRAD:	
  
–  Op?misa?on	
  Environment	
  For	
  Joint	
  Retrieval	
  Of	
  Mul?-­‐Sensor	
  Radiances	
  
(OPTIRAD),	
  Proceedings	
  of	
  the	
  ESA	
  2014	
  Conference	
  on	
  Big	
  Data	
  from	
  
Space	
  (BiDS’14)	
  hip://dx.doi.org/10.2788/1823	
  	
  
•  JASMIN	
  paper	
  (Sept	
  2013)	
  
–  hip://home.badc.rl.ac.uk/lawrence/sta?c/2013/10/14/
LawEA13_Jasmin.pdf	
  
–  Cloud	
  paper	
  to	
  follow	
  soon	
  
•  Cloud-­‐hosted	
  JupyterHub	
  with	
  Docker	
  for	
  teaching:	
  
–  hips://developer.rackspace.com/blog/deploying-­‐jupyterhub-­‐for-­‐
educa?on/	
  	
  	
  
•  JASMIN	
  and	
  CEDA:	
  
–  hip://jasmin.ac.uk/	
  	
  
–  hip://www.ceda.ac.uk	
  
•  @PhilipJKershaw	
  	
  

Weitere ähnliche Inhalte

Kürzlich hochgeladen

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 

Kürzlich hochgeladen (20)

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 

Empfohlen

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 

Empfohlen (20)

Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 

The OPTIRAD Platform: Cloud-hosted IPython Notebooks for collaborative EO Data Analysis and Processing

  • 1. The  OPTIRAD  Pla-orm:  Cloud-­‐hosted  IPython  Notebooks   for  collabora?ve  EO  Data  Analysis  and  Processing   ESA  EO  Open  Science  2.0  Conference  12-­‐14  October  2015     Philip  Kershaw  (CEDA),  John  Holt  (Tessella  plc.)  José  Gómez-­‐Dans,  Philip  Lewis  (UCL)   Nicola  Pounder,  Jon  Styles  (Assimila  Ltd.)   JASMIN  (STFC/Stephen  Kill)  
  • 2. Introduc?on   •  OPTIRAD  =  OPTImisa?on  environment  for  joint  retrieval  of   mul?-­‐sensor  RADiances   –  Collabora?on:  CEDA,  UCL,  Assimila  Ltd,  FastOpt  and  VU  Amsterdam   –  Funded  by  ESA   •  Overview  of  technical  solu?on   –  Introduc?on  to  IPython  (Jupyter)  Notebook   –  Deployment  on  JASMIN-­‐CEMS  science  cloud   •  Make  the  case,  IPython  Notebook  +  Cloud  =  powerful   combina?on  for  EO  Open  Science  2.0  
  • 3. OPTIRAD  Goals   Address  the  challenge  of   producing  consistent  EO   land  surface  informa?on   products  from   heterogeneous  EO  data   input:   Collabora?on:  provide  a  collabora?ve   research  environment  as  a  means  to   engender  closer  working  between  algorithm   specialists,  modellers  and  end  users.       Compu?ng  resources:  processing  at  high   spa?al  and  temporal  resolu?ons  with   computa?onally  expensive  algorithms.       Usability  and  access:  easy  execu?on  and   development  of  exis?ng  Python  code  and   the  provision  of  interac?ve  tutorials  for  new   users  
  • 4. IPython  Notebook   •  Provides  Python  kernels  accessible  via  a   web  browser     •  Sessions  can  be  saved  and  shared     •  Trivial  access  to  parallel  processing   capabili?es  –  IPython.parallel  (ipyparallel)   •  IPython  Jupyter  Notebook   •  Support  for  other  languages  such  as   R   •  New  JupyterHub  allows  mul?-­‐user   management  of  notebooks   •  Gained  trac?on  as  a  teaching  and   collabora?ve  tool    
  • 5. IPython  Notebook  +  Cloud   •  Cloud’s  characteris?cs:   –  Broad  network  access,  resource  pooling,  elas?city,  scale  –  compute  and   storage     –  Good  fit  for  Big  Data  science  applica?ons     •  Cloud-­‐hosted  Notebook  -­‐  a  model  already  demonstrated  with   public  cloud  services  e.g.   –  Wakari,  Azure,  Rackspace     •  Central  hos?ng  allows  central  management  of  socware  packages   –  no  installa?on  steps  needed  for  the  user     •  Algorithm  prototyping  environment  next  to  Big  Data   –  Acts  as  a  precursor  to  opera?onal  processing  services  
  • 6. Notebook:     a  user  –  applica?on  perspec?ve   Support  a  spectrum  of  usage  models   Different   classes  of   user   Long-­‐tail  of  science  users  è  
  • 7. Design  and  development     considera?ons   •  Host  on  JASMIN-­‐CEMS     –  Data  analysis  facility  and  science  cloud  at  Rutherford  Appleton  Lab,  UK   –  Advantage  of  proximity  to  locally  hosted  EO  and  climate  science  datasets   –  Integra?on  with  environmental  sciences  community     •  Lightweight  development  and  deployment  philosophy   –  Build  on  Open  Source  and  community  efforts  to  use  what’s  already  available     •  How  to  meet  mul?-­‐user  support  requirement?   –  Buy  off-­‐the-­‐shelf:  run  Wakari  on  JASMIN-­‐CEMS  pla-orm  or   –  Try  JupyterHub:  mul?-­‐user  IPython  Notebook  solu?on  or   –  Roll  our  own  solu?on     •  How  to  integrate  parallel  processing?   –  IPython.parallel  (ipyparallel)  Python  API  accessed  via  the  Notebook  
  • 8. OPTIRAD  JASMIN  Cloud  Tenancy   Docker  Container   VM:  Swarm  pool  0   VM:  Swarm  pool  0   Deployment  Architecture   JupyterHub   VM:  Swarm  pool  0   Docker  Container   IPython   Notebook   Kernel   Docker  Container   IPython   Notebook   Kernel   Kernel   Kernel   Parallel   Controller   Parallel   Controller   VM:  Swarm   pool  0   VM:  Swarm   pool  0   VM:  slave  0   Parallel   Engine   Parallel   Engine   Nodes  for  parallel   Processing   Notebooks  and  kernels  in   containers   Swarm  manages  alloca?on   of  containers  for  notebooks   Manage  users   and  provision  of   notebooks   Swarm   Firewall     VM:  shared  services   NFS   LDAP   Browser  access  
  • 9. Conclusions  +  Next  Steps   •  Experiences  from  project  delivery   –  Off-­‐shelf  solu?on  using  JupyterHub  paid  off   –  JupyterHub  and  Swarm  was  new  but   –  Installa?on  straigh-orward  +  opera?onally  robust   •  Challenges  and  future  development   –  Extend  use  of  containers  for  parallel  compute   –  Challenge:  managing  cloud  elas?city  with  both  containers  and  host   VMs   –  Provide  object  storage  –  CEPH  likely  to  be  adopted   –  Expand  from  OPTIRAD  pilot  to  wider  user  community   –  Deploy  with  toolboxes  e.g.  Sen?nels  or  CIS.  
  • 10. Demo  .  .  .   •  A  tutorial  on  EO  data   assimila?on   –  Notebook  blurs  the   tradi?onal  separa?on   between  tutorial   documenta?on  and   using  the  target  system   –  The  two  are  one  self-­‐ contained  interac?ve   unit  J  
  • 11. Further  informa?on   •  OPTIRAD:   –  Op?misa?on  Environment  For  Joint  Retrieval  Of  Mul?-­‐Sensor  Radiances   (OPTIRAD),  Proceedings  of  the  ESA  2014  Conference  on  Big  Data  from   Space  (BiDS’14)  hip://dx.doi.org/10.2788/1823     •  JASMIN  paper  (Sept  2013)   –  hip://home.badc.rl.ac.uk/lawrence/sta?c/2013/10/14/ LawEA13_Jasmin.pdf   –  Cloud  paper  to  follow  soon   •  Cloud-­‐hosted  JupyterHub  with  Docker  for  teaching:   –  hips://developer.rackspace.com/blog/deploying-­‐jupyterhub-­‐for-­‐ educa?on/       •  JASMIN  and  CEDA:   –  hip://jasmin.ac.uk/     –  hip://www.ceda.ac.uk   •  @PhilipJKershaw