SlideShare ist ein Scribd-Unternehmen logo
1 von 25
CyberGIS Center for Advanced Digital and Spatial Studies
A CyberGIS-Jupyter Framework for
Geospatial Analytics at Scale
Dandong Yin1,2, Yan Liu1,2,3, Anand Padmanabhan1,2,3,
Jeff Terstriep1,3, Johnathan Rush1,3, and Shaowen Wang1,2,3
1CyberGIS Center for Advanced Digital and Spatial Studies
2Department of Geography and Geographic Information Science
3National Center for Supercomputing Applications (NCSA)
University of Illinois at Urbana-Champaign
Practice & Experience in Advanced Research Computing (PEARC17)
July 11, 2017, New Orleans, Louisiana, USA
CyberGIS Center for Advanced Digital and Spatial Studies
Background
• Gateways facilitate and accelerate scientific research and education enabled by
advanced cyberinfrastructure (CI)
• End-user-oriented development approaches often consume significant resource
and time to achieve these desirable goals
2
Advanced Cyberinfrastructure
Gateway Gateway Gateway
• To fully leverage CI, it is necessary to not
only reduce the barrier of accessing CI via
gateways, but also reduce the barrier of
developing gateway applications so that
common researchers could efficiently
deliver their own applications to their
communities
CyberGIS Center for Advanced Digital and Spatial Studies
End-user-orientedmodel
• Final products need to be of fine quality in order to serve diverse users
3Topolens: A CyberGIS Gateway Application
• Usually implemented with
web-application frameworks
• Frontend UI design
• Backend computation
• Database design
• Costly to develop, optimize,
and maintain
• Limited to specific purposes
• Difficult to extend and re-use
CyberGIS Center for Advanced Digital and Spatial Studies
A new gateway paradigm
• In many scientific research scenarios, agility, reproducibility and extensibility are
increasingly important
4
Advanced Cyberinfrastructure
• Community-driven gateway development
• A series of building blocks to be re-organized for
new applications
• A public platform for domain scientists to
develop, reproduce and share their own
analytics at scale
Gateway Gateway
Gateway
• Common and friendly user interfaces for
collaborators, researchers, and gateway
developers
CyberGIS Center for Advanced Digital and Spatial Studies
CyberGIS
• CyberGIS -- geographic information
science and systems (GIS) based on
advanced CI
• Innovate new-generation GIS
• Focus on computational and data-
intensive geospatial problem-solving
within various research and education
domains
• Bridge gaps between geospatial big data,
software and applications through
• innovative cyberGIS supercomputer --
ROGER
5
Wang, S. (2010) A CyberGIS Framework for the Synthesis of
Cyberinfrastructure, GIS, and Spatial Analysis. Annals of the Association
of American Geographers, 100(3): 535-557
CyberGIS Center for Advanced Digital and Spatial Studies
• An interactive scripting interface
• Declarative UI widgets
• Supports general computation and
visualization
• Increasingly popular in data- and
computational sciences
Jupyter Notebook
6
CyberGIS Center for Advanced Digital and Spatial Studies
• An interactive scripting interface
• Declarative UI widgets
• Supports general computation and
visualization
• Increasingly popular in data- and
computational sciences
CyberGIS-Jupyter
7
CyberGIS Center for Advanced Digital and Spatial Studies
CyberGIS-Jupyter: Technologies
8
Batch Job management
Cloud-based architecture
Data/storage synchronization
CyberGIS Center for Advanced Digital and Spatial Studies
CyberGIS-Jupyter: Innovation
• Geovisualization
• Interactive map generation inside notebooks
• Support multiple formats of geospatial data
• Layer management, transparency and styles
9
No existing technologies found that match all the
criteria above. Therefore, we developed a library
named Floret to serve the purpose (shown in
demo).
CyberGIS Center for Advanced Digital and Spatial Studies
Features
• Adaptive interfaces for CI access embedded in notebooks
• Programming or GUI at choice
• Manage batch jobs at fingertips
• Cloud-based environment
• Reproducible computing environments with Docker
• Elastic resource-provisioning with OpenStack
• Shared storage
• Big data support
• Fault tolerance
• Collaborative sharing
• Generative geovisualization by programming
• Embedded web-mapping services
• Support multiple geographic data formats
10
CyberGIS Center for Advanced Digital and Spatial Studies
Architecture - 1
• Gateway applications are standardized as packages deployed on hybrid HPC
11
CyberGIS Center for Advanced Digital and Spatial Studies
Architecture - 2
12
CyberGIS Center for Advanced Digital and Spatial Studies
• Highly extensible
• Predefined UI templates
User interface
13
CyberGIS Center for Advanced Digital and Spatial Studies
• PAM with LDAP + SSSD
• Whitelist of authorized users
Authentication and authorization
14
CyberGIS Center for Advanced Digital and Spatial Studies
• Using JupyterHub with swarm spawner
• Openstack VMs join/leave swarm as needed
Container management
15
CyberGIS Center for Advanced Digital and Spatial Studies
• Widgets for composing job scripts (PBS files)
• Submit and retrieve information (via qsub and qstat)
Computation management
16
CyberGIS Center for Advanced Digital and Spatial Studies
• GPFS -> NFS -> VM Volume -> Container Volume
• Enable volume expansion, fault-tolerance and low-latency feedback
Data persistency
17
CyberGIS Center for Advanced Digital and Spatial Studies
Case study
18
• Height Above Nearest Drainage (HAND) at 10m for continental US
• Flood analysis map derived from 10m USGS 3DEP national elevation dataset (180 billion cells) and
National Hydrography Dataset (2.67 million stream reaches)
• Goals
a. Collaborative methodology development
b. HPC for data analytics
c. Deliver methodology and data products to
different user communities
• Collaborators
• Researchers
• Decision makers
• Students
CyberGIS Center for Advanced Digital and Spatial Studies
• Input data hosts
• ROGER GPFS storage
• Docker image
• GDAL, GEOS, PROJ4,
• HDF4, SQLite, MPI, etc.
• Sample notebook
• Explicit computation steps
• Seamless scaling up
• Output visualization
• Floret maps
HAND application package
19
CyberGIS Center for Advanced Digital and Spatial Studies
Comparison - 1
Collaboration
Requirements
Traditional Enhanced
Methodology
development
● Writeups as shared documents
● Communicated via emails, teleconf
● Methodology notebooks
○ Math formula
○ Code snippets
○ Sample results
Software
development
● Source codes only; computing
environment needs to be maintained and
synchronized manually
● Function notebooks with both source
codes and computing environments
effectively synchronized between
collaborators
Computation
● Conducted by a dedicated person, a
bottleneck
● Notebook interface to workflow
computation on advanced CI
○ Everyone can launch
20
CyberGIS Center for Advanced Digital and Spatial Studies
Comparison - 2
Result validation
● Data: direct download or via iRODS
● Validation results: shared document in
Google Drive
● Integrated validation notebooks with
reproducible input, statistics, and output
Visualization
● Local: download and use desktop GIS.
Not scalable for large outputs.
● Online: Tile Map Service (TMS); web-
GIS. Only available for major output data
● Integrated data, code, and visualization
notebooks
○ Traditional visualization libraries
○ Jupyter IFrame cells
○ CyberGIS Floret library
Collaboration
Requirements
Traditional Enhanced
21
CyberGIS Center for Advanced Digital and Spatial Studies
Feedbacks
• The CyberGIS-Jupyter framework was a major platform that supported UCGIS
summer school 2017
• Received impressively positive feedbacks from summer school attendees and
organizers
• According to an on-site survey, over 80% attendees strongly confirm the
importance of CyberGIS-Jupyter to enhancing cyberGIS-enabled research and
education
22
CyberGIS Center for Advanced Digital and Spatial Studies
Conclusions
• A new gateway development framework for facilitating and accelerating CI-
enabled research and education
•Domain scientists can easily adapt and create their own applications
•Students/collaborators can easily understand, reproduce and extend such gateway capabilities
• Leveraging cutting-edge technologies
•Jupyter notebooks and associated interactive interfaces
•Cloud-based elastic architecture
• To boost geospatial analytics and discoveries at desirable computing and
community scale
23
CyberGIS Center for Advanced Digital and Spatial Studies
Acknowledgements
• This work is supported in part by the National Science Foundation (NSF) under
grant numbers 1047916 and 1443080. The computational work used the NSF-
supported ROGER supercomputer (1429699).
• This work is also supported in part by the ECSS program of XSEDE, which is
supported by NSF grant number 1053575.
24
CyberGIS Center for Advanced Digital and Spatial Studies
DEMO
https://jupyter.cybergis.tk
25

Weitere ähnliche Inhalte

Ähnlich wie PEARC17: a CyberGIS-Jupyter framework for geospatial analytics at scale

GIS & Facilities - NJAPPA Talk - April 26 2012
GIS & Facilities - NJAPPA Talk - April 26 2012GIS & Facilities - NJAPPA Talk - April 26 2012
GIS & Facilities - NJAPPA Talk - April 26 2012John Reiser
 
SGCI - The Science Gateways Community Institute: International Collaboration ...
SGCI - The Science Gateways Community Institute: International Collaboration ...SGCI - The Science Gateways Community Institute: International Collaboration ...
SGCI - The Science Gateways Community Institute: International Collaboration ...Sandra Gesing
 
Amy Walton - NSF’s Computational Ecosystem for 21st Century Science & Enginee...
Amy Walton - NSF’s Computational Ecosystem for 21st Century Science & Enginee...Amy Walton - NSF’s Computational Ecosystem for 21st Century Science & Enginee...
Amy Walton - NSF’s Computational Ecosystem for 21st Century Science & Enginee...Larry Smarr
 
Rack Cluster Deployment for SDSC Supercomputer
Rack Cluster Deployment for SDSC SupercomputerRack Cluster Deployment for SDSC Supercomputer
Rack Cluster Deployment for SDSC SupercomputerRebekah Rodriguez
 
Open Geographies and Open Software
Open Geographies and Open SoftwareOpen Geographies and Open Software
Open Geographies and Open SoftwareJames Crone
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeGeoffrey Fox
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSEd Dodds
 
COBWEB technology platform and future development needs
COBWEB technology platform and future development needsCOBWEB technology platform and future development needs
COBWEB technology platform and future development needsEDINA, University of Edinburgh
 
COBWEB technology platform and future development needs, ISPRA 2016
COBWEB technology platform and future development needs, ISPRA 2016COBWEB technology platform and future development needs, ISPRA 2016
COBWEB technology platform and future development needs, ISPRA 2016COBWEB Project
 
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Larry Smarr
 
EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube
 
Geospatial Intelligence Middle East 2013_Big Data_Steven Ramage
Geospatial Intelligence Middle East 2013_Big Data_Steven RamageGeospatial Intelligence Middle East 2013_Big Data_Steven Ramage
Geospatial Intelligence Middle East 2013_Big Data_Steven RamageSteven Ramage
 
Big data analytics and machine intelligence v5.0
Big data analytics and machine intelligence   v5.0Big data analytics and machine intelligence   v5.0
Big data analytics and machine intelligence v5.0Amr Kamel Deklel
 
Enabling the physical world to the Internet and potential benefits for agricu...
Enabling the physical world to the Internet and potential benefits for agricu...Enabling the physical world to the Internet and potential benefits for agricu...
Enabling the physical world to the Internet and potential benefits for agricu...Andreas Kamilaris
 
Pioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSCPioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSCinside-BigData.com
 
GlobusWorld 2020 Keynote
GlobusWorld 2020 KeynoteGlobusWorld 2020 Keynote
GlobusWorld 2020 KeynoteGlobus
 
The Reasons Why the Science Gateways Community Needs an Institute
The Reasons Why the Science Gateways Community Needs an InstituteThe Reasons Why the Science Gateways Community Needs an Institute
The Reasons Why the Science Gateways Community Needs an InstituteSandra Gesing
 

Ähnlich wie PEARC17: a CyberGIS-Jupyter framework for geospatial analytics at scale (20)

GIS & Facilities - NJAPPA Talk - April 26 2012
GIS & Facilities - NJAPPA Talk - April 26 2012GIS & Facilities - NJAPPA Talk - April 26 2012
GIS & Facilities - NJAPPA Talk - April 26 2012
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 
SGCI - The Science Gateways Community Institute: International Collaboration ...
SGCI - The Science Gateways Community Institute: International Collaboration ...SGCI - The Science Gateways Community Institute: International Collaboration ...
SGCI - The Science Gateways Community Institute: International Collaboration ...
 
Amy Walton - NSF’s Computational Ecosystem for 21st Century Science & Enginee...
Amy Walton - NSF’s Computational Ecosystem for 21st Century Science & Enginee...Amy Walton - NSF’s Computational Ecosystem for 21st Century Science & Enginee...
Amy Walton - NSF’s Computational Ecosystem for 21st Century Science & Enginee...
 
Rack Cluster Deployment for SDSC Supercomputer
Rack Cluster Deployment for SDSC SupercomputerRack Cluster Deployment for SDSC Supercomputer
Rack Cluster Deployment for SDSC Supercomputer
 
Open Geographies and Open Software
Open Geographies and Open SoftwareOpen Geographies and Open Software
Open Geographies and Open Software
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run Time
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
 
COBWEB technology platform and future development needs
COBWEB technology platform and future development needsCOBWEB technology platform and future development needs
COBWEB technology platform and future development needs
 
COBWEB technology platform and future development needs, ISPRA 2016
COBWEB technology platform and future development needs, ISPRA 2016COBWEB technology platform and future development needs, ISPRA 2016
COBWEB technology platform and future development needs, ISPRA 2016
 
SGCI OAC webinar 4 18-19
SGCI OAC webinar 4 18-19SGCI OAC webinar 4 18-19
SGCI OAC webinar 4 18-19
 
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
 
EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013
 
Geospatial Intelligence Middle East 2013_Big Data_Steven Ramage
Geospatial Intelligence Middle East 2013_Big Data_Steven RamageGeospatial Intelligence Middle East 2013_Big Data_Steven Ramage
Geospatial Intelligence Middle East 2013_Big Data_Steven Ramage
 
Introduction to GIS
Introduction to GISIntroduction to GIS
Introduction to GIS
 
Big data analytics and machine intelligence v5.0
Big data analytics and machine intelligence   v5.0Big data analytics and machine intelligence   v5.0
Big data analytics and machine intelligence v5.0
 
Enabling the physical world to the Internet and potential benefits for agricu...
Enabling the physical world to the Internet and potential benefits for agricu...Enabling the physical world to the Internet and potential benefits for agricu...
Enabling the physical world to the Internet and potential benefits for agricu...
 
Pioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSCPioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSC
 
GlobusWorld 2020 Keynote
GlobusWorld 2020 KeynoteGlobusWorld 2020 Keynote
GlobusWorld 2020 Keynote
 
The Reasons Why the Science Gateways Community Needs an Institute
The Reasons Why the Science Gateways Community Needs an InstituteThe Reasons Why the Science Gateways Community Needs an Institute
The Reasons Why the Science Gateways Community Needs an Institute
 

Kürzlich hochgeladen

A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 

Kürzlich hochgeladen (20)

A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

PEARC17: a CyberGIS-Jupyter framework for geospatial analytics at scale

  • 1. CyberGIS Center for Advanced Digital and Spatial Studies A CyberGIS-Jupyter Framework for Geospatial Analytics at Scale Dandong Yin1,2, Yan Liu1,2,3, Anand Padmanabhan1,2,3, Jeff Terstriep1,3, Johnathan Rush1,3, and Shaowen Wang1,2,3 1CyberGIS Center for Advanced Digital and Spatial Studies 2Department of Geography and Geographic Information Science 3National Center for Supercomputing Applications (NCSA) University of Illinois at Urbana-Champaign Practice & Experience in Advanced Research Computing (PEARC17) July 11, 2017, New Orleans, Louisiana, USA
  • 2. CyberGIS Center for Advanced Digital and Spatial Studies Background • Gateways facilitate and accelerate scientific research and education enabled by advanced cyberinfrastructure (CI) • End-user-oriented development approaches often consume significant resource and time to achieve these desirable goals 2 Advanced Cyberinfrastructure Gateway Gateway Gateway • To fully leverage CI, it is necessary to not only reduce the barrier of accessing CI via gateways, but also reduce the barrier of developing gateway applications so that common researchers could efficiently deliver their own applications to their communities
  • 3. CyberGIS Center for Advanced Digital and Spatial Studies End-user-orientedmodel • Final products need to be of fine quality in order to serve diverse users 3Topolens: A CyberGIS Gateway Application • Usually implemented with web-application frameworks • Frontend UI design • Backend computation • Database design • Costly to develop, optimize, and maintain • Limited to specific purposes • Difficult to extend and re-use
  • 4. CyberGIS Center for Advanced Digital and Spatial Studies A new gateway paradigm • In many scientific research scenarios, agility, reproducibility and extensibility are increasingly important 4 Advanced Cyberinfrastructure • Community-driven gateway development • A series of building blocks to be re-organized for new applications • A public platform for domain scientists to develop, reproduce and share their own analytics at scale Gateway Gateway Gateway • Common and friendly user interfaces for collaborators, researchers, and gateway developers
  • 5. CyberGIS Center for Advanced Digital and Spatial Studies CyberGIS • CyberGIS -- geographic information science and systems (GIS) based on advanced CI • Innovate new-generation GIS • Focus on computational and data- intensive geospatial problem-solving within various research and education domains • Bridge gaps between geospatial big data, software and applications through • innovative cyberGIS supercomputer -- ROGER 5 Wang, S. (2010) A CyberGIS Framework for the Synthesis of Cyberinfrastructure, GIS, and Spatial Analysis. Annals of the Association of American Geographers, 100(3): 535-557
  • 6. CyberGIS Center for Advanced Digital and Spatial Studies • An interactive scripting interface • Declarative UI widgets • Supports general computation and visualization • Increasingly popular in data- and computational sciences Jupyter Notebook 6
  • 7. CyberGIS Center for Advanced Digital and Spatial Studies • An interactive scripting interface • Declarative UI widgets • Supports general computation and visualization • Increasingly popular in data- and computational sciences CyberGIS-Jupyter 7
  • 8. CyberGIS Center for Advanced Digital and Spatial Studies CyberGIS-Jupyter: Technologies 8 Batch Job management Cloud-based architecture Data/storage synchronization
  • 9. CyberGIS Center for Advanced Digital and Spatial Studies CyberGIS-Jupyter: Innovation • Geovisualization • Interactive map generation inside notebooks • Support multiple formats of geospatial data • Layer management, transparency and styles 9 No existing technologies found that match all the criteria above. Therefore, we developed a library named Floret to serve the purpose (shown in demo).
  • 10. CyberGIS Center for Advanced Digital and Spatial Studies Features • Adaptive interfaces for CI access embedded in notebooks • Programming or GUI at choice • Manage batch jobs at fingertips • Cloud-based environment • Reproducible computing environments with Docker • Elastic resource-provisioning with OpenStack • Shared storage • Big data support • Fault tolerance • Collaborative sharing • Generative geovisualization by programming • Embedded web-mapping services • Support multiple geographic data formats 10
  • 11. CyberGIS Center for Advanced Digital and Spatial Studies Architecture - 1 • Gateway applications are standardized as packages deployed on hybrid HPC 11
  • 12. CyberGIS Center for Advanced Digital and Spatial Studies Architecture - 2 12
  • 13. CyberGIS Center for Advanced Digital and Spatial Studies • Highly extensible • Predefined UI templates User interface 13
  • 14. CyberGIS Center for Advanced Digital and Spatial Studies • PAM with LDAP + SSSD • Whitelist of authorized users Authentication and authorization 14
  • 15. CyberGIS Center for Advanced Digital and Spatial Studies • Using JupyterHub with swarm spawner • Openstack VMs join/leave swarm as needed Container management 15
  • 16. CyberGIS Center for Advanced Digital and Spatial Studies • Widgets for composing job scripts (PBS files) • Submit and retrieve information (via qsub and qstat) Computation management 16
  • 17. CyberGIS Center for Advanced Digital and Spatial Studies • GPFS -> NFS -> VM Volume -> Container Volume • Enable volume expansion, fault-tolerance and low-latency feedback Data persistency 17
  • 18. CyberGIS Center for Advanced Digital and Spatial Studies Case study 18 • Height Above Nearest Drainage (HAND) at 10m for continental US • Flood analysis map derived from 10m USGS 3DEP national elevation dataset (180 billion cells) and National Hydrography Dataset (2.67 million stream reaches) • Goals a. Collaborative methodology development b. HPC for data analytics c. Deliver methodology and data products to different user communities • Collaborators • Researchers • Decision makers • Students
  • 19. CyberGIS Center for Advanced Digital and Spatial Studies • Input data hosts • ROGER GPFS storage • Docker image • GDAL, GEOS, PROJ4, • HDF4, SQLite, MPI, etc. • Sample notebook • Explicit computation steps • Seamless scaling up • Output visualization • Floret maps HAND application package 19
  • 20. CyberGIS Center for Advanced Digital and Spatial Studies Comparison - 1 Collaboration Requirements Traditional Enhanced Methodology development ● Writeups as shared documents ● Communicated via emails, teleconf ● Methodology notebooks ○ Math formula ○ Code snippets ○ Sample results Software development ● Source codes only; computing environment needs to be maintained and synchronized manually ● Function notebooks with both source codes and computing environments effectively synchronized between collaborators Computation ● Conducted by a dedicated person, a bottleneck ● Notebook interface to workflow computation on advanced CI ○ Everyone can launch 20
  • 21. CyberGIS Center for Advanced Digital and Spatial Studies Comparison - 2 Result validation ● Data: direct download or via iRODS ● Validation results: shared document in Google Drive ● Integrated validation notebooks with reproducible input, statistics, and output Visualization ● Local: download and use desktop GIS. Not scalable for large outputs. ● Online: Tile Map Service (TMS); web- GIS. Only available for major output data ● Integrated data, code, and visualization notebooks ○ Traditional visualization libraries ○ Jupyter IFrame cells ○ CyberGIS Floret library Collaboration Requirements Traditional Enhanced 21
  • 22. CyberGIS Center for Advanced Digital and Spatial Studies Feedbacks • The CyberGIS-Jupyter framework was a major platform that supported UCGIS summer school 2017 • Received impressively positive feedbacks from summer school attendees and organizers • According to an on-site survey, over 80% attendees strongly confirm the importance of CyberGIS-Jupyter to enhancing cyberGIS-enabled research and education 22
  • 23. CyberGIS Center for Advanced Digital and Spatial Studies Conclusions • A new gateway development framework for facilitating and accelerating CI- enabled research and education •Domain scientists can easily adapt and create their own applications •Students/collaborators can easily understand, reproduce and extend such gateway capabilities • Leveraging cutting-edge technologies •Jupyter notebooks and associated interactive interfaces •Cloud-based elastic architecture • To boost geospatial analytics and discoveries at desirable computing and community scale 23
  • 24. CyberGIS Center for Advanced Digital and Spatial Studies Acknowledgements • This work is supported in part by the National Science Foundation (NSF) under grant numbers 1047916 and 1443080. The computational work used the NSF- supported ROGER supercomputer (1429699). • This work is also supported in part by the ECSS program of XSEDE, which is supported by NSF grant number 1053575. 24
  • 25. CyberGIS Center for Advanced Digital and Spatial Studies DEMO https://jupyter.cybergis.tk 25

Hinweis der Redaktion

  1. In this presentation we will share our work of using Jupyter as a new type of gateway to cyberinfrastructure and cyberGIS. We deployed JupyterHub on ROGER supercomputing system, leveraging Openstack cloud,
  2. 5 PB storage in container