SlideShare ist ein Scribd-Unternehmen logo
1 von 34
Service and Support for Science IT
Scientific Cloud Experiences
Dr. Peter Kunszt
Director S3IT
Outline
• Introduction
– What is Science IT
– How are we organized
• UZH ScienceCloud Infrastructure and
Implementation
• Science Data and Security/Privacy
Challenge : Scale Up
• High Throughput Instruments
– Much larger data volumes
– Increased data complexity
• Large Collaborations
– More people
– More experiments and measurements
– More coverage
Fire and forget...
• Scientists do not want to be bothered with
infrastructure details
• IT JUST NEEDS TO WORK!
Widening Complexity Gap: IT-Research
Local IT
Resources
Research Labs
Core Facilities
Miracle
SCIENCE IT
What is Science IT ?
FILL THE GAP
Dedicated Support Center
for Science IT
• SPEED : faster time to solution
• ACCESS : to infrastructure,
software, expertise
• ENABLE : use IT technology and
software for new ideas
Speed
Access
Enablement
Supporting Science
• Be a partner to research projects for Science IT
• Provide services to individual researchers, groups and
consortia
– Consultancy for advanced usage of IT in Science
– Research software development and support
– Access to competitive IT infrastructure
– Access to a library of tools and software
– Project management and collaboration support
– Training and education on the usage of infrastructure and software
• Collaborate internally, nationally and internationally with
partners, suppliers and other Science IT units
• Maintain high level of internal expertise on topics relevant to
Science IT
• Advise UZH Governance on evolution of needs, assist in
prioritization
Organization Structures are
Changing
Org A
Org B
Org C
OrgD
Old world: Hierarchical New world: Federated
http://www.fedsm.eu/
S3IT Organization
Core
Team
Site
Team
Site
Team
EE
EE
EE
EE
EE
...
...
EE = Embedded Expert
Working directly in projects
or on-site in groups on
specific tasks
Site Teams
Joint teams with other units
providing local support and
some global services
Core Team
Directorate, Office, core
services, central
infrastructure and
consultancy, project mgmt
Partner Interactions
Core
Facilities
Core
Facilities
Core
Facilities
Agreements
Research
GroupsProjects
Projects
Projects
Partners / Clients
Research
Groups
Research
Groups
Research
Groups
Services
Faculties
Institutes
Departments
Faculties
Institutes
Departments
Central IT
Partners / Suppliers
CSCS
internal
external
VendorsVendorsVendorsVendors
S3IT Core Business: Project
Support
• Infrastructure is important but ‚just‘ a means to an end
• Science IT Support: Applications, access, integration
• Data analysis
• Simulations
• Data Integration
• Application scaling, making use of big infrastructures
• Workflows, automation
• Visualization
• Software design and usage advice, Code Clinic
• Training and education
• ...
14
Understand the science..
.. to map Science IT services!
Mapping Security and Privacy
• Most science follows 3 stages
– Conception, preparation, proposition stage – private
– Project stage (3-5y) – share in group
– Publication of results – open to all
• Some have additional constraints (regulations)
– Medicine – patient data records need consent
(different per country)
– Law and business – confidentiality in projects
– Engineering, pharmacology, etc.. – patents
Infrastructure
• Supercomputing
– Used as a scientific instrument by
• theoretical physics, astrophysics, mathematics, computational
chemistry, biochemistry, quantum chemistry
• Continuous usage
• Cluster computing
– Used as a workhorse by many groups
• Life science, biochem, geoscience, medicine, digital humanities,
banking and finance, art history, ...
• Data analysis, statistical analysis, parameter studies, etc
• Non-continuous usage
• Server computing
– Used as interactive computers by many groups
• All groups. Interactive processing, visualization, steering of
computation. Commercial and open-source tools.
• Daily usage, non-continuous.
Storage Classes
• Large, cheap data store for projects O(xPB)
– No need to be backed up: Easy to regenerate but
time-consuming
• Reliable project data store O(1PB)
– With secondary copy
– Only addition, no changes
• Working storage O(x100TB)
– Active data, databases, server-side processes
• Fast storage for streaming analysis O(100TB)
– Fast changing data, immediate analysis, rare!
Datacenter Consolidation
OCI – S3IT
ZMB
BIOC
MATH
PHYS
IMLS / Neuro
Consolidate
into
Central
Datacenter
Aim: Scale and Secure!
UZH ScienceCloud Implementation
• OpenStack – based on Canonical
• Deployment using Ansible
• Vagrant-like system for configuration:
Elasticluster (developed at UZH)
• Flexible submission and workflow framework
for job control: GC3pie (developed at UZH)
• Database management framework openBIS
for data lifecycle management (developed at
ETH/SystemsX.ch)
Business Model
• Supercomputing
– Investment every 4 years into the system
– Research groups to find 3rd party funding
• Commodity Cloud and Storage
– Subscription / year : Cores, TB
– Per use fee
– Subsidized, not TCO – covering operations
• Servers / Pets
– Yearly or monthly fee
– Size matters
• Yearly acquisition / rollover
– Easy to plan
Experience so far:
• Supercomputing needed only by few groups
– Can be completely outsourced to national center, done as of 2015
• Cloud is suitable for most Science Workloads
– User support scales well
– Can cover very many use cases
– Build dedicated boxes for exceptions, don‘t be driven by them
– Flexibility is key
• Must use local infrastructure for secure, data intensive and
memory intensive workloads
– Data locality needed for COST and (rarely) policy reasons –
exception: medical data
– Hybrid cloud – burst available for CPU intensive jobs
– Deal with heterogeneity
Future Cloud Strategy: HYBRID
• Run sizeable local cloud infrastructure for internal
workloads
• Burst peak loads to public cloud providers
– For selected workloads coherent with policy and cost
Advantages
• Plannable local infrastructure (plan for full usage)
• Flexibility in scaling, quick provisioning of needed
capacity
Open Questions
• Policies. What workloads can be burst to public clouds?
Under what conditions
– Calculations, simulations usually OK
– Data analyis: depends on data (network issues being
resolved)
– Check compliance of cloud providers. ISO, HIPAA, etc
– Adherence to swiss cantonal data protection regulations
• Cost. How to buy public cloud services?
– Public procurement of agreements?
– How not to be bound to a single provider?
– Is this necessary at all?
• How do i charge my users?
– For internal and for external use?
– Aim: consolidate their workload into our cloud. No TCO!
Comments on Security in
academia
• Users in academia are super smart. They remove
barriers faster than you can erect them.
• Do risk assessment and risk analysis instead of
prevention.
• Don‘t do anything ‚for security reasons‘, always qualify
with real risk numbers
• Public Clouds are MUCH MORE secure than our own
– Amazon, Microsoft, IBM etc have whole teams of security
experts – they hired our best students for this 
• It is a question of TRUST
– Regulations by countries
– Do we trust the US not to do industrial and academic
espionage, forcing their own companies to give out our
data?
Scientific Requirements
• Know your workload: Data, Privacy, Science,
Sharing aspects are tightly connected
• Lots of hidden complexity and contradicting
requirements
29
1. What Data?
• Different kinds of ‚BIG‘ data
• Volume, Variety, Velocity, Veracity
• Understanding is Knowledge is Science
– Data vs. Information and Knowledge
– What are the right questions?
– What should be protected, till when?
– How to navigate, explore, evolve
30
WHO OWNS THE DATA?
For science, proprietary data is a hindrance
2. Data Reuse
• Currently a wealth of data is not reused for
new discovery
• Lots of potential! Regulators need to be told..
• Data repositories with computing and search
capability – perfect for Cloud Model
• Do the computation where the data is –
Private, public, hybrid Cloud
31
IP on TOOLS, ease of data USE, not DATA itself.
3. Motivate to annotate
• Scientists publish what is necessary and
prescribed by the journals, not more –
mandate better annotation
• Provide more recognition for producing ´good´
datasets – Data Citation
• Check Data quality – bad quality or
data without annotation has no value
32
Creation of well annotated, sustained public
resources
4. Standard Formats
• Too many ‚Standards‘ or not used
– Instrument vendors often at fault
• Protection of data by proprietary formats
– Data is lost to research
• Do not pay for data in nonstandard
formats
– Data value is zero if unusable
33
Mandate standard formats for domain data
5. Data Sharing/Publishing
• Share in collaborative mode
• Avoid Data Loss
• Motivate and enable data publication
• Establish business model for data publication
(reward/career benefit)
• Journals adapt, see Scientific Data
http://www.nature.com/scientificdata
New role for Archives and Libraries
6. Patient Data Records
• Legal issues of data privacy
• People are not in control of their own data
• Difficult to get consent
• NSA effect – trust
Put citizens back in control
Patient Data Records
• TRUST
– Swiss Cooperative: citizen owned
• NEUTRALITY
– A simple e-Banking system for any personal health data. Same
level of security
• TRACTION
– Volume: it is free, it‘s rewarded
• IMPACT
– Request data directly, avoid legal issues
36
• It is a cooperative, not a business
• Funding by running campaigns to ask people to
participate in research & surveys
• Participants are REWARED for sharing their data or
providing new data
• Build tools on top
• Currently seeking funding
– H2020, foundations
– Projects with hospitals, clinics
37
Approach at S3IT
• Early involvement with Research Groups
– Proposal writing, partnership
– Advice on Data Management, infrastructure, standards
• Strong cooperation with Libraries
– Early involvement with publishers, archives
– Joint information to research groups on data management
plans, data citations
• Seeking contact with funding bodies and decision
makers
– Communicate business plan for Science IT ‚project
consumables‘
– Evaluation of projects based on technology cost and
feasibility
– Usage of public and each others‘ cloud resources for cash
Links
• www.s3it.uzh.ch - Science IT at UZH
• www.sybit.net - Systems Biology IT, SystemsX.ch
• www.erasysapp.eu - Systems Biology, DMMCore
project
• www.healthbank.ch - Public Cooperative being
set up for patient-owned data. Seeking funding
(H2020, pending, and other sources)

Weitere ähnliche Inhalte

Was ist angesagt?

SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010Jisc
 
Research Data Management at the University of Edinburgh
Research Data Management at the University of EdinburghResearch Data Management at the University of Edinburgh
Research Data Management at the University of EdinburghEDINA, University of Edinburgh
 
10 commandments in rdm funder compliancy
10 commandments in rdm funder compliancy10 commandments in rdm funder compliancy
10 commandments in rdm funder compliancyHannelore Vanhaverbeke
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the libraryColleen DeLory
 
Data management (1)
Data management (1)Data management (1)
Data management (1)SM Lalon
 
Planning for Research Data Managment
Planning for Research Data ManagmentPlanning for Research Data Managment
Planning for Research Data ManagmentDaniel Crane
 
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...Jenny Mitcham
 
SLIDES | 12 time-saving tips for research support
SLIDES | 12 time-saving tips for research supportSLIDES | 12 time-saving tips for research support
SLIDES | 12 time-saving tips for research supportLibrary_Connect
 
No Free Lunch: Metadata in the life sciences
No Free Lunch:  Metadata in the life sciencesNo Free Lunch:  Metadata in the life sciences
No Free Lunch: Metadata in the life sciencesChris Dwan
 
Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016Fiona Nielsen
 
Practical Best Practices for Data Management
Practical Best Practices for Data ManagementPractical Best Practices for Data Management
Practical Best Practices for Data ManagementUW Research Data Services
 
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Lauri Eloranta
 
Data presentation and transfer
Data presentation and transferData presentation and transfer
Data presentation and transferIyad Abou Rabii
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management PlanningSarah Jones
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 

Was ist angesagt? (20)

Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
 
The Ethics of Digital Preservation
The Ethics of Digital PreservationThe Ethics of Digital Preservation
The Ethics of Digital Preservation
 
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
 
SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010
 
Research Data Management at the University of Edinburgh
Research Data Management at the University of EdinburghResearch Data Management at the University of Edinburgh
Research Data Management at the University of Edinburgh
 
10 commandments in rdm funder compliancy
10 commandments in rdm funder compliancy10 commandments in rdm funder compliancy
10 commandments in rdm funder compliancy
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the library
 
Data management (1)
Data management (1)Data management (1)
Data management (1)
 
Planning for Research Data Managment
Planning for Research Data ManagmentPlanning for Research Data Managment
Planning for Research Data Managment
 
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
SLIDES | 12 time-saving tips for research support
SLIDES | 12 time-saving tips for research supportSLIDES | 12 time-saving tips for research support
SLIDES | 12 time-saving tips for research support
 
No Free Lunch: Metadata in the life sciences
No Free Lunch:  Metadata in the life sciencesNo Free Lunch:  Metadata in the life sciences
No Free Lunch: Metadata in the life sciences
 
Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016
 
Practical Best Practices for Data Management
Practical Best Practices for Data ManagementPractical Best Practices for Data Management
Practical Best Practices for Data Management
 
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
 
Data presentation and transfer
Data presentation and transferData presentation and transfer
Data presentation and transfer
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management Planning
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
Writing a Research Data Management Plan - 2016-11-09 - University of OxfordWriting a Research Data Management Plan - 2016-11-09 - University of Oxford
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
 

Andere mochten auch

Andere mochten auch (6)

Multiplexes1112
Multiplexes1112Multiplexes1112
Multiplexes1112
 
Famous person keynote
Famous person keynoteFamous person keynote
Famous person keynote
 
Science research
Science researchScience research
Science research
 
Spanish activites project
Spanish activites projectSpanish activites project
Spanish activites project
 
Introduction to social media for scientist
Introduction to social media for scientistIntroduction to social media for scientist
Introduction to social media for scientist
 
Online Presence Management
Online Presence ManagementOnline Presence Management
Online Presence Management
 

Ähnlich wie Service and Support for Science IT -Peter Kunzst, University of Zurich

Introduction to Digital Preservation
Introduction to Digital PreservationIntroduction to Digital Preservation
Introduction to Digital PreservationBill LeFurgy
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptxAkhirulAminulloh2
 
eCitizen Sensible-Data Design Challenge
eCitizen Sensible-Data Design ChallengeeCitizen Sensible-Data Design Challenge
eCitizen Sensible-Data Design Challengehopbeat
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management PlanKristin Briney
 
How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...ariadnenetwork
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...ICPSR
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxwahiba ben abdessalem
 
Data Mining and Big Data Challenges and Research Opportunities
Data Mining and Big Data Challenges and Research OpportunitiesData Mining and Big Data Challenges and Research Opportunities
Data Mining and Big Data Challenges and Research OpportunitiesKathirvel Ayyaswamy
 
Real-time applications of Data Science.pptx
Real-time applications  of Data Science.pptxReal-time applications  of Data Science.pptx
Real-time applications of Data Science.pptxshalini s
 
Recovered file 1
Recovered file 1Recovered file 1
Recovered file 1siragezeynu
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxssuser1a4f0f
 
A Lifecycle Approach to Information Privacy
A Lifecycle Approach to Information PrivacyA Lifecycle Approach to Information Privacy
A Lifecycle Approach to Information PrivacyMicah Altman
 
160905 tryggve-at-eccb pursula
160905 tryggve-at-eccb pursula160905 tryggve-at-eccb pursula
160905 tryggve-at-eccb pursulaanttipursula
 
Lecture 1-big data engineering (Introduction).pdf
Lecture 1-big data engineering (Introduction).pdfLecture 1-big data engineering (Introduction).pdf
Lecture 1-big data engineering (Introduction).pdfahmedibrahimghnnam01
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data ManagementJamie Bisset
 
Data Management and Horizon 2020
Data Management and Horizon 2020Data Management and Horizon 2020
Data Management and Horizon 2020Sarah Jones
 

Ähnlich wie Service and Support for Science IT -Peter Kunzst, University of Zurich (20)

Introduction to Digital Preservation
Introduction to Digital PreservationIntroduction to Digital Preservation
Introduction to Digital Preservation
 
DBMS
DBMSDBMS
DBMS
 
Research Data Management and your PhD
Research Data Management and your PhDResearch Data Management and your PhD
Research Data Management and your PhD
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptx
 
eCitizen Sensible-Data Design Challenge
eCitizen Sensible-Data Design ChallengeeCitizen Sensible-Data Design Challenge
eCitizen Sensible-Data Design Challenge
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management Plan
 
How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
Data Mining and Big Data Challenges and Research Opportunities
Data Mining and Big Data Challenges and Research OpportunitiesData Mining and Big Data Challenges and Research Opportunities
Data Mining and Big Data Challenges and Research Opportunities
 
Real-time applications of Data Science.pptx
Real-time applications  of Data Science.pptxReal-time applications  of Data Science.pptx
Real-time applications of Data Science.pptx
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Recovered file 1
Recovered file 1Recovered file 1
Recovered file 1
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
NCCT.pptx
NCCT.pptxNCCT.pptx
NCCT.pptx
 
A Lifecycle Approach to Information Privacy
A Lifecycle Approach to Information PrivacyA Lifecycle Approach to Information Privacy
A Lifecycle Approach to Information Privacy
 
160905 tryggve-at-eccb pursula
160905 tryggve-at-eccb pursula160905 tryggve-at-eccb pursula
160905 tryggve-at-eccb pursula
 
Lecture 1-big data engineering (Introduction).pdf
Lecture 1-big data engineering (Introduction).pdfLecture 1-big data engineering (Introduction).pdf
Lecture 1-big data engineering (Introduction).pdf
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Data Management and Horizon 2020
Data Management and Horizon 2020Data Management and Horizon 2020
Data Management and Horizon 2020
 

Mehr von Mind the Byte

Aleida Alcaide, European cloud partnership. Iniatives of the European Public ...
Aleida Alcaide, European cloud partnership. Iniatives of the European Public ...Aleida Alcaide, European cloud partnership. Iniatives of the European Public ...
Aleida Alcaide, European cloud partnership. Iniatives of the European Public ...Mind the Byte
 
Progress towards security in the Cloud-Héctor Sánchez, Microsoft
Progress towards security in the Cloud-Héctor Sánchez, MicrosoftProgress towards security in the Cloud-Héctor Sánchez, Microsoft
Progress towards security in the Cloud-Héctor Sánchez, MicrosoftMind the Byte
 
IP strategies to protect your cloud technology-Florian Michalek, Bardehele Pa...
IP strategies to protect your cloud technology-Florian Michalek, Bardehele Pa...IP strategies to protect your cloud technology-Florian Michalek, Bardehele Pa...
IP strategies to protect your cloud technology-Florian Michalek, Bardehele Pa...Mind the Byte
 
Shared Clinical History of Catalonia-Ignasi Garcia-Milà, Fundació TICSalut
Shared Clinical History of Catalonia-Ignasi Garcia-Milà, Fundació TICSalutShared Clinical History of Catalonia-Ignasi Garcia-Milà, Fundació TICSalut
Shared Clinical History of Catalonia-Ignasi Garcia-Milà, Fundació TICSalutMind the Byte
 
Introduction to cloud computing
Introduction to cloud computingIntroduction to cloud computing
Introduction to cloud computingMind the Byte
 
What can the cloud do for you?
What can the cloud do for you?What can the cloud do for you?
What can the cloud do for you?Mind the Byte
 
Intorduction to AWS and Boto
Intorduction to AWS and BotoIntorduction to AWS and Boto
Intorduction to AWS and BotoMind the Byte
 

Mehr von Mind the Byte (8)

Aleida Alcaide, European cloud partnership. Iniatives of the European Public ...
Aleida Alcaide, European cloud partnership. Iniatives of the European Public ...Aleida Alcaide, European cloud partnership. Iniatives of the European Public ...
Aleida Alcaide, European cloud partnership. Iniatives of the European Public ...
 
Progress towards security in the Cloud-Héctor Sánchez, Microsoft
Progress towards security in the Cloud-Héctor Sánchez, MicrosoftProgress towards security in the Cloud-Héctor Sánchez, Microsoft
Progress towards security in the Cloud-Héctor Sánchez, Microsoft
 
IP strategies to protect your cloud technology-Florian Michalek, Bardehele Pa...
IP strategies to protect your cloud technology-Florian Michalek, Bardehele Pa...IP strategies to protect your cloud technology-Florian Michalek, Bardehele Pa...
IP strategies to protect your cloud technology-Florian Michalek, Bardehele Pa...
 
Shared Clinical History of Catalonia-Ignasi Garcia-Milà, Fundació TICSalut
Shared Clinical History of Catalonia-Ignasi Garcia-Milà, Fundació TICSalutShared Clinical History of Catalonia-Ignasi Garcia-Milà, Fundació TICSalut
Shared Clinical History of Catalonia-Ignasi Garcia-Milà, Fundació TICSalut
 
Introduction to cloud computing
Introduction to cloud computingIntroduction to cloud computing
Introduction to cloud computing
 
What can the cloud do for you?
What can the cloud do for you?What can the cloud do for you?
What can the cloud do for you?
 
Intorduction to AWS and Boto
Intorduction to AWS and BotoIntorduction to AWS and Boto
Intorduction to AWS and Boto
 
Mind the Byte
Mind the ByteMind the Byte
Mind the Byte
 

Kürzlich hochgeladen

Low Rate Call Girls In Bommanahalli Just Call 7001305949
Low Rate Call Girls In Bommanahalli Just Call 7001305949Low Rate Call Girls In Bommanahalli Just Call 7001305949
Low Rate Call Girls In Bommanahalli Just Call 7001305949ps5894268
 
College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...
College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...
College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...delhimodelshub1
 
Call Girls Service Bommasandra - Call 7001305949 Rs-3500 with A/C Room Cash o...
Call Girls Service Bommasandra - Call 7001305949 Rs-3500 with A/C Room Cash o...Call Girls Service Bommasandra - Call 7001305949 Rs-3500 with A/C Room Cash o...
Call Girls Service Bommasandra - Call 7001305949 Rs-3500 with A/C Room Cash o...narwatsonia7
 
Leading transformational change: inner and outer skills
Leading transformational change: inner and outer skillsLeading transformational change: inner and outer skills
Leading transformational change: inner and outer skillsHelenBevan4
 
Russian Call Girls Hyderabad Saloni 9907093804 Independent Escort Service Hyd...
Russian Call Girls Hyderabad Saloni 9907093804 Independent Escort Service Hyd...Russian Call Girls Hyderabad Saloni 9907093804 Independent Escort Service Hyd...
Russian Call Girls Hyderabad Saloni 9907093804 Independent Escort Service Hyd...delhimodelshub1
 
Call Girls Uppal 7001305949 all area service COD available Any Time
Call Girls Uppal 7001305949 all area service COD available Any TimeCall Girls Uppal 7001305949 all area service COD available Any Time
Call Girls Uppal 7001305949 all area service COD available Any Timedelhimodelshub1
 
Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...ggsonu500
 
Call Girl Chandigarh Mallika ❤️🍑 9907093804 👄🫦 Independent Escort Service Cha...
Call Girl Chandigarh Mallika ❤️🍑 9907093804 👄🫦 Independent Escort Service Cha...Call Girl Chandigarh Mallika ❤️🍑 9907093804 👄🫦 Independent Escort Service Cha...
Call Girl Chandigarh Mallika ❤️🍑 9907093804 👄🫦 Independent Escort Service Cha...High Profile Call Girls Chandigarh Aarushi
 
hyderabad call girl.pdfRussian Call Girls in Hyderabad Amrita 9907093804 Inde...
hyderabad call girl.pdfRussian Call Girls in Hyderabad Amrita 9907093804 Inde...hyderabad call girl.pdfRussian Call Girls in Hyderabad Amrita 9907093804 Inde...
hyderabad call girl.pdfRussian Call Girls in Hyderabad Amrita 9907093804 Inde...delhimodelshub1
 
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service HyderabadCall Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabaddelhimodelshub1
 
Housewife Call Girls Nandini Layout - Phone No 7001305949 For Ultimate Sexual...
Housewife Call Girls Nandini Layout - Phone No 7001305949 For Ultimate Sexual...Housewife Call Girls Nandini Layout - Phone No 7001305949 For Ultimate Sexual...
Housewife Call Girls Nandini Layout - Phone No 7001305949 For Ultimate Sexual...narwatsonia7
 
Hi,Fi Call Girl In Marathahalli - 7001305949 with real photos and phone numbers
Hi,Fi Call Girl In Marathahalli - 7001305949 with real photos and phone numbersHi,Fi Call Girl In Marathahalli - 7001305949 with real photos and phone numbers
Hi,Fi Call Girl In Marathahalli - 7001305949 with real photos and phone numbersnarwatsonia7
 
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service GoaRussian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goanarwatsonia7
 
Call Girl Service ITPL - [ Cash on Delivery ] Contact 7001305949 Escorts Service
Call Girl Service ITPL - [ Cash on Delivery ] Contact 7001305949 Escorts ServiceCall Girl Service ITPL - [ Cash on Delivery ] Contact 7001305949 Escorts Service
Call Girl Service ITPL - [ Cash on Delivery ] Contact 7001305949 Escorts Servicenarwatsonia7
 

Kürzlich hochgeladen (20)

Call Girl Lucknow Gauri 🔝 8923113531 🔝 🎶 Independent Escort Service Lucknow
Call Girl Lucknow Gauri 🔝 8923113531  🔝 🎶 Independent Escort Service LucknowCall Girl Lucknow Gauri 🔝 8923113531  🔝 🎶 Independent Escort Service Lucknow
Call Girl Lucknow Gauri 🔝 8923113531 🔝 🎶 Independent Escort Service Lucknow
 
Low Rate Call Girls In Bommanahalli Just Call 7001305949
Low Rate Call Girls In Bommanahalli Just Call 7001305949Low Rate Call Girls In Bommanahalli Just Call 7001305949
Low Rate Call Girls In Bommanahalli Just Call 7001305949
 
College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...
College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...
College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...
 
VIP Call Girls Lucknow Isha 🔝 9719455033 🔝 🎶 Independent Escort Service Lucknow
VIP Call Girls Lucknow Isha 🔝 9719455033 🔝 🎶 Independent Escort Service LucknowVIP Call Girls Lucknow Isha 🔝 9719455033 🔝 🎶 Independent Escort Service Lucknow
VIP Call Girls Lucknow Isha 🔝 9719455033 🔝 🎶 Independent Escort Service Lucknow
 
Call Girls Service Bommasandra - Call 7001305949 Rs-3500 with A/C Room Cash o...
Call Girls Service Bommasandra - Call 7001305949 Rs-3500 with A/C Room Cash o...Call Girls Service Bommasandra - Call 7001305949 Rs-3500 with A/C Room Cash o...
Call Girls Service Bommasandra - Call 7001305949 Rs-3500 with A/C Room Cash o...
 
Leading transformational change: inner and outer skills
Leading transformational change: inner and outer skillsLeading transformational change: inner and outer skills
Leading transformational change: inner and outer skills
 
Call Girl Dehradun Aashi 🔝 7001305949 🔝 💃 Independent Escort Service Dehradun
Call Girl Dehradun Aashi 🔝 7001305949 🔝 💃 Independent Escort Service DehradunCall Girl Dehradun Aashi 🔝 7001305949 🔝 💃 Independent Escort Service Dehradun
Call Girl Dehradun Aashi 🔝 7001305949 🔝 💃 Independent Escort Service Dehradun
 
Russian Call Girls Hyderabad Saloni 9907093804 Independent Escort Service Hyd...
Russian Call Girls Hyderabad Saloni 9907093804 Independent Escort Service Hyd...Russian Call Girls Hyderabad Saloni 9907093804 Independent Escort Service Hyd...
Russian Call Girls Hyderabad Saloni 9907093804 Independent Escort Service Hyd...
 
College Call Girls Dehradun Kavya 🔝 7001305949 🔝 📍 Independent Escort Service...
College Call Girls Dehradun Kavya 🔝 7001305949 🔝 📍 Independent Escort Service...College Call Girls Dehradun Kavya 🔝 7001305949 🔝 📍 Independent Escort Service...
College Call Girls Dehradun Kavya 🔝 7001305949 🔝 📍 Independent Escort Service...
 
Call Girls Uppal 7001305949 all area service COD available Any Time
Call Girls Uppal 7001305949 all area service COD available Any TimeCall Girls Uppal 7001305949 all area service COD available Any Time
Call Girls Uppal 7001305949 all area service COD available Any Time
 
Russian Call Girls Lucknow Khushi 🔝 7001305949 🔝 🎶 Independent Escort Service...
Russian Call Girls Lucknow Khushi 🔝 7001305949 🔝 🎶 Independent Escort Service...Russian Call Girls Lucknow Khushi 🔝 7001305949 🔝 🎶 Independent Escort Service...
Russian Call Girls Lucknow Khushi 🔝 7001305949 🔝 🎶 Independent Escort Service...
 
Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...
 
Call Girl Chandigarh Mallika ❤️🍑 9907093804 👄🫦 Independent Escort Service Cha...
Call Girl Chandigarh Mallika ❤️🍑 9907093804 👄🫦 Independent Escort Service Cha...Call Girl Chandigarh Mallika ❤️🍑 9907093804 👄🫦 Independent Escort Service Cha...
Call Girl Chandigarh Mallika ❤️🍑 9907093804 👄🫦 Independent Escort Service Cha...
 
hyderabad call girl.pdfRussian Call Girls in Hyderabad Amrita 9907093804 Inde...
hyderabad call girl.pdfRussian Call Girls in Hyderabad Amrita 9907093804 Inde...hyderabad call girl.pdfRussian Call Girls in Hyderabad Amrita 9907093804 Inde...
hyderabad call girl.pdfRussian Call Girls in Hyderabad Amrita 9907093804 Inde...
 
Call Girl Guwahati Aashi 👉 7001305949 👈 🔝 Independent Escort Service Guwahati
Call Girl Guwahati Aashi 👉 7001305949 👈 🔝 Independent Escort Service GuwahatiCall Girl Guwahati Aashi 👉 7001305949 👈 🔝 Independent Escort Service Guwahati
Call Girl Guwahati Aashi 👉 7001305949 👈 🔝 Independent Escort Service Guwahati
 
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service HyderabadCall Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
 
Housewife Call Girls Nandini Layout - Phone No 7001305949 For Ultimate Sexual...
Housewife Call Girls Nandini Layout - Phone No 7001305949 For Ultimate Sexual...Housewife Call Girls Nandini Layout - Phone No 7001305949 For Ultimate Sexual...
Housewife Call Girls Nandini Layout - Phone No 7001305949 For Ultimate Sexual...
 
Hi,Fi Call Girl In Marathahalli - 7001305949 with real photos and phone numbers
Hi,Fi Call Girl In Marathahalli - 7001305949 with real photos and phone numbersHi,Fi Call Girl In Marathahalli - 7001305949 with real photos and phone numbers
Hi,Fi Call Girl In Marathahalli - 7001305949 with real photos and phone numbers
 
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service GoaRussian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
 
Call Girl Service ITPL - [ Cash on Delivery ] Contact 7001305949 Escorts Service
Call Girl Service ITPL - [ Cash on Delivery ] Contact 7001305949 Escorts ServiceCall Girl Service ITPL - [ Cash on Delivery ] Contact 7001305949 Escorts Service
Call Girl Service ITPL - [ Cash on Delivery ] Contact 7001305949 Escorts Service
 

Service and Support for Science IT -Peter Kunzst, University of Zurich

  • 1. Service and Support for Science IT Scientific Cloud Experiences Dr. Peter Kunszt Director S3IT
  • 2. Outline • Introduction – What is Science IT – How are we organized • UZH ScienceCloud Infrastructure and Implementation • Science Data and Security/Privacy
  • 3. Challenge : Scale Up • High Throughput Instruments – Much larger data volumes – Increased data complexity • Large Collaborations – More people – More experiments and measurements – More coverage
  • 4. Fire and forget... • Scientists do not want to be bothered with infrastructure details • IT JUST NEEDS TO WORK!
  • 5. Widening Complexity Gap: IT-Research Local IT Resources Research Labs Core Facilities Miracle SCIENCE IT
  • 6. What is Science IT ? FILL THE GAP Dedicated Support Center for Science IT • SPEED : faster time to solution • ACCESS : to infrastructure, software, expertise • ENABLE : use IT technology and software for new ideas Speed Access Enablement
  • 7.
  • 8. Supporting Science • Be a partner to research projects for Science IT • Provide services to individual researchers, groups and consortia – Consultancy for advanced usage of IT in Science – Research software development and support – Access to competitive IT infrastructure – Access to a library of tools and software – Project management and collaboration support – Training and education on the usage of infrastructure and software • Collaborate internally, nationally and internationally with partners, suppliers and other Science IT units • Maintain high level of internal expertise on topics relevant to Science IT • Advise UZH Governance on evolution of needs, assist in prioritization
  • 9. Organization Structures are Changing Org A Org B Org C OrgD Old world: Hierarchical New world: Federated http://www.fedsm.eu/
  • 10. S3IT Organization Core Team Site Team Site Team EE EE EE EE EE ... ... EE = Embedded Expert Working directly in projects or on-site in groups on specific tasks Site Teams Joint teams with other units providing local support and some global services Core Team Directorate, Office, core services, central infrastructure and consultancy, project mgmt
  • 11. Partner Interactions Core Facilities Core Facilities Core Facilities Agreements Research GroupsProjects Projects Projects Partners / Clients Research Groups Research Groups Research Groups Services Faculties Institutes Departments Faculties Institutes Departments Central IT Partners / Suppliers CSCS internal external VendorsVendorsVendorsVendors
  • 12. S3IT Core Business: Project Support • Infrastructure is important but ‚just‘ a means to an end • Science IT Support: Applications, access, integration • Data analysis • Simulations • Data Integration • Application scaling, making use of big infrastructures • Workflows, automation • Visualization • Software design and usage advice, Code Clinic • Training and education • ...
  • 13. 14 Understand the science.. .. to map Science IT services!
  • 14. Mapping Security and Privacy • Most science follows 3 stages – Conception, preparation, proposition stage – private – Project stage (3-5y) – share in group – Publication of results – open to all • Some have additional constraints (regulations) – Medicine – patient data records need consent (different per country) – Law and business – confidentiality in projects – Engineering, pharmacology, etc.. – patents
  • 15. Infrastructure • Supercomputing – Used as a scientific instrument by • theoretical physics, astrophysics, mathematics, computational chemistry, biochemistry, quantum chemistry • Continuous usage • Cluster computing – Used as a workhorse by many groups • Life science, biochem, geoscience, medicine, digital humanities, banking and finance, art history, ... • Data analysis, statistical analysis, parameter studies, etc • Non-continuous usage • Server computing – Used as interactive computers by many groups • All groups. Interactive processing, visualization, steering of computation. Commercial and open-source tools. • Daily usage, non-continuous.
  • 16. Storage Classes • Large, cheap data store for projects O(xPB) – No need to be backed up: Easy to regenerate but time-consuming • Reliable project data store O(1PB) – With secondary copy – Only addition, no changes • Working storage O(x100TB) – Active data, databases, server-side processes • Fast storage for streaming analysis O(100TB) – Fast changing data, immediate analysis, rare!
  • 17. Datacenter Consolidation OCI – S3IT ZMB BIOC MATH PHYS IMLS / Neuro Consolidate into Central Datacenter Aim: Scale and Secure!
  • 18. UZH ScienceCloud Implementation • OpenStack – based on Canonical • Deployment using Ansible • Vagrant-like system for configuration: Elasticluster (developed at UZH) • Flexible submission and workflow framework for job control: GC3pie (developed at UZH) • Database management framework openBIS for data lifecycle management (developed at ETH/SystemsX.ch)
  • 19. Business Model • Supercomputing – Investment every 4 years into the system – Research groups to find 3rd party funding • Commodity Cloud and Storage – Subscription / year : Cores, TB – Per use fee – Subsidized, not TCO – covering operations • Servers / Pets – Yearly or monthly fee – Size matters • Yearly acquisition / rollover – Easy to plan
  • 20. Experience so far: • Supercomputing needed only by few groups – Can be completely outsourced to national center, done as of 2015 • Cloud is suitable for most Science Workloads – User support scales well – Can cover very many use cases – Build dedicated boxes for exceptions, don‘t be driven by them – Flexibility is key • Must use local infrastructure for secure, data intensive and memory intensive workloads – Data locality needed for COST and (rarely) policy reasons – exception: medical data – Hybrid cloud – burst available for CPU intensive jobs – Deal with heterogeneity
  • 21. Future Cloud Strategy: HYBRID • Run sizeable local cloud infrastructure for internal workloads • Burst peak loads to public cloud providers – For selected workloads coherent with policy and cost Advantages • Plannable local infrastructure (plan for full usage) • Flexibility in scaling, quick provisioning of needed capacity
  • 22. Open Questions • Policies. What workloads can be burst to public clouds? Under what conditions – Calculations, simulations usually OK – Data analyis: depends on data (network issues being resolved) – Check compliance of cloud providers. ISO, HIPAA, etc – Adherence to swiss cantonal data protection regulations • Cost. How to buy public cloud services? – Public procurement of agreements? – How not to be bound to a single provider? – Is this necessary at all? • How do i charge my users? – For internal and for external use? – Aim: consolidate their workload into our cloud. No TCO!
  • 23. Comments on Security in academia • Users in academia are super smart. They remove barriers faster than you can erect them. • Do risk assessment and risk analysis instead of prevention. • Don‘t do anything ‚for security reasons‘, always qualify with real risk numbers • Public Clouds are MUCH MORE secure than our own – Amazon, Microsoft, IBM etc have whole teams of security experts – they hired our best students for this  • It is a question of TRUST – Regulations by countries – Do we trust the US not to do industrial and academic espionage, forcing their own companies to give out our data?
  • 24. Scientific Requirements • Know your workload: Data, Privacy, Science, Sharing aspects are tightly connected • Lots of hidden complexity and contradicting requirements 29
  • 25. 1. What Data? • Different kinds of ‚BIG‘ data • Volume, Variety, Velocity, Veracity • Understanding is Knowledge is Science – Data vs. Information and Knowledge – What are the right questions? – What should be protected, till when? – How to navigate, explore, evolve 30 WHO OWNS THE DATA? For science, proprietary data is a hindrance
  • 26. 2. Data Reuse • Currently a wealth of data is not reused for new discovery • Lots of potential! Regulators need to be told.. • Data repositories with computing and search capability – perfect for Cloud Model • Do the computation where the data is – Private, public, hybrid Cloud 31 IP on TOOLS, ease of data USE, not DATA itself.
  • 27. 3. Motivate to annotate • Scientists publish what is necessary and prescribed by the journals, not more – mandate better annotation • Provide more recognition for producing ´good´ datasets – Data Citation • Check Data quality – bad quality or data without annotation has no value 32 Creation of well annotated, sustained public resources
  • 28. 4. Standard Formats • Too many ‚Standards‘ or not used – Instrument vendors often at fault • Protection of data by proprietary formats – Data is lost to research • Do not pay for data in nonstandard formats – Data value is zero if unusable 33 Mandate standard formats for domain data
  • 29. 5. Data Sharing/Publishing • Share in collaborative mode • Avoid Data Loss • Motivate and enable data publication • Establish business model for data publication (reward/career benefit) • Journals adapt, see Scientific Data http://www.nature.com/scientificdata New role for Archives and Libraries
  • 30. 6. Patient Data Records • Legal issues of data privacy • People are not in control of their own data • Difficult to get consent • NSA effect – trust Put citizens back in control
  • 31. Patient Data Records • TRUST – Swiss Cooperative: citizen owned • NEUTRALITY – A simple e-Banking system for any personal health data. Same level of security • TRACTION – Volume: it is free, it‘s rewarded • IMPACT – Request data directly, avoid legal issues 36
  • 32. • It is a cooperative, not a business • Funding by running campaigns to ask people to participate in research & surveys • Participants are REWARED for sharing their data or providing new data • Build tools on top • Currently seeking funding – H2020, foundations – Projects with hospitals, clinics 37
  • 33. Approach at S3IT • Early involvement with Research Groups – Proposal writing, partnership – Advice on Data Management, infrastructure, standards • Strong cooperation with Libraries – Early involvement with publishers, archives – Joint information to research groups on data management plans, data citations • Seeking contact with funding bodies and decision makers – Communicate business plan for Science IT ‚project consumables‘ – Evaluation of projects based on technology cost and feasibility – Usage of public and each others‘ cloud resources for cash
  • 34. Links • www.s3it.uzh.ch - Science IT at UZH • www.sybit.net - Systems Biology IT, SystemsX.ch • www.erasysapp.eu - Systems Biology, DMMCore project • www.healthbank.ch - Public Cooperative being set up for patient-owned data. Seeking funding (H2020, pending, and other sources)