SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Research Infrastructures
H3ABioNet case study
Prof Nicola Mulder
H3ABioNet PI
Head of Computational Biology
University of Cape Town
Outline
• Introduction to H3Africa and H3ABioNet
• H3Africa data
• Data sharing policy
• Building infrastructure
• Computing infrastructure
• Human capacity
• Data harmonization & curation
• Facilitating data access
H3Africa: Human Heredity & Health in
Africa
• H3frica Vision: “To facilitate an Africa-based
contemporary research approach to the study of
genomics and environmental determinants of
common diseases with the goal of improving the
health of African populations”
• Funding: NIH, Wellcome Trust/AESA
The H3Africa Consortium
14
Collaborative
Centers
13 Research
Projects
3 Pilot
Biorepositories
8 Ethics Grants
The H3Africa
Consortium
Bioinformatics
Network
4 Global Health
Bioinformatics
Training Programs
H3ABioNet
H3ABioNet Informatics network
• H3ABioNet is a Pan African Informatics Network, to provide
bioinformatics infrastructure and support for the H3Africa
consortium
• Round 1: 34 partners in 14 African countries
• Round 2: 28 partners in 17 countries
• Activities:
• Infrastructure
• User support
• Research
• Training
www.h3abionet.org
H3Africa data (Phase I)
• Phenotype data (associated with genotype data)
– Demographic information
– Anthropometric data
– Disease and health related phenotype data
• Genetic Variation data human and pathogen
– Sequence data (whole genome, exome, targeted)
• Genotyping chip array data
– ~55,000 samples to be run on an H3Africa African custom chip
• Microbiome sequence data
– Patient/sample phenotypes
– Non-human 16S rRNA sequence data for microbiome
– Non-human full genome sequence data for microbiome
– Possible human sequence contamination
• Biospecimens to be deposited at the H3Africa biorepositories
Image credits: National Human Genome Research Institute (https://www.genome.gov/imagegallery/)
Why share data?
• New era of open science
• Enables reproducible science
• Increases visibility and credibility of data generators
• Additional publications and citations
• New research questions can be asked of data
• New discoveries made of relevance to participants
• Increasing sample size
• Increases value of the data
• Funder requirement
Limits to sharing human genetic data
• Data can be stored indefinitely, biobank
specimens can be stored for up to 20 years –
secondary use -rapid innovation with ‘omics
technologies
• Blood sample collection and visits to clinics
associated with disease and treatment – even if a
healthy control
• Ethics consent: H3Africa- some projects have
broad consent, some used tiered consent or
specific consent
• History of vulnerable populations, low education
levels and exploitation
• Anonymized, but risk of identification
Ethical
considerations
Informed
consent
Participant
identification
Stigmatisation
Benefit
sharing
Human genetic data privacy
• Age & Sex
• Country of birth
• Current residence
• Native language
• Ethno-linguistic/tribal affiliation
• Country of birth of father and mother
• Native language of father and mother
• Ethno-linguistic/tribal affiliation of
mother and father
• Height
• Weight
• Current medications
• Smoking history
• Alcohol history
Image credits: National Human Genome Research Institute (https://www.genome.gov/imagegallery/)
• Combination of phenotype and genetic data makes it possible to
identify different populations and individuals – restricted access
H3Africa Data Sharing Access and
Release Policy
• Balance between ensuring that adequate safeguards to protect
participants while not being a barrier for scientists to advance
research
• Maximizing the availability of research data, in a timely and
responsible manner
• Protecting the rights and privacy of human subjects who
participated in research studies
• Recognizing the scientific contribution of researchers who
generated the data
• Considering the nature and ethics of the research proposed in
establishing the timely release of data, and mechanisms of data
sharing
• Promoting deposition of genomic data in existing community data
repositories whenever possible
H3Africa DSAR policy
• For genomic and phenotype data:
• Submit to H3Africa archive
• 9 months to submit to public repository
• 12 month publication embargo
• In EGA access controlled by DBAC
2 months
Research
site- QC
genomic &
phenotypic
data
9 months
H3ABioNet-
Genomic &
phenotypic
data stored
12 months
EGA- Genomic &
phenotypic data
available through
DBAC with publication
embargo
Long term
EGA- Genomic & phenotypic data
available through DBAC without
publication embargo
Research
site -Data
generation
23 months
Data and Biospecimen Access Committee
• Review and approve requests for data and/or biospecimens
• Biospecimens:
• first 3 years only access outside H3Africa for those collaborating in
Africa
• Use info on availability in biobanks
• Data generated must be submitted to EGA
• Scientific review/funding available
• Data
• DBAC will ensure requestor has expertise and resources
• Scientific review
• Evaluation criteria
• Scientific merit
• Institutional capacity for the research
• Potential for publication or translation, e.g. new therapies
Data access agreement
• H3Africa not liable for use of data
• Only use data for agreed purpose
• Maintain data confidentiality
• Make sure data is secure
• Acknowledge source of data
• Submit annual reports
• Project put onto website
• Access is granted for 1 year
What is required for sharing data?
• Consent from participants –varying consent within a study
is difficult
• Robust data sharing model with implementation strategy
for data access, transfer, etc
• Access agreements and MoUs
• Infrastructure for
• Data transfer
• Data storage & compute
• Training
• Data curation and harmonization
Infrastructure development & support
• Node server purchases
• Sys Admin “How to” documents
• Access to HPC, Cloud (Docker
containers)
• Internet connectivity
measurement -NetMap
• Data transfer –Globus online,
testing vs Aspera
• Data storage
• Training in IT, data management
and general bioinformatics use
H3ABioNet combined equipment: 512
cores, 2384 GB RAM, 120TB storage
Building human capacity for genomics
data management
• Need to train
• Bioinformaticians
• Data scientists
• Bioinformatics users
• Medical professionals
Specialised courses,
shadow teams,
internships
ISCB
EMBL-EBI
training team
Training Approaches
Face to face Workshops
Train-the-Trainer
Internships
Live Online Training
Hackathons/Data Jamborees
Access to training materials
Harmonizing H3Africa data
Harmonizing H3Africa data
Mapping biobank data to
OMIABIS ontology
Mapping CRFs to ontologies,
e.g. phenotype or disease
ontology
Mapping genomics
data to
Experimental
Factor ontology
PHWG has developed
set of core phenotypes,
standard CRF
Mapping ethics
consent info to Data
Use ontology
Harmonizing H3Africa data
Mapping biobank data to
OMIABIS ontology
Mapping CRFs to ontologies,
e.g. phenotype or disease
ontology
Mapping genomics
data to
Experimental
Factor ontology
PHWG has developed
set of core phenotypes,
standard CRF
Mapping ethics
consent info to Data
Use ontology
Biorepositories
Archive & EGA
Catalogue
Making data FAIR
• Findable, Accessible, Interoperable, and Re-usable
https://www.force11.org/group/fairgroup/fairprinciples
• To be Findable: identifier, metadata, indexed
• To be Accessible: find by identifier, clear rules for
access and authentication
• To be Interoperable: standardized and cross-
referenced
• To be Reusable: licensed, metadata with provenance,
standards
Making data FAIR
• Findable, Accessible, Interoperable, and Re-usable
https://www.force11.org/group/fairgroup/fairprinciples
• To be Findable: identifier, metadata, indexed
• To be Accessible: find by identifier, clear rules for
access and authentication
• To be Interoperable: standardized and cross-
referenced
• To be Reusable: licensed, metadata with provenance,
standards
H3Africa Data Archive
• Assist H3Africa projects as data coordination center:
TransferValidate
Store
Submit
to EGA
Obtain EGA accessions
for publications
0.5 petabytes storage size including offsite
replication
Local EGA feasibility?
Data and biospecimen catalogue
Beacons
…a simple public web service … designed
merely to accept a query of the form "Do you
have any genomes with an 'A' at position
100,735 on chromosome 3" (or similar data)
and responds with one of "Yes" or "No."
genomicsandhealth.org
• Advantages
• Locally hosted
• Minimal information (yes/no for a
given allele)
• Protection against “scraping”
https://goo.gl/Bkd0dx
Summary
• H3Africa is largest collection of human biomedical
data in Africa to date
• Human data is sensitive and needs to be shared
while protecting participants and researchers
• Need to build infrastructure for sharing:
• harmonized/curated metadata
• storage and transfer facilities
• human capacity -skills
• Need to provide access tools –web interface, public
repositories, database
• Trying to promote Open science –user groups,
sessions
Acknowledgements
The H3ABioNet Consortium
Funding: NIH
Common Fund,
NGHRI grant:
U41HG006941,
U24HG006941
H3ABioNet team at CBIO:
• Sumir Panji
• Gerrit Botha
• Ayton Meintjes
• Suresh Maslamoney
• Vicky Nembaware
• Ziyaad Parker
• Kim Gurwitz
• Mamana Mbiyavanga
• Katherine Johnston
Slides: Sumir
Panji, Michelle
Skelton

Weitere ähnliche Inhalte

Was ist angesagt?

Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.
Susanna-Assunta Sansone
 

Was ist angesagt? (20)

Introduction to ADA
Introduction to ADAIntroduction to ADA
Introduction to ADA
 
Building blocks for success: criteria for trusted institutional repositories
Building blocks for success: criteria for trusted institutional repositoriesBuilding blocks for success: criteria for trusted institutional repositories
Building blocks for success: criteria for trusted institutional repositories
 
SHARE Update for CNI, Spring 2014
SHARE Update for CNI, Spring 2014SHARE Update for CNI, Spring 2014
SHARE Update for CNI, Spring 2014
 
Data Sharing and Release Legislation
Data Sharing and Release Legislation   Data Sharing and Release Legislation
Data Sharing and Release Legislation
 
Investigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspectiveInvestigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspective
 
Markham2009
Markham2009Markham2009
Markham2009
 
Increasing access to and preserving institutional research assets: an Open Ac...
Increasing access to and preserving institutional research assets: an Open Ac...Increasing access to and preserving institutional research assets: an Open Ac...
Increasing access to and preserving institutional research assets: an Open Ac...
 
APLIC 2014 - Douglas MacFadden on Harvard Catalyst
APLIC 2014 - Douglas MacFadden on Harvard CatalystAPLIC 2014 - Douglas MacFadden on Harvard Catalyst
APLIC 2014 - Douglas MacFadden on Harvard Catalyst
 
RDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
RDAP14: Maryann Martone, Keynote, The Neuroscience Information FrameworkRDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
RDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
 
NIH Data Sharing Plan Workshop - Handout
NIH Data Sharing Plan Workshop - HandoutNIH Data Sharing Plan Workshop - Handout
NIH Data Sharing Plan Workshop - Handout
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research data
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) data
 
Trusted Institutional Repository Monitoring & Evaluation: tracking usage & im...
Trusted Institutional Repository Monitoring & Evaluation: tracking usage & im...Trusted Institutional Repository Monitoring & Evaluation: tracking usage & im...
Trusted Institutional Repository Monitoring & Evaluation: tracking usage & im...
 
RDAP14: It’s a Real World: Developing Preservation Policy for Dryad
RDAP14: It’s a Real World: Developing Preservation Policy for DryadRDAP14: It’s a Real World: Developing Preservation Policy for Dryad
RDAP14: It’s a Real World: Developing Preservation Policy for Dryad
 
Common Ground: a policy framework for open access to research data
Common Ground: a  policy framework for open access to research dataCommon Ground: a  policy framework for open access to research data
Common Ground: a policy framework for open access to research data
 
Data Integration and Imaging Informatics - Status Report
Data Integration and Imaging Informatics - Status ReportData Integration and Imaging Informatics - Status Report
Data Integration and Imaging Informatics - Status Report
 
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.
 
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
 

Ähnlich wie Research Infrastructures H3ABioNet case study/Nicola Mulder

CINECA webinar slides: Making cohort data FAIR
CINECA webinar slides: Making cohort data FAIRCINECA webinar slides: Making cohort data FAIR
CINECA webinar slides: Making cohort data FAIR
CINECAProject
 
Jennifer Dent
Jennifer DentJennifer Dent
Jennifer Dent
BioDundee
 

Ähnlich wie Research Infrastructures H3ABioNet case study/Nicola Mulder (20)

H3Africa/H3ABioNet Case Study/Nicola Mulder
H3Africa/H3ABioNet Case Study/Nicola MulderH3Africa/H3ABioNet Case Study/Nicola Mulder
H3Africa/H3ABioNet Case Study/Nicola Mulder
 
Digital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data scienceDigital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data science
 
CINECA webinar slides: Making cohort data FAIR
CINECA webinar slides: Making cohort data FAIRCINECA webinar slides: Making cohort data FAIR
CINECA webinar slides: Making cohort data FAIR
 
David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access
 
NIH Data Science Special Interest Group
NIH Data Science Special Interest GroupNIH Data Science Special Interest Group
NIH Data Science Special Interest Group
 
Fair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevortFair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevort
 
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
 
Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016
 
Sourcing health data for open-access collection
Sourcing health data for open-access collectionSourcing health data for open-access collection
Sourcing health data for open-access collection
 
Finding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics DatasetsFinding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics Datasets
 
Samantha Robertson - NHMRC Perspectives on Increasing Access to Data from Pub...
Samantha Robertson - NHMRC Perspectives on Increasing Access to Data from Pub...Samantha Robertson - NHMRC Perspectives on Increasing Access to Data from Pub...
Samantha Robertson - NHMRC Perspectives on Increasing Access to Data from Pub...
 
eHealth Services: How Library and Information Workers Can Make a Positive Con...
eHealth Services: How Library and Information Workers Can Make a Positive Con...eHealth Services: How Library and Information Workers Can Make a Positive Con...
eHealth Services: How Library and Information Workers Can Make a Positive Con...
 
Jennifer Dent
Jennifer DentJennifer Dent
Jennifer Dent
 
Reflections on cohorts and longitudinal studies
Reflections on cohorts and longitudinal studiesReflections on cohorts and longitudinal studies
Reflections on cohorts and longitudinal studies
 
Open data in a big data world (Accord ICSU-IAP-ISSC-TWAS)
Open data in a big data world (Accord ICSU-IAP-ISSC-TWAS)Open data in a big data world (Accord ICSU-IAP-ISSC-TWAS)
Open data in a big data world (Accord ICSU-IAP-ISSC-TWAS)
 
Open data in a big data world Accord (ICSU-IAP-ISSC-TWAS)
Open data in a big data world Accord (ICSU-IAP-ISSC-TWAS)Open data in a big data world Accord (ICSU-IAP-ISSC-TWAS)
Open data in a big data world Accord (ICSU-IAP-ISSC-TWAS)
 
BioSHaRE: Operationalizing responsible data sharing and access: GA4GH - Barth...
BioSHaRE: Operationalizing responsible data sharing and access: GA4GH - Barth...BioSHaRE: Operationalizing responsible data sharing and access: GA4GH - Barth...
BioSHaRE: Operationalizing responsible data sharing and access: GA4GH - Barth...
 
Data Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingData Virtualization Modernizes Biobanking
Data Virtualization Modernizes Biobanking
 
Accessing data for research: data publishing pathways and the Five Safes
Accessing data for research: data publishing pathways and the Five SafesAccessing data for research: data publishing pathways and the Five Safes
Accessing data for research: data publishing pathways and the Five Safes
 
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...
Workshop   finding and accessing data - fiona nadia charlotte - cambridge apr...Workshop   finding and accessing data - fiona nadia charlotte - cambridge apr...
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...
 

Mehr von African Open Science Platform

Mehr von African Open Science Platform (20)

Science for the Future The Future of Science: Roadmap/Molapo Qhobela
Science for the Future The Future of Science: Roadmap/Molapo QhobelaScience for the Future The Future of Science: Roadmap/Molapo Qhobela
Science for the Future The Future of Science: Roadmap/Molapo Qhobela
 
Science for the future The future of science: Governance/Khotso Mokhele
Science for the future The future of science: Governance/Khotso MokheleScience for the future The future of science: Governance/Khotso Mokhele
Science for the future The future of science: Governance/Khotso Mokhele
 
The future of science is digital. Are YOU prepared?/Ina Smith
The future of science is digital. Are YOU prepared?/Ina SmithThe future of science is digital. Are YOU prepared?/Ina Smith
The future of science is digital. Are YOU prepared?/Ina Smith
 
African Open Science Platform pilot study and landscape findings
African Open Science Platform pilot study and landscape findingsAfrican Open Science Platform pilot study and landscape findings
African Open Science Platform pilot study and landscape findings
 
Climate change and variability/ Abiodun Adeola
Climate change and variability/ Abiodun AdeolaClimate change and variability/ Abiodun Adeola
Climate change and variability/ Abiodun Adeola
 
Accelerating Science, Technology and Innovation Through Open Data and Open Sc...
Accelerating Science, Technology and Innovation Through Open Data and Open Sc...Accelerating Science, Technology and Innovation Through Open Data and Open Sc...
Accelerating Science, Technology and Innovation Through Open Data and Open Sc...
 
African Open Science Platform
African Open Science PlatformAfrican Open Science Platform
African Open Science Platform
 
African Open Science Platform. Where are we? Where do we want to go? How do w...
African Open Science Platform. Where are we? Where do we want to go? How do w...African Open Science Platform. Where are we? Where do we want to go? How do w...
African Open Science Platform. Where are we? Where do we want to go? How do w...
 
Data management principles and trusted data repositories/Lynn Woolfrey
Data management principles and trusted data repositories/Lynn WoolfreyData management principles and trusted data repositories/Lynn Woolfrey
Data management principles and trusted data repositories/Lynn Woolfrey
 
African Open Science Platform: Research Data Towards a Sustainable World/Ina ...
African Open Science Platform: Research Data Towards a Sustainable World/Ina ...African Open Science Platform: Research Data Towards a Sustainable World/Ina ...
African Open Science Platform: Research Data Towards a Sustainable World/Ina ...
 
Why Open Science Matters to Libraries/Ina Smith
Why Open Science Matters to Libraries/Ina SmithWhy Open Science Matters to Libraries/Ina Smith
Why Open Science Matters to Libraries/Ina Smith
 
Europe's Open Science Policy and Policy Platform/Jean-Claude Burgelman
Europe's Open Science Policy and Policy Platform/Jean-Claude BurgelmanEurope's Open Science Policy and Policy Platform/Jean-Claude Burgelman
Europe's Open Science Policy and Policy Platform/Jean-Claude Burgelman
 
EOSC Strategic Implementation Roadmap 2018-2020/Jean-Claude Burgelman
EOSC Strategic Implementation Roadmap 2018-2020/Jean-Claude BurgelmanEOSC Strategic Implementation Roadmap 2018-2020/Jean-Claude Burgelman
EOSC Strategic Implementation Roadmap 2018-2020/Jean-Claude Burgelman
 
AIMS Ecosystem of Transformation/Barry Green
AIMS Ecosystem of Transformation/Barry GreenAIMS Ecosystem of Transformation/Barry Green
AIMS Ecosystem of Transformation/Barry Green
 
Building and Operating National Open Science Research Infrastructures - the e...
Building and Operating National Open Science Research Infrastructures - the e...Building and Operating National Open Science Research Infrastructures - the e...
Building and Operating National Open Science Research Infrastructures - the e...
 
Vision and Mission for a Future African Open Science Platform/Felix Dakora
Vision and Mission for a Future African Open Science Platform/Felix DakoraVision and Mission for a Future African Open Science Platform/Felix Dakora
Vision and Mission for a Future African Open Science Platform/Felix Dakora
 
The Digital Revolution and Open Science for the Future/Geoffrey Boulton
The Digital Revolution and Open Science for the Future/Geoffrey BoultonThe Digital Revolution and Open Science for the Future/Geoffrey Boulton
The Digital Revolution and Open Science for the Future/Geoffrey Boulton
 
Response of Academies of Science to Open Science/Roseanne Diab
Response of Academies of Science to Open Science/Roseanne DiabResponse of Academies of Science to Open Science/Roseanne Diab
Response of Academies of Science to Open Science/Roseanne Diab
 
The Landscape of Open Science in Africa/Susan Veldsman & Joseph Wafula
The Landscape of Open Science in Africa/Susan Veldsman & Joseph WafulaThe Landscape of Open Science in Africa/Susan Veldsman & Joseph Wafula
The Landscape of Open Science in Africa/Susan Veldsman & Joseph Wafula
 
Open Data for Socio-Economic Value/Ina Smith
Open Data for Socio-Economic Value/Ina SmithOpen Data for Socio-Economic Value/Ina Smith
Open Data for Socio-Economic Value/Ina Smith
 

Kürzlich hochgeladen

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 

Kürzlich hochgeladen (20)

Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 

Research Infrastructures H3ABioNet case study/Nicola Mulder

  • 1. Research Infrastructures H3ABioNet case study Prof Nicola Mulder H3ABioNet PI Head of Computational Biology University of Cape Town
  • 2. Outline • Introduction to H3Africa and H3ABioNet • H3Africa data • Data sharing policy • Building infrastructure • Computing infrastructure • Human capacity • Data harmonization & curation • Facilitating data access
  • 3. H3Africa: Human Heredity & Health in Africa • H3frica Vision: “To facilitate an Africa-based contemporary research approach to the study of genomics and environmental determinants of common diseases with the goal of improving the health of African populations” • Funding: NIH, Wellcome Trust/AESA
  • 4. The H3Africa Consortium 14 Collaborative Centers 13 Research Projects 3 Pilot Biorepositories 8 Ethics Grants The H3Africa Consortium Bioinformatics Network 4 Global Health Bioinformatics Training Programs H3ABioNet
  • 5. H3ABioNet Informatics network • H3ABioNet is a Pan African Informatics Network, to provide bioinformatics infrastructure and support for the H3Africa consortium • Round 1: 34 partners in 14 African countries • Round 2: 28 partners in 17 countries • Activities: • Infrastructure • User support • Research • Training www.h3abionet.org
  • 6. H3Africa data (Phase I) • Phenotype data (associated with genotype data) – Demographic information – Anthropometric data – Disease and health related phenotype data • Genetic Variation data human and pathogen – Sequence data (whole genome, exome, targeted) • Genotyping chip array data – ~55,000 samples to be run on an H3Africa African custom chip • Microbiome sequence data – Patient/sample phenotypes – Non-human 16S rRNA sequence data for microbiome – Non-human full genome sequence data for microbiome – Possible human sequence contamination • Biospecimens to be deposited at the H3Africa biorepositories Image credits: National Human Genome Research Institute (https://www.genome.gov/imagegallery/)
  • 7. Why share data? • New era of open science • Enables reproducible science • Increases visibility and credibility of data generators • Additional publications and citations • New research questions can be asked of data • New discoveries made of relevance to participants • Increasing sample size • Increases value of the data • Funder requirement
  • 8. Limits to sharing human genetic data • Data can be stored indefinitely, biobank specimens can be stored for up to 20 years – secondary use -rapid innovation with ‘omics technologies • Blood sample collection and visits to clinics associated with disease and treatment – even if a healthy control • Ethics consent: H3Africa- some projects have broad consent, some used tiered consent or specific consent • History of vulnerable populations, low education levels and exploitation • Anonymized, but risk of identification Ethical considerations Informed consent Participant identification Stigmatisation Benefit sharing
  • 9. Human genetic data privacy • Age & Sex • Country of birth • Current residence • Native language • Ethno-linguistic/tribal affiliation • Country of birth of father and mother • Native language of father and mother • Ethno-linguistic/tribal affiliation of mother and father • Height • Weight • Current medications • Smoking history • Alcohol history Image credits: National Human Genome Research Institute (https://www.genome.gov/imagegallery/) • Combination of phenotype and genetic data makes it possible to identify different populations and individuals – restricted access
  • 10. H3Africa Data Sharing Access and Release Policy • Balance between ensuring that adequate safeguards to protect participants while not being a barrier for scientists to advance research • Maximizing the availability of research data, in a timely and responsible manner • Protecting the rights and privacy of human subjects who participated in research studies • Recognizing the scientific contribution of researchers who generated the data • Considering the nature and ethics of the research proposed in establishing the timely release of data, and mechanisms of data sharing • Promoting deposition of genomic data in existing community data repositories whenever possible
  • 11. H3Africa DSAR policy • For genomic and phenotype data: • Submit to H3Africa archive • 9 months to submit to public repository • 12 month publication embargo • In EGA access controlled by DBAC 2 months Research site- QC genomic & phenotypic data 9 months H3ABioNet- Genomic & phenotypic data stored 12 months EGA- Genomic & phenotypic data available through DBAC with publication embargo Long term EGA- Genomic & phenotypic data available through DBAC without publication embargo Research site -Data generation 23 months
  • 12. Data and Biospecimen Access Committee • Review and approve requests for data and/or biospecimens • Biospecimens: • first 3 years only access outside H3Africa for those collaborating in Africa • Use info on availability in biobanks • Data generated must be submitted to EGA • Scientific review/funding available • Data • DBAC will ensure requestor has expertise and resources • Scientific review • Evaluation criteria • Scientific merit • Institutional capacity for the research • Potential for publication or translation, e.g. new therapies
  • 13. Data access agreement • H3Africa not liable for use of data • Only use data for agreed purpose • Maintain data confidentiality • Make sure data is secure • Acknowledge source of data • Submit annual reports • Project put onto website • Access is granted for 1 year
  • 14. What is required for sharing data? • Consent from participants –varying consent within a study is difficult • Robust data sharing model with implementation strategy for data access, transfer, etc • Access agreements and MoUs • Infrastructure for • Data transfer • Data storage & compute • Training • Data curation and harmonization
  • 15. Infrastructure development & support • Node server purchases • Sys Admin “How to” documents • Access to HPC, Cloud (Docker containers) • Internet connectivity measurement -NetMap • Data transfer –Globus online, testing vs Aspera • Data storage • Training in IT, data management and general bioinformatics use H3ABioNet combined equipment: 512 cores, 2384 GB RAM, 120TB storage
  • 16. Building human capacity for genomics data management • Need to train • Bioinformaticians • Data scientists • Bioinformatics users • Medical professionals Specialised courses, shadow teams, internships ISCB EMBL-EBI training team
  • 17. Training Approaches Face to face Workshops Train-the-Trainer Internships Live Online Training Hackathons/Data Jamborees Access to training materials
  • 19. Harmonizing H3Africa data Mapping biobank data to OMIABIS ontology Mapping CRFs to ontologies, e.g. phenotype or disease ontology Mapping genomics data to Experimental Factor ontology PHWG has developed set of core phenotypes, standard CRF Mapping ethics consent info to Data Use ontology
  • 20. Harmonizing H3Africa data Mapping biobank data to OMIABIS ontology Mapping CRFs to ontologies, e.g. phenotype or disease ontology Mapping genomics data to Experimental Factor ontology PHWG has developed set of core phenotypes, standard CRF Mapping ethics consent info to Data Use ontology Biorepositories Archive & EGA Catalogue
  • 21. Making data FAIR • Findable, Accessible, Interoperable, and Re-usable https://www.force11.org/group/fairgroup/fairprinciples • To be Findable: identifier, metadata, indexed • To be Accessible: find by identifier, clear rules for access and authentication • To be Interoperable: standardized and cross- referenced • To be Reusable: licensed, metadata with provenance, standards
  • 22. Making data FAIR • Findable, Accessible, Interoperable, and Re-usable https://www.force11.org/group/fairgroup/fairprinciples • To be Findable: identifier, metadata, indexed • To be Accessible: find by identifier, clear rules for access and authentication • To be Interoperable: standardized and cross- referenced • To be Reusable: licensed, metadata with provenance, standards
  • 23. H3Africa Data Archive • Assist H3Africa projects as data coordination center: TransferValidate Store Submit to EGA Obtain EGA accessions for publications 0.5 petabytes storage size including offsite replication Local EGA feasibility?
  • 24. Data and biospecimen catalogue
  • 25. Beacons …a simple public web service … designed merely to accept a query of the form "Do you have any genomes with an 'A' at position 100,735 on chromosome 3" (or similar data) and responds with one of "Yes" or "No." genomicsandhealth.org • Advantages • Locally hosted • Minimal information (yes/no for a given allele) • Protection against “scraping” https://goo.gl/Bkd0dx
  • 26. Summary • H3Africa is largest collection of human biomedical data in Africa to date • Human data is sensitive and needs to be shared while protecting participants and researchers • Need to build infrastructure for sharing: • harmonized/curated metadata • storage and transfer facilities • human capacity -skills • Need to provide access tools –web interface, public repositories, database • Trying to promote Open science –user groups, sessions
  • 27. Acknowledgements The H3ABioNet Consortium Funding: NIH Common Fund, NGHRI grant: U41HG006941, U24HG006941 H3ABioNet team at CBIO: • Sumir Panji • Gerrit Botha • Ayton Meintjes • Suresh Maslamoney • Vicky Nembaware • Ziyaad Parker • Kim Gurwitz • Mamana Mbiyavanga • Katherine Johnston Slides: Sumir Panji, Michelle Skelton