SlideShare a Scribd company logo
1 of 41
Metadata analysis of germplasm
collections
The case of agINFRA
Dr. Vassilis Protonotarios
Agricultural Biotechnologist, PhD
Agro-Know Technologies, Greece
e-Conference on Germplasm Data Interoperability
Session 2: “Status of data and metadata for germplasm”
Structure of the presentation
1. The agINFRA germplasm data sources
– Chinese Crop Germplasm Information System
– Italian National Germplasm Database

2. Current status
– Mappings
– Linked Data approach

3. Conclusions
The agINFRA germplasm data sources
agINFRA germplasm data sources
• Italian Germplasm Database (CRA)
– Data available through EURISCO -> GENESYS
– Uses EURISCO set of descriptors
– Data also available through GBIF

• Chinese Crop Germplasm Information System
(CGRIS/CAAS)
– Data unavailable through aggregators
– Own schema used for description of germplasm
accessions
– Metadata exposure in CSV
agINFRA germplasm data analysis
1. Analysis of agINFRA germplasm data sources
2. Analysis of metadata schemas used
3. Identification of external schemas
– Review of existing work

4. Definition of a base schema (descriptors)
5. Mappings of various schemas to the base
one
6. Development of a linked data approach for
linking germplasm data sources
1. Chinese Crop Germplasm
Information System (CGRIS / CAASD)
Chinese Crop Germplasm
Information System (CGRIS)
• Provided by: Chinese Academy of Agricultural Sciences
• A central repository for all type of plant genetic resources
information. It consists of six subsystems:
1. The management system of the National Crop Gene Bank (NCGB),
2. The management system of the long-term storage in Qinghai,
3. The management system of National germplasm Resources
Nursery,
4. The crop characterization and evaluation database system,
5. The database system for germplasm exchange at home and
abroad and
6. The management system of the medium-term storage in Beijing.

URL: http://icgr.caas.net.cn/cgrisintroduction.html
CGRIS: Data
At present, CGRIS owns
• > 2000 MB data on 180 kinds of crops
– including food crops, fibre plants, oil crops,
vegetable, fruit tree, tea, mulberry, tobacco,
sugar, green manure crops, tropical crops etc.),

• 390,000 accessions of germplasm
CGRIS: Accessions (indicative list)

http://icgr.caas.net.cn/cgrisintroduction.html
Crop Germplasm Classification
Info on wheat varieties
Info on wheat varieties
CGRIS: Germplasm Data Query
CGRIS: Germplasm Data Query
CGRIS Metadata
• CGRIS germplasm descriptors based on own
schema
– can be seen as the de facto standard for
germplasm accession information in China.
– Based on metadata scheme standards such as
developed by IPGRI (Bioversity) and GRIN
CGRIS: Basic Descriptors
CGRIS: Wheat descriptors
CGRIS Metadata: Next steps
• A mapping to the Multi-crop Passport
Descriptors (MCPD) standard is intended
– According to CAAS subject experts such a mapping
should be rather easy to produce.
CGRIS: Exposing data
• Data stored in relational DBs
• Hosted in an SQL server
• Exposure of data as CSV files (partially in
Chinese)
CGRIS: IPR information
• The CGRIS website is public and accessible for
everybody. The information is provided free of
charge but based on copyright.
• With regards to data exchange there is no
explicit policy to follow.
• CGRIS does not have an Open Access mandate
and the members of the CGRIS network apply
their own institution policy.
2. Italian Germplasm Database (CRA)
Italian Germplasm Database
• Provided by: Italian Council for Research and
Experimentation in Agriculture
• Developed in the context of the “Plant Genetic
Resources/FAO” project in 2004
– Research Centres and Units of the CRA
– The Institute of Plant Genetics of the CNR in Bari,
– NGO “Rete Semi Rurali”
– University collections (Perugia, Potenza etc.)
URL: http://fru.entecra.it
CRA Germplasm: Data
Current status of germplasm data (CRA)
• 20,954 records from Italy are included in
EURISCO of which 17,212 from CRA
• 28,509 records for 275 plant species in the
National Inventory (in general)
– does not allow for identifying the number of CRA
germplasm records
CRA: Accessions (indicative list)

URL: http://fru.entecra.it/accessioni.php
Info on specific species
EURISCO
descriptors
CRA Metadata
• Most CRA institutional databases use the
MCPD
– however, in the records provided to the National
Inventory several fields are often not filled.

• Some CRA collections also use descriptors
defined by
– the Union for the Protection of New Varieties of
Plants (UPOV) and
– the National Register of New Varieties.

• Ensure mapping to the Multi-crop Passport
Descriptors (MCPD)/EURISCO
CRA: IPR information
• The CRA website is public and accessible for everybody. The
information is provided free of charge but based on
copyright
• The Multilateral System (MLS) of the Treaty demands free
availability of the information on the PGRFA that are under
the management and control of the Contracting Parties and
in the public domain (Treaty, Art. 11.2).
• This excludes
– germplasm accessions that are subject to IPR and
– other legally binding protection which restricts the Contracting
Party’s control over the material.
– Accessions that are not covered by IPR include old and
autochthonous varieties, crop wild relatives and other material
found in in-situ conditions, new cultivars not protected by IPR
and cultivars whose IPR have expired.
Conclusions
Current status
• First version of mappings is available
• EURISCO descriptors used as base schema
– MCPD
– Darwin Core for Genebanks
– ABCD
– CGRIS
– CRA
Mapping table
Mapping table
Development of decision trees
Development of decision trees
Linked Data
• A linked data approach will be used by
agINFRA for linking germplasm data sources
• OpenAGRIS already aggregates germplasm
data using AGROVOC
Conclusions
• Both schemas / sets of descriptors can be
mapped to the EURISCO ones
• Linked Data approach will facilitate linking of
germplasm data from CRA/CGRIS
• EURISCO descriptors to be published as linked
data
– To be used as the base of passport data

• Linking to other germplasm standards
– e.g. Darwin Core for Genebanks*
*https://code.google.com/p/darwincore-germplasm/wiki/DarwinCoreGermplasmMapping
Take home message
• The identification of common properties
between different metadata schemas will
facilitate the linked data framework
(Indicative) List of References
• agINFRA Deliverable D2.3 “Review of Content
Requirements”
• agINFRA Deliverable D5.3 “Conceptual
specification of linked agricultural data
framework”
• agINFRA Germplasm Working Group Wiki
http://wiki.aginfra.eu/index.php/Germplasm_Working_Group

• EURISCO passport descriptors
http://www.ecpgr.cgiar.org/germplasm_databases.html

• Draft Mapping of EURISCO Descriptors to ABCD
2.06 http://www.bgbm.org/TDWG/CODATA/Schema/Mappings/EURISCO-2-ABCD.pdf
Source: http://verastic.com/social/why-do-people-not-say-thank-you.html

Contact me: vprot@agroknow.gr

More Related Content

What's hot

Application of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticultureApplication of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticultureDr.Hetalkumar Panchal
 
Tools of bioinforformatics by kk
Tools of bioinforformatics by kkTools of bioinforformatics by kk
Tools of bioinforformatics by kkKAUSHAL SAHU
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu KAUSHAL SAHU
 
Current Trends & Developments of Bioinformatics
Current Trends & Developments of BioinformaticsCurrent Trends & Developments of Bioinformatics
Current Trends & Developments of BioinformaticsYousif A. Algabri
 
1.bioinformatics introduction 32.03.2071
1.bioinformatics introduction 32.03.20711.bioinformatics introduction 32.03.2071
1.bioinformatics introduction 32.03.2071RajDip Basnet
 
Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsShikha Thakur
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introductionBiotech Online
 
Project report-on-bio-informatics
Project report-on-bio-informaticsProject report-on-bio-informatics
Project report-on-bio-informaticsDaniela Rotariu
 
Bioinformatics Database Computer applications
Bioinformatics Database Computer applicationsBioinformatics Database Computer applications
Bioinformatics Database Computer applicationsYogi Raikwar
 
Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...
Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...
Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...OECD Environment
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSMSCW Mysore
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!adcobb
 
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...ExternalEvents
 
User-friendly bioinformatics (Monthly Informational workshop)
User-friendly bioinformatics (Monthly Informational workshop)User-friendly bioinformatics (Monthly Informational workshop)
User-friendly bioinformatics (Monthly Informational workshop)Elia Brodsky
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesUniversity of Malaya
 
Data-integration platform for cancer research:cBioPortal demo
Data-integration platform for cancer research:cBioPortal demoData-integration platform for cancer research:cBioPortal demo
Data-integration platform for cancer research:cBioPortal demoCORBEL
 
Computational Biology and Bioinformatics
Computational Biology and BioinformaticsComputational Biology and Bioinformatics
Computational Biology and BioinformaticsSharif Shuvo
 

What's hot (20)

Application of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticultureApplication of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticulture
 
Tools of bioinforformatics by kk
Tools of bioinforformatics by kkTools of bioinforformatics by kk
Tools of bioinforformatics by kk
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu
 
Current Trends & Developments of Bioinformatics
Current Trends & Developments of BioinformaticsCurrent Trends & Developments of Bioinformatics
Current Trends & Developments of Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
1.bioinformatics introduction 32.03.2071
1.bioinformatics introduction 32.03.20711.bioinformatics introduction 32.03.2071
1.bioinformatics introduction 32.03.2071
 
Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Project report-on-bio-informatics
Project report-on-bio-informaticsProject report-on-bio-informatics
Project report-on-bio-informatics
 
Bioinformatics Database Computer applications
Bioinformatics Database Computer applicationsBioinformatics Database Computer applications
Bioinformatics Database Computer applications
 
Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...
Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...
Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
 
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
 
Bioinformatics Information Sources
Bioinformatics Information SourcesBioinformatics Information Sources
Bioinformatics Information Sources
 
User-friendly bioinformatics (Monthly Informational workshop)
User-friendly bioinformatics (Monthly Informational workshop)User-friendly bioinformatics (Monthly Informational workshop)
User-friendly bioinformatics (Monthly Informational workshop)
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future Perspectives
 
Data-integration platform for cancer research:cBioPortal demo
Data-integration platform for cancer research:cBioPortal demoData-integration platform for cancer research:cBioPortal demo
Data-integration platform for cancer research:cBioPortal demo
 
Computational Biology and Bioinformatics
Computational Biology and BioinformaticsComputational Biology and Bioinformatics
Computational Biology and Bioinformatics
 

Similar to Metadata analysis of germplasm collections

Major germplasm data sources and referatories
Major germplasm data sources and referatoriesMajor germplasm data sources and referatories
Major germplasm data sources and referatoriesVassilis Protonotarios
 
Sharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conferenceSharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conferenceDag Endresen
 
Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)Dag Endresen
 
Global Information Systems for Plant Genetic Resources (2009)
Global Information Systems for Plant Genetic Resources (2009)Global Information Systems for Plant Genetic Resources (2009)
Global Information Systems for Plant Genetic Resources (2009)Dag Endresen
 
Global Information Systems for Plant Genetic Resources, SeedNet training cour...
Global Information Systems for Plant Genetic Resources, SeedNet training cour...Global Information Systems for Plant Genetic Resources, SeedNet training cour...
Global Information Systems for Plant Genetic Resources, SeedNet training cour...Dag Endresen
 
Genesys: Online portal to Genebank Data
Genesys: Online portal to Genebank DataGenesys: Online portal to Genebank Data
Genesys: Online portal to Genebank DataLuigi Guarino
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekData Driven Innovation
 
Global RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm DataGlobal RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm DataVassilis Protonotarios
 
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)Dag Endresen
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchAnshika Bansal
 
Contribution of standards for developing networks, crop ontologies and a glob...
Contribution of standards for developing networks, crop ontologies and a glob...Contribution of standards for developing networks, crop ontologies and a glob...
Contribution of standards for developing networks, crop ontologies and a glob...IAALD Community
 
Harnessing ICTs in managing Southern African genebanks
Harnessing ICTs in managing Southern African genebanksHarnessing ICTs in managing Southern African genebanks
Harnessing ICTs in managing Southern African genebanksIAALD Community
 
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientistsRamil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientistsGigaScience, BGI Hong Kong
 
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseDevelopment of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Databasenist-spin
 
CIP Genebank Data Systems
CIP Genebank Data SystemsCIP Genebank Data Systems
CIP Genebank Data SystemsEdwin Rojas
 
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M SawkinsGRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M SawkinsCGIAR Generation Challenge Programme
 
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseDevelopment of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseNathan Olson
 
Agro know Food Safety Challenge for the Future Food Hack 2015
Agro know Food Safety Challenge for the Future Food Hack 2015Agro know Food Safety Challenge for the Future Food Hack 2015
Agro know Food Safety Challenge for the Future Food Hack 2015cthanopoulos
 

Similar to Metadata analysis of germplasm collections (20)

Major germplasm data sources and referatories
Major germplasm data sources and referatoriesMajor germplasm data sources and referatories
Major germplasm data sources and referatories
 
Sharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conferenceSharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conference
 
Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)
 
Global Information Systems for Plant Genetic Resources (2009)
Global Information Systems for Plant Genetic Resources (2009)Global Information Systems for Plant Genetic Resources (2009)
Global Information Systems for Plant Genetic Resources (2009)
 
Global Information Systems for Plant Genetic Resources, SeedNet training cour...
Global Information Systems for Plant Genetic Resources, SeedNet training cour...Global Information Systems for Plant Genetic Resources, SeedNet training cour...
Global Information Systems for Plant Genetic Resources, SeedNet training cour...
 
Genesys: Online portal to Genebank Data
Genesys: Online portal to Genebank DataGenesys: Online portal to Genebank Data
Genesys: Online portal to Genebank Data
 
01 pgr data base management
01 pgr data base management01 pgr data base management
01 pgr data base management
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
 
Global RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm DataGlobal RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm Data
 
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 
Contribution of standards for developing networks, crop ontologies and a glob...
Contribution of standards for developing networks, crop ontologies and a glob...Contribution of standards for developing networks, crop ontologies and a glob...
Contribution of standards for developing networks, crop ontologies and a glob...
 
Harnessing ICTs in managing Southern African genebanks
Harnessing ICTs in managing Southern African genebanksHarnessing ICTs in managing Southern African genebanks
Harnessing ICTs in managing Southern African genebanks
 
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientistsRamil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
 
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseDevelopment of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
 
Data integration
Data integrationData integration
Data integration
 
CIP Genebank Data Systems
CIP Genebank Data SystemsCIP Genebank Data Systems
CIP Genebank Data Systems
 
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M SawkinsGRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
 
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseDevelopment of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
 
Agro know Food Safety Challenge for the Future Food Hack 2015
Agro know Food Safety Challenge for the Future Food Hack 2015Agro know Food Safety Challenge for the Future Food Hack 2015
Agro know Food Safety Challenge for the Future Food Hack 2015
 

More from Vassilis Protonotarios

Doing business with Open Data in agriculture
Doing business with Open Data in agricultureDoing business with Open Data in agriculture
Doing business with Open Data in agricultureVassilis Protonotarios
 
Legal interoperability in the fishery and marine data ecosystem
Legal interoperability in the fishery and marine data ecosystemLegal interoperability in the fishery and marine data ecosystem
Legal interoperability in the fishery and marine data ecosystemVassilis Protonotarios
 
Agricultural Data Interest Group & Wheat Data Working Group of RDA
Agricultural Data Interest Group & Wheat Data Working Group of RDAAgricultural Data Interest Group & Wheat Data Working Group of RDA
Agricultural Data Interest Group & Wheat Data Working Group of RDAVassilis Protonotarios
 
Agro-Know internal training: Using the Agro-Know blog
Agro-Know internal training: Using the Agro-Know blogAgro-Know internal training: Using the Agro-Know blog
Agro-Know internal training: Using the Agro-Know blogVassilis Protonotarios
 
Introduction to Agriculture & Food Safety Data
Introduction to Agriculture & Food Safety DataIntroduction to Agriculture & Food Safety Data
Introduction to Agriculture & Food Safety DataVassilis Protonotarios
 
Seeding organic agriculture courses on Moodle: the agriMoodle Case
Seeding organic agriculture courses on Moodle:  the agriMoodle CaseSeeding organic agriculture courses on Moodle:  the agriMoodle Case
Seeding organic agriculture courses on Moodle: the agriMoodle CaseVassilis Protonotarios
 
KOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet OntologyKOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet OntologyVassilis Protonotarios
 
Using language services to enrich the LOs' descriptions
Using language services to enrich the LOs' descriptionsUsing language services to enrich the LOs' descriptions
Using language services to enrich the LOs' descriptionsVassilis Protonotarios
 
Using Agricultural Learning Portals in Developing Countries: The case of Orga...
Using Agricultural Learning Portals in Developing Countries: The case of Orga...Using Agricultural Learning Portals in Developing Countries: The case of Orga...
Using Agricultural Learning Portals in Developing Countries: The case of Orga...Vassilis Protonotarios
 
Developing a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetDeveloping a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetVassilis Protonotarios
 
AgEdWS12 - Introduction to the Workshop
AgEdWS12 - Introduction to the WorkshopAgEdWS12 - Introduction to the Workshop
AgEdWS12 - Introduction to the WorkshopVassilis Protonotarios
 
Developing a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetDeveloping a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetVassilis Protonotarios
 
Introducing a content integration process for a federation of agricultural in...
Introducing a content integration process for a federation of agricultural in...Introducing a content integration process for a federation of agricultural in...
Introducing a content integration process for a federation of agricultural in...Vassilis Protonotarios
 
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)Vassilis Protonotarios
 
Designing a Training Session for Public Authorities (EFITA 2011)
Designing a Training Session for Public Authorities (EFITA 2011)Designing a Training Session for Public Authorities (EFITA 2011)
Designing a Training Session for Public Authorities (EFITA 2011)Vassilis Protonotarios
 
Identifying the Training Content Needs in Vocational Education & Training Pr...
Identifying the Training Content Needs in Vocational Education  & Training Pr...Identifying the Training Content Needs in Vocational Education  & Training Pr...
Identifying the Training Content Needs in Vocational Education & Training Pr...Vassilis Protonotarios
 
Green Education Using Open Educational Resources (OER) (SPDECE 2012)
Green Education Using Open Educational Resources (OER) (SPDECE 2012)Green Education Using Open Educational Resources (OER) (SPDECE 2012)
Green Education Using Open Educational Resources (OER) (SPDECE 2012)Vassilis Protonotarios
 

More from Vassilis Protonotarios (20)

Doing business with Open Data in agriculture
Doing business with Open Data in agricultureDoing business with Open Data in agriculture
Doing business with Open Data in agriculture
 
Legal interoperability in the fishery and marine data ecosystem
Legal interoperability in the fishery and marine data ecosystemLegal interoperability in the fishery and marine data ecosystem
Legal interoperability in the fishery and marine data ecosystem
 
Agricultural Data Interest Group & Wheat Data Working Group of RDA
Agricultural Data Interest Group & Wheat Data Working Group of RDAAgricultural Data Interest Group & Wheat Data Working Group of RDA
Agricultural Data Interest Group & Wheat Data Working Group of RDA
 
Agro-Know internal training: Using the Agro-Know blog
Agro-Know internal training: Using the Agro-Know blogAgro-Know internal training: Using the Agro-Know blog
Agro-Know internal training: Using the Agro-Know blog
 
Introduction to Agriculture & Food Safety Data
Introduction to Agriculture & Food Safety DataIntroduction to Agriculture & Food Safety Data
Introduction to Agriculture & Food Safety Data
 
Seeding organic agriculture courses on Moodle: the agriMoodle Case
Seeding organic agriculture courses on Moodle:  the agriMoodle CaseSeeding organic agriculture courses on Moodle:  the agriMoodle Case
Seeding organic agriculture courses on Moodle: the agriMoodle Case
 
KOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet OntologyKOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet Ontology
 
The agINFRA Germplasm Working Group
The agINFRA Germplasm Working GroupThe agINFRA Germplasm Working Group
The agINFRA Germplasm Working Group
 
Designing Data Products
Designing Data ProductsDesigning Data Products
Designing Data Products
 
Using language services to enrich the LOs' descriptions
Using language services to enrich the LOs' descriptionsUsing language services to enrich the LOs' descriptions
Using language services to enrich the LOs' descriptions
 
Using Agricultural Learning Portals in Developing Countries: The case of Orga...
Using Agricultural Learning Portals in Developing Countries: The case of Orga...Using Agricultural Learning Portals in Developing Countries: The case of Orga...
Using Agricultural Learning Portals in Developing Countries: The case of Orga...
 
Developing a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetDeveloping a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.Edunet
 
AgEdWS12 - Introduction to the Workshop
AgEdWS12 - Introduction to the WorkshopAgEdWS12 - Introduction to the Workshop
AgEdWS12 - Introduction to the Workshop
 
Developing a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetDeveloping a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.Edunet
 
Introducing a content integration process for a federation of agricultural in...
Introducing a content integration process for a federation of agricultural in...Introducing a content integration process for a federation of agricultural in...
Introducing a content integration process for a federation of agricultural in...
 
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
 
Designing a Training Session for Public Authorities (EFITA 2011)
Designing a Training Session for Public Authorities (EFITA 2011)Designing a Training Session for Public Authorities (EFITA 2011)
Designing a Training Session for Public Authorities (EFITA 2011)
 
Identifying the Training Content Needs in Vocational Education & Training Pr...
Identifying the Training Content Needs in Vocational Education  & Training Pr...Identifying the Training Content Needs in Vocational Education  & Training Pr...
Identifying the Training Content Needs in Vocational Education & Training Pr...
 
Pecha Kucha
Pecha KuchaPecha Kucha
Pecha Kucha
 
Green Education Using Open Educational Resources (OER) (SPDECE 2012)
Green Education Using Open Educational Resources (OER) (SPDECE 2012)Green Education Using Open Educational Resources (OER) (SPDECE 2012)
Green Education Using Open Educational Resources (OER) (SPDECE 2012)
 

Recently uploaded

Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 

Recently uploaded (20)

Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 

Metadata analysis of germplasm collections

  • 1. Metadata analysis of germplasm collections The case of agINFRA Dr. Vassilis Protonotarios Agricultural Biotechnologist, PhD Agro-Know Technologies, Greece e-Conference on Germplasm Data Interoperability Session 2: “Status of data and metadata for germplasm”
  • 2. Structure of the presentation 1. The agINFRA germplasm data sources – Chinese Crop Germplasm Information System – Italian National Germplasm Database 2. Current status – Mappings – Linked Data approach 3. Conclusions
  • 3. The agINFRA germplasm data sources
  • 4. agINFRA germplasm data sources • Italian Germplasm Database (CRA) – Data available through EURISCO -> GENESYS – Uses EURISCO set of descriptors – Data also available through GBIF • Chinese Crop Germplasm Information System (CGRIS/CAAS) – Data unavailable through aggregators – Own schema used for description of germplasm accessions – Metadata exposure in CSV
  • 5. agINFRA germplasm data analysis 1. Analysis of agINFRA germplasm data sources 2. Analysis of metadata schemas used 3. Identification of external schemas – Review of existing work 4. Definition of a base schema (descriptors) 5. Mappings of various schemas to the base one 6. Development of a linked data approach for linking germplasm data sources
  • 6. 1. Chinese Crop Germplasm Information System (CGRIS / CAASD)
  • 7. Chinese Crop Germplasm Information System (CGRIS) • Provided by: Chinese Academy of Agricultural Sciences • A central repository for all type of plant genetic resources information. It consists of six subsystems: 1. The management system of the National Crop Gene Bank (NCGB), 2. The management system of the long-term storage in Qinghai, 3. The management system of National germplasm Resources Nursery, 4. The crop characterization and evaluation database system, 5. The database system for germplasm exchange at home and abroad and 6. The management system of the medium-term storage in Beijing. URL: http://icgr.caas.net.cn/cgrisintroduction.html
  • 8. CGRIS: Data At present, CGRIS owns • > 2000 MB data on 180 kinds of crops – including food crops, fibre plants, oil crops, vegetable, fruit tree, tea, mulberry, tobacco, sugar, green manure crops, tropical crops etc.), • 390,000 accessions of germplasm
  • 9. CGRIS: Accessions (indicative list) http://icgr.caas.net.cn/cgrisintroduction.html
  • 11. Info on wheat varieties
  • 12. Info on wheat varieties
  • 15. CGRIS Metadata • CGRIS germplasm descriptors based on own schema – can be seen as the de facto standard for germplasm accession information in China. – Based on metadata scheme standards such as developed by IPGRI (Bioversity) and GRIN
  • 18. CGRIS Metadata: Next steps • A mapping to the Multi-crop Passport Descriptors (MCPD) standard is intended – According to CAAS subject experts such a mapping should be rather easy to produce.
  • 19. CGRIS: Exposing data • Data stored in relational DBs • Hosted in an SQL server • Exposure of data as CSV files (partially in Chinese)
  • 20. CGRIS: IPR information • The CGRIS website is public and accessible for everybody. The information is provided free of charge but based on copyright. • With regards to data exchange there is no explicit policy to follow. • CGRIS does not have an Open Access mandate and the members of the CGRIS network apply their own institution policy.
  • 21. 2. Italian Germplasm Database (CRA)
  • 22. Italian Germplasm Database • Provided by: Italian Council for Research and Experimentation in Agriculture • Developed in the context of the “Plant Genetic Resources/FAO” project in 2004 – Research Centres and Units of the CRA – The Institute of Plant Genetics of the CNR in Bari, – NGO “Rete Semi Rurali” – University collections (Perugia, Potenza etc.) URL: http://fru.entecra.it
  • 23.
  • 24. CRA Germplasm: Data Current status of germplasm data (CRA) • 20,954 records from Italy are included in EURISCO of which 17,212 from CRA • 28,509 records for 275 plant species in the National Inventory (in general) – does not allow for identifying the number of CRA germplasm records
  • 25. CRA: Accessions (indicative list) URL: http://fru.entecra.it/accessioni.php
  • 26. Info on specific species
  • 27.
  • 29. CRA Metadata • Most CRA institutional databases use the MCPD – however, in the records provided to the National Inventory several fields are often not filled. • Some CRA collections also use descriptors defined by – the Union for the Protection of New Varieties of Plants (UPOV) and – the National Register of New Varieties. • Ensure mapping to the Multi-crop Passport Descriptors (MCPD)/EURISCO
  • 30. CRA: IPR information • The CRA website is public and accessible for everybody. The information is provided free of charge but based on copyright • The Multilateral System (MLS) of the Treaty demands free availability of the information on the PGRFA that are under the management and control of the Contracting Parties and in the public domain (Treaty, Art. 11.2). • This excludes – germplasm accessions that are subject to IPR and – other legally binding protection which restricts the Contracting Party’s control over the material. – Accessions that are not covered by IPR include old and autochthonous varieties, crop wild relatives and other material found in in-situ conditions, new cultivars not protected by IPR and cultivars whose IPR have expired.
  • 32. Current status • First version of mappings is available • EURISCO descriptors used as base schema – MCPD – Darwin Core for Genebanks – ABCD – CGRIS – CRA
  • 37. Linked Data • A linked data approach will be used by agINFRA for linking germplasm data sources • OpenAGRIS already aggregates germplasm data using AGROVOC
  • 38. Conclusions • Both schemas / sets of descriptors can be mapped to the EURISCO ones • Linked Data approach will facilitate linking of germplasm data from CRA/CGRIS • EURISCO descriptors to be published as linked data – To be used as the base of passport data • Linking to other germplasm standards – e.g. Darwin Core for Genebanks* *https://code.google.com/p/darwincore-germplasm/wiki/DarwinCoreGermplasmMapping
  • 39. Take home message • The identification of common properties between different metadata schemas will facilitate the linked data framework
  • 40. (Indicative) List of References • agINFRA Deliverable D2.3 “Review of Content Requirements” • agINFRA Deliverable D5.3 “Conceptual specification of linked agricultural data framework” • agINFRA Germplasm Working Group Wiki http://wiki.aginfra.eu/index.php/Germplasm_Working_Group • EURISCO passport descriptors http://www.ecpgr.cgiar.org/germplasm_databases.html • Draft Mapping of EURISCO Descriptors to ABCD 2.06 http://www.bgbm.org/TDWG/CODATA/Schema/Mappings/EURISCO-2-ABCD.pdf