SlideShare a Scribd company logo
1 of 41
Metadata analysis of germplasm
collections
The case of agINFRA
Dr. Vassilis Protonotarios
Agricultural Biotechnologist, PhD
Agro-Know Technologies, Greece
e-Conference on Germplasm Data Interoperability
Session 2: “Status of data and metadata for germplasm”
Structure of the presentation
1. The agINFRA germplasm data sources
– Chinese Crop Germplasm Information System
– Italian National Germplasm Database

2. Current status
– Mappings
– Linked Data approach

3. Conclusions
The agINFRA germplasm data sources
agINFRA germplasm data sources
• Italian Germplasm Database (CRA)
– Data available through EURISCO -> GENESYS
– Uses EURISCO set of descriptors
– Data also available through GBIF

• Chinese Crop Germplasm Information System
(CGRIS/CAAS)
– Data unavailable through aggregators
– Own schema used for description of germplasm
accessions
– Metadata exposure in CSV
agINFRA germplasm data analysis
1. Analysis of agINFRA germplasm data sources
2. Analysis of metadata schemas used
3. Identification of external schemas
– Review of existing work

4. Definition of a base schema (descriptors)
5. Mappings of various schemas to the base
one
6. Development of a linked data approach for
linking germplasm data sources
1. Chinese Crop Germplasm
Information System (CGRIS / CAASD)
Chinese Crop Germplasm
Information System (CGRIS)
• Provided by: Chinese Academy of Agricultural Sciences
• A central repository for all type of plant genetic resources
information. It consists of six subsystems:
1. The management system of the National Crop Gene Bank (NCGB),
2. The management system of the long-term storage in Qinghai,
3. The management system of National germplasm Resources
Nursery,
4. The crop characterization and evaluation database system,
5. The database system for germplasm exchange at home and
abroad and
6. The management system of the medium-term storage in Beijing.

URL: http://icgr.caas.net.cn/cgrisintroduction.html
CGRIS: Data
At present, CGRIS owns
• > 2000 MB data on 180 kinds of crops
– including food crops, fibre plants, oil crops,
vegetable, fruit tree, tea, mulberry, tobacco,
sugar, green manure crops, tropical crops etc.),

• 390,000 accessions of germplasm
CGRIS: Accessions (indicative list)

http://icgr.caas.net.cn/cgrisintroduction.html
Crop Germplasm Classification
Info on wheat varieties
Info on wheat varieties
CGRIS: Germplasm Data Query
CGRIS: Germplasm Data Query
CGRIS Metadata
• CGRIS germplasm descriptors based on own
schema
– can be seen as the de facto standard for
germplasm accession information in China.
– Based on metadata scheme standards such as
developed by IPGRI (Bioversity) and GRIN
CGRIS: Basic Descriptors
CGRIS: Wheat descriptors
CGRIS Metadata: Next steps
• A mapping to the Multi-crop Passport
Descriptors (MCPD) standard is intended
– According to CAAS subject experts such a mapping
should be rather easy to produce.
CGRIS: Exposing data
• Data stored in relational DBs
• Hosted in an SQL server
• Exposure of data as CSV files (partially in
Chinese)
CGRIS: IPR information
• The CGRIS website is public and accessible for
everybody. The information is provided free of
charge but based on copyright.
• With regards to data exchange there is no
explicit policy to follow.
• CGRIS does not have an Open Access mandate
and the members of the CGRIS network apply
their own institution policy.
2. Italian Germplasm Database (CRA)
Italian Germplasm Database
• Provided by: Italian Council for Research and
Experimentation in Agriculture
• Developed in the context of the “Plant Genetic
Resources/FAO” project in 2004
– Research Centres and Units of the CRA
– The Institute of Plant Genetics of the CNR in Bari,
– NGO “Rete Semi Rurali”
– University collections (Perugia, Potenza etc.)
URL: http://fru.entecra.it
CRA Germplasm: Data
Current status of germplasm data (CRA)
• 20,954 records from Italy are included in
EURISCO of which 17,212 from CRA
• 28,509 records for 275 plant species in the
National Inventory (in general)
– does not allow for identifying the number of CRA
germplasm records
CRA: Accessions (indicative list)

URL: http://fru.entecra.it/accessioni.php
Info on specific species
EURISCO
descriptors
CRA Metadata
• Most CRA institutional databases use the
MCPD
– however, in the records provided to the National
Inventory several fields are often not filled.

• Some CRA collections also use descriptors
defined by
– the Union for the Protection of New Varieties of
Plants (UPOV) and
– the National Register of New Varieties.

• Ensure mapping to the Multi-crop Passport
Descriptors (MCPD)/EURISCO
CRA: IPR information
• The CRA website is public and accessible for everybody. The
information is provided free of charge but based on
copyright
• The Multilateral System (MLS) of the Treaty demands free
availability of the information on the PGRFA that are under
the management and control of the Contracting Parties and
in the public domain (Treaty, Art. 11.2).
• This excludes
– germplasm accessions that are subject to IPR and
– other legally binding protection which restricts the Contracting
Party’s control over the material.
– Accessions that are not covered by IPR include old and
autochthonous varieties, crop wild relatives and other material
found in in-situ conditions, new cultivars not protected by IPR
and cultivars whose IPR have expired.
Conclusions
Current status
• First version of mappings is available
• EURISCO descriptors used as base schema
– MCPD
– Darwin Core for Genebanks
– ABCD
– CGRIS
– CRA
Mapping table
Mapping table
Development of decision trees
Development of decision trees
Linked Data
• A linked data approach will be used by
agINFRA for linking germplasm data sources
• OpenAGRIS already aggregates germplasm
data using AGROVOC
Conclusions
• Both schemas / sets of descriptors can be
mapped to the EURISCO ones
• Linked Data approach will facilitate linking of
germplasm data from CRA/CGRIS
• EURISCO descriptors to be published as linked
data
– To be used as the base of passport data

• Linking to other germplasm standards
– e.g. Darwin Core for Genebanks*
*https://code.google.com/p/darwincore-germplasm/wiki/DarwinCoreGermplasmMapping
Take home message
• The identification of common properties
between different metadata schemas will
facilitate the linked data framework
(Indicative) List of References
• agINFRA Deliverable D2.3 “Review of Content
Requirements”
• agINFRA Deliverable D5.3 “Conceptual
specification of linked agricultural data
framework”
• agINFRA Germplasm Working Group Wiki
http://wiki.aginfra.eu/index.php/Germplasm_Working_Group

• EURISCO passport descriptors
http://www.ecpgr.cgiar.org/germplasm_databases.html

• Draft Mapping of EURISCO Descriptors to ABCD
2.06 http://www.bgbm.org/TDWG/CODATA/Schema/Mappings/EURISCO-2-ABCD.pdf
Source: http://verastic.com/social/why-do-people-not-say-thank-you.html

Contact me: vprot@agroknow.gr

More Related Content

What's hot

Application of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticultureApplication of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticultureDr.Hetalkumar Panchal
 
Tools of bioinforformatics by kk
Tools of bioinforformatics by kkTools of bioinforformatics by kk
Tools of bioinforformatics by kkKAUSHAL SAHU
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu KAUSHAL SAHU
 
Current Trends & Developments of Bioinformatics
Current Trends & Developments of BioinformaticsCurrent Trends & Developments of Bioinformatics
Current Trends & Developments of BioinformaticsYousif A. Algabri
 
1.bioinformatics introduction 32.03.2071
1.bioinformatics introduction 32.03.20711.bioinformatics introduction 32.03.2071
1.bioinformatics introduction 32.03.2071RajDip Basnet
 
Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsShikha Thakur
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introductionBiotech Online
 
Project report-on-bio-informatics
Project report-on-bio-informaticsProject report-on-bio-informatics
Project report-on-bio-informaticsDaniela Rotariu
 
Bioinformatics Database Computer applications
Bioinformatics Database Computer applicationsBioinformatics Database Computer applications
Bioinformatics Database Computer applicationsYogi Raikwar
 
Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...
Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...
Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...OECD Environment
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSMSCW Mysore
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!adcobb
 
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...ExternalEvents
 
User-friendly bioinformatics (Monthly Informational workshop)
User-friendly bioinformatics (Monthly Informational workshop)User-friendly bioinformatics (Monthly Informational workshop)
User-friendly bioinformatics (Monthly Informational workshop)Elia Brodsky
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesUniversity of Malaya
 
Data-integration platform for cancer research:cBioPortal demo
Data-integration platform for cancer research:cBioPortal demoData-integration platform for cancer research:cBioPortal demo
Data-integration platform for cancer research:cBioPortal demoCORBEL
 
Computational Biology and Bioinformatics
Computational Biology and BioinformaticsComputational Biology and Bioinformatics
Computational Biology and BioinformaticsSharif Shuvo
 

What's hot (20)

Application of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticultureApplication of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticulture
 
Tools of bioinforformatics by kk
Tools of bioinforformatics by kkTools of bioinforformatics by kk
Tools of bioinforformatics by kk
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu
 
Current Trends & Developments of Bioinformatics
Current Trends & Developments of BioinformaticsCurrent Trends & Developments of Bioinformatics
Current Trends & Developments of Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
1.bioinformatics introduction 32.03.2071
1.bioinformatics introduction 32.03.20711.bioinformatics introduction 32.03.2071
1.bioinformatics introduction 32.03.2071
 
Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Project report-on-bio-informatics
Project report-on-bio-informaticsProject report-on-bio-informatics
Project report-on-bio-informatics
 
Bioinformatics Database Computer applications
Bioinformatics Database Computer applicationsBioinformatics Database Computer applications
Bioinformatics Database Computer applications
 
Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...
Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...
Potential value of bioinformatic analysis in regulatory process - OECD Bioinf...
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
 
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
 
Bioinformatics Information Sources
Bioinformatics Information SourcesBioinformatics Information Sources
Bioinformatics Information Sources
 
User-friendly bioinformatics (Monthly Informational workshop)
User-friendly bioinformatics (Monthly Informational workshop)User-friendly bioinformatics (Monthly Informational workshop)
User-friendly bioinformatics (Monthly Informational workshop)
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future Perspectives
 
Data-integration platform for cancer research:cBioPortal demo
Data-integration platform for cancer research:cBioPortal demoData-integration platform for cancer research:cBioPortal demo
Data-integration platform for cancer research:cBioPortal demo
 
Computational Biology and Bioinformatics
Computational Biology and BioinformaticsComputational Biology and Bioinformatics
Computational Biology and Bioinformatics
 

Similar to agINFRA Germplasm metadata analysis

Major germplasm data sources and referatories
Major germplasm data sources and referatoriesMajor germplasm data sources and referatories
Major germplasm data sources and referatoriesVassilis Protonotarios
 
Sharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conferenceSharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conferenceDag Endresen
 
Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)Dag Endresen
 
Global Information Systems for Plant Genetic Resources (2009)
Global Information Systems for Plant Genetic Resources (2009)Global Information Systems for Plant Genetic Resources (2009)
Global Information Systems for Plant Genetic Resources (2009)Dag Endresen
 
Global Information Systems for Plant Genetic Resources, SeedNet training cour...
Global Information Systems for Plant Genetic Resources, SeedNet training cour...Global Information Systems for Plant Genetic Resources, SeedNet training cour...
Global Information Systems for Plant Genetic Resources, SeedNet training cour...Dag Endresen
 
Genesys: Online portal to Genebank Data
Genesys: Online portal to Genebank DataGenesys: Online portal to Genebank Data
Genesys: Online portal to Genebank DataLuigi Guarino
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekData Driven Innovation
 
Global RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm DataGlobal RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm DataVassilis Protonotarios
 
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)Dag Endresen
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchAnshika Bansal
 
Contribution of standards for developing networks, crop ontologies and a glob...
Contribution of standards for developing networks, crop ontologies and a glob...Contribution of standards for developing networks, crop ontologies and a glob...
Contribution of standards for developing networks, crop ontologies and a glob...IAALD Community
 
Harnessing ICTs in managing Southern African genebanks
Harnessing ICTs in managing Southern African genebanksHarnessing ICTs in managing Southern African genebanks
Harnessing ICTs in managing Southern African genebanksIAALD Community
 
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientistsRamil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientistsGigaScience, BGI Hong Kong
 
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseDevelopment of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Databasenist-spin
 
CIP Genebank Data Systems
CIP Genebank Data SystemsCIP Genebank Data Systems
CIP Genebank Data SystemsEdwin Rojas
 
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M SawkinsGRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M SawkinsCGIAR Generation Challenge Programme
 
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseDevelopment of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseNathan Olson
 
Agro know Food Safety Challenge for the Future Food Hack 2015
Agro know Food Safety Challenge for the Future Food Hack 2015Agro know Food Safety Challenge for the Future Food Hack 2015
Agro know Food Safety Challenge for the Future Food Hack 2015cthanopoulos
 

Similar to agINFRA Germplasm metadata analysis (20)

Major germplasm data sources and referatories
Major germplasm data sources and referatoriesMajor germplasm data sources and referatories
Major germplasm data sources and referatories
 
Sharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conferenceSharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conference
 
Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)
 
Global Information Systems for Plant Genetic Resources (2009)
Global Information Systems for Plant Genetic Resources (2009)Global Information Systems for Plant Genetic Resources (2009)
Global Information Systems for Plant Genetic Resources (2009)
 
Global Information Systems for Plant Genetic Resources, SeedNet training cour...
Global Information Systems for Plant Genetic Resources, SeedNet training cour...Global Information Systems for Plant Genetic Resources, SeedNet training cour...
Global Information Systems for Plant Genetic Resources, SeedNet training cour...
 
Genesys: Online portal to Genebank Data
Genesys: Online portal to Genebank DataGenesys: Online portal to Genebank Data
Genesys: Online portal to Genebank Data
 
01 pgr data base management
01 pgr data base management01 pgr data base management
01 pgr data base management
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
 
Global RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm DataGlobal RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm Data
 
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 
Contribution of standards for developing networks, crop ontologies and a glob...
Contribution of standards for developing networks, crop ontologies and a glob...Contribution of standards for developing networks, crop ontologies and a glob...
Contribution of standards for developing networks, crop ontologies and a glob...
 
Harnessing ICTs in managing Southern African genebanks
Harnessing ICTs in managing Southern African genebanksHarnessing ICTs in managing Southern African genebanks
Harnessing ICTs in managing Southern African genebanks
 
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientistsRamil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
 
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseDevelopment of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
 
Data integration
Data integrationData integration
Data integration
 
CIP Genebank Data Systems
CIP Genebank Data SystemsCIP Genebank Data Systems
CIP Genebank Data Systems
 
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M SawkinsGRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
 
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseDevelopment of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
 
Agro know Food Safety Challenge for the Future Food Hack 2015
Agro know Food Safety Challenge for the Future Food Hack 2015Agro know Food Safety Challenge for the Future Food Hack 2015
Agro know Food Safety Challenge for the Future Food Hack 2015
 

More from Vassilis Protonotarios

Doing business with Open Data in agriculture
Doing business with Open Data in agricultureDoing business with Open Data in agriculture
Doing business with Open Data in agricultureVassilis Protonotarios
 
Legal interoperability in the fishery and marine data ecosystem
Legal interoperability in the fishery and marine data ecosystemLegal interoperability in the fishery and marine data ecosystem
Legal interoperability in the fishery and marine data ecosystemVassilis Protonotarios
 
Agricultural Data Interest Group & Wheat Data Working Group of RDA
Agricultural Data Interest Group & Wheat Data Working Group of RDAAgricultural Data Interest Group & Wheat Data Working Group of RDA
Agricultural Data Interest Group & Wheat Data Working Group of RDAVassilis Protonotarios
 
Agro-Know internal training: Using the Agro-Know blog
Agro-Know internal training: Using the Agro-Know blogAgro-Know internal training: Using the Agro-Know blog
Agro-Know internal training: Using the Agro-Know blogVassilis Protonotarios
 
Introduction to Agriculture & Food Safety Data
Introduction to Agriculture & Food Safety DataIntroduction to Agriculture & Food Safety Data
Introduction to Agriculture & Food Safety DataVassilis Protonotarios
 
Seeding organic agriculture courses on Moodle: the agriMoodle Case
Seeding organic agriculture courses on Moodle:  the agriMoodle CaseSeeding organic agriculture courses on Moodle:  the agriMoodle Case
Seeding organic agriculture courses on Moodle: the agriMoodle CaseVassilis Protonotarios
 
KOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet OntologyKOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet OntologyVassilis Protonotarios
 
Using language services to enrich the LOs' descriptions
Using language services to enrich the LOs' descriptionsUsing language services to enrich the LOs' descriptions
Using language services to enrich the LOs' descriptionsVassilis Protonotarios
 
Using Agricultural Learning Portals in Developing Countries: The case of Orga...
Using Agricultural Learning Portals in Developing Countries: The case of Orga...Using Agricultural Learning Portals in Developing Countries: The case of Orga...
Using Agricultural Learning Portals in Developing Countries: The case of Orga...Vassilis Protonotarios
 
Developing a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetDeveloping a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetVassilis Protonotarios
 
AgEdWS12 - Introduction to the Workshop
AgEdWS12 - Introduction to the WorkshopAgEdWS12 - Introduction to the Workshop
AgEdWS12 - Introduction to the WorkshopVassilis Protonotarios
 
Developing a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetDeveloping a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetVassilis Protonotarios
 
Introducing a content integration process for a federation of agricultural in...
Introducing a content integration process for a federation of agricultural in...Introducing a content integration process for a federation of agricultural in...
Introducing a content integration process for a federation of agricultural in...Vassilis Protonotarios
 
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)Vassilis Protonotarios
 
Designing a Training Session for Public Authorities (EFITA 2011)
Designing a Training Session for Public Authorities (EFITA 2011)Designing a Training Session for Public Authorities (EFITA 2011)
Designing a Training Session for Public Authorities (EFITA 2011)Vassilis Protonotarios
 
Identifying the Training Content Needs in Vocational Education & Training Pr...
Identifying the Training Content Needs in Vocational Education  & Training Pr...Identifying the Training Content Needs in Vocational Education  & Training Pr...
Identifying the Training Content Needs in Vocational Education & Training Pr...Vassilis Protonotarios
 
Green Education Using Open Educational Resources (OER) (SPDECE 2012)
Green Education Using Open Educational Resources (OER) (SPDECE 2012)Green Education Using Open Educational Resources (OER) (SPDECE 2012)
Green Education Using Open Educational Resources (OER) (SPDECE 2012)Vassilis Protonotarios
 

More from Vassilis Protonotarios (20)

Doing business with Open Data in agriculture
Doing business with Open Data in agricultureDoing business with Open Data in agriculture
Doing business with Open Data in agriculture
 
Legal interoperability in the fishery and marine data ecosystem
Legal interoperability in the fishery and marine data ecosystemLegal interoperability in the fishery and marine data ecosystem
Legal interoperability in the fishery and marine data ecosystem
 
Agricultural Data Interest Group & Wheat Data Working Group of RDA
Agricultural Data Interest Group & Wheat Data Working Group of RDAAgricultural Data Interest Group & Wheat Data Working Group of RDA
Agricultural Data Interest Group & Wheat Data Working Group of RDA
 
Agro-Know internal training: Using the Agro-Know blog
Agro-Know internal training: Using the Agro-Know blogAgro-Know internal training: Using the Agro-Know blog
Agro-Know internal training: Using the Agro-Know blog
 
Introduction to Agriculture & Food Safety Data
Introduction to Agriculture & Food Safety DataIntroduction to Agriculture & Food Safety Data
Introduction to Agriculture & Food Safety Data
 
Seeding organic agriculture courses on Moodle: the agriMoodle Case
Seeding organic agriculture courses on Moodle:  the agriMoodle CaseSeeding organic agriculture courses on Moodle:  the agriMoodle Case
Seeding organic agriculture courses on Moodle: the agriMoodle Case
 
KOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet OntologyKOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet Ontology
 
The agINFRA Germplasm Working Group
The agINFRA Germplasm Working GroupThe agINFRA Germplasm Working Group
The agINFRA Germplasm Working Group
 
Designing Data Products
Designing Data ProductsDesigning Data Products
Designing Data Products
 
Using language services to enrich the LOs' descriptions
Using language services to enrich the LOs' descriptionsUsing language services to enrich the LOs' descriptions
Using language services to enrich the LOs' descriptions
 
Using Agricultural Learning Portals in Developing Countries: The case of Orga...
Using Agricultural Learning Portals in Developing Countries: The case of Orga...Using Agricultural Learning Portals in Developing Countries: The case of Orga...
Using Agricultural Learning Portals in Developing Countries: The case of Orga...
 
Developing a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetDeveloping a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.Edunet
 
AgEdWS12 - Introduction to the Workshop
AgEdWS12 - Introduction to the WorkshopAgEdWS12 - Introduction to the Workshop
AgEdWS12 - Introduction to the Workshop
 
Developing a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetDeveloping a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.Edunet
 
Introducing a content integration process for a federation of agricultural in...
Introducing a content integration process for a federation of agricultural in...Introducing a content integration process for a federation of agricultural in...
Introducing a content integration process for a federation of agricultural in...
 
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
 
Designing a Training Session for Public Authorities (EFITA 2011)
Designing a Training Session for Public Authorities (EFITA 2011)Designing a Training Session for Public Authorities (EFITA 2011)
Designing a Training Session for Public Authorities (EFITA 2011)
 
Identifying the Training Content Needs in Vocational Education & Training Pr...
Identifying the Training Content Needs in Vocational Education  & Training Pr...Identifying the Training Content Needs in Vocational Education  & Training Pr...
Identifying the Training Content Needs in Vocational Education & Training Pr...
 
Pecha Kucha
Pecha KuchaPecha Kucha
Pecha Kucha
 
Green Education Using Open Educational Resources (OER) (SPDECE 2012)
Green Education Using Open Educational Resources (OER) (SPDECE 2012)Green Education Using Open Educational Resources (OER) (SPDECE 2012)
Green Education Using Open Educational Resources (OER) (SPDECE 2012)
 

Recently uploaded

ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseAnaAcapella
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxAmita Gupta
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxcallscotland1987
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 

Recently uploaded (20)

ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 

agINFRA Germplasm metadata analysis

  • 1. Metadata analysis of germplasm collections The case of agINFRA Dr. Vassilis Protonotarios Agricultural Biotechnologist, PhD Agro-Know Technologies, Greece e-Conference on Germplasm Data Interoperability Session 2: “Status of data and metadata for germplasm”
  • 2. Structure of the presentation 1. The agINFRA germplasm data sources – Chinese Crop Germplasm Information System – Italian National Germplasm Database 2. Current status – Mappings – Linked Data approach 3. Conclusions
  • 3. The agINFRA germplasm data sources
  • 4. agINFRA germplasm data sources • Italian Germplasm Database (CRA) – Data available through EURISCO -> GENESYS – Uses EURISCO set of descriptors – Data also available through GBIF • Chinese Crop Germplasm Information System (CGRIS/CAAS) – Data unavailable through aggregators – Own schema used for description of germplasm accessions – Metadata exposure in CSV
  • 5. agINFRA germplasm data analysis 1. Analysis of agINFRA germplasm data sources 2. Analysis of metadata schemas used 3. Identification of external schemas – Review of existing work 4. Definition of a base schema (descriptors) 5. Mappings of various schemas to the base one 6. Development of a linked data approach for linking germplasm data sources
  • 6. 1. Chinese Crop Germplasm Information System (CGRIS / CAASD)
  • 7. Chinese Crop Germplasm Information System (CGRIS) • Provided by: Chinese Academy of Agricultural Sciences • A central repository for all type of plant genetic resources information. It consists of six subsystems: 1. The management system of the National Crop Gene Bank (NCGB), 2. The management system of the long-term storage in Qinghai, 3. The management system of National germplasm Resources Nursery, 4. The crop characterization and evaluation database system, 5. The database system for germplasm exchange at home and abroad and 6. The management system of the medium-term storage in Beijing. URL: http://icgr.caas.net.cn/cgrisintroduction.html
  • 8. CGRIS: Data At present, CGRIS owns • > 2000 MB data on 180 kinds of crops – including food crops, fibre plants, oil crops, vegetable, fruit tree, tea, mulberry, tobacco, sugar, green manure crops, tropical crops etc.), • 390,000 accessions of germplasm
  • 9. CGRIS: Accessions (indicative list) http://icgr.caas.net.cn/cgrisintroduction.html
  • 11. Info on wheat varieties
  • 12. Info on wheat varieties
  • 15. CGRIS Metadata • CGRIS germplasm descriptors based on own schema – can be seen as the de facto standard for germplasm accession information in China. – Based on metadata scheme standards such as developed by IPGRI (Bioversity) and GRIN
  • 18. CGRIS Metadata: Next steps • A mapping to the Multi-crop Passport Descriptors (MCPD) standard is intended – According to CAAS subject experts such a mapping should be rather easy to produce.
  • 19. CGRIS: Exposing data • Data stored in relational DBs • Hosted in an SQL server • Exposure of data as CSV files (partially in Chinese)
  • 20. CGRIS: IPR information • The CGRIS website is public and accessible for everybody. The information is provided free of charge but based on copyright. • With regards to data exchange there is no explicit policy to follow. • CGRIS does not have an Open Access mandate and the members of the CGRIS network apply their own institution policy.
  • 21. 2. Italian Germplasm Database (CRA)
  • 22. Italian Germplasm Database • Provided by: Italian Council for Research and Experimentation in Agriculture • Developed in the context of the “Plant Genetic Resources/FAO” project in 2004 – Research Centres and Units of the CRA – The Institute of Plant Genetics of the CNR in Bari, – NGO “Rete Semi Rurali” – University collections (Perugia, Potenza etc.) URL: http://fru.entecra.it
  • 23.
  • 24. CRA Germplasm: Data Current status of germplasm data (CRA) • 20,954 records from Italy are included in EURISCO of which 17,212 from CRA • 28,509 records for 275 plant species in the National Inventory (in general) – does not allow for identifying the number of CRA germplasm records
  • 25. CRA: Accessions (indicative list) URL: http://fru.entecra.it/accessioni.php
  • 26. Info on specific species
  • 27.
  • 29. CRA Metadata • Most CRA institutional databases use the MCPD – however, in the records provided to the National Inventory several fields are often not filled. • Some CRA collections also use descriptors defined by – the Union for the Protection of New Varieties of Plants (UPOV) and – the National Register of New Varieties. • Ensure mapping to the Multi-crop Passport Descriptors (MCPD)/EURISCO
  • 30. CRA: IPR information • The CRA website is public and accessible for everybody. The information is provided free of charge but based on copyright • The Multilateral System (MLS) of the Treaty demands free availability of the information on the PGRFA that are under the management and control of the Contracting Parties and in the public domain (Treaty, Art. 11.2). • This excludes – germplasm accessions that are subject to IPR and – other legally binding protection which restricts the Contracting Party’s control over the material. – Accessions that are not covered by IPR include old and autochthonous varieties, crop wild relatives and other material found in in-situ conditions, new cultivars not protected by IPR and cultivars whose IPR have expired.
  • 32. Current status • First version of mappings is available • EURISCO descriptors used as base schema – MCPD – Darwin Core for Genebanks – ABCD – CGRIS – CRA
  • 37. Linked Data • A linked data approach will be used by agINFRA for linking germplasm data sources • OpenAGRIS already aggregates germplasm data using AGROVOC
  • 38. Conclusions • Both schemas / sets of descriptors can be mapped to the EURISCO ones • Linked Data approach will facilitate linking of germplasm data from CRA/CGRIS • EURISCO descriptors to be published as linked data – To be used as the base of passport data • Linking to other germplasm standards – e.g. Darwin Core for Genebanks* *https://code.google.com/p/darwincore-germplasm/wiki/DarwinCoreGermplasmMapping
  • 39. Take home message • The identification of common properties between different metadata schemas will facilitate the linked data framework
  • 40. (Indicative) List of References • agINFRA Deliverable D2.3 “Review of Content Requirements” • agINFRA Deliverable D5.3 “Conceptual specification of linked agricultural data framework” • agINFRA Germplasm Working Group Wiki http://wiki.aginfra.eu/index.php/Germplasm_Working_Group • EURISCO passport descriptors http://www.ecpgr.cgiar.org/germplasm_databases.html • Draft Mapping of EURISCO Descriptors to ABCD 2.06 http://www.bgbm.org/TDWG/CODATA/Schema/Mappings/EURISCO-2-ABCD.pdf