SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Downloaden Sie, um offline zu lesen
Exploitation opportunities

Pasquale Pagano (CNR)
iMarine Technical Director
pasquale.pagano@isti.cnr.it
Outline
The Infrastructure
• Heterogeneous resources as a service
• Data Bonanza
• Virtual Research Environment
• Software platform
iMarine Catalogue
• StatsCube
• GeosCube
• BiolCube
• ConnectCube
I-MARINE EXTENDED BOARD

2
Distinguishing capabilities of the iMarine e-infrastructure and its
enabling software

THE INFRASTRUCTURE

I-MARINE EXTENDED BOARD

3
Concepts
The initiative
(the visionary leadership)

The e-infrastructure
(the operational platform)

The system
(the enabling sw system)
I-MARINE EXTENDED BOARD

4
e-Infrastructure

Geographically
Distributed
Computing
Infrastructure

Service
Allocations,
Deployment,
Monitoring, and
Operation

Across
administrative
boundaries
Across private and
commercial
providers

Uniform resource
and data access

I-MARINE EXTENDED BOARD

5
Infrastructure: key characteristics
• Efficient and tailored storage technologies
• Computational environments dealing with the volume
of the data
• Elastic management of the resources, monitoring,
alerting, recovery
• Collaborative environment to support scientific
communities
• Rich portfolio of applications to perform access,
validation, enriching, processing, sharing, and mash-up
of data
I-MARINE EXTENDED BOARD

6
Infrastructure: Storage as Service
• Secure
• Fault-tolerant
• Replication

• Open source
RDBMS
• Up to 1 TB data

Virtual
Workspace

Relational
Databases

45 TB Currently Used
Spatial
Database

Large and
Active data
storage

• ISO
19115/10139
Metadata
• Catalogue

• Scalability and
high availability
• Across sites

I-MARINE EXTENDED BOARD

7
330 Cores Currently Allocated

Infrastructure: Computing as Service

Hadoop

• MapReduce

Statistical
Manager

• Analysis/clustering/modeling

R clusters

• Windows and Linux

I-MARINE EXTENDED BOARD

8
Infrastructure: Management as Service
Operation Machine readable SLAs
Machine readable monitoring, auditing, billing,
reporting, and notification
Machine readable resource/performance capabilities
description

Trust

Privacy, governance, and attribution

Security, trusted network

I-MARINE EXTENDED BOARD

9
Infrastructure: Collaborative Environment
The Social Portal offers a familiar view of what is
happening on their VREs
A single place to
• Get status and updates from applications and other users they are interested in;
• Get notifications about messages, jobs completion, new generated products, etc.

I-MARINE EXTENDED BOARD

10
Infrastructure: Collaborative Environment
The Social Portal offers a familiar view of what is
happening on their VREs
A single place to
• Manage all the portal extension.

W rk p Ms ags o atio sP e
o s ac es eNtific n ag
e

Se hiny u W rk p e
arc o r o s ac

H m So ial
oe c

I-MARINE EXTENDED BOARD

11
Infrastructure: Collaborative Environment
The Social Portal offers a familiar view of what is
happening on their VREs
A single place to
• Manage data, store and preserve them
• Share data

I-MARINE EXTENDED BOARD

12
Google Analytics iMarine portal

I-MARINE EXTENDED BOARD

13
Data Bonanza
OBIS
WoR
MS

…

Data.
FAO

Validation

WoR
DS
Private
Cloud

EuroS
tat
Sharing

iMarine
iMarine
Registries

GBIF
Enriching

Commercial
Cloud

WOA

MyOc
ean

CoL

Processing
ITIS
NCBI

IRMN
G

I-MARINE EXTENDED BOARD

14
Data Bonanza
SDMX *
- FAO CodeLists
- IRD CodeLists
- FAO Global
Aquaculture
Production
- FAO Global Capture
Production
- FAO Global
Production
- Eurostat
- …

Statistical

Biodiversity

Geospatial

DarwinCore / ISO19139
>35 M Observations (OBIS)
≈ 120 K Observed Species
(OBIS)
≈ 500 K Taxa (WoRMS)
>600 K Scientific Names
(ITIS)
>12 K Species Distribution
Maps (AquaMaps)
≈ 600 Species Extent (FAO)
… FishBase, SeaLifeBase
… CoL, GBIF

> 300
variables

ISO19139 (OGC W*S)
10 years Chemical and Physical variables in 2D space
Ice concentration and velocity, Chlorophyll, Oxygen, Nitrate, Phosphate,
Phytoplankton as carbon, Salinity, Temperature, …
On-demand Chemical and Physical variables in 3D space
Apparent Oxygen Utilization, Dissolved Oxygen, Salinity, Temperature, …
I-MARINE EXTENDED BOARD

15
Not Only Access
• Access
– Retrieval of geospatial data as
space/time-varying phenomena
– Direct fine-grained access to feature
and feature property level.
• Validation
– User-defined quality and dissemination level

• Enriching
– Generation metadata, exploitation of reference data, linking to
environmental dataset

• Processing
– Analysis and mining exploiting e.g. R, Weka and RapidMiner
statistical frameworks

• Sharing
– User-driven process to decide how other agents (human / machine)
can access information
I-MARINE EXTENDED BOARD

16
Features Clustering with StatsCube
Presence
Points
(FishBase
+
Obis)

Density Based Clustering
DBSCAN
(with outliers)

Other methods are also
available …

K-Means
X-Means
I-MARINE EXTENDED BOARD

17
Ecological Modeling with BiolCube

I-MARINE EXTENDED BOARD

18
Maps Comparison with GeosCube

MEAN=0.81
VARIANCE=0.02
NUMBER_OF_ERRORS=6691
NUMBER_OF_COMPARISONS=259200
ACCURACY=97.42
MAXIMUM_ERROR=1.0
MAXIMUM_ERROR_POINT=3005:363:1
COHENS_KAPPA=0.218
COHENS_KAPPA_CLASSIFICATION_LANDIS_KOCH=Fair
COHENS_KAPPA_CLASSIFICATION_FLEISS=Marginal
TREND=EXPANSION
RESOLUTION=0.5

FAO Eleutheronema tetradactylum
VS
AquaMaps Eleutheronema tetradactylum

I-MARINE EXTENDED BOARD

19
Not Only Access, Validation, Enriching,
Processing, Sharing
• It is always possible to save the discovered
data in various Standard formats
• It is always possible to collaborate with coworkers through a dedicated workspace.
• Mash-up data across diversity
– Accessing statistical datasets in SDMX, geo-referencing
them, describing them in ISO19139, and making them
available via OGC W*S standard protocols
– Accessing species observation datasets in DwC, analysing
their distribution trend via R, and projecting them in
geographical space
– Accessing species taxonomies in DwCA and publishing
them as reference data in SDMX
I-MARINE EXTENDED BOARD

20
Data Bonanza: a common vision
Integrate and harmonize crossdisciplinary data and information
across information systems and
workflows to support evidence-based
decision making
iMarine is implementing this vision through the
adoption of Standards, the identification of
common Methods and the implementation of
Tools which enable integration and
harmonization.
I-MARINE EXTENDED BOARD

21
Is this enough?
• An ecosystem of
participatory data eInfrastructures
• Regulated by policies
• Enabled by standards
• Promoting not only
access but mash-up of
heterogeneous data

User centric
I-MARINE EXTENDED BOARD

22
User-Centric View
User-centric view of an ecosystem of
participatory data e-Infrastructures to
• Cope with the overwhelming amount of data
and capacities
• Promote re-use of data
• Encourage sharing of resulting products
User-centric and workflow-oriented

I-MARINE EXTENDED BOARD

23
Virtual Research Environment
iMarine is user-centric and workflow-oriented thanks to
the gCube VRE technology
Virtual Research Environment (VRE) is
• a distributed and dynamically created environment
• where subset of data, services, computational, and
storage resources
• regulated by tailored policies
• are assigned to a subset of users via interfaces
• for a limited timeframe
• at little or no cost for the providers of
the participatory data e-infrastructures
L. Candela, D. Castelli, P. Pagano (2013) Virtual Research Environments: An Overview and a
Research Agenda. Data Science Journal, Vol. 12
I-MARINE EXTENDED BOARD

24
Flexible

Software Platform

Software platform to
abstract over differences
in location, protocols,
and models by

keeping failures partial and
temporary,

Storage, Discovery, Indexing,
Search, Execution, …

reacting to and recovering from
a large number of potential
issues.

I-MARINE EXTENDED BOARD

Feature-rich
Feature-rich

scaling no less than the
interfaced resources,

It turns resources and
technologies into a utility
by offering a single
registration, monitoring,
and access facilities

25
Software Platform

I-MARINE EXTENDED BOARD

26
iMarine Exploitation models
Service

Data hosting

Infrastructure

Unlimited users, Infrastructure support, helpdesk, back-up, security
Validation (records)

Workspace

Hardware

Default Processing (<1MB)

Social Tool

Community Management

Storage 1TB

Cloud Resources

Validation (Datasets)

Custom Data Resources

Custom Processing (> 1MB)

Spatial Data integration

User Management

Large and Active Storage

Unlimited VRE’s

Hour/Day

Month

Year

27
Concept map of the products

I-MARINE OFFER

I-MARINE EXTENDED BOARD

28
Application Bundles
Management and interpretation of biological and
ecological data in the environment
Complete full life-cycle data framework, from
observational data to aggregated data repositories
enriched with validation and analytical tools
Storage and interpretation of geospatial explicit
information, including WPS processing

Flexible sharing, storage, reporting, search and
retrieval, aggregation and projection facilities

I-MARINE EXTENDED BOARD

A BUNDLE is
a set of
services and
technologie
s grouped
according to
a family of
related
tasks for ac
hieving a
common
objective

29
Discussion time
Thank you
for your attention

www.i-marine.eu
I-MARINE EXTENDED BOARD

30

Weitere ähnliche Inhalte

Andere mochten auch

Christopher Columbus
Christopher ColumbusChristopher Columbus
Christopher Columbus
mattiann000
 

Andere mochten auch (6)

Klassmate it's fun to learn !
Klassmate it's fun to learn !Klassmate it's fun to learn !
Klassmate it's fun to learn !
 
Virtual classroom tour
Virtual classroom tour Virtual classroom tour
Virtual classroom tour
 
Christopher Columbus
Christopher ColumbusChristopher Columbus
Christopher Columbus
 
iMarine catalogue of services
iMarine catalogue of servicesiMarine catalogue of services
iMarine catalogue of services
 
iMarine Products and Services delivery
iMarine Products and Services deliveryiMarine Products and Services delivery
iMarine Products and Services delivery
 
Integrating Heterogeneous and Distributed Information about Marine Species th...
Integrating Heterogeneous and Distributed Information about Marine Species th...Integrating Heterogeneous and Distributed Information about Marine Species th...
Integrating Heterogeneous and Distributed Information about Marine Species th...
 

Ähnlich wie iMarine exploitation opportunities

Scale-on-Scale : Part 3 of 3 - Disaster Recovery
Scale-on-Scale : Part 3 of 3 - Disaster RecoveryScale-on-Scale : Part 3 of 3 - Disaster Recovery
Scale-on-Scale : Part 3 of 3 - Disaster Recovery
Scale Computing
 
Geo charvat enviro_grids_zk
Geo charvat enviro_grids_zkGeo charvat enviro_grids_zk
Geo charvat enviro_grids_zk
Karel Charvat
 

Ähnlich wie iMarine exploitation opportunities (20)

iMarine data e-infrastructure: Data access, harmonization, analysis, and mana...
iMarine data e-infrastructure: Data access, harmonization, analysis, and mana...iMarine data e-infrastructure: Data access, harmonization, analysis, and mana...
iMarine data e-infrastructure: Data access, harmonization, analysis, and mana...
 
Virtual Research Environments supporting tailor-made data management service...
Virtual Research Environments supporting tailor-made data management service...Virtual Research Environments supporting tailor-made data management service...
Virtual Research Environments supporting tailor-made data management service...
 
iMarine Services
iMarine ServicesiMarine Services
iMarine Services
 
Scale-on-Scale : Part 3 of 3 - Disaster Recovery
Scale-on-Scale : Part 3 of 3 - Disaster RecoveryScale-on-Scale : Part 3 of 3 - Disaster Recovery
Scale-on-Scale : Part 3 of 3 - Disaster Recovery
 
Energy Databank in Nigeria: Management ,Technology and Security
Energy Databank in Nigeria:   Management ,Technology and SecurityEnergy Databank in Nigeria:   Management ,Technology and Security
Energy Databank in Nigeria: Management ,Technology and Security
 
2016 asprs track: science, scale, and innovation: when remote sensing analys...
2016 asprs track: science, scale, and innovation:  when remote sensing analys...2016 asprs track: science, scale, and innovation:  when remote sensing analys...
2016 asprs track: science, scale, and innovation: when remote sensing analys...
 
BlueBRIDGE Presentation at Blue Growth Research & Innovation Event 2017
BlueBRIDGE Presentation at Blue Growth Research & Innovation Event 2017BlueBRIDGE Presentation at Blue Growth Research & Innovation Event 2017
BlueBRIDGE Presentation at Blue Growth Research & Innovation Event 2017
 
Cliff Denhom, Stream Restoration Inc., "Datashed Overview and Q&A"
Cliff Denhom, Stream Restoration Inc., "Datashed Overview and Q&A"Cliff Denhom, Stream Restoration Inc., "Datashed Overview and Q&A"
Cliff Denhom, Stream Restoration Inc., "Datashed Overview and Q&A"
 
Geo charvat enviro_grids_zk
Geo charvat enviro_grids_zkGeo charvat enviro_grids_zk
Geo charvat enviro_grids_zk
 
Information Systems
Information SystemsInformation Systems
Information Systems
 
Towards an e-infrastructure in agriculture?
Towards an e-infrastructure in agriculture?Towards an e-infrastructure in agriculture?
Towards an e-infrastructure in agriculture?
 
Red Hat Storage Product Overview
Red Hat Storage Product OverviewRed Hat Storage Product Overview
Red Hat Storage Product Overview
 
Evolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and RainEvolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and Rain
 
MapR 5.2: Getting More Value from the MapR Converged Community Edition
MapR 5.2: Getting More Value from the MapR Converged Community EditionMapR 5.2: Getting More Value from the MapR Converged Community Edition
MapR 5.2: Getting More Value from the MapR Converged Community Edition
 
Smart Water Networks
Smart Water NetworksSmart Water Networks
Smart Water Networks
 
Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...
Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...
Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...
 
MapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data PlatformMapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data Platform
 
Kafka Migration for Satellite Event Streaming Data | Eric Velte, ASRC Federal
Kafka Migration for Satellite Event Streaming Data | Eric Velte, ASRC FederalKafka Migration for Satellite Event Streaming Data | Eric Velte, ASRC Federal
Kafka Migration for Satellite Event Streaming Data | Eric Velte, ASRC Federal
 
APAN Cloud WG (2015/3/2)
APAN Cloud WG (2015/3/2)APAN Cloud WG (2015/3/2)
APAN Cloud WG (2015/3/2)
 
Sobloo Geospatial Ecosystem
Sobloo Geospatial EcosystemSobloo Geospatial Ecosystem
Sobloo Geospatial Ecosystem
 

Mehr von iMarine283644

The vulnerable marine ecosystems (VME DB) factsheet workflow
The vulnerable marine ecosystems (VME DB) factsheet workflowThe vulnerable marine ecosystems (VME DB) factsheet workflow
The vulnerable marine ecosystems (VME DB) factsheet workflow
iMarine283644
 
The iMarine solutions in support to the ecosystem approach needs
The iMarine solutions in support to the ecosystem approach needsThe iMarine solutions in support to the ecosystem approach needs
The iMarine solutions in support to the ecosystem approach needs
iMarine283644
 
I marine achievements the story so far
I marine achievements  the story so farI marine achievements  the story so far
I marine achievements the story so far
iMarine283644
 
Cool tools and high level experts for fisheries management and knowledge
Cool tools and high level experts for fisheries management and knowledgeCool tools and high level experts for fisheries management and knowledge
Cool tools and high level experts for fisheries management and knowledge
iMarine283644
 

Mehr von iMarine283644 (12)

Discovering the impact of climate change on the marine species, Aquamaps
Discovering the impact of climate change on the marine species, AquamapsDiscovering the impact of climate change on the marine species, Aquamaps
Discovering the impact of climate change on the marine species, Aquamaps
 
How iMarine fulfils data needs in support of the Ecosystem Approach (EA)
How iMarine fulfils data needs in support of the Ecosystem Approach (EA)How iMarine fulfils data needs in support of the Ecosystem Approach (EA)
How iMarine fulfils data needs in support of the Ecosystem Approach (EA)
 
iMarine achievements: three years and beyond, D. Castelli, CNR-ISTI & iMarine...
iMarine achievements: three years and beyond, D. Castelli, CNR-ISTI & iMarine...iMarine achievements: three years and beyond, D. Castelli, CNR-ISTI & iMarine...
iMarine achievements: three years and beyond, D. Castelli, CNR-ISTI & iMarine...
 
BiOnym
BiOnymBiOnym
BiOnym
 
Chimaera
ChimaeraChimaera
Chimaera
 
The vulnerable marine ecosystems (VME DB) factsheet workflow
The vulnerable marine ecosystems (VME DB) factsheet workflowThe vulnerable marine ecosystems (VME DB) factsheet workflow
The vulnerable marine ecosystems (VME DB) factsheet workflow
 
The iMarine solutions in support to the ecosystem approach needs
The iMarine solutions in support to the ecosystem approach needsThe iMarine solutions in support to the ecosystem approach needs
The iMarine solutions in support to the ecosystem approach needs
 
I marine achievements the story so far
I marine achievements  the story so farI marine achievements  the story so far
I marine achievements the story so far
 
Providing Statistical Algorithms as-a-Service
Providing Statistical Algorithms as-a-ServiceProviding Statistical Algorithms as-a-Service
Providing Statistical Algorithms as-a-Service
 
iMarine initiative overview
iMarine initiative overviewiMarine initiative overview
iMarine initiative overview
 
Cool tools and high level experts for fisheries management and knowledge
Cool tools and high level experts for fisheries management and knowledgeCool tools and high level experts for fisheries management and knowledge
Cool tools and high level experts for fisheries management and knowledge
 
Marine Knowledge Meeting, 11-12 Oct 2012, Brussels: Marine Knowledge 2020: Re...
Marine Knowledge Meeting, 11-12 Oct 2012, Brussels: Marine Knowledge 2020: Re...Marine Knowledge Meeting, 11-12 Oct 2012, Brussels: Marine Knowledge 2020: Re...
Marine Knowledge Meeting, 11-12 Oct 2012, Brussels: Marine Knowledge 2020: Re...
 

Kürzlich hochgeladen

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Kürzlich hochgeladen (20)

Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 

iMarine exploitation opportunities

  • 1. Exploitation opportunities Pasquale Pagano (CNR) iMarine Technical Director pasquale.pagano@isti.cnr.it
  • 2. Outline The Infrastructure • Heterogeneous resources as a service • Data Bonanza • Virtual Research Environment • Software platform iMarine Catalogue • StatsCube • GeosCube • BiolCube • ConnectCube I-MARINE EXTENDED BOARD 2
  • 3. Distinguishing capabilities of the iMarine e-infrastructure and its enabling software THE INFRASTRUCTURE I-MARINE EXTENDED BOARD 3
  • 4. Concepts The initiative (the visionary leadership) The e-infrastructure (the operational platform) The system (the enabling sw system) I-MARINE EXTENDED BOARD 4
  • 6. Infrastructure: key characteristics • Efficient and tailored storage technologies • Computational environments dealing with the volume of the data • Elastic management of the resources, monitoring, alerting, recovery • Collaborative environment to support scientific communities • Rich portfolio of applications to perform access, validation, enriching, processing, sharing, and mash-up of data I-MARINE EXTENDED BOARD 6
  • 7. Infrastructure: Storage as Service • Secure • Fault-tolerant • Replication • Open source RDBMS • Up to 1 TB data Virtual Workspace Relational Databases 45 TB Currently Used Spatial Database Large and Active data storage • ISO 19115/10139 Metadata • Catalogue • Scalability and high availability • Across sites I-MARINE EXTENDED BOARD 7
  • 8. 330 Cores Currently Allocated Infrastructure: Computing as Service Hadoop • MapReduce Statistical Manager • Analysis/clustering/modeling R clusters • Windows and Linux I-MARINE EXTENDED BOARD 8
  • 9. Infrastructure: Management as Service Operation Machine readable SLAs Machine readable monitoring, auditing, billing, reporting, and notification Machine readable resource/performance capabilities description Trust Privacy, governance, and attribution Security, trusted network I-MARINE EXTENDED BOARD 9
  • 10. Infrastructure: Collaborative Environment The Social Portal offers a familiar view of what is happening on their VREs A single place to • Get status and updates from applications and other users they are interested in; • Get notifications about messages, jobs completion, new generated products, etc. I-MARINE EXTENDED BOARD 10
  • 11. Infrastructure: Collaborative Environment The Social Portal offers a familiar view of what is happening on their VREs A single place to • Manage all the portal extension. W rk p Ms ags o atio sP e o s ac es eNtific n ag e Se hiny u W rk p e arc o r o s ac H m So ial oe c I-MARINE EXTENDED BOARD 11
  • 12. Infrastructure: Collaborative Environment The Social Portal offers a familiar view of what is happening on their VREs A single place to • Manage data, store and preserve them • Share data I-MARINE EXTENDED BOARD 12
  • 13. Google Analytics iMarine portal I-MARINE EXTENDED BOARD 13
  • 15. Data Bonanza SDMX * - FAO CodeLists - IRD CodeLists - FAO Global Aquaculture Production - FAO Global Capture Production - FAO Global Production - Eurostat - … Statistical Biodiversity Geospatial DarwinCore / ISO19139 >35 M Observations (OBIS) ≈ 120 K Observed Species (OBIS) ≈ 500 K Taxa (WoRMS) >600 K Scientific Names (ITIS) >12 K Species Distribution Maps (AquaMaps) ≈ 600 Species Extent (FAO) … FishBase, SeaLifeBase … CoL, GBIF > 300 variables ISO19139 (OGC W*S) 10 years Chemical and Physical variables in 2D space Ice concentration and velocity, Chlorophyll, Oxygen, Nitrate, Phosphate, Phytoplankton as carbon, Salinity, Temperature, … On-demand Chemical and Physical variables in 3D space Apparent Oxygen Utilization, Dissolved Oxygen, Salinity, Temperature, … I-MARINE EXTENDED BOARD 15
  • 16. Not Only Access • Access – Retrieval of geospatial data as space/time-varying phenomena – Direct fine-grained access to feature and feature property level. • Validation – User-defined quality and dissemination level • Enriching – Generation metadata, exploitation of reference data, linking to environmental dataset • Processing – Analysis and mining exploiting e.g. R, Weka and RapidMiner statistical frameworks • Sharing – User-driven process to decide how other agents (human / machine) can access information I-MARINE EXTENDED BOARD 16
  • 17. Features Clustering with StatsCube Presence Points (FishBase + Obis) Density Based Clustering DBSCAN (with outliers) Other methods are also available … K-Means X-Means I-MARINE EXTENDED BOARD 17
  • 18. Ecological Modeling with BiolCube I-MARINE EXTENDED BOARD 18
  • 19. Maps Comparison with GeosCube MEAN=0.81 VARIANCE=0.02 NUMBER_OF_ERRORS=6691 NUMBER_OF_COMPARISONS=259200 ACCURACY=97.42 MAXIMUM_ERROR=1.0 MAXIMUM_ERROR_POINT=3005:363:1 COHENS_KAPPA=0.218 COHENS_KAPPA_CLASSIFICATION_LANDIS_KOCH=Fair COHENS_KAPPA_CLASSIFICATION_FLEISS=Marginal TREND=EXPANSION RESOLUTION=0.5 FAO Eleutheronema tetradactylum VS AquaMaps Eleutheronema tetradactylum I-MARINE EXTENDED BOARD 19
  • 20. Not Only Access, Validation, Enriching, Processing, Sharing • It is always possible to save the discovered data in various Standard formats • It is always possible to collaborate with coworkers through a dedicated workspace. • Mash-up data across diversity – Accessing statistical datasets in SDMX, geo-referencing them, describing them in ISO19139, and making them available via OGC W*S standard protocols – Accessing species observation datasets in DwC, analysing their distribution trend via R, and projecting them in geographical space – Accessing species taxonomies in DwCA and publishing them as reference data in SDMX I-MARINE EXTENDED BOARD 20
  • 21. Data Bonanza: a common vision Integrate and harmonize crossdisciplinary data and information across information systems and workflows to support evidence-based decision making iMarine is implementing this vision through the adoption of Standards, the identification of common Methods and the implementation of Tools which enable integration and harmonization. I-MARINE EXTENDED BOARD 21
  • 22. Is this enough? • An ecosystem of participatory data eInfrastructures • Regulated by policies • Enabled by standards • Promoting not only access but mash-up of heterogeneous data User centric I-MARINE EXTENDED BOARD 22
  • 23. User-Centric View User-centric view of an ecosystem of participatory data e-Infrastructures to • Cope with the overwhelming amount of data and capacities • Promote re-use of data • Encourage sharing of resulting products User-centric and workflow-oriented I-MARINE EXTENDED BOARD 23
  • 24. Virtual Research Environment iMarine is user-centric and workflow-oriented thanks to the gCube VRE technology Virtual Research Environment (VRE) is • a distributed and dynamically created environment • where subset of data, services, computational, and storage resources • regulated by tailored policies • are assigned to a subset of users via interfaces • for a limited timeframe • at little or no cost for the providers of the participatory data e-infrastructures L. Candela, D. Castelli, P. Pagano (2013) Virtual Research Environments: An Overview and a Research Agenda. Data Science Journal, Vol. 12 I-MARINE EXTENDED BOARD 24
  • 25. Flexible Software Platform Software platform to abstract over differences in location, protocols, and models by keeping failures partial and temporary, Storage, Discovery, Indexing, Search, Execution, … reacting to and recovering from a large number of potential issues. I-MARINE EXTENDED BOARD Feature-rich Feature-rich scaling no less than the interfaced resources, It turns resources and technologies into a utility by offering a single registration, monitoring, and access facilities 25
  • 27. iMarine Exploitation models Service Data hosting Infrastructure Unlimited users, Infrastructure support, helpdesk, back-up, security Validation (records) Workspace Hardware Default Processing (<1MB) Social Tool Community Management Storage 1TB Cloud Resources Validation (Datasets) Custom Data Resources Custom Processing (> 1MB) Spatial Data integration User Management Large and Active Storage Unlimited VRE’s Hour/Day Month Year 27
  • 28. Concept map of the products I-MARINE OFFER I-MARINE EXTENDED BOARD 28
  • 29. Application Bundles Management and interpretation of biological and ecological data in the environment Complete full life-cycle data framework, from observational data to aggregated data repositories enriched with validation and analytical tools Storage and interpretation of geospatial explicit information, including WPS processing Flexible sharing, storage, reporting, search and retrieval, aggregation and projection facilities I-MARINE EXTENDED BOARD A BUNDLE is a set of services and technologie s grouped according to a family of related tasks for ac hieving a common objective 29
  • 30. Discussion time Thank you for your attention www.i-marine.eu I-MARINE EXTENDED BOARD 30