BigDataEurope aims to maximize the societal value of big data by addressing challenges in health, food security, energy, transport, climate action, and secure societies. It develops a big data aggregator platform to integrate diverse data sources and apply analytics to help solve societal problems. The platform follows the Lambda architecture to handle both batch and real-time processing while retaining semantic meaning from data. Current activities include workshops and interest groups in specific societal challenge domains.
3. Big Data in Marketing
24-sept.-15www.big-data-europe.eu
4. Big Data in Intelligence
24-sept.-15www.big-data-europe.eu
5. BigDataEurope aims to help
maximizing the societal value of Big
Data
Health, demographic change and wellbeing;
Food security, sustainable agriculture and forestry,
marine and maritime and inland water research, and the
Bioeconomy;
Secure, clean and efficient energy;
Smart, green and integrated transport;
Climate action, environment, resource efficiency and
raw materials;
Europe in a changing world - inclusive, innovative and
reflective societies;
Secure societies - protecting freedom and security of
Europe and its citizens.
24-sept.-15www.big-data-europe.eu
6. The three Big Data „V“ –
Variety is often neglected
Quelle: Gesellschaft für Informatik
10. BigDataEurope Rationale
Show societal value of Big Data
Lower barrrier for using big data technologies
o Required effort and resources
o Limited data science skills
o Lack of Generic Architectures, components
Help establishing cross-
lingual/organizational/domain Data Value
Chains
o Multiple Data Sources
o Required: Integration, Harmonisation
24-sept.-15www.big-data-europe.eu
12. Orthogonal Dimensions of Big Data
Ecosystems
Generic Big Data Enabling Technologies
Data Value Chain
Data Generation
& Acquisition
Data Analysis &
Processing
Data Storage &
Curation
Data
Visualization &
Usage
Data-driven
Services
SocietalChallenges
DomainSpecificDataAssets&Technology
Healthcare
Food Security
Energy
Intelligent Transport
Climate & Environment
Inclusive & Reflective Societies
Secure Societies
20. Blueprint of the Data Aggregator
Platform
Follows typical Lambda Architecture
Integrated on top of existing Big Data distribution
+ Semantic Layer (Retaining Semantics using LD
Batch Layer
Speed Layer
Data Storage
Real-time data &
Transactions …
Batch View
Real-time
View
messagepassing
message passing
Applications & Showcases
Real-time dashboards
Domain-specific BDE apps
Big Data Analytics
In-stream Mining
BDEPlatform&
Intelligence
Input data
Stream
Spatial
Social
Statistical
Temporal
Transactiona
l
Imagery
21. BDE Platform based on BigTop
Packaging Smoke testing Virtualization
Package RPMs and DEBs,
so that you can manage
and maintain your own
cluster.
Integrated smoke testing
framework
Vagrant recipes, raw
images, and docker recipes
for deploying BigData
infrastructures from zero.
24-sept.-15www.big-data-europe.eu
+ Semantic Layer - Retaining Semantics using
Linked Data
22. Data Aggregator Platform
Challenges
Ingest semantic (RDF) and non-semantic
(CSV, JSON, XML, …) data
o Integrate various mapping techniques (R2RML,
CSV on the Web, JSON-LD)
preserve semantics, provenance and
metadata in Big Data processing chains
o Preserve URI/IRIs
o Preserve triples
Exploit semantics for aggregations
24-sept.-15www.big-data-europe.eu
23. Current Activities – Year#1
2015 BDE Societal Workshops (7) Planned
o Schedule on Website
7 W3C Interest Groups set up: Please Join!
o SC1: HEALTH https://www.w3.org/community/bde-health/join
o SC2: FOOD & AGRICULTURE https://www.w3.org/community/bde-food/
o SC3: ENERGY https://www.w3.org/community/bde-energy/
o SC4: TRANSPORT https://www.w3.org/community/bde-transport/
o SC5: CLIMATE & ENVIRONMENT https://www.w3.org/community/bde-climate/
o SC6: SOCIETIES https://www.w3.org/community/bde-societies/
o SC7: SECURITY https://www.w3.org/community/bde-secure-societies/
www.big-data-europe.eu
Quelle 4. Bild: http://images.zeit.de/mobilitaet/2014-05/auto-autonom/auto-autonom-540x304.jpg
Prescriptive analytics not only anticipates what will happen and when it will happen, but also why it will happen. Further, prescriptive analytics suggests decision options on how to take advantage of a future opportunity or mitigate a future risk and shows the implication of each decision option. Prescriptive analytics can continually take in new data to re-predict and re-prescribe, thus automatically improving prediction accuracy and prescribing better decision options. Prescriptive analytics ingests hybrid data, a combination of structured (numbers, categories) and unstructured data (videos, images, sounds, texts), and business rules to predict what lies ahead and to prescribe how to take advantage of this predicted future without compromising other priorities.[8]
Two clearly defined coordination and support measures:
Coordination: Engaging with a diverse range of stakeholder groups from SCs
Collecting requirements for the ICT infrastructure needed by data-intensive science practitioners
covering all aspects of publishing and consuming semantically interoperable, large-scale data and knowledge assets
Support:
meets requirements
minimises disruption to current workflows,
maximises the opportunities to take advantage of the latest European RTD developments
BigDataEurope will implement and apply two main instruments to successfully realize these measures:
Build Societal Big Data Interest/Community Groups in the W3C interest group scheme & involving a large number of stakeholders from the Horizon 2020 societal challenges as well as technical Big Data experts;
Design, integrate and deploy a cloud-deployment-ready Big Data aggregator platform comprising key open-source Big Data technologies for real-time and batch processing, such as Hadoop, Cassandra and Storm.
Allein die heute verfügbaren Open Source Technologien für Big Data passen kaum auf eine Folie und erfordern Architekturverständnis mit Blick auf die Use Cases (Folie Technologie-Stack aus Technologie-Recherche)
Ggfs. SMILA erwähnen, welches THESEUS-Wurzeln hat (heute Empolis)
Die populärsten Projekte sind hier hadoop (batch) & storm (real-time)