SlideShare a Scribd company logo
1 of 23
Download to read offline
ARCHITECTURE OF THE
BDE PILOT FOR SECURE
SOCIETIES
3rd BDE Hangout “Big Data in Secure Societies”5 Dicember 2016
George Papadakis, University of Athens
Postdoctoral Researcher
Pilot Architecture
7-déc.-16www.big-data-europe.eu
Event Detection Workflow
7-déc.-16www.big-data-europe.eu
News
Crawler
…
Event
Detector
Lookup
Service
ED Workflow: News Crawler
 Runs periodically
 Monitored sources:
o Reuters news feeds (RSS)
o Selected Twitter accounts
o Keyword-based search
 Possible to cover more sources if needed
7-déc.-16www.big-data-europe.eu
ED Workflow: Cassandra
 Scalable, noSQL distributed database
 Input Scenario I:
o Individual news items from News Crawler
o Conforms with privacy regulations
 Input Scenario II:
o Events identified by Event Detector
 Input Scenario III:
o Queries about the stored news items and events
7-déc.-16www.big-data-europe.eu
ED Workflow: Event Detector
 Runs periodically, parallel execution based on Spark
 Input:
o News items
 Output:
o Events
 Every event is associated with meta-data: date & location
 Algorithm based on
7-déc.-16www.big-data-europe.eu
Event Detector Algorithm
Two steps:
1. Identify events
o Compare pairs of news items
o If similarity > threshold → related pair
o Form clusters based on related pairs
o If cluster has support > threshold → event
2. Enrich events
o Compare individual items with events
o If similarity > threshold → attached to event
7-déc.-16www.big-data-europe.eu
ED Workflow: Lookup service
 Based on Apache Lucene for fuzzy queries
 Based on the GAMD dataset
o more than 180,000 location names
 Input:
o Query including an extracted location name
 Output:
o The corresponding geocordinates
7-déc.-16www.big-data-europe.eu
Change Detection Workflow
7-déc.-16www.big-data-europe.eu
Image
Aggregator
Change
Detector
CD Workflow: Image Aggregator
 Rest service called by GUI & Event Detection
 Input (manual or automatic):
o Bounding box of the area of interest (WKT)
o The time of interest
o A past time, before an event of interest took place
 Output:
o a set of satellite images downloaded from ESA’s SciHub.
o Subset operator
7-déc.-16www.big-data-europe.eu
Automatic call of the CD workflow
 Best-effort service
 Based on a queue
o Maximum capacity: 1,000 events
o Maximum waiting time: 1 week
 Input:
o Event meta-data
 Output:
o Areas with detected changes & corresponding satellite images
7-déc.-16www.big-data-europe.eu
CD Workflow: HDFS
 Input:
o Two satellite images in zip format, each occupying few GBs.
 Output:
o Distribute parts of every image to the available cluster nodes to facilitate
their efficient processing.
7-déc.-16www.big-data-europe.eu
CD Workflow: Change Detector
 Parallelizes the change detection algorithm using Spark.
 Input:
o Two satellite images depicting the same geolocation.
 Output:
o A set of the areas with differences between the two snapshots.
7-déc.-16www.big-data-europe.eu
Change Detector Algorithm
 Three steps:
1. Preprocessing to align the given images
 Coregistration (4 successive operators) or
 Terrain Correction (1 operator)
2. Main algorithm to perform the actual comparison
3. DBScan for clustering together pixels with changes
 Two parallelization strategies:
1. Tile-centric approach (subset operator)
2. Image-centric approach (baseline approach)
7-déc.-16www.big-data-europe.eu
Common Workflow
7-déc.-16www.big-data-europe.eu
GeoTriples
Strabon
User
Interface
Common workflow: GeoTriples
 Converts geospatial data into RDF.
 Input Scenario I:
o Areas of change from Change Detector
 Input Scenario II:
o Event summaries from Event Detector
 Output:
o RDF statements
7-déc.-16www.big-data-europe.eu
Common workflow: Strabon
 Scalable & efficient spatiotemporal RDF store.
 Input Scenario I:
o Data coming from GeoTriples
 Input Scenario II:
o SPARQL queries such as:
 Get N latest event summaries from location X.
 Get event summaries with keyword Y.
 Output:
o Answers to the received queries.
7-déc.-16www.big-data-europe.eu
Common Workflow: SemaGrow
 Federates Cassandra and Strabon.
 Input:
o Queries from GUI about events or locations with changes.
 Output:
o Answers to the received queries.
7-déc.-16www.big-data-europe.eu
Common Workflow: Sextant - A
 Web application implementing the GUI.
 Input for Change Detection:
o Area selected by user through the interactive map
o Time interval (optional)
o User info
 Output:
o Calls Image Aggregator
o Progress messages
7-déc.-16www.big-data-europe.eu
Common Workflow: Sextant - B
 Input for Event Detection (at least one of the following):
o Keyword
o Location name or coordinates
o Time
 Output:
o Latest relevant event summaries & corresponding news items.
7-déc.-16www.big-data-europe.eu
Common Workflow: Sextant - C
 Cybersecurity
o User registration
 Pilot credentials (encrypted)
SciHub credentials (encrypted)
 Type of user (classified, unclassified)
 Requires administration approval
o Authorization
7-déc.-16www.big-data-europe.eu
Common Workflow: Sextant - D
 Twitter keyword search
o Retrieves tweets on the fly
o Input:
Hashtag (e.g., #bdeSC7)
 Mention (e.g., @bigDataEurope)
 Keyword(s)
o Output:
 Latest posts from Twitter Public Stream
7-déc.-16www.big-data-europe.eu
Thank you!
Questions?
Links
 Strabon: http://strabon.di.uoa.gr
 GeoTriples: https://github.com/LinkedEOData/GeoTriples
 Sextant: http://sextant.di.uoa.gr
7-déc.-16www.big-data-europe.eu

More Related Content

What's hot

Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
BigData_Europe
 

What's hot (20)

SC7 Webinar 4 04/05/2017 NCSR Demokritos Presentation "Event Detection"
SC7 Webinar 4 04/05/2017 NCSR Demokritos Presentation "Event Detection"SC7 Webinar 4 04/05/2017 NCSR Demokritos Presentation "Event Detection"
SC7 Webinar 4 04/05/2017 NCSR Demokritos Presentation "Event Detection"
 
SC7 Hangout 2: Technical update on the pilot development for secure societies
SC7 Hangout 2: Technical update on the pilot development for secure societiesSC7 Hangout 2: Technical update on the pilot development for secure societies
SC7 Hangout 2: Technical update on the pilot development for secure societies
 
SC1 Workshop 2 General Introduction to BDE
SC1 Workshop 2 General Introduction to BDESC1 Workshop 2 General Introduction to BDE
SC1 Workshop 2 General Introduction to BDE
 
Big Data Europe Transport Pilot case, Luigi Selmi
Big Data Europe Transport Pilot case, Luigi SelmiBig Data Europe Transport Pilot case, Luigi Selmi
Big Data Europe Transport Pilot case, Luigi Selmi
 
SC7 Hangout 3: Community Building Activities for Big Data in Secure Societies
SC7 Hangout 3: Community Building Activities for Big Data in Secure SocietiesSC7 Hangout 3: Community Building Activities for Big Data in Secure Societies
SC7 Hangout 3: Community Building Activities for Big Data in Secure Societies
 
SC1 Workshop 2 Pilot instantiations
SC1 Workshop 2 Pilot instantiationsSC1 Workshop 2 Pilot instantiations
SC1 Workshop 2 Pilot instantiations
 
SC1 Workshop 2 Technical overview
SC1 Workshop 2 Technical overviewSC1 Workshop 2 Technical overview
SC1 Workshop 2 Technical overview
 
SC7 Webinar 4 04/05/2017 SatCen Presentation "The Secure Societies Community ...
SC7 Webinar 4 04/05/2017 SatCen Presentation "The Secure Societies Community ...SC7 Webinar 4 04/05/2017 SatCen Presentation "The Secure Societies Community ...
SC7 Webinar 4 04/05/2017 SatCen Presentation "The Secure Societies Community ...
 
SC7 Hangout 2: Remote Sensing Data Exploitation in the secure societies pilot
SC7 Hangout 2: Remote Sensing Data Exploitation in the secure societies pilotSC7 Hangout 2: Remote Sensing Data Exploitation in the secure societies pilot
SC7 Hangout 2: Remote Sensing Data Exploitation in the secure societies pilot
 
Platform introduction & Summary
Platform introduction & SummaryPlatform introduction & Summary
Platform introduction & Summary
 
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...
 
BDE SC6-ws-05/12/2016 technology part - SWC
BDE SC6-ws-05/12/2016 technology part - SWCBDE SC6-ws-05/12/2016 technology part - SWC
BDE SC6-ws-05/12/2016 technology part - SWC
 
Big data Europe: concept, platform and pilots
Big data Europe: concept, platform and pilotsBig data Europe: concept, platform and pilots
Big data Europe: concept, platform and pilots
 
BDE SC6.2 Workshop-05/12/16 - CESSDA
BDE SC6.2 Workshop-05/12/16 - CESSDABDE SC6.2 Workshop-05/12/16 - CESSDA
BDE SC6.2 Workshop-05/12/16 - CESSDA
 
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
 
Bde euro proworkshop
Bde euro proworkshopBde euro proworkshop
Bde euro proworkshop
 
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
 
BDE SC3.3 Workshop - BDE review: Scope and Opportunities
 BDE SC3.3 Workshop -  BDE review: Scope and Opportunities BDE SC3.3 Workshop -  BDE review: Scope and Opportunities
BDE SC3.3 Workshop - BDE review: Scope and Opportunities
 
Josep Maria Salanova - Introduction to BDE+SC4
Josep Maria Salanova - Introduction to BDE+SC4Josep Maria Salanova - Introduction to BDE+SC4
Josep Maria Salanova - Introduction to BDE+SC4
 
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVA
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVABDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVA
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVA
 

Viewers also liked

Viewers also liked (19)

Big data Europe the transport pilot in Thessaloniki - Josep Maria Salanova
Big data Europe the transport pilot in Thessaloniki - Josep Maria SalanovaBig data Europe the transport pilot in Thessaloniki - Josep Maria Salanova
Big data Europe the transport pilot in Thessaloniki - Josep Maria Salanova
 
Big data and the transport societal challenge - Maxime Flament
Big data and the transport societal challenge - Maxime FlamentBig data and the transport societal challenge - Maxime Flament
Big data and the transport societal challenge - Maxime Flament
 
Big data and transport - where can it take us? Paul Kompfner
Big data and transport - where can it take us? Paul KompfnerBig data and transport - where can it take us? Paul Kompfner
Big data and transport - where can it take us? Paul Kompfner
 
BigDataEurope - Big Data & Secure Societies
BigDataEurope - Big Data & Secure SocietiesBigDataEurope - Big Data & Secure Societies
BigDataEurope - Big Data & Secure Societies
 
BioASQ and BDE in SC1.1
BioASQ and BDE in SC1.1BioASQ and BDE in SC1.1
BioASQ and BDE in SC1.1
 
SC7 Hangout 2: Community Building activities for Big Data in Secure Societies
SC7 Hangout 2: Community Building activities for Big Data in Secure SocietiesSC7 Hangout 2: Community Building activities for Big Data in Secure Societies
SC7 Hangout 2: Community Building activities for Big Data in Secure Societies
 
Bde sc3 2nd_workshop_2016_10_04_p06_bde_pilot
Bde sc3 2nd_workshop_2016_10_04_p06_bde_pilotBde sc3 2nd_workshop_2016_10_04_p06_bde_pilot
Bde sc3 2nd_workshop_2016_10_04_p06_bde_pilot
 
SC7 Workshop 2: Demo of the BigDataEurope pilot for Secure Societies
SC7 Workshop 2: Demo of the BigDataEurope pilot for Secure SocietiesSC7 Workshop 2: Demo of the BigDataEurope pilot for Secure Societies
SC7 Workshop 2: Demo of the BigDataEurope pilot for Secure Societies
 
SC7 Workshop 2: Big Data Technologies and Scenarios
SC7 Workshop 2: Big Data Technologies and ScenariosSC7 Workshop 2: Big Data Technologies and Scenarios
SC7 Workshop 2: Big Data Technologies and Scenarios
 
1st BDE SC5 pilot: rationale, components and reusability
1st BDE SC5 pilot: rationale, components and reusability1st BDE SC5 pilot: rationale, components and reusability
1st BDE SC5 pilot: rationale, components and reusability
 
The physics background of the BDE SC5 pilot cases
The physics background of the BDE SC5 pilot casesThe physics background of the BDE SC5 pilot cases
The physics background of the BDE SC5 pilot cases
 
Bde sc3 2nd_workshop_2016_10_04_p01_bde_introduction
Bde sc3 2nd_workshop_2016_10_04_p01_bde_introductionBde sc3 2nd_workshop_2016_10_04_p01_bde_introduction
Bde sc3 2nd_workshop_2016_10_04_p01_bde_introduction
 
Bde sc3 2nd_workshop_2016_10_04_p07_laustsen_jens
Bde sc3 2nd_workshop_2016_10_04_p07_laustsen_jensBde sc3 2nd_workshop_2016_10_04_p07_laustsen_jens
Bde sc3 2nd_workshop_2016_10_04_p07_laustsen_jens
 
SC7 Workshop 2: Big Data Challenges in building GEOSS
SC7 Workshop 2: Big Data Challenges in building GEOSSSC7 Workshop 2: Big Data Challenges in building GEOSS
SC7 Workshop 2: Big Data Challenges in building GEOSS
 
Bde sc3 2nd_workshop_2016_10_04_p05_bde_system_monitoring
Bde sc3 2nd_workshop_2016_10_04_p05_bde_system_monitoringBde sc3 2nd_workshop_2016_10_04_p05_bde_system_monitoring
Bde sc3 2nd_workshop_2016_10_04_p05_bde_system_monitoring
 
SC7 Workshop 2: Space-based applications and Big Data
SC7 Workshop 2: Space-based applications and Big DataSC7 Workshop 2: Space-based applications and Big Data
SC7 Workshop 2: Space-based applications and Big Data
 
SC7 Workshop 2: The BigDataEurope project
SC7 Workshop 2: The BigDataEurope projectSC7 Workshop 2: The BigDataEurope project
SC7 Workshop 2: The BigDataEurope project
 
SC7 Workshop 2: Big Data pilots for Secure Societies
SC7 Workshop 2: Big Data pilots for Secure SocietiesSC7 Workshop 2: Big Data pilots for Secure Societies
SC7 Workshop 2: Big Data pilots for Secure Societies
 
SC7 Workshop 2: Big Data and Secure Societies
SC7 Workshop 2: Big Data and Secure SocietiesSC7 Workshop 2: Big Data and Secure Societies
SC7 Workshop 2: Big Data and Secure Societies
 

Similar to SC7 Hangout 3: Architecture of the BDE Pilot for Secure Societies

Similar to SC7 Hangout 3: Architecture of the BDE Pilot for Secure Societies (20)

SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...
SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...
SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...
 
NASA Web World Wind: welcome to the new era of virtual globes
NASA Web World Wind: welcome to the new era of virtual globes NASA Web World Wind: welcome to the new era of virtual globes
NASA Web World Wind: welcome to the new era of virtual globes
 
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"
 
SC7 Workshop 3: The BDE pilot for secure societies
SC7 Workshop 3: The BDE pilot for secure societiesSC7 Workshop 3: The BDE pilot for secure societies
SC7 Workshop 3: The BDE pilot for secure societies
 
ICWE2017 BigDataEurope
ICWE2017 BigDataEuropeICWE2017 BigDataEurope
ICWE2017 BigDataEurope
 
Visualizing WSJT-X received signals with spotviz.info
Visualizing WSJT-X received signals with spotviz.infoVisualizing WSJT-X received signals with spotviz.info
Visualizing WSJT-X received signals with spotviz.info
 
Data science apps: beyond notebooks
Data science apps: beyond notebooksData science apps: beyond notebooks
Data science apps: beyond notebooks
 
EventShop ISG talk 140213
EventShop ISG talk 140213EventShop ISG talk 140213
EventShop ISG talk 140213
 
BigDataEurope @BDVA Summit2016 2: Societal Pilots
BigDataEurope @BDVA Summit2016 2: Societal PilotsBigDataEurope @BDVA Summit2016 2: Societal Pilots
BigDataEurope @BDVA Summit2016 2: Societal Pilots
 
Data Science Apps: Beyond Notebooks - Natalino Busa - Codemotion Amsterdam 2017
Data Science Apps: Beyond Notebooks - Natalino Busa - Codemotion Amsterdam 2017Data Science Apps: Beyond Notebooks - Natalino Busa - Codemotion Amsterdam 2017
Data Science Apps: Beyond Notebooks - Natalino Busa - Codemotion Amsterdam 2017
 
WSO2 Machine Learner - Product Overview
WSO2 Machine Learner - Product OverviewWSO2 Machine Learner - Product Overview
WSO2 Machine Learner - Product Overview
 
BETTER Session, Excercise 1 @ EO Joint Big Data Hackathon
BETTER Session, Excercise 1 @ EO Joint Big Data HackathonBETTER Session, Excercise 1 @ EO Joint Big Data Hackathon
BETTER Session, Excercise 1 @ EO Joint Big Data Hackathon
 
Serp4 iot20.ppt
Serp4 iot20.pptSerp4 iot20.ppt
Serp4 iot20.ppt
 
SC4 Workshop 2: Soren Auer BDE project Overview
SC4 Workshop 2: Soren Auer BDE project OverviewSC4 Workshop 2: Soren Auer BDE project Overview
SC4 Workshop 2: Soren Auer BDE project Overview
 
Big Data Seervices in Danaos Use Case
Big Data Seervices in Danaos Use CaseBig Data Seervices in Danaos Use Case
Big Data Seervices in Danaos Use Case
 
Spark Streaming - Meetup Data Analysis
Spark Streaming - Meetup Data AnalysisSpark Streaming - Meetup Data Analysis
Spark Streaming - Meetup Data Analysis
 
Big Data to SMART Data : Process Scenario
Big Data to SMART Data : Process ScenarioBig Data to SMART Data : Process Scenario
Big Data to SMART Data : Process Scenario
 
Portfolio
PortfolioPortfolio
Portfolio
 
WSO2 Big Data Platform and Applications
WSO2 Big Data Platform and ApplicationsWSO2 Big Data Platform and Applications
WSO2 Big Data Platform and Applications
 
Cotap Tech Talks: Roderic Campbell, Minority Report UX with an Apple Watch
Cotap Tech Talks: Roderic Campbell, Minority Report UX with an Apple WatchCotap Tech Talks: Roderic Campbell, Minority Report UX with an Apple Watch
Cotap Tech Talks: Roderic Campbell, Minority Report UX with an Apple Watch
 

More from BigData_Europe

More from BigData_Europe (20)

Luigi Selmi - The Big Data Integrator Platform
Luigi Selmi - The Big Data Integrator PlatformLuigi Selmi - The Big Data Integrator Platform
Luigi Selmi - The Big Data Integrator Platform
 
Rajendra Akerkar - LeMO Project
Rajendra Akerkar - LeMO ProjectRajendra Akerkar - LeMO Project
Rajendra Akerkar - LeMO Project
 
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
 
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
 
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
 
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
 
BDE SC3.3 Workshop - Agenda
 BDE SC3.3 Workshop - Agenda BDE SC3.3 Workshop - Agenda
BDE SC3.3 Workshop - Agenda
 
BDE SC3.3 Workshop - BDE Pilot case for Wind Turbine condition monitoring re...
 BDE SC3.3 Workshop - BDE Pilot case for Wind Turbine condition monitoring re... BDE SC3.3 Workshop - BDE Pilot case for Wind Turbine condition monitoring re...
BDE SC3.3 Workshop - BDE Pilot case for Wind Turbine condition monitoring re...
 
BDE SC3.3 Workshop - Data management in WT testing and monitoring
 BDE SC3.3 Workshop - Data management in WT testing and monitoring  BDE SC3.3 Workshop - Data management in WT testing and monitoring
BDE SC3.3 Workshop - Data management in WT testing and monitoring
 
BDE SC3.3 Workshop - Big Data in Wind Turbine Condition Monitoring
 BDE SC3.3 Workshop -  Big Data in Wind Turbine Condition Monitoring BDE SC3.3 Workshop -  Big Data in Wind Turbine Condition Monitoring
BDE SC3.3 Workshop - Big Data in Wind Turbine Condition Monitoring
 
BDE SC3.3 Workshop - BDE Platform: Technical overview
 BDE SC3.3 Workshop -  BDE Platform: Technical overview BDE SC3.3 Workshop -  BDE Platform: Technical overview
BDE SC3.3 Workshop - BDE Platform: Technical overview
 
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
 
BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics
 BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics  BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics
BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics
 
Big Data Europe: Workshop 3 SC6 Social Science: THE IMPORTANCE OF METADATA & ...
Big Data Europe: Workshop 3 SC6 Social Science: THE IMPORTANCE OF METADATA & ...Big Data Europe: Workshop 3 SC6 Social Science: THE IMPORTANCE OF METADATA & ...
Big Data Europe: Workshop 3 SC6 Social Science: THE IMPORTANCE OF METADATA & ...
 
BDE SC1 Workshop 3 - BigMedilytics Overview (Supriyo Chatterjea)
BDE SC1 Workshop 3 - BigMedilytics Overview (Supriyo Chatterjea)BDE SC1 Workshop 3 - BigMedilytics Overview (Supriyo Chatterjea)
BDE SC1 Workshop 3 - BigMedilytics Overview (Supriyo Chatterjea)
 
BDE SC1 Workshop 3 - iASiS (Guillermo Palma)
BDE SC1 Workshop 3 - iASiS (Guillermo Palma)BDE SC1 Workshop 3 - iASiS (Guillermo Palma)
BDE SC1 Workshop 3 - iASiS (Guillermo Palma)
 
BDE SC1 Workshop 3 - MIDAS (Michaela Black)
BDE SC1 Workshop 3 - MIDAS (Michaela Black)BDE SC1 Workshop 3 - MIDAS (Michaela Black)
BDE SC1 Workshop 3 - MIDAS (Michaela Black)
 
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
 
BDE SC1 Workshop 3 - Big Data Europe (Simon Scerri)
BDE SC1 Workshop 3 - Big Data Europe (Simon Scerri)BDE SC1 Workshop 3 - Big Data Europe (Simon Scerri)
BDE SC1 Workshop 3 - Big Data Europe (Simon Scerri)
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

SC7 Hangout 3: Architecture of the BDE Pilot for Secure Societies

  • 1. ARCHITECTURE OF THE BDE PILOT FOR SECURE SOCIETIES 3rd BDE Hangout “Big Data in Secure Societies”5 Dicember 2016 George Papadakis, University of Athens Postdoctoral Researcher
  • 4. ED Workflow: News Crawler  Runs periodically  Monitored sources: o Reuters news feeds (RSS) o Selected Twitter accounts o Keyword-based search  Possible to cover more sources if needed 7-déc.-16www.big-data-europe.eu
  • 5. ED Workflow: Cassandra  Scalable, noSQL distributed database  Input Scenario I: o Individual news items from News Crawler o Conforms with privacy regulations  Input Scenario II: o Events identified by Event Detector  Input Scenario III: o Queries about the stored news items and events 7-déc.-16www.big-data-europe.eu
  • 6. ED Workflow: Event Detector  Runs periodically, parallel execution based on Spark  Input: o News items  Output: o Events  Every event is associated with meta-data: date & location  Algorithm based on 7-déc.-16www.big-data-europe.eu
  • 7. Event Detector Algorithm Two steps: 1. Identify events o Compare pairs of news items o If similarity > threshold → related pair o Form clusters based on related pairs o If cluster has support > threshold → event 2. Enrich events o Compare individual items with events o If similarity > threshold → attached to event 7-déc.-16www.big-data-europe.eu
  • 8. ED Workflow: Lookup service  Based on Apache Lucene for fuzzy queries  Based on the GAMD dataset o more than 180,000 location names  Input: o Query including an extracted location name  Output: o The corresponding geocordinates 7-déc.-16www.big-data-europe.eu
  • 10. CD Workflow: Image Aggregator  Rest service called by GUI & Event Detection  Input (manual or automatic): o Bounding box of the area of interest (WKT) o The time of interest o A past time, before an event of interest took place  Output: o a set of satellite images downloaded from ESA’s SciHub. o Subset operator 7-déc.-16www.big-data-europe.eu
  • 11. Automatic call of the CD workflow  Best-effort service  Based on a queue o Maximum capacity: 1,000 events o Maximum waiting time: 1 week  Input: o Event meta-data  Output: o Areas with detected changes & corresponding satellite images 7-déc.-16www.big-data-europe.eu
  • 12. CD Workflow: HDFS  Input: o Two satellite images in zip format, each occupying few GBs.  Output: o Distribute parts of every image to the available cluster nodes to facilitate their efficient processing. 7-déc.-16www.big-data-europe.eu
  • 13. CD Workflow: Change Detector  Parallelizes the change detection algorithm using Spark.  Input: o Two satellite images depicting the same geolocation.  Output: o A set of the areas with differences between the two snapshots. 7-déc.-16www.big-data-europe.eu
  • 14. Change Detector Algorithm  Three steps: 1. Preprocessing to align the given images  Coregistration (4 successive operators) or  Terrain Correction (1 operator) 2. Main algorithm to perform the actual comparison 3. DBScan for clustering together pixels with changes  Two parallelization strategies: 1. Tile-centric approach (subset operator) 2. Image-centric approach (baseline approach) 7-déc.-16www.big-data-europe.eu
  • 16. Common workflow: GeoTriples  Converts geospatial data into RDF.  Input Scenario I: o Areas of change from Change Detector  Input Scenario II: o Event summaries from Event Detector  Output: o RDF statements 7-déc.-16www.big-data-europe.eu
  • 17. Common workflow: Strabon  Scalable & efficient spatiotemporal RDF store.  Input Scenario I: o Data coming from GeoTriples  Input Scenario II: o SPARQL queries such as:  Get N latest event summaries from location X.  Get event summaries with keyword Y.  Output: o Answers to the received queries. 7-déc.-16www.big-data-europe.eu
  • 18. Common Workflow: SemaGrow  Federates Cassandra and Strabon.  Input: o Queries from GUI about events or locations with changes.  Output: o Answers to the received queries. 7-déc.-16www.big-data-europe.eu
  • 19. Common Workflow: Sextant - A  Web application implementing the GUI.  Input for Change Detection: o Area selected by user through the interactive map o Time interval (optional) o User info  Output: o Calls Image Aggregator o Progress messages 7-déc.-16www.big-data-europe.eu
  • 20. Common Workflow: Sextant - B  Input for Event Detection (at least one of the following): o Keyword o Location name or coordinates o Time  Output: o Latest relevant event summaries & corresponding news items. 7-déc.-16www.big-data-europe.eu
  • 21. Common Workflow: Sextant - C  Cybersecurity o User registration  Pilot credentials (encrypted) SciHub credentials (encrypted)  Type of user (classified, unclassified)  Requires administration approval o Authorization 7-déc.-16www.big-data-europe.eu
  • 22. Common Workflow: Sextant - D  Twitter keyword search o Retrieves tweets on the fly o Input: Hashtag (e.g., #bdeSC7)  Mention (e.g., @bigDataEurope)  Keyword(s) o Output:  Latest posts from Twitter Public Stream 7-déc.-16www.big-data-europe.eu
  • 23. Thank you! Questions? Links  Strabon: http://strabon.di.uoa.gr  GeoTriples: https://github.com/LinkedEOData/GeoTriples  Sextant: http://sextant.di.uoa.gr 7-déc.-16www.big-data-europe.eu