SlideShare ist ein Scribd-Unternehmen logo
1 von 37
Downloaden Sie, um offline zu lesen
Big Data and Geospatial with HPCC Systems®
Powered by LexisNexis Risk Solutions
Ignacio Calvo
Greg McRandal
10/05/2016
Concepts in Geospatial
How to use them with HPCC
Use cases
@HPCCSystems
An approach to applying statistical
analysis and other analytic techniques
to data which has a geographical or
spatial aspect
Definition
Origin of Geospatial
John Snow’s original map (1854),
using GIS to save lives. This map
was used to determine that
Cholera was water-borne
Need to know :
• Format
• Projection / coordinate system
Understanding the data
Formats : Vector vs Raster
Vector Raster
Projections are used to represent the world in ways
we can process
•The Earth is round and maps are flat
•Physical Maps
•Computer Maps
What is a projection?
Have I seen projections before?
•Peter vs Mercator vs Winkel tripel
•GPS (latitude/longitude)
•Google Maps
Two different projections representing the same place.
Projections
WGS84
•Latitude and longitude
•Our best approximation of the world
•Not always the best for a specific region
•Not technically a projection
Projections to know about
Mercator
•Many different ones, choose one based on your location
•Reduces the area it covers to a simple Cartesian plane
•Good near the central axis, bad far away from it :
• Web Mercator covers the whole world – good near equator, gets worse as you travel north or
south
• Irish National Grid – very good for Ireland, awful anywhere else.
Lies, damned lies, statistics… and maps!
*https://twitter.com/flashboy/status/641221733509373952
Lies, damned lies, statistics… and maps!
Projection Woes:
A straight line in Mercator is
not a straight line in WGS84
Four points converted
to WGS84
Where the lines
should be
Don’t re-project polygons!
This “solution” is only good
enough for visuals, not for
maths.
Lies, damned lies, statistics… and maps!
Lies, damned lies, statistics… and maps!
Visuals don’t agree with maths: Wind and Hail.
Web Mercator WGS84
Number one bug in Geospatial
*http://twcc.fr
Number one bug in Geospatial
Latitude
Longitude
X
Y
LatY LonX
Now I understand my data, what’s next?
Data Ingest Index Query
Bringing Geospatial into HPCC
GOAL
Bring our geospatial processes
into the realm of Big Data
STEPS
Spatial filtering of vector geometries
Spatial operations using vector geometries
Spatial reference projection and transformation
Reading of compressed geo-raster files
Big Data
Extend HPCC and ECL to support the following main
capabilities :
STEPS
Big Data
Integration of open source libraries
Ingesting Vector Data
It’s a CSV file.
Id Name Geometry Projection Value
1 Alice’s
place
POINT (53.78925462 -6.08354321) 4326* €5,973,000
2 Bob’s place POINT (-34.78925462 7.08354321) 4326 €872,000
3 Celine’s
place
POINT (102.78925462 -6.08354321) 4326 €9,324,000
* WGS84 (Lat/Lon)
3.
Peril tag
2.
Geocode address
1.
Policy data
Data ready to
ingest
Ingesting Vector Data
It’s a GML / XML file.
3.
Process and index
2.
Parse XPATH
1.
Shape data
Data ready to
query
Ingesting Vector Data
It’s a GML / XML file.
3.
Process and index
2.
Parse XPATH
1.
Shape data
Data ready to
query
Ingesting Vector Data
It’s a GML / XML file.
3.
Process and index
2.
Parse XPATH
1.
Shape data
Data ready to
query
Indexing vector data
• Outline Box: Biggest rectangle
• Boxes contain boxes
• Bottom box in the tree contains actual
geometries
• Here, 3 levels pictured
• Boxes can overlap (entries are only in one)
Querying vector data
Searching an R-Tree: e.g. Finding all buildings (points) inside a flood zone (polygon)
Does the query polygon overlap our box?
Return empty list
Search our boxes’
children
Is it a leaf node?
Return all nodes
for verification
Y
N
Y
N
Ingesting Raster Data
It’s a raster / TIFF file. Bitmap image
3.
Process and index
2.
Tile and spray
1.
Raster data
Data ready to
query
Ingesting Raster Data
3.
Process and index
2.
Tile and spray
1.
Raster data
Data ready to
query
Tiling divides raster images into
small manageable areas of known
dimensions.
These tiles have their own
metadata:
• Bounding box
• Grid position
Ingesting Raster Data
3.
Process and index
2.
Tile and spray
1.
Raster data
Data ready to
query
1. Figure out which grid position the
geometry needs
2. Extract the required pixel
3. Interrogate the pixel for its value
4. Interpret its value
5. Return to user
Ingesting Raster Data
It’s a raster / TIFF file. Bitmap image
3.
Process and index
2.
Tile and spray
1.
Raster data
Data ready to
query
Ingesting Raster Data
It’s a raster / TIFF file.
3.
Process and index
2.
Tile and spray
1.
Raster data
Data ready to
query
Bringing it all together
*Andrew Farrell
In pursuit of perils : Geo-spatial risk analysis through HPCC Systems
https://hpccsystems.com/resources/blog/afarrell/pursuit-perils-geo-spatial-risk-analysis-
through-hpcc-systems
Add even more value
Add even more value
Why Geospatial with HPCC?
• Efficient parallel processing
• Ability to import libraries from different languages
• Good coverage of functions and spatial predicates
• Fast ingestion
• Support for different formats
• Sub-second queries
hpccsystems.com

Weitere ähnliche Inhalte

Was ist angesagt?

4.2 spatial data mining
4.2 spatial data mining4.2 spatial data mining
4.2 spatial data miningKrish_ver2
 
Spatial analysis and Analysis Tools ( GIS )
Spatial analysis and Analysis Tools ( GIS )Spatial analysis and Analysis Tools ( GIS )
Spatial analysis and Analysis Tools ( GIS )designQube
 
Improvement of Spatial Data Quality Using the Data Conflation
Improvement of Spatial Data Quality Using the Data ConflationImprovement of Spatial Data Quality Using the Data Conflation
Improvement of Spatial Data Quality Using the Data ConflationBeniamino Murgante
 
3D Analyst - Lake, Jatiluhur
3D Analyst - Lake, Jatiluhur3D Analyst - Lake, Jatiluhur
3D Analyst - Lake, JatiluhurHartanto Sanjaya
 
Spatial data analysis
Spatial data analysisSpatial data analysis
Spatial data analysisJohan Blomme
 
Spatial vs non spatial
Spatial vs non spatialSpatial vs non spatial
Spatial vs non spatialSumant Diwakar
 
Vector data model
Vector data model Vector data model
Vector data model Pramoda Raj
 
ePOM - Intro to Ocean Data Science - Raster and Vector Data Formats
ePOM - Intro to Ocean Data Science - Raster and Vector Data FormatsePOM - Intro to Ocean Data Science - Raster and Vector Data Formats
ePOM - Intro to Ocean Data Science - Raster and Vector Data FormatsGiuseppe Masetti
 
Geographical information system unit 5
Geographical information  system unit 5Geographical information  system unit 5
Geographical information system unit 5WE-IT TUTORIALS
 
Conversion of Existing Data
Conversion of Existing DataConversion of Existing Data
Conversion of Existing DataTilok Chetri
 
LIDAR and Drone Data - Datamine Discover3D
LIDAR and Drone Data - Datamine Discover3DLIDAR and Drone Data - Datamine Discover3D
LIDAR and Drone Data - Datamine Discover3DPrakher Hajela Saxena
 
MapInfo Discover 3D for Wind Energy Resources
MapInfo Discover 3D for Wind Energy ResourcesMapInfo Discover 3D for Wind Energy Resources
MapInfo Discover 3D for Wind Energy ResourcesPrakher Hajela Saxena
 
3D Analyst - Watershed Lorelindu
3D Analyst - Watershed Lorelindu3D Analyst - Watershed Lorelindu
3D Analyst - Watershed LorelinduHartanto Sanjaya
 

Was ist angesagt? (20)

GIS data structure
GIS data structureGIS data structure
GIS data structure
 
Spatial databases
Spatial databasesSpatial databases
Spatial databases
 
3D Analyst - Lab
3D Analyst - Lab3D Analyst - Lab
3D Analyst - Lab
 
4.2 spatial data mining
4.2 spatial data mining4.2 spatial data mining
4.2 spatial data mining
 
Spatial analysis and Analysis Tools ( GIS )
Spatial analysis and Analysis Tools ( GIS )Spatial analysis and Analysis Tools ( GIS )
Spatial analysis and Analysis Tools ( GIS )
 
Spatial Data Model
Spatial Data ModelSpatial Data Model
Spatial Data Model
 
Improvement of Spatial Data Quality Using the Data Conflation
Improvement of Spatial Data Quality Using the Data ConflationImprovement of Spatial Data Quality Using the Data Conflation
Improvement of Spatial Data Quality Using the Data Conflation
 
Iccsa stankuteha180611
Iccsa stankuteha180611Iccsa stankuteha180611
Iccsa stankuteha180611
 
3D Analyst - Lake, Jatiluhur
3D Analyst - Lake, Jatiluhur3D Analyst - Lake, Jatiluhur
3D Analyst - Lake, Jatiluhur
 
Spatial data analysis
Spatial data analysisSpatial data analysis
Spatial data analysis
 
Spatial vs non spatial
Spatial vs non spatialSpatial vs non spatial
Spatial vs non spatial
 
Vector data model
Vector data model Vector data model
Vector data model
 
GIS Modeling
GIS ModelingGIS Modeling
GIS Modeling
 
ePOM - Intro to Ocean Data Science - Raster and Vector Data Formats
ePOM - Intro to Ocean Data Science - Raster and Vector Data FormatsePOM - Intro to Ocean Data Science - Raster and Vector Data Formats
ePOM - Intro to Ocean Data Science - Raster and Vector Data Formats
 
Geographical information system unit 5
Geographical information  system unit 5Geographical information  system unit 5
Geographical information system unit 5
 
Conversion of Existing Data
Conversion of Existing DataConversion of Existing Data
Conversion of Existing Data
 
LIDAR and Drone Data - Datamine Discover3D
LIDAR and Drone Data - Datamine Discover3DLIDAR and Drone Data - Datamine Discover3D
LIDAR and Drone Data - Datamine Discover3D
 
MapInfo Discover 3D for Wind Energy Resources
MapInfo Discover 3D for Wind Energy ResourcesMapInfo Discover 3D for Wind Energy Resources
MapInfo Discover 3D for Wind Energy Resources
 
3D Analyst - Watershed Lorelindu
3D Analyst - Watershed Lorelindu3D Analyst - Watershed Lorelindu
3D Analyst - Watershed Lorelindu
 
Spatial Database Systems
Spatial Database SystemsSpatial Database Systems
Spatial Database Systems
 

Andere mochten auch

2016 HPCC Systems Poster Presentation Competition
2016 HPCC Systems Poster Presentation Competition2016 HPCC Systems Poster Presentation Competition
2016 HPCC Systems Poster Presentation CompetitionHPCC Systems
 
Farm Management System - Delivering a Precision Agriculture Solution
Farm Management System - Delivering a Precision Agriculture SolutionFarm Management System - Delivering a Precision Agriculture Solution
Farm Management System - Delivering a Precision Agriculture SolutionHPCC Systems
 
Enabling Aviation Analytics through HPCC Systems
Enabling Aviation Analytics through HPCC SystemsEnabling Aviation Analytics through HPCC Systems
Enabling Aviation Analytics through HPCC SystemsHPCC Systems
 
Introduction to the Open Source HPCC Systems Platform by Arjuna Chala
Introduction to the Open Source HPCC Systems Platform by Arjuna ChalaIntroduction to the Open Source HPCC Systems Platform by Arjuna Chala
Introduction to the Open Source HPCC Systems Platform by Arjuna ChalaHPCC Systems
 
HPCC Systems - Using Big Data to Help Feed the World
HPCC Systems - Using Big Data to Help Feed the WorldHPCC Systems - Using Big Data to Help Feed the World
HPCC Systems - Using Big Data to Help Feed the WorldHPCC Systems
 
HUG Ireland Event - HPCC Presentation Slides
HUG Ireland Event - HPCC Presentation SlidesHUG Ireland Event - HPCC Presentation Slides
HUG Ireland Event - HPCC Presentation SlidesJohn Mulhall
 
Proagrica - Big Data to Feed the World
Proagrica - Big Data to Feed the WorldProagrica - Big Data to Feed the World
Proagrica - Big Data to Feed the WorldHPCC Systems
 
Big Data - Hadoop and MapReduce - Aditya Garg
Big Data - Hadoop and MapReduce - Aditya GargBig Data - Hadoop and MapReduce - Aditya Garg
Big Data - Hadoop and MapReduce - Aditya GargAgile Testing Alliance
 
Poultry farm management system
Poultry farm management systemPoultry farm management system
Poultry farm management systembimoljit
 
The current challenges and opportunities of big data and analytics in emergen...
The current challenges and opportunities of big data and analytics in emergen...The current challenges and opportunities of big data and analytics in emergen...
The current challenges and opportunities of big data and analytics in emergen...IBM Analytics
 
Big-data analytics: challenges and opportunities
Big-data analytics: challenges and opportunitiesBig-data analytics: challenges and opportunities
Big-data analytics: challenges and opportunities台灣資料科學年會
 
Webinar 2013 11-21-sebillo
Webinar 2013 11-21-sebilloWebinar 2013 11-21-sebillo
Webinar 2013 11-21-sebillosmespire
 
LR Каталог продукции 2012
LR Каталог продукции 2012LR Каталог продукции 2012
LR Каталог продукции 2012t575ae
 
LR Прайс лист 08.2012
LR Прайс лист 08.2012LR Прайс лист 08.2012
LR Прайс лист 08.2012t575ae
 
Two Days Training on Advocacy at Lahore 8 - 9 December 2016
Two Days Training on Advocacy at Lahore 8 - 9 December 2016Two Days Training on Advocacy at Lahore 8 - 9 December 2016
Two Days Training on Advocacy at Lahore 8 - 9 December 2016sultantareen1976
 

Andere mochten auch (20)

2016 HPCC Systems Poster Presentation Competition
2016 HPCC Systems Poster Presentation Competition2016 HPCC Systems Poster Presentation Competition
2016 HPCC Systems Poster Presentation Competition
 
Farm Management System - Delivering a Precision Agriculture Solution
Farm Management System - Delivering a Precision Agriculture SolutionFarm Management System - Delivering a Precision Agriculture Solution
Farm Management System - Delivering a Precision Agriculture Solution
 
Enabling Aviation Analytics through HPCC Systems
Enabling Aviation Analytics through HPCC SystemsEnabling Aviation Analytics through HPCC Systems
Enabling Aviation Analytics through HPCC Systems
 
Introduction to the Open Source HPCC Systems Platform by Arjuna Chala
Introduction to the Open Source HPCC Systems Platform by Arjuna ChalaIntroduction to the Open Source HPCC Systems Platform by Arjuna Chala
Introduction to the Open Source HPCC Systems Platform by Arjuna Chala
 
HPCC Systems - Using Big Data to Help Feed the World
HPCC Systems - Using Big Data to Help Feed the WorldHPCC Systems - Using Big Data to Help Feed the World
HPCC Systems - Using Big Data to Help Feed the World
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
HPCC Presentation
HPCC PresentationHPCC Presentation
HPCC Presentation
 
HUG Ireland Event - HPCC Presentation Slides
HUG Ireland Event - HPCC Presentation SlidesHUG Ireland Event - HPCC Presentation Slides
HUG Ireland Event - HPCC Presentation Slides
 
Proagrica - Big Data to Feed the World
Proagrica - Big Data to Feed the WorldProagrica - Big Data to Feed the World
Proagrica - Big Data to Feed the World
 
Big Data Ready Enterprise
Big Data Ready Enterprise Big Data Ready Enterprise
Big Data Ready Enterprise
 
Big Data - Hadoop and MapReduce - Aditya Garg
Big Data - Hadoop and MapReduce - Aditya GargBig Data - Hadoop and MapReduce - Aditya Garg
Big Data - Hadoop and MapReduce - Aditya Garg
 
Poultry farm management system
Poultry farm management systemPoultry farm management system
Poultry farm management system
 
The current challenges and opportunities of big data and analytics in emergen...
The current challenges and opportunities of big data and analytics in emergen...The current challenges and opportunities of big data and analytics in emergen...
The current challenges and opportunities of big data and analytics in emergen...
 
Big-data analytics: challenges and opportunities
Big-data analytics: challenges and opportunitiesBig-data analytics: challenges and opportunities
Big-data analytics: challenges and opportunities
 
Big Data: Issues and Challenges
Big Data: Issues and ChallengesBig Data: Issues and Challenges
Big Data: Issues and Challenges
 
Webinar 2013 11-21-sebillo
Webinar 2013 11-21-sebilloWebinar 2013 11-21-sebillo
Webinar 2013 11-21-sebillo
 
LR Каталог продукции 2012
LR Каталог продукции 2012LR Каталог продукции 2012
LR Каталог продукции 2012
 
MY NAME IS DUBIAN MARIN - UNAD
MY NAME IS DUBIAN MARIN - UNADMY NAME IS DUBIAN MARIN - UNAD
MY NAME IS DUBIAN MARIN - UNAD
 
LR Прайс лист 08.2012
LR Прайс лист 08.2012LR Прайс лист 08.2012
LR Прайс лист 08.2012
 
Two Days Training on Advocacy at Lahore 8 - 9 December 2016
Two Days Training on Advocacy at Lahore 8 - 9 December 2016Two Days Training on Advocacy at Lahore 8 - 9 December 2016
Two Days Training on Advocacy at Lahore 8 - 9 December 2016
 

Ähnlich wie Big Data and Geospatial with HPCC Systems

What is Geography Information Systems (GIS)
What is Geography Information Systems (GIS)What is Geography Information Systems (GIS)
What is Geography Information Systems (GIS)John Lanser
 
GIS Analysis For Site Remediation
GIS Analysis For Site RemediationGIS Analysis For Site Remediation
GIS Analysis For Site RemediationJoseph Luchette
 
THE NATURE AND SOURCE OF GEOGRAPHIC DATA
THE NATURE AND SOURCE OF GEOGRAPHIC DATATHE NATURE AND SOURCE OF GEOGRAPHIC DATA
THE NATURE AND SOURCE OF GEOGRAPHIC DATANadia Aziz
 
Getting started with GIS
Getting started with GISGetting started with GIS
Getting started with GISEsri India
 
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND dHSA
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND  dHSAPIAS 2013-GIS.pptxfskjczjsbchdbfscnnND  dHSA
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND dHSAFloridaTLaoaten
 
Geographic Information System unit 1
Geographic Information System   unit 1Geographic Information System   unit 1
Geographic Information System unit 1sridevi5983
 
Data models in geographical information system(GIS)
Data models in geographical information system(GIS)Data models in geographical information system(GIS)
Data models in geographical information system(GIS)Pramoda Raj
 
geographic information system pdf
geographic information system pdfgeographic information system pdf
geographic information system pdfRolan Ben Lorono
 
Topological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsTopological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsMason Porter
 
Geographic information system(GIS) and its applications in agriculture
Geographic information system(GIS) and its applications in agricultureGeographic information system(GIS) and its applications in agriculture
Geographic information system(GIS) and its applications in agricultureKiranmai nalla
 
Spatial Data Science with R
Spatial Data Science with RSpatial Data Science with R
Spatial Data Science with Ramsantac
 
Scattered gis handbook
Scattered gis handbookScattered gis handbook
Scattered gis handbookWaleed Liaqat
 
Intro to GIS and Remote Sensing
Intro to GIS and Remote SensingIntro to GIS and Remote Sensing
Intro to GIS and Remote SensingJohn Reiser
 
Vector data model
Vector data modelVector data model
Vector data modelPramoda Raj
 

Ähnlich wie Big Data and Geospatial with HPCC Systems (20)

What is Geography Information Systems (GIS)
What is Geography Information Systems (GIS)What is Geography Information Systems (GIS)
What is Geography Information Systems (GIS)
 
GIS Analysis For Site Remediation
GIS Analysis For Site RemediationGIS Analysis For Site Remediation
GIS Analysis For Site Remediation
 
THE NATURE AND SOURCE OF GEOGRAPHIC DATA
THE NATURE AND SOURCE OF GEOGRAPHIC DATATHE NATURE AND SOURCE OF GEOGRAPHIC DATA
THE NATURE AND SOURCE OF GEOGRAPHIC DATA
 
Getting started with GIS
Getting started with GISGetting started with GIS
Getting started with GIS
 
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND dHSA
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND  dHSAPIAS 2013-GIS.pptxfskjczjsbchdbfscnnND  dHSA
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND dHSA
 
Fundamentals of GIS
Fundamentals of GISFundamentals of GIS
Fundamentals of GIS
 
Geographic Information System unit 1
Geographic Information System   unit 1Geographic Information System   unit 1
Geographic Information System unit 1
 
Data models in geographical information system(GIS)
Data models in geographical information system(GIS)Data models in geographical information system(GIS)
Data models in geographical information system(GIS)
 
geographic information system pdf
geographic information system pdfgeographic information system pdf
geographic information system pdf
 
GIS_Intro_March_2014
GIS_Intro_March_2014GIS_Intro_March_2014
GIS_Intro_March_2014
 
Info Grafix
Info GrafixInfo Grafix
Info Grafix
 
Topological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsTopological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial Systems
 
Geographic information system(GIS) and its applications in agriculture
Geographic information system(GIS) and its applications in agricultureGeographic information system(GIS) and its applications in agriculture
Geographic information system(GIS) and its applications in agriculture
 
Final ies
Final iesFinal ies
Final ies
 
GIS_FDP_Final.pdf
GIS_FDP_Final.pdfGIS_FDP_Final.pdf
GIS_FDP_Final.pdf
 
Spatial Data Science with R
Spatial Data Science with RSpatial Data Science with R
Spatial Data Science with R
 
Exploratory Spatial Analytics (ESA)
Exploratory Spatial Analytics (ESA)Exploratory Spatial Analytics (ESA)
Exploratory Spatial Analytics (ESA)
 
Scattered gis handbook
Scattered gis handbookScattered gis handbook
Scattered gis handbook
 
Intro to GIS and Remote Sensing
Intro to GIS and Remote SensingIntro to GIS and Remote Sensing
Intro to GIS and Remote Sensing
 
Vector data model
Vector data modelVector data model
Vector data model
 

Mehr von HPCC Systems

Natural Language to SQL Query conversion using Machine Learning Techniques on...
Natural Language to SQL Query conversion using Machine Learning Techniques on...Natural Language to SQL Query conversion using Machine Learning Techniques on...
Natural Language to SQL Query conversion using Machine Learning Techniques on...HPCC Systems
 
Improving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC SystemsImproving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC SystemsHPCC Systems
 
Towards Trustable AI for Complex Systems
Towards Trustable AI for Complex SystemsTowards Trustable AI for Complex Systems
Towards Trustable AI for Complex SystemsHPCC Systems
 
Closing / Adjourn
Closing / Adjourn Closing / Adjourn
Closing / Adjourn HPCC Systems
 
Community Website: Virtual Ribbon Cutting
Community Website: Virtual Ribbon CuttingCommunity Website: Virtual Ribbon Cutting
Community Website: Virtual Ribbon CuttingHPCC Systems
 
Release Cycle Changes
Release Cycle ChangesRelease Cycle Changes
Release Cycle ChangesHPCC Systems
 
Geohashing with Uber’s H3 Geospatial Index
Geohashing with Uber’s H3 Geospatial Index Geohashing with Uber’s H3 Geospatial Index
Geohashing with Uber’s H3 Geospatial Index HPCC Systems
 
Advancements in HPCC Systems Machine Learning
Advancements in HPCC Systems Machine LearningAdvancements in HPCC Systems Machine Learning
Advancements in HPCC Systems Machine LearningHPCC Systems
 
Expanding HPCC Systems Deep Neural Network Capabilities
Expanding HPCC Systems Deep Neural Network CapabilitiesExpanding HPCC Systems Deep Neural Network Capabilities
Expanding HPCC Systems Deep Neural Network CapabilitiesHPCC Systems
 
Leveraging Intra-Node Parallelization in HPCC Systems
Leveraging Intra-Node Parallelization in HPCC SystemsLeveraging Intra-Node Parallelization in HPCC Systems
Leveraging Intra-Node Parallelization in HPCC SystemsHPCC Systems
 
DataPatterns - Profiling in ECL Watch
DataPatterns - Profiling in ECL Watch DataPatterns - Profiling in ECL Watch
DataPatterns - Profiling in ECL Watch HPCC Systems
 
Leveraging the Spark-HPCC Ecosystem
Leveraging the Spark-HPCC Ecosystem Leveraging the Spark-HPCC Ecosystem
Leveraging the Spark-HPCC Ecosystem HPCC Systems
 
Work Unit Analysis Tool
Work Unit Analysis ToolWork Unit Analysis Tool
Work Unit Analysis ToolHPCC Systems
 
Community Award Ceremony
Community Award Ceremony Community Award Ceremony
Community Award Ceremony HPCC Systems
 
Dapper Tool - A Bundle to Make your ECL Neater
Dapper Tool - A Bundle to Make your ECL NeaterDapper Tool - A Bundle to Make your ECL Neater
Dapper Tool - A Bundle to Make your ECL NeaterHPCC Systems
 
A Success Story of Challenging the Status Quo: Gadget Girls and the Inclusion...
A Success Story of Challenging the Status Quo: Gadget Girls and the Inclusion...A Success Story of Challenging the Status Quo: Gadget Girls and the Inclusion...
A Success Story of Challenging the Status Quo: Gadget Girls and the Inclusion...HPCC Systems
 
Beyond the Spectrum – Creating an Environment of Diversity and Empowerment wi...
Beyond the Spectrum – Creating an Environment of Diversity and Empowerment wi...Beyond the Spectrum – Creating an Environment of Diversity and Empowerment wi...
Beyond the Spectrum – Creating an Environment of Diversity and Empowerment wi...HPCC Systems
 

Mehr von HPCC Systems (20)

Natural Language to SQL Query conversion using Machine Learning Techniques on...
Natural Language to SQL Query conversion using Machine Learning Techniques on...Natural Language to SQL Query conversion using Machine Learning Techniques on...
Natural Language to SQL Query conversion using Machine Learning Techniques on...
 
Improving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC SystemsImproving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC Systems
 
Towards Trustable AI for Complex Systems
Towards Trustable AI for Complex SystemsTowards Trustable AI for Complex Systems
Towards Trustable AI for Complex Systems
 
Welcome
WelcomeWelcome
Welcome
 
Closing / Adjourn
Closing / Adjourn Closing / Adjourn
Closing / Adjourn
 
Community Website: Virtual Ribbon Cutting
Community Website: Virtual Ribbon CuttingCommunity Website: Virtual Ribbon Cutting
Community Website: Virtual Ribbon Cutting
 
Path to 8.0
Path to 8.0 Path to 8.0
Path to 8.0
 
Release Cycle Changes
Release Cycle ChangesRelease Cycle Changes
Release Cycle Changes
 
Geohashing with Uber’s H3 Geospatial Index
Geohashing with Uber’s H3 Geospatial Index Geohashing with Uber’s H3 Geospatial Index
Geohashing with Uber’s H3 Geospatial Index
 
Advancements in HPCC Systems Machine Learning
Advancements in HPCC Systems Machine LearningAdvancements in HPCC Systems Machine Learning
Advancements in HPCC Systems Machine Learning
 
Docker Support
Docker Support Docker Support
Docker Support
 
Expanding HPCC Systems Deep Neural Network Capabilities
Expanding HPCC Systems Deep Neural Network CapabilitiesExpanding HPCC Systems Deep Neural Network Capabilities
Expanding HPCC Systems Deep Neural Network Capabilities
 
Leveraging Intra-Node Parallelization in HPCC Systems
Leveraging Intra-Node Parallelization in HPCC SystemsLeveraging Intra-Node Parallelization in HPCC Systems
Leveraging Intra-Node Parallelization in HPCC Systems
 
DataPatterns - Profiling in ECL Watch
DataPatterns - Profiling in ECL Watch DataPatterns - Profiling in ECL Watch
DataPatterns - Profiling in ECL Watch
 
Leveraging the Spark-HPCC Ecosystem
Leveraging the Spark-HPCC Ecosystem Leveraging the Spark-HPCC Ecosystem
Leveraging the Spark-HPCC Ecosystem
 
Work Unit Analysis Tool
Work Unit Analysis ToolWork Unit Analysis Tool
Work Unit Analysis Tool
 
Community Award Ceremony
Community Award Ceremony Community Award Ceremony
Community Award Ceremony
 
Dapper Tool - A Bundle to Make your ECL Neater
Dapper Tool - A Bundle to Make your ECL NeaterDapper Tool - A Bundle to Make your ECL Neater
Dapper Tool - A Bundle to Make your ECL Neater
 
A Success Story of Challenging the Status Quo: Gadget Girls and the Inclusion...
A Success Story of Challenging the Status Quo: Gadget Girls and the Inclusion...A Success Story of Challenging the Status Quo: Gadget Girls and the Inclusion...
A Success Story of Challenging the Status Quo: Gadget Girls and the Inclusion...
 
Beyond the Spectrum – Creating an Environment of Diversity and Empowerment wi...
Beyond the Spectrum – Creating an Environment of Diversity and Empowerment wi...Beyond the Spectrum – Creating an Environment of Diversity and Empowerment wi...
Beyond the Spectrum – Creating an Environment of Diversity and Empowerment wi...
 

Kürzlich hochgeladen

Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...amitlee9823
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 

Kürzlich hochgeladen (20)

Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 

Big Data and Geospatial with HPCC Systems

  • 1. Big Data and Geospatial with HPCC Systems® Powered by LexisNexis Risk Solutions Ignacio Calvo Greg McRandal 10/05/2016
  • 2. Concepts in Geospatial How to use them with HPCC Use cases @HPCCSystems
  • 3. An approach to applying statistical analysis and other analytic techniques to data which has a geographical or spatial aspect Definition
  • 4.
  • 5. Origin of Geospatial John Snow’s original map (1854), using GIS to save lives. This map was used to determine that Cholera was water-borne
  • 6. Need to know : • Format • Projection / coordinate system Understanding the data
  • 7. Formats : Vector vs Raster Vector Raster
  • 8. Projections are used to represent the world in ways we can process •The Earth is round and maps are flat •Physical Maps •Computer Maps What is a projection? Have I seen projections before? •Peter vs Mercator vs Winkel tripel •GPS (latitude/longitude) •Google Maps
  • 9. Two different projections representing the same place. Projections
  • 10. WGS84 •Latitude and longitude •Our best approximation of the world •Not always the best for a specific region •Not technically a projection Projections to know about Mercator •Many different ones, choose one based on your location •Reduces the area it covers to a simple Cartesian plane •Good near the central axis, bad far away from it : • Web Mercator covers the whole world – good near equator, gets worse as you travel north or south • Irish National Grid – very good for Ireland, awful anywhere else.
  • 11. Lies, damned lies, statistics… and maps! *https://twitter.com/flashboy/status/641221733509373952
  • 12. Lies, damned lies, statistics… and maps! Projection Woes: A straight line in Mercator is not a straight line in WGS84 Four points converted to WGS84 Where the lines should be Don’t re-project polygons! This “solution” is only good enough for visuals, not for maths.
  • 13. Lies, damned lies, statistics… and maps!
  • 14. Lies, damned lies, statistics… and maps! Visuals don’t agree with maths: Wind and Hail. Web Mercator WGS84
  • 15. Number one bug in Geospatial *http://twcc.fr
  • 16. Number one bug in Geospatial Latitude Longitude X Y LatY LonX
  • 17. Now I understand my data, what’s next? Data Ingest Index Query
  • 18. Bringing Geospatial into HPCC GOAL Bring our geospatial processes into the realm of Big Data
  • 19. STEPS Spatial filtering of vector geometries Spatial operations using vector geometries Spatial reference projection and transformation Reading of compressed geo-raster files Big Data Extend HPCC and ECL to support the following main capabilities :
  • 20. STEPS Big Data Integration of open source libraries
  • 21. Ingesting Vector Data It’s a CSV file. Id Name Geometry Projection Value 1 Alice’s place POINT (53.78925462 -6.08354321) 4326* €5,973,000 2 Bob’s place POINT (-34.78925462 7.08354321) 4326 €872,000 3 Celine’s place POINT (102.78925462 -6.08354321) 4326 €9,324,000 * WGS84 (Lat/Lon) 3. Peril tag 2. Geocode address 1. Policy data Data ready to ingest
  • 22. Ingesting Vector Data It’s a GML / XML file. 3. Process and index 2. Parse XPATH 1. Shape data Data ready to query
  • 23. Ingesting Vector Data It’s a GML / XML file. 3. Process and index 2. Parse XPATH 1. Shape data Data ready to query
  • 24. Ingesting Vector Data It’s a GML / XML file. 3. Process and index 2. Parse XPATH 1. Shape data Data ready to query
  • 25. Indexing vector data • Outline Box: Biggest rectangle • Boxes contain boxes • Bottom box in the tree contains actual geometries • Here, 3 levels pictured • Boxes can overlap (entries are only in one)
  • 26. Querying vector data Searching an R-Tree: e.g. Finding all buildings (points) inside a flood zone (polygon) Does the query polygon overlap our box? Return empty list Search our boxes’ children Is it a leaf node? Return all nodes for verification Y N Y N
  • 27. Ingesting Raster Data It’s a raster / TIFF file. Bitmap image 3. Process and index 2. Tile and spray 1. Raster data Data ready to query
  • 28. Ingesting Raster Data 3. Process and index 2. Tile and spray 1. Raster data Data ready to query Tiling divides raster images into small manageable areas of known dimensions. These tiles have their own metadata: • Bounding box • Grid position
  • 29. Ingesting Raster Data 3. Process and index 2. Tile and spray 1. Raster data Data ready to query 1. Figure out which grid position the geometry needs 2. Extract the required pixel 3. Interrogate the pixel for its value 4. Interpret its value 5. Return to user
  • 30. Ingesting Raster Data It’s a raster / TIFF file. Bitmap image 3. Process and index 2. Tile and spray 1. Raster data Data ready to query
  • 31. Ingesting Raster Data It’s a raster / TIFF file. 3. Process and index 2. Tile and spray 1. Raster data Data ready to query
  • 32. Bringing it all together *Andrew Farrell In pursuit of perils : Geo-spatial risk analysis through HPCC Systems https://hpccsystems.com/resources/blog/afarrell/pursuit-perils-geo-spatial-risk-analysis- through-hpcc-systems
  • 33. Add even more value
  • 34. Add even more value
  • 35. Why Geospatial with HPCC? • Efficient parallel processing • Ability to import libraries from different languages • Good coverage of functions and spatial predicates • Fast ingestion • Support for different formats • Sub-second queries
  • 36.