SlideShare ist ein Scribd-Unternehmen logo
1 von 20
NoizCrowd:
A Crowd-Based Data Gathering and
Management System for Noise Level Data
Mariusz Wisniewski, Gianluca
Demartini, Apostolos Malatras, and
Philippe CudrĂŠ-Mauroux
University of Fribourg, Switzerland
Motivation - Big Data
• Large dataset are necessary to enable analytics
and support decision making
– Meteorological station / car traffic
• Set up a large-scale sensing infrastructure is
costly and time-consuming
• Create a large amount of valuable data
– Crowdsourcing
– Data generation models
– Smartphones as sensors
– Big Data analytics
Gianluca Demartini 2
NoizCrowd
• A crowd-sensing approach to big data generation
using commodity sensors
• Crowd-source noise level in a geo region
• Noise propagation models to generate data
• Array data management techniques to scale
• Results accessible via a visual interface
• Support decisions (e.g., where to live)
Gianluca Demartini 3
Outline
• Related approaches
• NoizCrowd Architecture Overview
– Data Gathering
– Storage
– Modeling
– Export and Visualization
• Data Models
• Performance Evaluation
Gianluca Demartini 4
Related Work
• Participatory Sensing vs Sensor Networks
– Low cost / High cost
– Mobile phones / Sensors
– Distributed / Centralized management
– Privacy, data quality
• Applications: Environment, vehicle routing
Gianluca Demartini 5
Related Work
• Noise Mapping Apps
– NoiseTube: opensource, widespread usage
– NoiseMap: control over data
– SoundSense: machine learning to classify sounds
• NoizCrowd
– Data in RDF linkable to other datasets
(linkeddata.org)
– Scalable storage: generate data by interpolation
Gianluca Demartini 6
NoizCrowd Architecture
Gianluca Demartini 7
Data Gathering
• By means of Crowd-sourcing
– GPS: location
– Microphone: noise level
– Internet connection: send data to server
• Microphone Calibration
– Sound level meter
– Sharing conversion table for smartphone models
Gianluca Demartini 8
Data Storage
• App sends median and peak dB values over
few seconds
• Spatio-temporal data: non-relational storage
system (SciDB)
– Durable storage
– Retrieve data to build models
– Export data for visualization
• Multi-dimensional array (space and time)
• Distributed storage
Gianluca Demartini 9
Noise Modeling
• Data from crowd is noisy and skewed/sparse
• Raw data is not shown to the end users
• Models to deal with
– Overlapping data
– Missing data
Gianluca Demartini 10
Data Export and Visualization
• From SciDB data is
– converted to RDF
– stored in dipLODocus[RDF]
– Available via SPARQL
• Visualization
– Overlay noise level on a map
– Additional chart for time evolution
Gianluca Demartini 11
Gianluca Demartini 12
Data Models
• Spatial Interpolation
– In the same time interval, data from different
locations
– Need to be computational simple (large volume)
– Bi-dimensional range queries in space (SciDB)
– K-nearest neighbor interpolation
– Computed in parallel
Gianluca Demartini
Data Models
• Temporal interpolation
– Short ranges (minutes) like spatial interp. in 3D
– Long ranges, look for patterns and infer
• E.g., every Monday at 11am we have 50dB and we miss
a Monday measurement
• E.g., same measurement (50dB) in same area 2h ago
and now
Gianluca Demartini 14
Noise Propagation Models
• We adopt an existing model that takes into
account:
– Sound power
– Distance from source
– Directivity
– Atmospheric absorption
– Excess attenuation (we use meteo conditions)
• Difficult to measure with smartphone
• Constant in a given region (and use GPS info)
Gianluca Demartini 15
Materialization of Models
• Data from models
– Is computationally expensive to generate
– May be a lot since we can cover any region
• We do late materialization
– At query time
– Only for the specific request
– Cached and indexed for future requests
– Incremental updates of views, if possible
Gianluca Demartini 16
Performance Evaluation (1)
• 30 outdoor deployments
– 2,3,4 smartphones
– Multiple noise sources
– Urban setting, flat area of 50x50 meters
• Professional-grade noise level meter as gold
standard measurement
• 85% of interpolated data +-6dB error
• 63% of interpolated data +-4dB error
Gianluca Demartini 17
Performance Evaluation (2)
• Sound propagation and source location
• 3 smartphones, 100dB source
Gianluca Demartini 18
Performance Evaluation (3)
• Sound level of source error
– 16% with 3 measurements
– 10% with 4 measurements
– 9% with 5 measurements
• Source location
– 3m error on average
Gianluca Demartini 19
NoizCrowd - Conclusions
• Large scale data is key for decision making
• Crowd-source noise level data using mobiles
– Scale-out using an array backend
– Generate missing data and visualize
• Next steps
– Android app
– Data recording as background feature
– Additional materialization strategies
http://exascale.info
Gianluca Demartini 20

Weitere ähnliche Inhalte

Was ist angesagt?

Wi-Fi based indoor positioning
Wi-Fi based indoor positioningWi-Fi based indoor positioning
Wi-Fi based indoor positioningSherwin Rodrigues
 
3D remote sensing of mines
3D remote sensing of mines 3D remote sensing of mines
3D remote sensing of mines CLIC Innovation Ltd
 
SPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
SPAR 2015 - Civil Maps Presentation by Sravan PuttaguntaSPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
SPAR 2015 - Civil Maps Presentation by Sravan PuttaguntaSravan Puttagunta
 
Sensor-driven indoor localization with android #bcs2
Sensor-driven indoor localization with android #bcs2Sensor-driven indoor localization with android #bcs2
Sensor-driven indoor localization with android #bcs2Stephan Linzner
 
Indoor Positioning Systems
Indoor Positioning SystemsIndoor Positioning Systems
Indoor Positioning SystemsProjectENhANCE
 
Indoor Tracking System
Indoor Tracking SystemIndoor Tracking System
Indoor Tracking SystemSherwin Rodrigues
 
Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...
Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...
Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...YutaFunada
 
Precision (Indoor) Real Time Location Systems
Precision (Indoor) Real Time Location SystemsPrecision (Indoor) Real Time Location Systems
Precision (Indoor) Real Time Location SystemsPeter Batty
 
Lecture 6 geolocation
Lecture 6 geolocationLecture 6 geolocation
Lecture 6 geolocationmoduledesign
 
GET2016 - UrbanSense Platform. Porto Living Lab
GET2016 - UrbanSense Platform. Porto Living LabGET2016 - UrbanSense Platform. Porto Living Lab
GET2016 - UrbanSense Platform. Porto Living LabCecilia Rocha
 
IoT Applications based on LoRaWan
IoT Applications based on LoRaWanIoT Applications based on LoRaWan
IoT Applications based on LoRaWanDaniel Koller
 

Was ist angesagt? (13)

Wi-Fi based indoor positioning
Wi-Fi based indoor positioningWi-Fi based indoor positioning
Wi-Fi based indoor positioning
 
3D remote sensing of mines
3D remote sensing of mines 3D remote sensing of mines
3D remote sensing of mines
 
SPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
SPAR 2015 - Civil Maps Presentation by Sravan PuttaguntaSPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
SPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
 
Sensor-driven indoor localization with android #bcs2
Sensor-driven indoor localization with android #bcs2Sensor-driven indoor localization with android #bcs2
Sensor-driven indoor localization with android #bcs2
 
Indoor Positioning Systems
Indoor Positioning SystemsIndoor Positioning Systems
Indoor Positioning Systems
 
Indoor Tracking System
Indoor Tracking SystemIndoor Tracking System
Indoor Tracking System
 
Geolocalisation
GeolocalisationGeolocalisation
Geolocalisation
 
Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...
Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...
Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...
 
Precision (Indoor) Real Time Location Systems
Precision (Indoor) Real Time Location SystemsPrecision (Indoor) Real Time Location Systems
Precision (Indoor) Real Time Location Systems
 
Lecture 6 geolocation
Lecture 6 geolocationLecture 6 geolocation
Lecture 6 geolocation
 
Indoor navigation system
Indoor navigation systemIndoor navigation system
Indoor navigation system
 
GET2016 - UrbanSense Platform. Porto Living Lab
GET2016 - UrbanSense Platform. Porto Living LabGET2016 - UrbanSense Platform. Porto Living Lab
GET2016 - UrbanSense Platform. Porto Living Lab
 
IoT Applications based on LoRaWan
IoT Applications based on LoRaWanIoT Applications based on LoRaWan
IoT Applications based on LoRaWan
 

Ähnlich wie NoizCrowd: A Crowd-Based Data Gathering and Management System for Noise Level Data

Automation of National Noise Model
Automation of National Noise Model Automation of National Noise Model
Automation of National Noise Model Safe Software
 
Arpan pal roboticsensing_sw2015
Arpan pal roboticsensing_sw2015Arpan pal roboticsensing_sw2015
Arpan pal roboticsensing_sw2015Arpan Pal
 
Advancements In Visualization Of Remotely Sensed 3D Data
Advancements In Visualization Of Remotely Sensed 3D DataAdvancements In Visualization Of Remotely Sensed 3D Data
Advancements In Visualization Of Remotely Sensed 3D DataMerrick & Company
 
20181128 satellogic @ barcelona ai
20181128 satellogic @ barcelona ai20181128 satellogic @ barcelona ai
20181128 satellogic @ barcelona aiAlbert Pujol Torras
 
(Lidar) Pan Australia Topo Mapping Q1 2018
(Lidar) Pan Australia Topo Mapping Q1 2018(Lidar) Pan Australia Topo Mapping Q1 2018
(Lidar) Pan Australia Topo Mapping Q1 2018Brett Johnson
 
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...Gloria Re Calegari
 
Urban senseoverview201507
Urban senseoverview201507Urban senseoverview201507
Urban senseoverview201507Ana Aguiar
 
Singapore 3D Map for a Smart Nation
Singapore 3D Map for a Smart NationSingapore 3D Map for a Smart Nation
Singapore 3D Map for a Smart NationMonica Moran
 
PathS: Enhancing Geographical Maps with Environmental Sensed Data
PathS: Enhancing Geographical Maps with Environmental Sensed DataPathS: Enhancing Geographical Maps with Environmental Sensed Data
PathS: Enhancing Geographical Maps with Environmental Sensed DataUniversity of Geneva
 
Will camera technology become an ITS sensor
Will camera technology become an ITS sensorWill camera technology become an ITS sensor
Will camera technology become an ITS sensorAllied Vision
 
Geo-Navigation, an Augmented Reality Perspective
Geo-Navigation, an Augmented Reality PerspectiveGeo-Navigation, an Augmented Reality Perspective
Geo-Navigation, an Augmented Reality PerspectiveAntonio Camara
 
Ambient Intelligence: Definitions and Application Areas
Ambient Intelligence: Definitions and Application AreasAmbient Intelligence: Definitions and Application Areas
Ambient Intelligence: Definitions and Application AreasFulvio Corno
 
matdid473708.pdf
matdid473708.pdfmatdid473708.pdf
matdid473708.pdfssuser3b7a36
 
understanding the planet using satellites and deep learning
understanding the planet using satellites and deep learningunderstanding the planet using satellites and deep learning
understanding the planet using satellites and deep learningAlbert Pujol Torras
 
Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...
Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...
Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...Daniela Mazza
 
On Crowd-sensing back-end
On Crowd-sensing back-endOn Crowd-sensing back-end
On Crowd-sensing back-endColdbeans Software
 
Crowd sourced intelligence built into search over hadoop
Crowd sourced intelligence built into search over hadoopCrowd sourced intelligence built into search over hadoop
Crowd sourced intelligence built into search over hadooplucenerevolution
 

Ähnlich wie NoizCrowd: A Crowd-Based Data Gathering and Management System for Noise Level Data (20)

Automation of National Noise Model
Automation of National Noise Model Automation of National Noise Model
Automation of National Noise Model
 
Arpan pal roboticsensing_sw2015
Arpan pal roboticsensing_sw2015Arpan pal roboticsensing_sw2015
Arpan pal roboticsensing_sw2015
 
Advancements In Visualization Of Remotely Sensed 3D Data
Advancements In Visualization Of Remotely Sensed 3D DataAdvancements In Visualization Of Remotely Sensed 3D Data
Advancements In Visualization Of Remotely Sensed 3D Data
 
20181128 satellogic @ barcelona ai
20181128 satellogic @ barcelona ai20181128 satellogic @ barcelona ai
20181128 satellogic @ barcelona ai
 
(Lidar) Pan Australia Topo Mapping Q1 2018
(Lidar) Pan Australia Topo Mapping Q1 2018(Lidar) Pan Australia Topo Mapping Q1 2018
(Lidar) Pan Australia Topo Mapping Q1 2018
 
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
 
Urban senseoverview201507
Urban senseoverview201507Urban senseoverview201507
Urban senseoverview201507
 
Singapore 3D Map for a Smart Nation
Singapore 3D Map for a Smart NationSingapore 3D Map for a Smart Nation
Singapore 3D Map for a Smart Nation
 
PathS: Enhancing Geographical Maps with Environmental Sensed Data
PathS: Enhancing Geographical Maps with Environmental Sensed DataPathS: Enhancing Geographical Maps with Environmental Sensed Data
PathS: Enhancing Geographical Maps with Environmental Sensed Data
 
Intro
IntroIntro
Intro
 
Will camera technology become an ITS sensor
Will camera technology become an ITS sensorWill camera technology become an ITS sensor
Will camera technology become an ITS sensor
 
Geo-Navigation, an Augmented Reality Perspective
Geo-Navigation, an Augmented Reality PerspectiveGeo-Navigation, an Augmented Reality Perspective
Geo-Navigation, an Augmented Reality Perspective
 
Ambient Intelligence: Definitions and Application Areas
Ambient Intelligence: Definitions and Application AreasAmbient Intelligence: Definitions and Application Areas
Ambient Intelligence: Definitions and Application Areas
 
matdid473708.pdf
matdid473708.pdfmatdid473708.pdf
matdid473708.pdf
 
understanding the planet using satellites and deep learning
understanding the planet using satellites and deep learningunderstanding the planet using satellites and deep learning
understanding the planet using satellites and deep learning
 
PPT-DC.pptx
PPT-DC.pptxPPT-DC.pptx
PPT-DC.pptx
 
Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...
Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...
Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...
 
On Crowd-sensing back-end
On Crowd-sensing back-endOn Crowd-sensing back-end
On Crowd-sensing back-end
 
DGterzo
DGterzoDGterzo
DGterzo
 
Crowd sourced intelligence built into search over hadoop
Crowd sourced intelligence built into search over hadoopCrowd sourced intelligence built into search over hadoop
Crowd sourced intelligence built into search over hadoop
 

Mehr von eXascale Infolab

Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link PredictionBeyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link PredictioneXascale Infolab
 
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...eXascale Infolab
 
Representation Learning on Complex Graphs
Representation Learning on Complex GraphsRepresentation Learning on Complex Graphs
Representation Learning on Complex GraphseXascale Infolab
 
A force directed approach for offline gps trajectory map
A force directed approach for offline gps trajectory mapA force directed approach for offline gps trajectory map
A force directed approach for offline gps trajectory mapeXascale Infolab
 
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...eXascale Infolab
 
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...eXascale Infolab
 
Dependency-Driven Analytics: A Compass for Uncharted Data Oceans
Dependency-Driven Analytics: A Compass for Uncharted Data OceansDependency-Driven Analytics: A Compass for Uncharted Data Oceans
Dependency-Driven Analytics: A Compass for Uncharted Data OceanseXascale Infolab
 
Crowd scheduling www2016
Crowd scheduling www2016Crowd scheduling www2016
Crowd scheduling www2016eXascale Infolab
 
SANAPHOR: Ontology-based Coreference Resolution
SANAPHOR: Ontology-based Coreference ResolutionSANAPHOR: Ontology-based Coreference Resolution
SANAPHOR: Ontology-based Coreference ResolutioneXascale Infolab
 
Efficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked DataEfficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked DataeXascale Infolab
 
Entity-Centric Data Management
Entity-Centric Data ManagementEntity-Centric Data Management
Entity-Centric Data ManagementeXascale Infolab
 
SSSW 2015 Sense Making
SSSW 2015 Sense MakingSSSW 2015 Sense Making
SSSW 2015 Sense MakingeXascale Infolab
 
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked DataLDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked DataeXascale Infolab
 
Executing Provenance-Enabled Queries over Web Data
Executing Provenance-Enabled Queries over Web DataExecuting Provenance-Enabled Queries over Web Data
Executing Provenance-Enabled Queries over Web DataeXascale Infolab
 
The Dynamics of Micro-Task Crowdsourcing
The Dynamics of Micro-Task CrowdsourcingThe Dynamics of Micro-Task Crowdsourcing
The Dynamics of Micro-Task CrowdsourcingeXascale Infolab
 
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...eXascale Infolab
 
CIKM14: Fixing grammatical errors by preposition ranking
CIKM14: Fixing grammatical errors by preposition rankingCIKM14: Fixing grammatical errors by preposition ranking
CIKM14: Fixing grammatical errors by preposition rankingeXascale Infolab
 
An Introduction to Big Data
An Introduction to Big DataAn Introduction to Big Data
An Introduction to Big DataeXascale Infolab
 

Mehr von eXascale Infolab (20)

Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link PredictionBeyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
 
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
 
Representation Learning on Complex Graphs
Representation Learning on Complex GraphsRepresentation Learning on Complex Graphs
Representation Learning on Complex Graphs
 
A force directed approach for offline gps trajectory map
A force directed approach for offline gps trajectory mapA force directed approach for offline gps trajectory map
A force directed approach for offline gps trajectory map
 
Cikm 2018
Cikm 2018Cikm 2018
Cikm 2018
 
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
 
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
 
Dependency-Driven Analytics: A Compass for Uncharted Data Oceans
Dependency-Driven Analytics: A Compass for Uncharted Data OceansDependency-Driven Analytics: A Compass for Uncharted Data Oceans
Dependency-Driven Analytics: A Compass for Uncharted Data Oceans
 
Crowd scheduling www2016
Crowd scheduling www2016Crowd scheduling www2016
Crowd scheduling www2016
 
SANAPHOR: Ontology-based Coreference Resolution
SANAPHOR: Ontology-based Coreference ResolutionSANAPHOR: Ontology-based Coreference Resolution
SANAPHOR: Ontology-based Coreference Resolution
 
Efficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked DataEfficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked Data
 
Entity-Centric Data Management
Entity-Centric Data ManagementEntity-Centric Data Management
Entity-Centric Data Management
 
SSSW 2015 Sense Making
SSSW 2015 Sense MakingSSSW 2015 Sense Making
SSSW 2015 Sense Making
 
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked DataLDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
 
Executing Provenance-Enabled Queries over Web Data
Executing Provenance-Enabled Queries over Web DataExecuting Provenance-Enabled Queries over Web Data
Executing Provenance-Enabled Queries over Web Data
 
The Dynamics of Micro-Task Crowdsourcing
The Dynamics of Micro-Task CrowdsourcingThe Dynamics of Micro-Task Crowdsourcing
The Dynamics of Micro-Task Crowdsourcing
 
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
 
CIKM14: Fixing grammatical errors by preposition ranking
CIKM14: Fixing grammatical errors by preposition rankingCIKM14: Fixing grammatical errors by preposition ranking
CIKM14: Fixing grammatical errors by preposition ranking
 
OLTP-Bench
OLTP-BenchOLTP-Bench
OLTP-Bench
 
An Introduction to Big Data
An Introduction to Big DataAn Introduction to Big Data
An Introduction to Big Data
 

KĂźrzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 

KĂźrzlich hochgeladen (20)

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 

NoizCrowd: A Crowd-Based Data Gathering and Management System for Noise Level Data

  • 1. NoizCrowd: A Crowd-Based Data Gathering and Management System for Noise Level Data Mariusz Wisniewski, Gianluca Demartini, Apostolos Malatras, and Philippe CudrĂŠ-Mauroux University of Fribourg, Switzerland
  • 2. Motivation - Big Data • Large dataset are necessary to enable analytics and support decision making – Meteorological station / car traffic • Set up a large-scale sensing infrastructure is costly and time-consuming • Create a large amount of valuable data – Crowdsourcing – Data generation models – Smartphones as sensors – Big Data analytics Gianluca Demartini 2
  • 3. NoizCrowd • A crowd-sensing approach to big data generation using commodity sensors • Crowd-source noise level in a geo region • Noise propagation models to generate data • Array data management techniques to scale • Results accessible via a visual interface • Support decisions (e.g., where to live) Gianluca Demartini 3
  • 4. Outline • Related approaches • NoizCrowd Architecture Overview – Data Gathering – Storage – Modeling – Export and Visualization • Data Models • Performance Evaluation Gianluca Demartini 4
  • 5. Related Work • Participatory Sensing vs Sensor Networks – Low cost / High cost – Mobile phones / Sensors – Distributed / Centralized management – Privacy, data quality • Applications: Environment, vehicle routing Gianluca Demartini 5
  • 6. Related Work • Noise Mapping Apps – NoiseTube: opensource, widespread usage – NoiseMap: control over data – SoundSense: machine learning to classify sounds • NoizCrowd – Data in RDF linkable to other datasets (linkeddata.org) – Scalable storage: generate data by interpolation Gianluca Demartini 6
  • 8. Data Gathering • By means of Crowd-sourcing – GPS: location – Microphone: noise level – Internet connection: send data to server • Microphone Calibration – Sound level meter – Sharing conversion table for smartphone models Gianluca Demartini 8
  • 9. Data Storage • App sends median and peak dB values over few seconds • Spatio-temporal data: non-relational storage system (SciDB) – Durable storage – Retrieve data to build models – Export data for visualization • Multi-dimensional array (space and time) • Distributed storage Gianluca Demartini 9
  • 10. Noise Modeling • Data from crowd is noisy and skewed/sparse • Raw data is not shown to the end users • Models to deal with – Overlapping data – Missing data Gianluca Demartini 10
  • 11. Data Export and Visualization • From SciDB data is – converted to RDF – stored in dipLODocus[RDF] – Available via SPARQL • Visualization – Overlay noise level on a map – Additional chart for time evolution Gianluca Demartini 11
  • 13. Data Models • Spatial Interpolation – In the same time interval, data from different locations – Need to be computational simple (large volume) – Bi-dimensional range queries in space (SciDB) – K-nearest neighbor interpolation – Computed in parallel Gianluca Demartini
  • 14. Data Models • Temporal interpolation – Short ranges (minutes) like spatial interp. in 3D – Long ranges, look for patterns and infer • E.g., every Monday at 11am we have 50dB and we miss a Monday measurement • E.g., same measurement (50dB) in same area 2h ago and now Gianluca Demartini 14
  • 15. Noise Propagation Models • We adopt an existing model that takes into account: – Sound power – Distance from source – Directivity – Atmospheric absorption – Excess attenuation (we use meteo conditions) • Difficult to measure with smartphone • Constant in a given region (and use GPS info) Gianluca Demartini 15
  • 16. Materialization of Models • Data from models – Is computationally expensive to generate – May be a lot since we can cover any region • We do late materialization – At query time – Only for the specific request – Cached and indexed for future requests – Incremental updates of views, if possible Gianluca Demartini 16
  • 17. Performance Evaluation (1) • 30 outdoor deployments – 2,3,4 smartphones – Multiple noise sources – Urban setting, flat area of 50x50 meters • Professional-grade noise level meter as gold standard measurement • 85% of interpolated data +-6dB error • 63% of interpolated data +-4dB error Gianluca Demartini 17
  • 18. Performance Evaluation (2) • Sound propagation and source location • 3 smartphones, 100dB source Gianluca Demartini 18
  • 19. Performance Evaluation (3) • Sound level of source error – 16% with 3 measurements – 10% with 4 measurements – 9% with 5 measurements • Source location – 3m error on average Gianluca Demartini 19
  • 20. NoizCrowd - Conclusions • Large scale data is key for decision making • Crowd-source noise level data using mobiles – Scale-out using an array backend – Generate missing data and visualize • Next steps – Android app – Data recording as background feature – Additional materialization strategies http://exascale.info Gianluca Demartini 20