1) The document discusses using NoSQL databases like Elasticsearch to store and analyze large amounts of traffic data in order to better manage road infrastructure and traffic planning in Norway.
2) Elasticsearch allows BEKK to index traffic event data in real-time, generate reports on vehicle counts, speeds and categories, and provide up-to-date traffic information for safety, routing and planning purposes.
3) BEKK is happy with Elasticsearch's performance and capabilities for aggregations, but notes the Java API can sometimes be complex and emphasizes the importance of ensuring fitness-for-purpose and continuous upgrades.
1. BEATING THE TRAFFIC JAM
USING NOSQL
NoSQL Matters
Kristoffer Dyrkorn, BEKK
22/11/14
2. BEKK Public Roads Administration
Norwegian consulting firm
Private and public sector enterprises
Strategy, technology, digital services
370 employees
Responsible for state and county roads
Planning, construction, operation
7500 employees
Spending: € 7 Billion (2013)
BACKGROUND
3. Population
(mill)
Area
(km2)
Roads
(1000 km)
Germany 81 357 644
Spain 46 500 683
Norway 5 385 95
CONTEXT
9. ROADS ARE INFRASTRUCTURE
BUILDING AND MAINTAINING ROADS IS EXPENSIVE
PROPER PLANNING DEPENDS ON TRAFFIC ANALYSIS
VEHICLE COUNTS & WEIGHTS DECIDE
PRECISE REPORTS ARE NEEDED
16. SYSTEM GOALS & REQUIREMENTS
EASE OF INSTALLATION AND VERIFICATION OF ROADSIDE EQUIPMENT
INCREASED DATA QUALITY
INCREASED DATA AVAILABILITY
ALL EVENTS MUST BE KEPT (NO PRE-AGGREGATION)
MINIMAL LATENCY
AD-HOC REPORTING
SCALABILITY
ROBUSTNESS
17.
18.
19.
20. A TRAFFIC EVENT
Voltage
signature
OPC-UA
event
Bulked
data
Sensor Data logger Application Storage
21. Sensors
System status
SenSsenosrosr s System
Other
backends
GUI
Vehicle info Reports
DATA FLOW
22. SOLUTION ARCHITECTURE
HTML5 GUI
(HTTP, JSON)
Application logic
Support libraries
Java VM
OS
Traffic
events
Reports
(CSV, SOAP)
Elasticsearch
Java VM
OS
Request/
response
Data logger
N data loggers M application servers K storage servers
24. HOW WE USE ELASTICSEARCH
BULK INDEXING, JAVA API
DATA IS INDEXED, STORED, NOT ANALYZED
TEMPORAL SHARDING
SPATIAL SHARDING
DATA CENTER-AWARE REPLICATION
NO SPECIAL OPTIMIZATIONS!
RAM, CPU, DISK
25. REPORTING
FOR A GIVEN TIME INTERVAL, PROVIDE:
• TOTAL VEHICLE COUNT AND AVERAGE SPEED,
• THE 85 AND 95 PERCENTILE SPEEDS,
• IN EACH OF 5 LENGTH CATEGORIES: THE VEHICLE COUNT AND AVERAGE SPEED,
• IN EACH OF 12 SPEED CATEGORIES: THE VEHICLE COUNT,
...AND ALL OF THIS FOR
• EACH TRAFFIC LANE AT A MEASURE POINT,
• EACH MEASURE POINT IN A REGION
28. SYSTEM VALUE
REPORTING:
• MORE COST-EFFICIENT ROAD MAINTENANCE
REAL TIME:
• ROUTING OF EMERGENCY VEHICLES
• GENERAL TRAFFIC INFORMATION TO THE PUBLIC
• ROUTE PLANNING ON HOLIDAYS
• ROUTE PLANNING FOR PARCEL SERVICES
29. EXPERIENCES USING ELASTICSEARCH
ENSURE FITNESS-TO-PURPOSE
UPGRADE CONTINUOUSLY
REVISE THE RUN-TIME ENVIRONMENT CONTINUOUSLY
THE AGGREGATIONS MODULE IS FANTASTIC
USE TOOLING (WE LIKE KOPF)
MONITOR THE RESOURCE UTILIZATION
THE JAVA API IS SOMETIMES COMPLEX
WE ARE HAPPY!