SlideShare ist ein Scribd-Unternehmen logo
1 von 24
This project has received funding from the European Union’s Horizon 2020
research and innovation program under grant agreement No. 688191.
@tobiaslindener
lindener@kth.se
Approximate Standing
Queries on Apache Flink
Overview
1. Introduction
2. Background
3. Design & Implementation
4. Results
04.09.2018 2
It is better to use a crude approximation and know the truth, plus or minus
10 percent, than demand an exact solution and know nothing at all
In Arthur Bloch, The Complete Murphy's Law: A Definitive Collection (1991), 126
04.09.2018 3
Unbounded Stream
04.09.2018 4
ID_A
P_A
Key Count
A 1
Time
Infinite
Growth
Unbounded Stream
04.09.2018 5
ID_A
P_A
ID_B
P_A
Key Count
A 1
B 1
Time
Infinite
Growth
Unbounded Stream
04.09.2018 6
ID_A
P_A
ID_C
P_A
ID_B
P_A
Key Count
A 1
B 1
C 1
Time
Infinite
Growth
Unbounded Stream
04.09.2018 7
ID_A
P_A
ID_C
P_A
ID_D
P_A
ID_B
P_A
Key Count
A 1
B 1
C 1
D 1
Time
Infinite
Growth
Unbounded Stream
04.09.2018 8
ID_A
P_A
ID_C
P_A
ID_D
P_A
ID_C
P_A
ID_B
P_A
Key Count
A 1
B 1
C 2
D 1
Time
Infinite
Growth
Unbounded Stream
04.09.2018 9
ID_A
P_A
ID_C
P_A
ID_D
P_A
ID_C
P_A
ID_B
P_A
Key Count
A 1
B 1
C 2
D 1
Time
Infinite
Growth
Approximation Algorithms
Use-cases
• Identify heavy hitters (Count)
• Cardinality Estimation (Count Distinct)
Algorithms
• Frequent Item Estimation
• HyperLogLog
• Quantile Estimation
04.09.2018 10
Processing
04.09.2018 11
Flink Architecture (Apache Software Foundation, 2018)
Approximate Queries
Processing
04.09.2018 12
Flink Architecture (Apache Software Foundation, 2018)
Approximate
Queries
Approximate Query API (1)
04.09.2018 13
Approximate Query API (2)
04.09.2018 14
Sketch Capacity
04.09.2018 15
Estimate Deviation - Frequent Items
WIKITRACE DATASET TOP 6000 URL AMAZON DATASET TOP 1000 REVIEWER
04.09.2018 16
Estimate Deviation - HyperLogLog
04.09.2018 17
Native
Implementation
• Potentially increased efficiency
• Reduced overhead
• Stateful processing
Integration with
Table API
• CQL with Approximate Queries
• Support for Count Distinct
Queryable State • Reduced data handling
Future Work
04.09.2018 18
Queryable State
04.09.2018 19
Stream Elements Query Results
Sketch
Function
Time
Queryable State
04.09.2018 20
Stream Elements Query Results
Sketch
Function
Time
Query Results
REST API
Conclusion
CHALLENGES
• Efficient Grouping (HLL)
• Stateful Native Implementation
• Skewed Datasets
LEARNINGS
• Importance of Data
Distribution
• Performance Advantages
• Algorithm Parameters
04.09.2018 21
Team
04.09.2018 22
Tobias Lindener
Consultant @ Netlight
Theodore Vasiloudis
Researcher @ RISE
Paris Carbone
PhD candidate @ KTH
Slides
bit.ly/2LULyoZ
References & Links
• https://github.com/tlindener/ApproximateQueries
• https://datasketches.github.io/
• Daniel Anderson, Pryce Bevan, Kevin Lang, Edo Liberty, Lee
Rhodes, Justin Thaler. A High-Performance Algorithm for
Identifying Frequent Items in Data Streams.
• Kevin Lang, Back to the Future: an Even More Nearly Optimal
Cardinality Estimation Algorithm.
04.09.2018 23
Evaluation Environment
▪ WikiTrace Dataset (9 GB)
▪ Address (6,708,723 distinct urls)
▪ Amazon Rating Dataset (3 GB)
▪ ProductId (9,874,210 distinct
items)
▪ ReviewerId (21,176,521 distinct
users)
▪ Ryzen 1600 (6C/12T)
▪ 16GB RAM
▪ Ubuntu 18.04
▪ OpenJDK 8
▪ JVM tuned for max 10 GB heap
▪ Flink 1.4.2
▪ Standalone mode
▪ Evaluation through python scripts
04.09.2018 24

Weitere ähnliche Inhalte

Was ist angesagt?

Just add Imagination
Just add ImaginationJust add Imagination
Just add ImaginationKNIMESlides
 
KNIME Data Science Learnathon: From Raw Data To Deployment - Paris - November...
KNIME Data Science Learnathon: From Raw Data To Deployment - Paris - November...KNIME Data Science Learnathon: From Raw Data To Deployment - Paris - November...
KNIME Data Science Learnathon: From Raw Data To Deployment - Paris - November...KNIMESlides
 
AWS IoT Day - Using AWS IoT Core
AWS IoT Day - Using AWS IoT CoreAWS IoT Day - Using AWS IoT Core
AWS IoT Day - Using AWS IoT CoreAmazon Web Services
 
Beginners guide to weather and climate data
Beginners guide to weather and climate dataBeginners guide to weather and climate data
Beginners guide to weather and climate dataMargriet Groenendijk
 
Open Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME SoftwareOpen Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME SoftwareKNIMESlides
 
Sharing and Deploying Data Science with KNIME Server
Sharing and Deploying Data Science with KNIME ServerSharing and Deploying Data Science with KNIME Server
Sharing and Deploying Data Science with KNIME ServerKNIMESlides
 
From Raw Data to Deployment
From Raw Data to DeploymentFrom Raw Data to Deployment
From Raw Data to DeploymentKNIMESlides
 
Get Your Aircraft Spare Parts Inventory Management Off the Ground
Get Your Aircraft Spare Parts Inventory Management Off the GroundGet Your Aircraft Spare Parts Inventory Management Off the Ground
Get Your Aircraft Spare Parts Inventory Management Off the GroundPTC
 
Stack for high performance in data-intensive operations
Stack for high performance in data-intensive operationsStack for high performance in data-intensive operations
Stack for high performance in data-intensive operationsOnedot
 
REF compliance - what Jisc is doing
REF compliance - what Jisc is doingREF compliance - what Jisc is doing
REF compliance - what Jisc is doingRepository Fringe
 
ODSC Europe: Weather and Climate Data: Not Just for Meteorologists
ODSC Europe: Weather and Climate Data: Not Just for MeteorologistsODSC Europe: Weather and Climate Data: Not Just for Meteorologists
ODSC Europe: Weather and Climate Data: Not Just for MeteorologistsMargriet Groenendijk
 
Efficiency analyzer for wallblower iiot_quest
Efficiency analyzer for wallblower iiot_questEfficiency analyzer for wallblower iiot_quest
Efficiency analyzer for wallblower iiot_questUma Santharam
 
Introduction to the IBM Watson Data Platform
Introduction to the IBM Watson Data PlatformIntroduction to the IBM Watson Data Platform
Introduction to the IBM Watson Data PlatformMargriet Groenendijk
 
Acoustic io t rail monitoring.pptx
Acoustic io t rail monitoring.pptxAcoustic io t rail monitoring.pptx
Acoustic io t rail monitoring.pptxIBM Sverige
 
GI2016 ppt charvat senslog api as tools for collection of big vgi data
GI2016 ppt charvat senslog api as tools for collection of big vgi dataGI2016 ppt charvat senslog api as tools for collection of big vgi data
GI2016 ppt charvat senslog api as tools for collection of big vgi dataIGN Vorstand
 
DSD-INT 2018 Impact of flooding on critical infrastructures - Mulder
DSD-INT 2018 Impact of flooding on critical infrastructures - MulderDSD-INT 2018 Impact of flooding on critical infrastructures - Mulder
DSD-INT 2018 Impact of flooding on critical infrastructures - MulderDeltares
 
Smart orchestrator for pipeline processing chain applied to space data cwin18...
Smart orchestrator for pipeline processing chain applied to space data cwin18...Smart orchestrator for pipeline processing chain applied to space data cwin18...
Smart orchestrator for pipeline processing chain applied to space data cwin18...Capgemini
 
Ground Penetrating Radar (GPR) | Military | UXO
Ground Penetrating Radar (GPR) | Military | UXOGround Penetrating Radar (GPR) | Military | UXO
Ground Penetrating Radar (GPR) | Military | UXOSensors & Software Inc.
 
re:Inventから見えたDeepRacer Leagueで勝つための心構え
re:Inventから見えたDeepRacer Leagueで勝つための心構えre:Inventから見えたDeepRacer Leagueで勝つための心構え
re:Inventから見えたDeepRacer Leagueで勝つための心構えTakatoshi Ohmura
 

Was ist angesagt? (20)

Just add Imagination
Just add ImaginationJust add Imagination
Just add Imagination
 
KNIME Data Science Learnathon: From Raw Data To Deployment - Paris - November...
KNIME Data Science Learnathon: From Raw Data To Deployment - Paris - November...KNIME Data Science Learnathon: From Raw Data To Deployment - Paris - November...
KNIME Data Science Learnathon: From Raw Data To Deployment - Paris - November...
 
AWS IoT Day - Using AWS IoT Core
AWS IoT Day - Using AWS IoT CoreAWS IoT Day - Using AWS IoT Core
AWS IoT Day - Using AWS IoT Core
 
Beginners guide to weather and climate data
Beginners guide to weather and climate dataBeginners guide to weather and climate data
Beginners guide to weather and climate data
 
Open Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME SoftwareOpen Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME Software
 
Sharing and Deploying Data Science with KNIME Server
Sharing and Deploying Data Science with KNIME ServerSharing and Deploying Data Science with KNIME Server
Sharing and Deploying Data Science with KNIME Server
 
From Raw Data to Deployment
From Raw Data to DeploymentFrom Raw Data to Deployment
From Raw Data to Deployment
 
Get Your Aircraft Spare Parts Inventory Management Off the Ground
Get Your Aircraft Spare Parts Inventory Management Off the GroundGet Your Aircraft Spare Parts Inventory Management Off the Ground
Get Your Aircraft Spare Parts Inventory Management Off the Ground
 
Stack for high performance in data-intensive operations
Stack for high performance in data-intensive operationsStack for high performance in data-intensive operations
Stack for high performance in data-intensive operations
 
REF compliance - what Jisc is doing
REF compliance - what Jisc is doingREF compliance - what Jisc is doing
REF compliance - what Jisc is doing
 
ODSC Europe: Weather and Climate Data: Not Just for Meteorologists
ODSC Europe: Weather and Climate Data: Not Just for MeteorologistsODSC Europe: Weather and Climate Data: Not Just for Meteorologists
ODSC Europe: Weather and Climate Data: Not Just for Meteorologists
 
Efficiency analyzer for wallblower iiot_quest
Efficiency analyzer for wallblower iiot_questEfficiency analyzer for wallblower iiot_quest
Efficiency analyzer for wallblower iiot_quest
 
Introduction to the IBM Watson Data Platform
Introduction to the IBM Watson Data PlatformIntroduction to the IBM Watson Data Platform
Introduction to the IBM Watson Data Platform
 
Acoustic io t rail monitoring.pptx
Acoustic io t rail monitoring.pptxAcoustic io t rail monitoring.pptx
Acoustic io t rail monitoring.pptx
 
GI2016 ppt charvat senslog api as tools for collection of big vgi data
GI2016 ppt charvat senslog api as tools for collection of big vgi dataGI2016 ppt charvat senslog api as tools for collection of big vgi data
GI2016 ppt charvat senslog api as tools for collection of big vgi data
 
DSD-INT 2018 Impact of flooding on critical infrastructures - Mulder
DSD-INT 2018 Impact of flooding on critical infrastructures - MulderDSD-INT 2018 Impact of flooding on critical infrastructures - Mulder
DSD-INT 2018 Impact of flooding on critical infrastructures - Mulder
 
Smart orchestrator for pipeline processing chain applied to space data cwin18...
Smart orchestrator for pipeline processing chain applied to space data cwin18...Smart orchestrator for pipeline processing chain applied to space data cwin18...
Smart orchestrator for pipeline processing chain applied to space data cwin18...
 
Ground Penetrating Radar (GPR) | Military | UXO
Ground Penetrating Radar (GPR) | Military | UXOGround Penetrating Radar (GPR) | Military | UXO
Ground Penetrating Radar (GPR) | Military | UXO
 
re:Inventから見えたDeepRacer Leagueで勝つための心構え
re:Inventから見えたDeepRacer Leagueで勝つための心構えre:Inventから見えたDeepRacer Leagueで勝つための心構え
re:Inventから見えたDeepRacer Leagueで勝つための心構え
 
Integrating vert.x v2
Integrating vert.x v2Integrating vert.x v2
Integrating vert.x v2
 

Ähnlich wie Flink Forward Berlin 2018: Tobias Lindener - "Approximate standing queries on Stream Processing"

Network of Networks Share Group Spring Update
Network of Networks Share Group Spring UpdateNetwork of Networks Share Group Spring Update
Network of Networks Share Group Spring UpdateLora Cecere
 
Integrating, exposing and managing distributed data with RESTful APIs and op...
Integrating, exposing and managing distributed data with RESTful APIs and op...Integrating, exposing and managing distributed data with RESTful APIs and op...
Integrating, exposing and managing distributed data with RESTful APIs and op...3scale
 
State of enterprise data science
State of enterprise data scienceState of enterprise data science
State of enterprise data scienceYan Xu
 
Cips Meetup Auckland-Mirko-Kleiner-lean-agile-procurement-201904-1.0
Cips Meetup Auckland-Mirko-Kleiner-lean-agile-procurement-201904-1.0Cips Meetup Auckland-Mirko-Kleiner-lean-agile-procurement-201904-1.0
Cips Meetup Auckland-Mirko-Kleiner-lean-agile-procurement-201904-1.0Mirko Kleiner
 
K8s & cloud native past, present and future
K8s & cloud native past, present and futureK8s & cloud native past, present and future
K8s & cloud native past, present and futureCheryl Hung
 
digitalization in oil & gas at an inflection point
digitalization in oil & gas at an inflection pointdigitalization in oil & gas at an inflection point
digitalization in oil & gas at an inflection pointJoseph Elkhoury
 
K8s & cloud native past, present and future
K8s & cloud native past, present and futureK8s & cloud native past, present and future
K8s & cloud native past, present and futureCheryl Hung
 
TechEvent DWH Modernization
TechEvent DWH ModernizationTechEvent DWH Modernization
TechEvent DWH ModernizationTrivadis
 
Jax London 2018: "Testing Microservices from Development to Production"
Jax London 2018: "Testing Microservices from Development to Production"Jax London 2018: "Testing Microservices from Development to Production"
Jax London 2018: "Testing Microservices from Development to Production"Daniel Bryant
 
IoT Market in Canada
IoT Market in CanadaIoT Market in Canada
IoT Market in CanadaNikhil Mittal
 
Artificial Intelligence and the Cognitive Revolution – the next frontier?
Artificial Intelligence and the Cognitive Revolution – the next frontier?Artificial Intelligence and the Cognitive Revolution – the next frontier?
Artificial Intelligence and the Cognitive Revolution – the next frontier?Level
 
Early adopter group and closing of webinar - João Fernandes (CERN)
Early adopter group and closing of webinar - João Fernandes (CERN)Early adopter group and closing of webinar - João Fernandes (CERN)
Early adopter group and closing of webinar - João Fernandes (CERN)Helix Nebula The Science Cloud
 
Produktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4jProduktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4jNeo4j
 
Technip Energies Italy: Planning is a graph matter
Technip Energies Italy: Planning is a graph matterTechnip Energies Italy: Planning is a graph matter
Technip Energies Italy: Planning is a graph matterNeo4j
 
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWANEnabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWANHong-Linh Truong
 
INTERFACE, by apidays - The Evolution of Data Movement.pdf
INTERFACE, by apidays - The Evolution of Data Movement.pdfINTERFACE, by apidays - The Evolution of Data Movement.pdf
INTERFACE, by apidays - The Evolution of Data Movement.pdfapidays
 
Repository Power: How Repositories can support Open Access Mandates (OR2015 O...
Repository Power: How Repositories can support Open Access Mandates (OR2015 O...Repository Power: How Repositories can support Open Access Mandates (OR2015 O...
Repository Power: How Repositories can support Open Access Mandates (OR2015 O...OpenAIRE
 
Easy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance ProfessionalEasy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance ProfessionalMartin Kaltenböck
 
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...Alok Singh
 

Ähnlich wie Flink Forward Berlin 2018: Tobias Lindener - "Approximate standing queries on Stream Processing" (20)

Network of Networks Share Group Spring Update
Network of Networks Share Group Spring UpdateNetwork of Networks Share Group Spring Update
Network of Networks Share Group Spring Update
 
Integrating, exposing and managing distributed data with RESTful APIs and op...
Integrating, exposing and managing distributed data with RESTful APIs and op...Integrating, exposing and managing distributed data with RESTful APIs and op...
Integrating, exposing and managing distributed data with RESTful APIs and op...
 
State of enterprise data science
State of enterprise data scienceState of enterprise data science
State of enterprise data science
 
Cips Meetup Auckland-Mirko-Kleiner-lean-agile-procurement-201904-1.0
Cips Meetup Auckland-Mirko-Kleiner-lean-agile-procurement-201904-1.0Cips Meetup Auckland-Mirko-Kleiner-lean-agile-procurement-201904-1.0
Cips Meetup Auckland-Mirko-Kleiner-lean-agile-procurement-201904-1.0
 
K8s & cloud native past, present and future
K8s & cloud native past, present and futureK8s & cloud native past, present and future
K8s & cloud native past, present and future
 
digitalization in oil & gas at an inflection point
digitalization in oil & gas at an inflection pointdigitalization in oil & gas at an inflection point
digitalization in oil & gas at an inflection point
 
K8s & cloud native past, present and future
K8s & cloud native past, present and futureK8s & cloud native past, present and future
K8s & cloud native past, present and future
 
TechEvent DWH Modernization
TechEvent DWH ModernizationTechEvent DWH Modernization
TechEvent DWH Modernization
 
Jax London 2018: "Testing Microservices from Development to Production"
Jax London 2018: "Testing Microservices from Development to Production"Jax London 2018: "Testing Microservices from Development to Production"
Jax London 2018: "Testing Microservices from Development to Production"
 
IoT Market in Canada
IoT Market in CanadaIoT Market in Canada
IoT Market in Canada
 
Artificial Intelligence and the Cognitive Revolution – the next frontier?
Artificial Intelligence and the Cognitive Revolution – the next frontier?Artificial Intelligence and the Cognitive Revolution – the next frontier?
Artificial Intelligence and the Cognitive Revolution – the next frontier?
 
Early adopter group and closing of webinar - João Fernandes (CERN)
Early adopter group and closing of webinar - João Fernandes (CERN)Early adopter group and closing of webinar - João Fernandes (CERN)
Early adopter group and closing of webinar - João Fernandes (CERN)
 
Produktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4jProduktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4j
 
Technip Energies Italy: Planning is a graph matter
Technip Energies Italy: Planning is a graph matterTechnip Energies Italy: Planning is a graph matter
Technip Energies Italy: Planning is a graph matter
 
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWANEnabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
 
INTERFACE, by apidays - The Evolution of Data Movement.pdf
INTERFACE, by apidays - The Evolution of Data Movement.pdfINTERFACE, by apidays - The Evolution of Data Movement.pdf
INTERFACE, by apidays - The Evolution of Data Movement.pdf
 
Repository Power: How Repositories can support Open Access Mandates (OR2015 O...
Repository Power: How Repositories can support Open Access Mandates (OR2015 O...Repository Power: How Repositories can support Open Access Mandates (OR2015 O...
Repository Power: How Repositories can support Open Access Mandates (OR2015 O...
 
Easy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance ProfessionalEasy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance Professional
 
Flink Meetup Septmeber 2017 2018
Flink Meetup Septmeber 2017 2018Flink Meetup Septmeber 2017 2018
Flink Meetup Septmeber 2017 2018
 
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
 

Mehr von Flink Forward

Building a fully managed stream processing platform on Flink at scale for Lin...
Building a fully managed stream processing platform on Flink at scale for Lin...Building a fully managed stream processing platform on Flink at scale for Lin...
Building a fully managed stream processing platform on Flink at scale for Lin...Flink Forward
 
Evening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in FlinkEvening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in FlinkFlink Forward
 
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...Flink Forward
 
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...Flink Forward
 
Introducing the Apache Flink Kubernetes Operator
Introducing the Apache Flink Kubernetes OperatorIntroducing the Apache Flink Kubernetes Operator
Introducing the Apache Flink Kubernetes OperatorFlink Forward
 
Autoscaling Flink with Reactive Mode
Autoscaling Flink with Reactive ModeAutoscaling Flink with Reactive Mode
Autoscaling Flink with Reactive ModeFlink Forward
 
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...Flink Forward
 
One sink to rule them all: Introducing the new Async Sink
One sink to rule them all: Introducing the new Async SinkOne sink to rule them all: Introducing the new Async Sink
One sink to rule them all: Introducing the new Async SinkFlink Forward
 
Tuning Apache Kafka Connectors for Flink.pptx
Tuning Apache Kafka Connectors for Flink.pptxTuning Apache Kafka Connectors for Flink.pptx
Tuning Apache Kafka Connectors for Flink.pptxFlink Forward
 
Flink powered stream processing platform at Pinterest
Flink powered stream processing platform at PinterestFlink powered stream processing platform at Pinterest
Flink powered stream processing platform at PinterestFlink Forward
 
Apache Flink in the Cloud-Native Era
Apache Flink in the Cloud-Native EraApache Flink in the Cloud-Native Era
Apache Flink in the Cloud-Native EraFlink Forward
 
Where is my bottleneck? Performance troubleshooting in Flink
Where is my bottleneck? Performance troubleshooting in FlinkWhere is my bottleneck? Performance troubleshooting in Flink
Where is my bottleneck? Performance troubleshooting in FlinkFlink Forward
 
Using the New Apache Flink Kubernetes Operator in a Production Deployment
Using the New Apache Flink Kubernetes Operator in a Production DeploymentUsing the New Apache Flink Kubernetes Operator in a Production Deployment
Using the New Apache Flink Kubernetes Operator in a Production DeploymentFlink Forward
 
The Current State of Table API in 2022
The Current State of Table API in 2022The Current State of Table API in 2022
The Current State of Table API in 2022Flink Forward
 
Flink SQL on Pulsar made easy
Flink SQL on Pulsar made easyFlink SQL on Pulsar made easy
Flink SQL on Pulsar made easyFlink Forward
 
Dynamic Rule-based Real-time Market Data Alerts
Dynamic Rule-based Real-time Market Data AlertsDynamic Rule-based Real-time Market Data Alerts
Dynamic Rule-based Real-time Market Data AlertsFlink Forward
 
Exactly-Once Financial Data Processing at Scale with Flink and Pinot
Exactly-Once Financial Data Processing at Scale with Flink and PinotExactly-Once Financial Data Processing at Scale with Flink and Pinot
Exactly-Once Financial Data Processing at Scale with Flink and PinotFlink Forward
 
Processing Semantically-Ordered Streams in Financial Services
Processing Semantically-Ordered Streams in Financial ServicesProcessing Semantically-Ordered Streams in Financial Services
Processing Semantically-Ordered Streams in Financial ServicesFlink Forward
 
Tame the small files problem and optimize data layout for streaming ingestion...
Tame the small files problem and optimize data layout for streaming ingestion...Tame the small files problem and optimize data layout for streaming ingestion...
Tame the small files problem and optimize data layout for streaming ingestion...Flink Forward
 
Batch Processing at Scale with Flink & Iceberg
Batch Processing at Scale with Flink & IcebergBatch Processing at Scale with Flink & Iceberg
Batch Processing at Scale with Flink & IcebergFlink Forward
 

Mehr von Flink Forward (20)

Building a fully managed stream processing platform on Flink at scale for Lin...
Building a fully managed stream processing platform on Flink at scale for Lin...Building a fully managed stream processing platform on Flink at scale for Lin...
Building a fully managed stream processing platform on Flink at scale for Lin...
 
Evening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in FlinkEvening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in Flink
 
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
 
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
 
Introducing the Apache Flink Kubernetes Operator
Introducing the Apache Flink Kubernetes OperatorIntroducing the Apache Flink Kubernetes Operator
Introducing the Apache Flink Kubernetes Operator
 
Autoscaling Flink with Reactive Mode
Autoscaling Flink with Reactive ModeAutoscaling Flink with Reactive Mode
Autoscaling Flink with Reactive Mode
 
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
 
One sink to rule them all: Introducing the new Async Sink
One sink to rule them all: Introducing the new Async SinkOne sink to rule them all: Introducing the new Async Sink
One sink to rule them all: Introducing the new Async Sink
 
Tuning Apache Kafka Connectors for Flink.pptx
Tuning Apache Kafka Connectors for Flink.pptxTuning Apache Kafka Connectors for Flink.pptx
Tuning Apache Kafka Connectors for Flink.pptx
 
Flink powered stream processing platform at Pinterest
Flink powered stream processing platform at PinterestFlink powered stream processing platform at Pinterest
Flink powered stream processing platform at Pinterest
 
Apache Flink in the Cloud-Native Era
Apache Flink in the Cloud-Native EraApache Flink in the Cloud-Native Era
Apache Flink in the Cloud-Native Era
 
Where is my bottleneck? Performance troubleshooting in Flink
Where is my bottleneck? Performance troubleshooting in FlinkWhere is my bottleneck? Performance troubleshooting in Flink
Where is my bottleneck? Performance troubleshooting in Flink
 
Using the New Apache Flink Kubernetes Operator in a Production Deployment
Using the New Apache Flink Kubernetes Operator in a Production DeploymentUsing the New Apache Flink Kubernetes Operator in a Production Deployment
Using the New Apache Flink Kubernetes Operator in a Production Deployment
 
The Current State of Table API in 2022
The Current State of Table API in 2022The Current State of Table API in 2022
The Current State of Table API in 2022
 
Flink SQL on Pulsar made easy
Flink SQL on Pulsar made easyFlink SQL on Pulsar made easy
Flink SQL on Pulsar made easy
 
Dynamic Rule-based Real-time Market Data Alerts
Dynamic Rule-based Real-time Market Data AlertsDynamic Rule-based Real-time Market Data Alerts
Dynamic Rule-based Real-time Market Data Alerts
 
Exactly-Once Financial Data Processing at Scale with Flink and Pinot
Exactly-Once Financial Data Processing at Scale with Flink and PinotExactly-Once Financial Data Processing at Scale with Flink and Pinot
Exactly-Once Financial Data Processing at Scale with Flink and Pinot
 
Processing Semantically-Ordered Streams in Financial Services
Processing Semantically-Ordered Streams in Financial ServicesProcessing Semantically-Ordered Streams in Financial Services
Processing Semantically-Ordered Streams in Financial Services
 
Tame the small files problem and optimize data layout for streaming ingestion...
Tame the small files problem and optimize data layout for streaming ingestion...Tame the small files problem and optimize data layout for streaming ingestion...
Tame the small files problem and optimize data layout for streaming ingestion...
 
Batch Processing at Scale with Flink & Iceberg
Batch Processing at Scale with Flink & IcebergBatch Processing at Scale with Flink & Iceberg
Batch Processing at Scale with Flink & Iceberg
 

Kürzlich hochgeladen

AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 

Kürzlich hochgeladen (20)

AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 

Flink Forward Berlin 2018: Tobias Lindener - "Approximate standing queries on Stream Processing"