SlideShare ist ein Scribd-Unternehmen logo
1 von 25
© 2021 Google LLC. All rights reserved.
Hybrid Streaming Analytics
for Apache Kafka Users
Firat Tekiner (ftekiner@google.com)
EMEA Data Analytics Practice Lead
© 2021 Google LLC. All rights reserved.
On-premises or Other Cloud
Hybrid Kafka Reference Architecture
Dataflow
BigQuery
Cloud
Storage
Data
Studio
Cloud
Functions
AI
Platform
Bigtable
Confluent Replicator
KSQL
App App DataStore
MySQL HDFS Teradata,
Netezza
Mainframe
App App
© 2021 Google LLC. All rights reserved.
Business is transforming
Businesses have to anticipate and
act on risks and opportunities faster
than ever before
The data and events needed for
analysis are increasing in velocity,
volume, and type
Companies that are able to quickly identify and capitalize on insights within this
changing landscape have a strategic advantage.
© 2021 Google LLC. All rights reserved.
Why Enterprises
choose Google Cloud
for Streaming
Analytics
Serverless Architecture
Robust ingestion services
Unified batch and stream processing
Comprehensive set of analysis tools
Flexibility for users
© 2021 Google LLC. All rights reserved.
Serverless data analytics
From infrastructure to platform for insights
Performance tuning
Monitoring
Reliability
Deployment &
configuration
Utilization
improvements
The traditional data analytics platform
Analysis and insights
Resource provisioning
Handling growing scale
Analysis and
insights
The serverless data
analytics model
© 2021 Google LLC. All rights reserved.
Right-time Action
Dashboard
Visualize and share anomalous events in
your data.
Alerts
Manage by exception through condition-
based notifications.
Actions
Automatically trigger workflows in other
systems using conditions.
1
2
3
Looker
Blocks
© 2021 Google LLC. All rights reserved.
Comprehensive set of analysis tools
BigQuery
Cloud Data
Warehouse
Easy setup
Directly integrated with
streaming Dataflow and
Confluent Cloud
Real time
Fast insights and action
powered by BigQuery’s
Streaming API
Intelligent
Built-in ML for out-of-the-
box predictive insights
Cloud AI
Platform
AI & ML Tools
Plug-and-play
Easily experiment and
collaborate with Google’s
AI Hub
Building blocks
Tools for sight, language,
conversation, and
structured data
Fast deployment
Code-based AI platform
quickly moves ML ideas
to deployment
Tensorflow
Extended (TFX)
© 2021 Google LLC. All rights reserved.
Improve the customer experience with Real-time AI
TFX uses Dataflow and Apache Beam as the distributed data processing engine to enable
several aspects of the ML life cycle, all supported with CI/CD for ML through Kubeflow pipelines.
Predictive
Analytics
Fraud
Detection
Real-time
Personalization
More!
Proprietary + Confidential
© 2021 Google LLC. All rights reserved.
Data Analytics
& Management
Google
Cloud
Smart
Analytics
& AI
Prebuilt
ML APIs
Foundation
AI Platform
AutoML
AI Solutions
Language Conversation
Horizontal solutions
Structured Data
Language
Frameworks Compute
Contact
Center AI
Ingestion and Processing Storage and Analytics
Orchestration
Notebooks
Industry solutions
Data
Labeling
Training Prediction Continuous
evaluation
Explainability Pipelines
Compute
Engine
Cloud TPU
Cloud GPU Cloud
scheduler
Cloud
Composer
Instrumentation
Cloud Build Container
Registry
Cloud
Pub/Sub
Cloud
Dataflow
Cloud
Dataproc
Data
Fusion
Cloud
Storage BigQuery
Cloud
Bigtable
Cloud SQL
Data
Catalog
Data
Studio
Data Science and Machine Learning
Sight
Sight
Vision Video Translate Natural
Language
Tables
Video
Intelligence
Vision Natural
Language
Translate Speech-to-Text Text-to-Speech
Document AI
Dialogflow Talent Solution
Recommendation AI
© 2021 Google LLC. All rights reserved.
Flexibility for users
Apache Beam
Open-source,
unified model and
set of SDKs for
defining and
executing data
processing
Open source programming
model
Serves as the SDK for
creating Cloud Dataflow jobs;
community development
increases flexibility
Choose your language
Java, Python, Scala, and GO are available;
join DA Spotlight
for news on languages
Portability
Program in Beam, and gain the ability to
move between
Spark, Flink, Dataflow, and more
Dataflow
Simplified stream and
batch data processing
Batch and Stream
Reduce complexity and reuse code
by driving batch and stream
workloads from the same tool
Reliable and consistent processing
Exactly once processing with built-in
support for fault-tolerant execution
Simplified operations & management
Performance, scaling, availability,
security, and compliance
handled automatically
Integrated
Integration with Kafka/Confluent Cloud,
the Google Data Analytics suite,
and GCP broadly
Unified stream and batch
processing
© 2021 Google LLC. All rights reserved.
Ingest Transform Analyze
Ingest and distribute
data reliably
Fast, correct computations
quickly and simply
Machine learning &
data warehouse
Cloud Dataflow
Cloud ML
Pub/Sub BigQuery
Dataflow
Flexible stream analytics with OSS
KSQL
© 2021 Google LLC. All rights reserved.
Google Cloud has an
end-to-end, fully-
managed Stream
Analytics offering
Pub/Sub
(Messaging)
Confluent Kafka
(Messaging)*
BigQuery Streaming
API
IoT Core
Collect
Data Catalog (Metadata Management) & Composer (Workflow Orchestration)
Dataflow
(Beam Streaming)
Dataproc
(Spark Streaming and Flink)
Dataform
Kubernetes
Process
BigQuery
Bigtable
AI Platform + TFX
Integration
Databases (e.g.
Cloud SQL, Spanner)
Store and Analyze
Looker
Apigee
Firebase
Activate
Cloud Functions
* Partner Solution
© 2021 Google LLC. All rights reserved.
A platform for all users and intents throughout the data lifecycle
Fine-grained
access control
Cloud IAM
Metadata
management
Data Catalog
Always
encrypted
Data at rest and
in transit
Redact sensitive
data
Cloud DLP
Security Admin
Protecting data
Messaging
PubSub
Data Processing
Dataflow
Data Apps
Looker
(LookML)
OSS Engines
Dataproc
(Spark, Flink)
Developer
Intelligent apps
DW & DB
BigQuery ,
BigTable
Data processing
(OSS) pipelines
Dataproc
(Spark, Presto, Flink)
Data Processing
(Native) pipelines
Dataflow
Orchestration
Composer
Data engineer
Get clean, useful data
Messaging
PubSub or
Confluent Kafka
CDW
BigQuery
CDW &
Orchestration
BigQuery
Visual data
Integration
Data Fusion
ML in SQL
BigQuery ML
Data models,
catalog
Looker, Data
Catalog
Data analyst
Query and analyze
Ingestion
BigQuery
Streaming &
DTS
Governed BI
Looker
CDW in a
Spreadsheet
Connected
Sheets
Natural Language
Query
Data QnA
Business User
Insights Everywhere
Data models,
catalog
Looker, Data
Catalog
CDW
BigQuery
Portable
notebooks
AI Platform
Notebooks
Simplified ML
BigQuery ML &
Auto ML
Collaboration
Feature Store,
AI Platform
Pipelines
Spark
Dataproc
Data scientist
Models that work
CDW
BigQuery
Secure data
sharing
BigQuery
© 2021 Google LLC. All rights reserved.
Real-time Analytics GCP Approach
Event Collect Process Store and Analyze Activate
BigQuery Looker
Event stream / Integration
Pub/sub Dataflow
IoT Core
Analytics
Low Latency,
Time Series Bigtable
Apigee
Firebase
Apigee
Firebase
Monetization
Cloud Logging
...
Templates
AI Platform
Continuous
Intelligence
Edge Manager
for ML
ML at the Edge
App Activation
© 2021 Google LLC. All rights reserved.
Real-time Analytics GCP Simplified Approach
Event Collect Process Store and Analyze Activate
BigQuery
Looker
Streaming API
ELT
(Dataform)
Materialized
Views
BQML
BI Engine
Data Studio
Apigee
Connected
Sheets
Event stream / Integration
© 2021 Google LLC. All rights reserved.
Real-time Analytics Open and Partner Approach
Event Collect Process Store and Analyze Activate
Dataproc
Streaming
BigQuery
3rd Party BI and
activation tools
...
...
© 2021 Google LLC. All rights reserved.
Options
Hybrid
● Accessing Kafka on-prem directly from GCP
● Kafka replication (on-prem to GCE or Confluent Cloud’s GCP marketplace offering)
Lift and Shift
● Confluent Cloud’s fully managed Kafka (Marketplace offering)
– Connectors available to BigQuery, Cloud Storage, Pub/Sub, MongoDB Atlas, etc
– Clustering, SLAs, etc
● Self-managing Kafka on GCE
GCP Integration
● Pre-Built Dataflow Flex
● Kafka to BigQuery template
● Using Kafka Connect
● To push to Google BigQuery. Supported by Confluent and WePay
● To push to Google Cloud Pub/Sub. Supported by Google
● Fivetran, Confluent ...
How do we deploy Kafka or integrate it with the rest of the GCP
stack?
© 2021 Google LLC. All rights reserved.
On-prem
Hybrid: Access Kafka on-prem from GCP
Gateway
Google Cloud
Interconnect
& VPN
Gateway
Kafka
Cluster
Analysis
Cloud Dataflow
Analysis
Compute Engine
Analysis
Cloud Dataproc
© 2021 Google LLC. All rights reserved.
On-prem
Hybrid: Replicate Kafka on-prem to GCP
Gateway
Google Cloud
Interconnect
& VPN
Gateway
Kafka
Cluster
Kafka
Self Managed
Cluster
Compute
Engine
Analysis
Cloud
Dataflow
Analysis
Compute
Engine
Kafka Connect
Kafka Connect
Replicator
Analysis
Cloud
Dataproc
© 2021 Google LLC. All rights reserved.
On-prem
Lift and Shift: Confluent Cloud’s Kafka on GCP
Analysis
Cloud
Dataflow
Analysis
Compute
Engine
Analysis
Cloud
Dataproc
Confluent Cloud
Managed by Confluent
Kafka Cluster
Customer Project
Internet
Private network
© 2021 Google LLC. All rights reserved.
On-prem
Lift and Shift: Self-managing Kafka on GCP
Gateway
Google Cloud
Interconnect
& VPN
Gateway
Kafka
Self Managed
Cluster
Compute
Engine
Analysis
Cloud
Dataflow
Analysis
Compute
Engine
Analysis
Cloud
Dataproc
© 2021 Google LLC. All rights reserved.
GCP Integration: Using Dataflow Template
Kafka to BQ
Dataflow Template
Table
BigQuery
Kafka
Compute Engine
© 2021 Google LLC. All rights reserved.
On-prem
GCP Integration: Using Kafka Connect
Gateway
Google Cloud
Interconnect
& VPN
Gateway
Analysis
Cloud
Dataflow
Kafka Connect
Cloud Pub/Sub
Connector
Kafka Topic
Cloud Pub/Sub
Kafka Topic Dest.
BigQuery
Kafka Connect
BigQuery Connector
Internet
Private Network
Supported by Google
Supported by Confluent and WePay
Analysis
Cloud
BigQuery
© 2021 Google LLC. All rights reserved.
Comparing it to Google Cloud Pub/Sub
Self-managed Kafka
● Open source
● Set up your own auth to protect your Kafka
● You must provision and plan for load isolation
● You must support it
● You must infer costs based on variety of capacity
and availability patterns, buy components (rather
than pay for usage): CPU, disk, network
● You must design and maintain your own replication
and backup setup
● Can be used as a system of record, messages re-
read from beginning — new subscribers can read
from start (depending on retention policy)
● Order guarantees within a partition
● Large platform of streaming tools — KSQL, Schema
Registry, Connectors to/from data sources
Cloud Pub/Sub
● GCP only; however, the API can be emulated on a
Kafka server on-prem
● GCP IAM integration
● 24-hour on-call support, SLAs from Google, and
integrated monitoring with Stackdriver
● Transparent replication and backups for high
availability and durability
● Predictable bandwidth-based billing
● Global presence: Pub/Sub is already deployed in all
GCP data centers for consistent latency and high
availability. Today, only global is possible.
● Single service: You only worry about managing
topics and subscribers, rather than clusters
● At least once delivery
Thank you

Weitere ähnliche Inhalte

Was ist angesagt?

Don't Cross the Streams! (or do, we got you)
Don't Cross the Streams! (or do, we got you)Don't Cross the Streams! (or do, we got you)
Don't Cross the Streams! (or do, we got you)Caito Scherr
 
Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...
Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...
Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...HostedbyConfluent
 
Building a Codeless Log Pipeline w/ Confluent Sink Connector | Pollyanna Vale...
Building a Codeless Log Pipeline w/ Confluent Sink Connector | Pollyanna Vale...Building a Codeless Log Pipeline w/ Confluent Sink Connector | Pollyanna Vale...
Building a Codeless Log Pipeline w/ Confluent Sink Connector | Pollyanna Vale...HostedbyConfluent
 
Building a Data Subscription Service with Kafka Connect (Danica Fine & Ajay V...
Building a Data Subscription Service with Kafka Connect (Danica Fine & Ajay V...Building a Data Subscription Service with Kafka Connect (Danica Fine & Ajay V...
Building a Data Subscription Service with Kafka Connect (Danica Fine & Ajay V...confluent
 
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...HostedbyConfluent
 
Should we manage events like APIs? | Kim Clark, IBM
Should we manage events like APIs? | Kim Clark, IBMShould we manage events like APIs? | Kim Clark, IBM
Should we manage events like APIs? | Kim Clark, IBMHostedbyConfluent
 
Kafka summit apac session
Kafka summit apac sessionKafka summit apac session
Kafka summit apac sessionChristina Lin
 
Top 5 Event Streaming Use Cases for 2021 with Apache Kafka
Top 5 Event Streaming Use Cases for 2021 with Apache KafkaTop 5 Event Streaming Use Cases for 2021 with Apache Kafka
Top 5 Event Streaming Use Cases for 2021 with Apache KafkaKai Wähner
 
Real-time Adaptation of Financial Market Events with Kafka | Cliff Cheng and ...
Real-time Adaptation of Financial Market Events with Kafka | Cliff Cheng and ...Real-time Adaptation of Financial Market Events with Kafka | Cliff Cheng and ...
Real-time Adaptation of Financial Market Events with Kafka | Cliff Cheng and ...HostedbyConfluent
 
Reacting to an Event-Driven World (Kate Stanley & Grace Jansen, IBM) Kafka Su...
Reacting to an Event-Driven World (Kate Stanley & Grace Jansen, IBM) Kafka Su...Reacting to an Event-Driven World (Kate Stanley & Grace Jansen, IBM) Kafka Su...
Reacting to an Event-Driven World (Kate Stanley & Grace Jansen, IBM) Kafka Su...confluent
 
A Solution for Leveraging Kafka to Provide End-to-End ACID Transactions
A Solution for Leveraging Kafka to Provide End-to-End ACID TransactionsA Solution for Leveraging Kafka to Provide End-to-End ACID Transactions
A Solution for Leveraging Kafka to Provide End-to-End ACID Transactionsconfluent
 
Kafka for connected vehicle research | Pavle Bujanovic, Federal Highway Admin...
Kafka for connected vehicle research | Pavle Bujanovic, Federal Highway Admin...Kafka for connected vehicle research | Pavle Bujanovic, Federal Highway Admin...
Kafka for connected vehicle research | Pavle Bujanovic, Federal Highway Admin...HostedbyConfluent
 
EDA Governance Model: a multicloud approach based on GitOps | Alejandro Alija...
EDA Governance Model: a multicloud approach based on GitOps | Alejandro Alija...EDA Governance Model: a multicloud approach based on GitOps | Alejandro Alija...
EDA Governance Model: a multicloud approach based on GitOps | Alejandro Alija...HostedbyConfluent
 
How we eased out security journey with OAuth (Goodbye Kerberos!) | Paul Makka...
How we eased out security journey with OAuth (Goodbye Kerberos!) | Paul Makka...How we eased out security journey with OAuth (Goodbye Kerberos!) | Paul Makka...
How we eased out security journey with OAuth (Goodbye Kerberos!) | Paul Makka...HostedbyConfluent
 
Kubernetes Apache Kafka
Kubernetes Apache KafkaKubernetes Apache Kafka
Kubernetes Apache Kafkaconfluent
 
Digital Transformation: Highly Resilient Streaming Architecture and Strategies
Digital Transformation: Highly Resilient Streaming Architecture and StrategiesDigital Transformation: Highly Resilient Streaming Architecture and Strategies
Digital Transformation: Highly Resilient Streaming Architecture and StrategiesHostedbyConfluent
 
Stream Processing with Kafka and KSQL in Jupiter | Namit Mahuvakar, Jupiter
Stream Processing with Kafka and KSQL in Jupiter | Namit Mahuvakar, JupiterStream Processing with Kafka and KSQL in Jupiter | Namit Mahuvakar, Jupiter
Stream Processing with Kafka and KSQL in Jupiter | Namit Mahuvakar, JupiterHostedbyConfluent
 
Toward Hybrid Cloud Serverless Transparency with Lithops Framework
Toward Hybrid Cloud Serverless Transparency with Lithops FrameworkToward Hybrid Cloud Serverless Transparency with Lithops Framework
Toward Hybrid Cloud Serverless Transparency with Lithops FrameworkLibbySchulze
 
Stream processing and managing real-time data
Stream processing and managing real-time dataStream processing and managing real-time data
Stream processing and managing real-time dataAmazon Web Services
 

Was ist angesagt? (20)

Don't Cross the Streams! (or do, we got you)
Don't Cross the Streams! (or do, we got you)Don't Cross the Streams! (or do, we got you)
Don't Cross the Streams! (or do, we got you)
 
Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...
Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...
Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...
 
Building a Codeless Log Pipeline w/ Confluent Sink Connector | Pollyanna Vale...
Building a Codeless Log Pipeline w/ Confluent Sink Connector | Pollyanna Vale...Building a Codeless Log Pipeline w/ Confluent Sink Connector | Pollyanna Vale...
Building a Codeless Log Pipeline w/ Confluent Sink Connector | Pollyanna Vale...
 
Building a Data Subscription Service with Kafka Connect (Danica Fine & Ajay V...
Building a Data Subscription Service with Kafka Connect (Danica Fine & Ajay V...Building a Data Subscription Service with Kafka Connect (Danica Fine & Ajay V...
Building a Data Subscription Service with Kafka Connect (Danica Fine & Ajay V...
 
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
 
Should we manage events like APIs? | Kim Clark, IBM
Should we manage events like APIs? | Kim Clark, IBMShould we manage events like APIs? | Kim Clark, IBM
Should we manage events like APIs? | Kim Clark, IBM
 
Kafka summit apac session
Kafka summit apac sessionKafka summit apac session
Kafka summit apac session
 
Top 5 Event Streaming Use Cases for 2021 with Apache Kafka
Top 5 Event Streaming Use Cases for 2021 with Apache KafkaTop 5 Event Streaming Use Cases for 2021 with Apache Kafka
Top 5 Event Streaming Use Cases for 2021 with Apache Kafka
 
Real-time Adaptation of Financial Market Events with Kafka | Cliff Cheng and ...
Real-time Adaptation of Financial Market Events with Kafka | Cliff Cheng and ...Real-time Adaptation of Financial Market Events with Kafka | Cliff Cheng and ...
Real-time Adaptation of Financial Market Events with Kafka | Cliff Cheng and ...
 
Reacting to an Event-Driven World (Kate Stanley & Grace Jansen, IBM) Kafka Su...
Reacting to an Event-Driven World (Kate Stanley & Grace Jansen, IBM) Kafka Su...Reacting to an Event-Driven World (Kate Stanley & Grace Jansen, IBM) Kafka Su...
Reacting to an Event-Driven World (Kate Stanley & Grace Jansen, IBM) Kafka Su...
 
A Solution for Leveraging Kafka to Provide End-to-End ACID Transactions
A Solution for Leveraging Kafka to Provide End-to-End ACID TransactionsA Solution for Leveraging Kafka to Provide End-to-End ACID Transactions
A Solution for Leveraging Kafka to Provide End-to-End ACID Transactions
 
Kafka for connected vehicle research | Pavle Bujanovic, Federal Highway Admin...
Kafka for connected vehicle research | Pavle Bujanovic, Federal Highway Admin...Kafka for connected vehicle research | Pavle Bujanovic, Federal Highway Admin...
Kafka for connected vehicle research | Pavle Bujanovic, Federal Highway Admin...
 
EDA Governance Model: a multicloud approach based on GitOps | Alejandro Alija...
EDA Governance Model: a multicloud approach based on GitOps | Alejandro Alija...EDA Governance Model: a multicloud approach based on GitOps | Alejandro Alija...
EDA Governance Model: a multicloud approach based on GitOps | Alejandro Alija...
 
How we eased out security journey with OAuth (Goodbye Kerberos!) | Paul Makka...
How we eased out security journey with OAuth (Goodbye Kerberos!) | Paul Makka...How we eased out security journey with OAuth (Goodbye Kerberos!) | Paul Makka...
How we eased out security journey with OAuth (Goodbye Kerberos!) | Paul Makka...
 
Kubernetes Apache Kafka
Kubernetes Apache KafkaKubernetes Apache Kafka
Kubernetes Apache Kafka
 
Digital Transformation: Highly Resilient Streaming Architecture and Strategies
Digital Transformation: Highly Resilient Streaming Architecture and StrategiesDigital Transformation: Highly Resilient Streaming Architecture and Strategies
Digital Transformation: Highly Resilient Streaming Architecture and Strategies
 
Stream Processing with Kafka and KSQL in Jupiter | Namit Mahuvakar, Jupiter
Stream Processing with Kafka and KSQL in Jupiter | Namit Mahuvakar, JupiterStream Processing with Kafka and KSQL in Jupiter | Namit Mahuvakar, Jupiter
Stream Processing with Kafka and KSQL in Jupiter | Namit Mahuvakar, Jupiter
 
Toward Hybrid Cloud Serverless Transparency with Lithops Framework
Toward Hybrid Cloud Serverless Transparency with Lithops FrameworkToward Hybrid Cloud Serverless Transparency with Lithops Framework
Toward Hybrid Cloud Serverless Transparency with Lithops Framework
 
Anypoint Data Graphs
Anypoint Data GraphsAnypoint Data Graphs
Anypoint Data Graphs
 
Stream processing and managing real-time data
Stream processing and managing real-time dataStream processing and managing real-time data
Stream processing and managing real-time data
 

Ähnlich wie Hybrid Streaming Analytics for Apache Kafka Users | Firat Tekiner, Google

Hybrid Kafka, Taking Real-time Analytics to the Business (Cody Irwin, Google ...
Hybrid Kafka, Taking Real-time Analytics to the Business (Cody Irwin, Google ...Hybrid Kafka, Taking Real-time Analytics to the Business (Cody Irwin, Google ...
Hybrid Kafka, Taking Real-time Analytics to the Business (Cody Irwin, Google ...HostedbyConfluent
 
Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...
Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...
Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...HostedbyConfluent
 
Gimel and PayPal Notebooks @ TDWI Leadership Summit Orlando
Gimel and PayPal Notebooks @ TDWI Leadership Summit OrlandoGimel and PayPal Notebooks @ TDWI Leadership Summit Orlando
Gimel and PayPal Notebooks @ TDWI Leadership Summit OrlandoRomit Mehta
 
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...HostedbyConfluent
 
Hybrid data lake on google cloud with alluxio and dataproc
Hybrid data lake on google cloud  with alluxio and dataprocHybrid data lake on google cloud  with alluxio and dataproc
Hybrid data lake on google cloud with alluxio and dataprocAlluxio, Inc.
 
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...Alluxio, Inc.
 
GCP for Apache Kafka® Users: Stream Ingestion and Processing
GCP for Apache Kafka® Users: Stream Ingestion and ProcessingGCP for Apache Kafka® Users: Stream Ingestion and Processing
GCP for Apache Kafka® Users: Stream Ingestion and Processingconfluent
 
Data & Analytics ReInvent Recap [AWS Basel Meetup - Jan 2023].pdf
Data & Analytics ReInvent Recap [AWS Basel Meetup - Jan 2023].pdfData & Analytics ReInvent Recap [AWS Basel Meetup - Jan 2023].pdf
Data & Analytics ReInvent Recap [AWS Basel Meetup - Jan 2023].pdfChris Bingham
 
Introduction to GCP Data Flow Presentation
Introduction to GCP Data Flow PresentationIntroduction to GCP Data Flow Presentation
Introduction to GCP Data Flow PresentationKnoldus Inc.
 
Introduction to GCP DataFlow Presentation
Introduction to GCP DataFlow PresentationIntroduction to GCP DataFlow Presentation
Introduction to GCP DataFlow PresentationKnoldus Inc.
 
Big Data and ML on Google Cloud
Big Data and ML on Google CloudBig Data and ML on Google Cloud
Big Data and ML on Google CloudWlodek Bielski
 
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...Abhinav Joshi
 
Building what's next with google cloud's powerful infrastructure
Building what's next with google cloud's powerful infrastructureBuilding what's next with google cloud's powerful infrastructure
Building what's next with google cloud's powerful infrastructureMediaAgility
 
Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka...
Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka...Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka...
Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka...Timothy Spann
 
A fresh look at Google’s Cloud by Mandy Waite
A fresh look at Google’s Cloud by Mandy Waite A fresh look at Google’s Cloud by Mandy Waite
A fresh look at Google’s Cloud by Mandy Waite Codemotion
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024Timothy Spann
 
Confluent Partner Tech Talk with Reply
Confluent Partner Tech Talk with ReplyConfluent Partner Tech Talk with Reply
Confluent Partner Tech Talk with Replyconfluent
 
Cloud is the new normal - Red Hat Forum Bangalore 2015
Cloud is the new normal - Red Hat Forum Bangalore 2015Cloud is the new normal - Red Hat Forum Bangalore 2015
Cloud is the new normal - Red Hat Forum Bangalore 2015Red Hat India Pvt. Ltd.
 
Comparing three data ingestion approaches where Apache Kafka integrates with ...
Comparing three data ingestion approaches where Apache Kafka integrates with ...Comparing three data ingestion approaches where Apache Kafka integrates with ...
Comparing three data ingestion approaches where Apache Kafka integrates with ...HostedbyConfluent
 
ACdP Fiware.pdf
ACdP Fiware.pdfACdP Fiware.pdf
ACdP Fiware.pdfMASSAL3
 

Ähnlich wie Hybrid Streaming Analytics for Apache Kafka Users | Firat Tekiner, Google (20)

Hybrid Kafka, Taking Real-time Analytics to the Business (Cody Irwin, Google ...
Hybrid Kafka, Taking Real-time Analytics to the Business (Cody Irwin, Google ...Hybrid Kafka, Taking Real-time Analytics to the Business (Cody Irwin, Google ...
Hybrid Kafka, Taking Real-time Analytics to the Business (Cody Irwin, Google ...
 
Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...
Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...
Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...
 
Gimel and PayPal Notebooks @ TDWI Leadership Summit Orlando
Gimel and PayPal Notebooks @ TDWI Leadership Summit OrlandoGimel and PayPal Notebooks @ TDWI Leadership Summit Orlando
Gimel and PayPal Notebooks @ TDWI Leadership Summit Orlando
 
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
 
Hybrid data lake on google cloud with alluxio and dataproc
Hybrid data lake on google cloud  with alluxio and dataprocHybrid data lake on google cloud  with alluxio and dataproc
Hybrid data lake on google cloud with alluxio and dataproc
 
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
 
GCP for Apache Kafka® Users: Stream Ingestion and Processing
GCP for Apache Kafka® Users: Stream Ingestion and ProcessingGCP for Apache Kafka® Users: Stream Ingestion and Processing
GCP for Apache Kafka® Users: Stream Ingestion and Processing
 
Data & Analytics ReInvent Recap [AWS Basel Meetup - Jan 2023].pdf
Data & Analytics ReInvent Recap [AWS Basel Meetup - Jan 2023].pdfData & Analytics ReInvent Recap [AWS Basel Meetup - Jan 2023].pdf
Data & Analytics ReInvent Recap [AWS Basel Meetup - Jan 2023].pdf
 
Introduction to GCP Data Flow Presentation
Introduction to GCP Data Flow PresentationIntroduction to GCP Data Flow Presentation
Introduction to GCP Data Flow Presentation
 
Introduction to GCP DataFlow Presentation
Introduction to GCP DataFlow PresentationIntroduction to GCP DataFlow Presentation
Introduction to GCP DataFlow Presentation
 
Big Data and ML on Google Cloud
Big Data and ML on Google CloudBig Data and ML on Google Cloud
Big Data and ML on Google Cloud
 
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
 
Building what's next with google cloud's powerful infrastructure
Building what's next with google cloud's powerful infrastructureBuilding what's next with google cloud's powerful infrastructure
Building what's next with google cloud's powerful infrastructure
 
Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka...
Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka...Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka...
Live Demo Jam Expands: The Leading-Edge Streaming Data Platform with NiFi, Ka...
 
A fresh look at Google’s Cloud by Mandy Waite
A fresh look at Google’s Cloud by Mandy Waite A fresh look at Google’s Cloud by Mandy Waite
A fresh look at Google’s Cloud by Mandy Waite
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
 
Confluent Partner Tech Talk with Reply
Confluent Partner Tech Talk with ReplyConfluent Partner Tech Talk with Reply
Confluent Partner Tech Talk with Reply
 
Cloud is the new normal - Red Hat Forum Bangalore 2015
Cloud is the new normal - Red Hat Forum Bangalore 2015Cloud is the new normal - Red Hat Forum Bangalore 2015
Cloud is the new normal - Red Hat Forum Bangalore 2015
 
Comparing three data ingestion approaches where Apache Kafka integrates with ...
Comparing three data ingestion approaches where Apache Kafka integrates with ...Comparing three data ingestion approaches where Apache Kafka integrates with ...
Comparing three data ingestion approaches where Apache Kafka integrates with ...
 
ACdP Fiware.pdf
ACdP Fiware.pdfACdP Fiware.pdf
ACdP Fiware.pdf
 

Mehr von HostedbyConfluent

Renaming a Kafka Topic | Kafka Summit London
Renaming a Kafka Topic | Kafka Summit LondonRenaming a Kafka Topic | Kafka Summit London
Renaming a Kafka Topic | Kafka Summit LondonHostedbyConfluent
 
Evolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at TrendyolEvolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at TrendyolHostedbyConfluent
 
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Ensuring Kafka Service Resilience: A Dive into Health-Checking TechniquesEnsuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Ensuring Kafka Service Resilience: A Dive into Health-Checking TechniquesHostedbyConfluent
 
Exactly-once Stream Processing with Arroyo and Kafka
Exactly-once Stream Processing with Arroyo and KafkaExactly-once Stream Processing with Arroyo and Kafka
Exactly-once Stream Processing with Arroyo and KafkaHostedbyConfluent
 
Fish Plays Pokemon | Kafka Summit London
Fish Plays Pokemon | Kafka Summit LondonFish Plays Pokemon | Kafka Summit London
Fish Plays Pokemon | Kafka Summit LondonHostedbyConfluent
 
Tiered Storage 101 | Kafla Summit London
Tiered Storage 101 | Kafla Summit LondonTiered Storage 101 | Kafla Summit London
Tiered Storage 101 | Kafla Summit LondonHostedbyConfluent
 
Building a Self-Service Stream Processing Portal: How And Why
Building a Self-Service Stream Processing Portal: How And WhyBuilding a Self-Service Stream Processing Portal: How And Why
Building a Self-Service Stream Processing Portal: How And WhyHostedbyConfluent
 
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...HostedbyConfluent
 
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...HostedbyConfluent
 
Navigating Private Network Connectivity Options for Kafka Clusters
Navigating Private Network Connectivity Options for Kafka ClustersNavigating Private Network Connectivity Options for Kafka Clusters
Navigating Private Network Connectivity Options for Kafka ClustersHostedbyConfluent
 
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
Apache Flink: Building a Company-wide Self-service Streaming Data PlatformApache Flink: Building a Company-wide Self-service Streaming Data Platform
Apache Flink: Building a Company-wide Self-service Streaming Data PlatformHostedbyConfluent
 
Explaining How Real-Time GenAI Works in a Noisy Pub
Explaining How Real-Time GenAI Works in a Noisy PubExplaining How Real-Time GenAI Works in a Noisy Pub
Explaining How Real-Time GenAI Works in a Noisy PubHostedbyConfluent
 
TL;DR Kafka Metrics | Kafka Summit London
TL;DR Kafka Metrics | Kafka Summit LondonTL;DR Kafka Metrics | Kafka Summit London
TL;DR Kafka Metrics | Kafka Summit LondonHostedbyConfluent
 
A Window Into Your Kafka Streams Tasks | KSL
A Window Into Your Kafka Streams Tasks | KSLA Window Into Your Kafka Streams Tasks | KSL
A Window Into Your Kafka Streams Tasks | KSLHostedbyConfluent
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
Mastering Kafka Producer Configs: A Guide to Optimizing PerformanceMastering Kafka Producer Configs: A Guide to Optimizing Performance
Mastering Kafka Producer Configs: A Guide to Optimizing PerformanceHostedbyConfluent
 
Data Contracts Management: Schema Registry and Beyond
Data Contracts Management: Schema Registry and BeyondData Contracts Management: Schema Registry and Beyond
Data Contracts Management: Schema Registry and BeyondHostedbyConfluent
 
Code-First Approach: Crafting Efficient Flink Apps
Code-First Approach: Crafting Efficient Flink AppsCode-First Approach: Crafting Efficient Flink Apps
Code-First Approach: Crafting Efficient Flink AppsHostedbyConfluent
 
Debezium vs. the World: An Overview of the CDC Ecosystem
Debezium vs. the World: An Overview of the CDC EcosystemDebezium vs. the World: An Overview of the CDC Ecosystem
Debezium vs. the World: An Overview of the CDC EcosystemHostedbyConfluent
 
Beyond Tiered Storage: Serverless Kafka with No Local Disks
Beyond Tiered Storage: Serverless Kafka with No Local DisksBeyond Tiered Storage: Serverless Kafka with No Local Disks
Beyond Tiered Storage: Serverless Kafka with No Local DisksHostedbyConfluent
 

Mehr von HostedbyConfluent (20)

Renaming a Kafka Topic | Kafka Summit London
Renaming a Kafka Topic | Kafka Summit LondonRenaming a Kafka Topic | Kafka Summit London
Renaming a Kafka Topic | Kafka Summit London
 
Evolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at TrendyolEvolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at Trendyol
 
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Ensuring Kafka Service Resilience: A Dive into Health-Checking TechniquesEnsuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
 
Exactly-once Stream Processing with Arroyo and Kafka
Exactly-once Stream Processing with Arroyo and KafkaExactly-once Stream Processing with Arroyo and Kafka
Exactly-once Stream Processing with Arroyo and Kafka
 
Fish Plays Pokemon | Kafka Summit London
Fish Plays Pokemon | Kafka Summit LondonFish Plays Pokemon | Kafka Summit London
Fish Plays Pokemon | Kafka Summit London
 
Tiered Storage 101 | Kafla Summit London
Tiered Storage 101 | Kafla Summit LondonTiered Storage 101 | Kafla Summit London
Tiered Storage 101 | Kafla Summit London
 
Building a Self-Service Stream Processing Portal: How And Why
Building a Self-Service Stream Processing Portal: How And WhyBuilding a Self-Service Stream Processing Portal: How And Why
Building a Self-Service Stream Processing Portal: How And Why
 
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
 
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
 
Navigating Private Network Connectivity Options for Kafka Clusters
Navigating Private Network Connectivity Options for Kafka ClustersNavigating Private Network Connectivity Options for Kafka Clusters
Navigating Private Network Connectivity Options for Kafka Clusters
 
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
Apache Flink: Building a Company-wide Self-service Streaming Data PlatformApache Flink: Building a Company-wide Self-service Streaming Data Platform
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
 
Explaining How Real-Time GenAI Works in a Noisy Pub
Explaining How Real-Time GenAI Works in a Noisy PubExplaining How Real-Time GenAI Works in a Noisy Pub
Explaining How Real-Time GenAI Works in a Noisy Pub
 
TL;DR Kafka Metrics | Kafka Summit London
TL;DR Kafka Metrics | Kafka Summit LondonTL;DR Kafka Metrics | Kafka Summit London
TL;DR Kafka Metrics | Kafka Summit London
 
A Window Into Your Kafka Streams Tasks | KSL
A Window Into Your Kafka Streams Tasks | KSLA Window Into Your Kafka Streams Tasks | KSL
A Window Into Your Kafka Streams Tasks | KSL
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
Mastering Kafka Producer Configs: A Guide to Optimizing PerformanceMastering Kafka Producer Configs: A Guide to Optimizing Performance
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
 
Data Contracts Management: Schema Registry and Beyond
Data Contracts Management: Schema Registry and BeyondData Contracts Management: Schema Registry and Beyond
Data Contracts Management: Schema Registry and Beyond
 
Code-First Approach: Crafting Efficient Flink Apps
Code-First Approach: Crafting Efficient Flink AppsCode-First Approach: Crafting Efficient Flink Apps
Code-First Approach: Crafting Efficient Flink Apps
 
Debezium vs. the World: An Overview of the CDC Ecosystem
Debezium vs. the World: An Overview of the CDC EcosystemDebezium vs. the World: An Overview of the CDC Ecosystem
Debezium vs. the World: An Overview of the CDC Ecosystem
 
Beyond Tiered Storage: Serverless Kafka with No Local Disks
Beyond Tiered Storage: Serverless Kafka with No Local DisksBeyond Tiered Storage: Serverless Kafka with No Local Disks
Beyond Tiered Storage: Serverless Kafka with No Local Disks
 

Kürzlich hochgeladen

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 

Kürzlich hochgeladen (20)

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 

Hybrid Streaming Analytics for Apache Kafka Users | Firat Tekiner, Google

  • 1. © 2021 Google LLC. All rights reserved. Hybrid Streaming Analytics for Apache Kafka Users Firat Tekiner (ftekiner@google.com) EMEA Data Analytics Practice Lead
  • 2. © 2021 Google LLC. All rights reserved. On-premises or Other Cloud Hybrid Kafka Reference Architecture Dataflow BigQuery Cloud Storage Data Studio Cloud Functions AI Platform Bigtable Confluent Replicator KSQL App App DataStore MySQL HDFS Teradata, Netezza Mainframe App App
  • 3. © 2021 Google LLC. All rights reserved. Business is transforming Businesses have to anticipate and act on risks and opportunities faster than ever before The data and events needed for analysis are increasing in velocity, volume, and type Companies that are able to quickly identify and capitalize on insights within this changing landscape have a strategic advantage.
  • 4. © 2021 Google LLC. All rights reserved. Why Enterprises choose Google Cloud for Streaming Analytics Serverless Architecture Robust ingestion services Unified batch and stream processing Comprehensive set of analysis tools Flexibility for users
  • 5. © 2021 Google LLC. All rights reserved. Serverless data analytics From infrastructure to platform for insights Performance tuning Monitoring Reliability Deployment & configuration Utilization improvements The traditional data analytics platform Analysis and insights Resource provisioning Handling growing scale Analysis and insights The serverless data analytics model
  • 6. © 2021 Google LLC. All rights reserved. Right-time Action Dashboard Visualize and share anomalous events in your data. Alerts Manage by exception through condition- based notifications. Actions Automatically trigger workflows in other systems using conditions. 1 2 3 Looker Blocks
  • 7. © 2021 Google LLC. All rights reserved. Comprehensive set of analysis tools BigQuery Cloud Data Warehouse Easy setup Directly integrated with streaming Dataflow and Confluent Cloud Real time Fast insights and action powered by BigQuery’s Streaming API Intelligent Built-in ML for out-of-the- box predictive insights Cloud AI Platform AI & ML Tools Plug-and-play Easily experiment and collaborate with Google’s AI Hub Building blocks Tools for sight, language, conversation, and structured data Fast deployment Code-based AI platform quickly moves ML ideas to deployment Tensorflow Extended (TFX)
  • 8. © 2021 Google LLC. All rights reserved. Improve the customer experience with Real-time AI TFX uses Dataflow and Apache Beam as the distributed data processing engine to enable several aspects of the ML life cycle, all supported with CI/CD for ML through Kubeflow pipelines. Predictive Analytics Fraud Detection Real-time Personalization More!
  • 9. Proprietary + Confidential © 2021 Google LLC. All rights reserved. Data Analytics & Management Google Cloud Smart Analytics & AI Prebuilt ML APIs Foundation AI Platform AutoML AI Solutions Language Conversation Horizontal solutions Structured Data Language Frameworks Compute Contact Center AI Ingestion and Processing Storage and Analytics Orchestration Notebooks Industry solutions Data Labeling Training Prediction Continuous evaluation Explainability Pipelines Compute Engine Cloud TPU Cloud GPU Cloud scheduler Cloud Composer Instrumentation Cloud Build Container Registry Cloud Pub/Sub Cloud Dataflow Cloud Dataproc Data Fusion Cloud Storage BigQuery Cloud Bigtable Cloud SQL Data Catalog Data Studio Data Science and Machine Learning Sight Sight Vision Video Translate Natural Language Tables Video Intelligence Vision Natural Language Translate Speech-to-Text Text-to-Speech Document AI Dialogflow Talent Solution Recommendation AI
  • 10. © 2021 Google LLC. All rights reserved. Flexibility for users Apache Beam Open-source, unified model and set of SDKs for defining and executing data processing Open source programming model Serves as the SDK for creating Cloud Dataflow jobs; community development increases flexibility Choose your language Java, Python, Scala, and GO are available; join DA Spotlight for news on languages Portability Program in Beam, and gain the ability to move between Spark, Flink, Dataflow, and more Dataflow Simplified stream and batch data processing Batch and Stream Reduce complexity and reuse code by driving batch and stream workloads from the same tool Reliable and consistent processing Exactly once processing with built-in support for fault-tolerant execution Simplified operations & management Performance, scaling, availability, security, and compliance handled automatically Integrated Integration with Kafka/Confluent Cloud, the Google Data Analytics suite, and GCP broadly Unified stream and batch processing
  • 11. © 2021 Google LLC. All rights reserved. Ingest Transform Analyze Ingest and distribute data reliably Fast, correct computations quickly and simply Machine learning & data warehouse Cloud Dataflow Cloud ML Pub/Sub BigQuery Dataflow Flexible stream analytics with OSS KSQL
  • 12. © 2021 Google LLC. All rights reserved. Google Cloud has an end-to-end, fully- managed Stream Analytics offering Pub/Sub (Messaging) Confluent Kafka (Messaging)* BigQuery Streaming API IoT Core Collect Data Catalog (Metadata Management) & Composer (Workflow Orchestration) Dataflow (Beam Streaming) Dataproc (Spark Streaming and Flink) Dataform Kubernetes Process BigQuery Bigtable AI Platform + TFX Integration Databases (e.g. Cloud SQL, Spanner) Store and Analyze Looker Apigee Firebase Activate Cloud Functions * Partner Solution
  • 13. © 2021 Google LLC. All rights reserved. A platform for all users and intents throughout the data lifecycle Fine-grained access control Cloud IAM Metadata management Data Catalog Always encrypted Data at rest and in transit Redact sensitive data Cloud DLP Security Admin Protecting data Messaging PubSub Data Processing Dataflow Data Apps Looker (LookML) OSS Engines Dataproc (Spark, Flink) Developer Intelligent apps DW & DB BigQuery , BigTable Data processing (OSS) pipelines Dataproc (Spark, Presto, Flink) Data Processing (Native) pipelines Dataflow Orchestration Composer Data engineer Get clean, useful data Messaging PubSub or Confluent Kafka CDW BigQuery CDW & Orchestration BigQuery Visual data Integration Data Fusion ML in SQL BigQuery ML Data models, catalog Looker, Data Catalog Data analyst Query and analyze Ingestion BigQuery Streaming & DTS Governed BI Looker CDW in a Spreadsheet Connected Sheets Natural Language Query Data QnA Business User Insights Everywhere Data models, catalog Looker, Data Catalog CDW BigQuery Portable notebooks AI Platform Notebooks Simplified ML BigQuery ML & Auto ML Collaboration Feature Store, AI Platform Pipelines Spark Dataproc Data scientist Models that work CDW BigQuery Secure data sharing BigQuery
  • 14. © 2021 Google LLC. All rights reserved. Real-time Analytics GCP Approach Event Collect Process Store and Analyze Activate BigQuery Looker Event stream / Integration Pub/sub Dataflow IoT Core Analytics Low Latency, Time Series Bigtable Apigee Firebase Apigee Firebase Monetization Cloud Logging ... Templates AI Platform Continuous Intelligence Edge Manager for ML ML at the Edge App Activation
  • 15. © 2021 Google LLC. All rights reserved. Real-time Analytics GCP Simplified Approach Event Collect Process Store and Analyze Activate BigQuery Looker Streaming API ELT (Dataform) Materialized Views BQML BI Engine Data Studio Apigee Connected Sheets Event stream / Integration
  • 16. © 2021 Google LLC. All rights reserved. Real-time Analytics Open and Partner Approach Event Collect Process Store and Analyze Activate Dataproc Streaming BigQuery 3rd Party BI and activation tools ... ...
  • 17. © 2021 Google LLC. All rights reserved. Options Hybrid ● Accessing Kafka on-prem directly from GCP ● Kafka replication (on-prem to GCE or Confluent Cloud’s GCP marketplace offering) Lift and Shift ● Confluent Cloud’s fully managed Kafka (Marketplace offering) – Connectors available to BigQuery, Cloud Storage, Pub/Sub, MongoDB Atlas, etc – Clustering, SLAs, etc ● Self-managing Kafka on GCE GCP Integration ● Pre-Built Dataflow Flex ● Kafka to BigQuery template ● Using Kafka Connect ● To push to Google BigQuery. Supported by Confluent and WePay ● To push to Google Cloud Pub/Sub. Supported by Google ● Fivetran, Confluent ... How do we deploy Kafka or integrate it with the rest of the GCP stack?
  • 18. © 2021 Google LLC. All rights reserved. On-prem Hybrid: Access Kafka on-prem from GCP Gateway Google Cloud Interconnect & VPN Gateway Kafka Cluster Analysis Cloud Dataflow Analysis Compute Engine Analysis Cloud Dataproc
  • 19. © 2021 Google LLC. All rights reserved. On-prem Hybrid: Replicate Kafka on-prem to GCP Gateway Google Cloud Interconnect & VPN Gateway Kafka Cluster Kafka Self Managed Cluster Compute Engine Analysis Cloud Dataflow Analysis Compute Engine Kafka Connect Kafka Connect Replicator Analysis Cloud Dataproc
  • 20. © 2021 Google LLC. All rights reserved. On-prem Lift and Shift: Confluent Cloud’s Kafka on GCP Analysis Cloud Dataflow Analysis Compute Engine Analysis Cloud Dataproc Confluent Cloud Managed by Confluent Kafka Cluster Customer Project Internet Private network
  • 21. © 2021 Google LLC. All rights reserved. On-prem Lift and Shift: Self-managing Kafka on GCP Gateway Google Cloud Interconnect & VPN Gateway Kafka Self Managed Cluster Compute Engine Analysis Cloud Dataflow Analysis Compute Engine Analysis Cloud Dataproc
  • 22. © 2021 Google LLC. All rights reserved. GCP Integration: Using Dataflow Template Kafka to BQ Dataflow Template Table BigQuery Kafka Compute Engine
  • 23. © 2021 Google LLC. All rights reserved. On-prem GCP Integration: Using Kafka Connect Gateway Google Cloud Interconnect & VPN Gateway Analysis Cloud Dataflow Kafka Connect Cloud Pub/Sub Connector Kafka Topic Cloud Pub/Sub Kafka Topic Dest. BigQuery Kafka Connect BigQuery Connector Internet Private Network Supported by Google Supported by Confluent and WePay Analysis Cloud BigQuery
  • 24. © 2021 Google LLC. All rights reserved. Comparing it to Google Cloud Pub/Sub Self-managed Kafka ● Open source ● Set up your own auth to protect your Kafka ● You must provision and plan for load isolation ● You must support it ● You must infer costs based on variety of capacity and availability patterns, buy components (rather than pay for usage): CPU, disk, network ● You must design and maintain your own replication and backup setup ● Can be used as a system of record, messages re- read from beginning — new subscribers can read from start (depending on retention policy) ● Order guarantees within a partition ● Large platform of streaming tools — KSQL, Schema Registry, Connectors to/from data sources Cloud Pub/Sub ● GCP only; however, the API can be emulated on a Kafka server on-prem ● GCP IAM integration ● 24-hour on-call support, SLAs from Google, and integrated monitoring with Stackdriver ● Transparent replication and backups for high availability and durability ● Predictable bandwidth-based billing ● Global presence: Pub/Sub is already deployed in all GCP data centers for consistent latency and high availability. Today, only global is possible. ● Single service: You only worry about managing topics and subscribers, rather than clusters ● At least once delivery