SlideShare ist ein Scribd-Unternehmen logo
1 von 19
How Tencent Applies
Apache Pulsar to Apache InLong
2022/1/15
dockerzhang(张超)
About Me
• Apache InLong (incubating) PPMC
• Apache Pulsar Contributor & KoP Maintainer
• Multiple Big Data & Cloud Native projects contributor
• About Apache InLong
• Apache InLong + Pulsar
• The User Case of Apache
InLong
Contents
About Apache InLong
The Users of InLong
About Apache InLong
The History of Apache InLong
2013-06
200
2014-06
6201
2015-06
25052
2016-06
54905
2017-06
91096
2018-06
138718
2019-06
227360
TubeMQ
open source
2020-06
333029
2020-12
457552
2021-07
663285
0
100000
200000
300000
400000
500000
600000
700000
2013-06 2013-12 2014-06 2014-12 2015-06 2015-12 2016-06 2016-12 2017-06 2017-12 2018-06 2018-12 2019-06 2019-12 2020-06 2020-12 2021-07
Average daily data reporting volume(100 million pieces/day)
Rename
InLong
About Apache InLong
What is Apache InLong
Apache InLong(incubating) is a one-stop data integration framework that provides automatic, secure and reliable data
transmission capabilities. InLong supports both batch and stream data processing at the same time, which offers great
power to build data analysis, modeling and other real-time applications based on streaming data.
• Ease of Use
• can easily and quickly report, transfer, and distribute data
• Stability & Reliability
• delivers high-performance processing capabilities for 10 trillion-level data streams
• Comprehensive Feature
• integrated with different types of Message Queue (MQ) services, provides real-time data extract, transform, and
load (ETL) and sorting capabilities
• Scalability
• adopts a pluggable architecture that allows you to plug modules into the system based on specific protocols.
About Apache InLong
The Architecture of InLong
SDK
File
HTTP
DB
DataProxy
DataProxy
TubeMQ
Pulsar
Kafka
Sort
Real-time
Offline
SDK
Hive
Iceberg
HBase
ClickHouse
Inges
t
Converge Cache Sort Storage
OpenAPI
Manager
Metadata Authority Scheduler
Naming
Service Audit Monitor
Cluster
DataProxy
• About Apache InLong
• Apache InLong + Pulsar
• The User Case of Apache
InLong
Contents
Apache InLong + Pulsar
The Pulsar Data Stream
Apple |
175.64
AT&T | 24.78
Tesla | 908.87
……
Smith | 24
Jones | 33
Kevin | 19
……
people
stream
stocks
stream
InLong Group
1
Data
Prox
y
tenant/group1/people
tenant/group1/stocks
Pulsar
Cluster
Sort
(Smith, 24)
(Jones, 33)
(Kevin, 19)
(Apple,
175.64)
(AT&T, 24.78)
(Tesla, 908.87)
People table
Stocks table
• InLong Stream: Data Stream, a stream has a specific flow direction.
• InLong Group: Data Stream Group, it contains multiple data streams.
Apache InLong + Pulsar
Why Choose Pulsar ?
Comparison TubeMQ Kafka Pulsar
Latency Very low,10ms Low,250ms Very low,10ms
TPS High,14W+/s Normal,10W+/s High,14W+/s
Filter consume Supports client filter or server filter Supports client filter Supports client filter
Data No copies Multiple copies Multiple copies
Reliability Relies on RAID 10 Low High, autorecovery
Stability High, running in Tencent for almost 7 years
with 33 trillions of message per day
Unstable when topics grows HIgh
Client language supports Java or C++ 1 client (Official support) 7 kinds of client
CAP Model AP AP or CP CP or AP
Apache InLong + Pulsar
KoP(Kafka on Pulsar) Replace Kafka
• Migrate the Kafka business
• The first team to put KoP in the production environment
• 2 KoP maintainers
Pulsar Cluster
bookie
bookie
bookie
bookie
broker
broker
broker
KoP
Kafka consumer
message
Kafka producer
InLong
Sort
message
message
message
InLong
DataProxy
Apache InLong + Pulsar
Pulsar Auto Disaster Tolerance For InLong
Pulsar
Cluster1
Monitor
primary
producer
failover
producer
Pulsar
Cluster2
check
check
consumer
 Procedures:
1. Initialize two produce and produce to two clusters accordingly
2. Only one producer is active
3. Change producerMonitor checks the errors inside a time window
Apache InLong + Pulsar
Pulsar Multi Tenancy for InLong Data Stream
persistent:// tenant namespace topic
business InLong group InLong
stream
• InLong Stream: Data Stream, a stream has a specific flow direction.
• InLong Group: Data Stream Group, it contains multiple data streams.
school
students teachers
teacher
s table
students
table
Apache InLong + Pulsar
InLong Data Audit Using
Pulsar
• Separate audit data stream
• No data loss
Audit
Proxy
InLong Agent AuditSDK
InLong DataProxy AuditSDK
InLong Sort AuditSDK
Pulsar
AuditD
ds
MySQL
ES
HDFS
Minute
Hour
Day
Audit
Repor
t
Apache InLong + Pulsar
InLong Contribute to Pulsar
6 60+ 50+
Contributor Pulsar
PR
KoP
PR
• About Apache InLong
• Apache InLong + Pulsar
• The User Case of Apache
InLong
Contents
The User Case of Apache InLong
Tencent Ads
• Background
• account statement of advertises can be used as data input for analysis or reconciliation. The inputs
are mainly binlogs from mysql
• Used features in InLong:
• Low latency: no more than 10ms
• No data loss
• Massive consumers: thousands of consumers for one topic
• Massive data: over 100 billion/day
InLong
DB Agent
Pulsar Flink
Pulsar
Client
InLong
Sort
Druid
InLong
DataProxy
Hive
Binlog
The User Case of Apache InLong
Tencent Security Platform
• Background
• As business goes to the cloud, there are more and more security agents. If a particular module is
abnormal, it will cause the entire background data to skyrocket and cause an avalanche. A set of
transmission schemes are required to act as a "barrier" to slow the impact on this system.
• Used features in InLong:
• No data loss
• Massive agent: Over 1 million agents
Pulsar
Flink
InLong
Sort
InLong
DataProxy
Hive
Security
Agent
Security
Agent
Security
Agent
Security
Agent
Security
Agent
Security
Agent
Security
Agent
Security
Agent
Security
Agent
Security
Agent
THANKS
https://inlong.apache.org
https://github.com/apache/incubator-inlong
Email:dockerzhang@tencent.com

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

In Flux Limiting for a multi-tenant logging service
In Flux Limiting for a multi-tenant logging serviceIn Flux Limiting for a multi-tenant logging service
In Flux Limiting for a multi-tenant logging service
 
Streaming Analytics with Spark, Kafka, Cassandra and Akka by Helena Edelson
Streaming Analytics with Spark, Kafka, Cassandra and Akka by Helena EdelsonStreaming Analytics with Spark, Kafka, Cassandra and Akka by Helena Edelson
Streaming Analytics with Spark, Kafka, Cassandra and Akka by Helena Edelson
 
Efficient State Management With Spark 2.0 And Scale-Out Databases
Efficient State Management With Spark 2.0 And Scale-Out DatabasesEfficient State Management With Spark 2.0 And Scale-Out Databases
Efficient State Management With Spark 2.0 And Scale-Out Databases
 
Data Pipeline with Kafka
Data Pipeline with KafkaData Pipeline with Kafka
Data Pipeline with Kafka
 
Kafka connect-london-meetup-2016
Kafka connect-london-meetup-2016Kafka connect-london-meetup-2016
Kafka connect-london-meetup-2016
 
Spark Summit EU talk by Ruben Pulido Behar Veliqi
Spark Summit EU talk by Ruben Pulido Behar VeliqiSpark Summit EU talk by Ruben Pulido Behar Veliqi
Spark Summit EU talk by Ruben Pulido Behar Veliqi
 
Kinesis vs-kafka-and-kafka-deep-dive
Kinesis vs-kafka-and-kafka-deep-diveKinesis vs-kafka-and-kafka-deep-dive
Kinesis vs-kafka-and-kafka-deep-dive
 
Kafka Summit SF 2017 - Query the Application, Not a Database: “Interactive Qu...
Kafka Summit SF 2017 - Query the Application, Not a Database: “Interactive Qu...Kafka Summit SF 2017 - Query the Application, Not a Database: “Interactive Qu...
Kafka Summit SF 2017 - Query the Application, Not a Database: “Interactive Qu...
 
Spark Summit EU talk by Kaarthik Sivashanmugam
Spark Summit EU talk by Kaarthik SivashanmugamSpark Summit EU talk by Kaarthik Sivashanmugam
Spark Summit EU talk by Kaarthik Sivashanmugam
 
Data Architectures for Robust Decision Making
Data Architectures for Robust Decision MakingData Architectures for Robust Decision Making
Data Architectures for Robust Decision Making
 
Introduction to Apache Kafka
Introduction to Apache KafkaIntroduction to Apache Kafka
Introduction to Apache Kafka
 
Architecture of a Kafka camus infrastructure
Architecture of a Kafka camus infrastructureArchitecture of a Kafka camus infrastructure
Architecture of a Kafka camus infrastructure
 
Real time Messages at Scale with Apache Kafka and Couchbase
Real time Messages at Scale with Apache Kafka and CouchbaseReal time Messages at Scale with Apache Kafka and Couchbase
Real time Messages at Scale with Apache Kafka and Couchbase
 
Pulsar in the Lakehouse: Overview of Apache Pulsar and Delta Lake Connector -...
Pulsar in the Lakehouse: Overview of Apache Pulsar and Delta Lake Connector -...Pulsar in the Lakehouse: Overview of Apache Pulsar and Delta Lake Connector -...
Pulsar in the Lakehouse: Overview of Apache Pulsar and Delta Lake Connector -...
 
GNW03: Stream Processing with Apache Kafka by Gwen Shapira
GNW03: Stream Processing with Apache Kafka by Gwen ShapiraGNW03: Stream Processing with Apache Kafka by Gwen Shapira
GNW03: Stream Processing with Apache Kafka by Gwen Shapira
 
Monitoring of GPU Usage with Tensorflow Models Using Prometheus
Monitoring of GPU Usage with Tensorflow Models Using PrometheusMonitoring of GPU Usage with Tensorflow Models Using Prometheus
Monitoring of GPU Usage with Tensorflow Models Using Prometheus
 
Hadoop summit - Scaling Uber’s Real-Time Infra for Trillion Events per Day
Hadoop summit - Scaling Uber’s Real-Time Infra for  Trillion Events per DayHadoop summit - Scaling Uber’s Real-Time Infra for  Trillion Events per Day
Hadoop summit - Scaling Uber’s Real-Time Infra for Trillion Events per Day
 
The Next AMPLab: Real-Time, Intelligent, and Secure Computing
The Next AMPLab: Real-Time, Intelligent, and Secure ComputingThe Next AMPLab: Real-Time, Intelligent, and Secure Computing
The Next AMPLab: Real-Time, Intelligent, and Secure Computing
 
Real-Time Machine Learning with Redis, Apache Spark, Tensor Flow, and more wi...
Real-Time Machine Learning with Redis, Apache Spark, Tensor Flow, and more wi...Real-Time Machine Learning with Redis, Apache Spark, Tensor Flow, and more wi...
Real-Time Machine Learning with Redis, Apache Spark, Tensor Flow, and more wi...
 
Change data capture with MongoDB and Kafka.
Change data capture with MongoDB and Kafka.Change data capture with MongoDB and Kafka.
Change data capture with MongoDB and Kafka.
 

Ähnlich wie How Tencent Applies Apache Pulsar to Apache InLong - Pulsar Summit Asia 2021

CON6492 - Oracle Database Public Cloud Services v1 1
CON6492 - Oracle Database Public Cloud Services v1 1CON6492 - Oracle Database Public Cloud Services v1 1
CON6492 - Oracle Database Public Cloud Services v1 1
David van Schalkwyk
 

Ähnlich wie How Tencent Applies Apache Pulsar to Apache InLong - Pulsar Summit Asia 2021 (20)

Benchmark Showdown: Which Relational Database is the Fastest on AWS?
Benchmark Showdown: Which Relational Database is the Fastest on AWS?Benchmark Showdown: Which Relational Database is the Fastest on AWS?
Benchmark Showdown: Which Relational Database is the Fastest on AWS?
 
Time Series Databases for IoT (On-premises and Azure)
Time Series Databases for IoT (On-premises and Azure)Time Series Databases for IoT (On-premises and Azure)
Time Series Databases for IoT (On-premises and Azure)
 
From a student to an apache committer practice of apache io tdb
From a student to an apache committer  practice of apache io tdbFrom a student to an apache committer  practice of apache io tdb
From a student to an apache committer practice of apache io tdb
 
What's new in Elasticsearch v5
What's new in Elasticsearch v5What's new in Elasticsearch v5
What's new in Elasticsearch v5
 
Cloud lunch and learn real-time streaming in azure
Cloud lunch and learn real-time streaming in azureCloud lunch and learn real-time streaming in azure
Cloud lunch and learn real-time streaming in azure
 
Elastic Data Analytics Platform @Datadog
Elastic Data Analytics Platform @DatadogElastic Data Analytics Platform @Datadog
Elastic Data Analytics Platform @Datadog
 
What to Expect for Big Data and Apache Spark in 2017
What to Expect for Big Data and Apache Spark in 2017 What to Expect for Big Data and Apache Spark in 2017
What to Expect for Big Data and Apache Spark in 2017
 
A Big Data Lake Based on Spark for BBVA Bank-(Oscar Mendez, STRATIO)
A Big Data Lake Based on Spark for BBVA Bank-(Oscar Mendez, STRATIO)A Big Data Lake Based on Spark for BBVA Bank-(Oscar Mendez, STRATIO)
A Big Data Lake Based on Spark for BBVA Bank-(Oscar Mendez, STRATIO)
 
Palringo AWS London Summit 2017
Palringo AWS London Summit 2017Palringo AWS London Summit 2017
Palringo AWS London Summit 2017
 
Data Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFixData Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFix
 
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
 
AWS를 활용한 첫 빅데이터 프로젝트 시작하기(김일호)- AWS 웨비나 시리즈 2015
AWS를 활용한 첫 빅데이터 프로젝트 시작하기(김일호)- AWS 웨비나 시리즈 2015AWS를 활용한 첫 빅데이터 프로젝트 시작하기(김일호)- AWS 웨비나 시리즈 2015
AWS를 활용한 첫 빅데이터 프로젝트 시작하기(김일호)- AWS 웨비나 시리즈 2015
 
How Netflix Monitors Applications in Near Real-time w Amazon Kinesis - ABD401...
How Netflix Monitors Applications in Near Real-time w Amazon Kinesis - ABD401...How Netflix Monitors Applications in Near Real-time w Amazon Kinesis - ABD401...
How Netflix Monitors Applications in Near Real-time w Amazon Kinesis - ABD401...
 
CON6492 - Oracle Database Public Cloud Services v1 1
CON6492 - Oracle Database Public Cloud Services v1 1CON6492 - Oracle Database Public Cloud Services v1 1
CON6492 - Oracle Database Public Cloud Services v1 1
 
Apache CarbonData+Spark to realize data convergence and Unified high performa...
Apache CarbonData+Spark to realize data convergence and Unified high performa...Apache CarbonData+Spark to realize data convergence and Unified high performa...
Apache CarbonData+Spark to realize data convergence and Unified high performa...
 
PCM Vision 2019 Breakout: Quest Software
PCM Vision 2019 Breakout: Quest SoftwarePCM Vision 2019 Breakout: Quest Software
PCM Vision 2019 Breakout: Quest Software
 
Data Democracy: Journey to User-Facing Analytics - Pulsar Summit SF 2022
Data Democracy: Journey to User-Facing Analytics - Pulsar Summit SF 2022Data Democracy: Journey to User-Facing Analytics - Pulsar Summit SF 2022
Data Democracy: Journey to User-Facing Analytics - Pulsar Summit SF 2022
 
The Netflix Way to deal with Big Data Problems
The Netflix Way to deal with Big Data ProblemsThe Netflix Way to deal with Big Data Problems
The Netflix Way to deal with Big Data Problems
 
AWS glue technical enablement training
AWS glue technical enablement trainingAWS glue technical enablement training
AWS glue technical enablement training
 
Apache Tajo - BWC 2014
Apache Tajo - BWC 2014Apache Tajo - BWC 2014
Apache Tajo - BWC 2014
 

Mehr von StreamNative

Pulsar's Journey in Yahoo!: On-prem, Cloud and Hybrid - Pulsar Summit SF 2022
Pulsar's Journey in Yahoo!: On-prem, Cloud and Hybrid - Pulsar Summit SF 2022Pulsar's Journey in Yahoo!: On-prem, Cloud and Hybrid - Pulsar Summit SF 2022
Pulsar's Journey in Yahoo!: On-prem, Cloud and Hybrid - Pulsar Summit SF 2022
StreamNative
 

Mehr von StreamNative (20)

Is Using KoP (Kafka-on-Pulsar) a Good Idea? - Pulsar Summit SF 2022
Is Using KoP (Kafka-on-Pulsar) a Good Idea? - Pulsar Summit SF 2022Is Using KoP (Kafka-on-Pulsar) a Good Idea? - Pulsar Summit SF 2022
Is Using KoP (Kafka-on-Pulsar) a Good Idea? - Pulsar Summit SF 2022
 
Building an Asynchronous Application Framework with Python and Pulsar - Pulsa...
Building an Asynchronous Application Framework with Python and Pulsar - Pulsa...Building an Asynchronous Application Framework with Python and Pulsar - Pulsa...
Building an Asynchronous Application Framework with Python and Pulsar - Pulsa...
 
Blue-green deploys with Pulsar & Envoy in an event-driven microservice ecosys...
Blue-green deploys with Pulsar & Envoy in an event-driven microservice ecosys...Blue-green deploys with Pulsar & Envoy in an event-driven microservice ecosys...
Blue-green deploys with Pulsar & Envoy in an event-driven microservice ecosys...
 
Distributed Database Design Decisions to Support High Performance Event Strea...
Distributed Database Design Decisions to Support High Performance Event Strea...Distributed Database Design Decisions to Support High Performance Event Strea...
Distributed Database Design Decisions to Support High Performance Event Strea...
 
Simplify Pulsar Functions Development with SQL - Pulsar Summit SF 2022
Simplify Pulsar Functions Development with SQL - Pulsar Summit SF 2022Simplify Pulsar Functions Development with SQL - Pulsar Summit SF 2022
Simplify Pulsar Functions Development with SQL - Pulsar Summit SF 2022
 
Towards a ZooKeeper-less Pulsar, etcd, etcd, etcd. - Pulsar Summit SF 2022
Towards a ZooKeeper-less Pulsar, etcd, etcd, etcd. - Pulsar Summit SF 2022Towards a ZooKeeper-less Pulsar, etcd, etcd, etcd. - Pulsar Summit SF 2022
Towards a ZooKeeper-less Pulsar, etcd, etcd, etcd. - Pulsar Summit SF 2022
 
Validating Apache Pulsar’s Behavior under Failure Conditions - Pulsar Summit ...
Validating Apache Pulsar’s Behavior under Failure Conditions - Pulsar Summit ...Validating Apache Pulsar’s Behavior under Failure Conditions - Pulsar Summit ...
Validating Apache Pulsar’s Behavior under Failure Conditions - Pulsar Summit ...
 
Cross the Streams! Creating Streaming Data Pipelines with Apache Flink + Apac...
Cross the Streams! Creating Streaming Data Pipelines with Apache Flink + Apac...Cross the Streams! Creating Streaming Data Pipelines with Apache Flink + Apac...
Cross the Streams! Creating Streaming Data Pipelines with Apache Flink + Apac...
 
Message Redelivery: An Unexpected Journey - Pulsar Summit SF 2022
Message Redelivery: An Unexpected Journey - Pulsar Summit SF 2022Message Redelivery: An Unexpected Journey - Pulsar Summit SF 2022
Message Redelivery: An Unexpected Journey - Pulsar Summit SF 2022
 
Unlocking the Power of Lakehouse Architectures with Apache Pulsar and Apache ...
Unlocking the Power of Lakehouse Architectures with Apache Pulsar and Apache ...Unlocking the Power of Lakehouse Architectures with Apache Pulsar and Apache ...
Unlocking the Power of Lakehouse Architectures with Apache Pulsar and Apache ...
 
Understanding Broker Load Balancing - Pulsar Summit SF 2022
Understanding Broker Load Balancing - Pulsar Summit SF 2022Understanding Broker Load Balancing - Pulsar Summit SF 2022
Understanding Broker Load Balancing - Pulsar Summit SF 2022
 
Building an Asynchronous Application Framework with Python and Pulsar - Pulsa...
Building an Asynchronous Application Framework with Python and Pulsar - Pulsa...Building an Asynchronous Application Framework with Python and Pulsar - Pulsa...
Building an Asynchronous Application Framework with Python and Pulsar - Pulsa...
 
Pulsar's Journey in Yahoo!: On-prem, Cloud and Hybrid - Pulsar Summit SF 2022
Pulsar's Journey in Yahoo!: On-prem, Cloud and Hybrid - Pulsar Summit SF 2022Pulsar's Journey in Yahoo!: On-prem, Cloud and Hybrid - Pulsar Summit SF 2022
Pulsar's Journey in Yahoo!: On-prem, Cloud and Hybrid - Pulsar Summit SF 2022
 
Event-Driven Applications Done Right - Pulsar Summit SF 2022
Event-Driven Applications Done Right - Pulsar Summit SF 2022Event-Driven Applications Done Right - Pulsar Summit SF 2022
Event-Driven Applications Done Right - Pulsar Summit SF 2022
 
Pulsar @ Scale. 200M RPM and 1K instances - Pulsar Summit SF 2022
Pulsar @ Scale. 200M RPM and 1K instances - Pulsar Summit SF 2022Pulsar @ Scale. 200M RPM and 1K instances - Pulsar Summit SF 2022
Pulsar @ Scale. 200M RPM and 1K instances - Pulsar Summit SF 2022
 
Beam + Pulsar: Powerful Stream Processing at Scale - Pulsar Summit SF 2022
Beam + Pulsar: Powerful Stream Processing at Scale - Pulsar Summit SF 2022Beam + Pulsar: Powerful Stream Processing at Scale - Pulsar Summit SF 2022
Beam + Pulsar: Powerful Stream Processing at Scale - Pulsar Summit SF 2022
 
Welcome and Opening Remarks - Pulsar Summit SF 2022
Welcome and Opening Remarks - Pulsar Summit SF 2022Welcome and Opening Remarks - Pulsar Summit SF 2022
Welcome and Opening Remarks - Pulsar Summit SF 2022
 
MoP(MQTT on Pulsar) - a Powerful Tool for Apache Pulsar in IoT - Pulsar Summi...
MoP(MQTT on Pulsar) - a Powerful Tool for Apache Pulsar in IoT - Pulsar Summi...MoP(MQTT on Pulsar) - a Powerful Tool for Apache Pulsar in IoT - Pulsar Summi...
MoP(MQTT on Pulsar) - a Powerful Tool for Apache Pulsar in IoT - Pulsar Summi...
 
Improvements Made in KoP 2.9.0 - Pulsar Summit Asia 2021
Improvements Made in KoP 2.9.0  - Pulsar Summit Asia 2021Improvements Made in KoP 2.9.0  - Pulsar Summit Asia 2021
Improvements Made in KoP 2.9.0 - Pulsar Summit Asia 2021
 
The Evolution History of RoP(RocketMQ-on-Pulsar) - Pulsar Summit Asia 2021
The Evolution History of RoP(RocketMQ-on-Pulsar) - Pulsar Summit Asia 2021The Evolution History of RoP(RocketMQ-on-Pulsar) - Pulsar Summit Asia 2021
The Evolution History of RoP(RocketMQ-on-Pulsar) - Pulsar Summit Asia 2021
 

Kürzlich hochgeladen

📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
@Chandigarh #call #Girls 9053900678 @Call #Girls in @Punjab 9053900678
 
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
ydyuyu
 
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 

Kürzlich hochgeladen (20)

Sarola * Female Escorts Service in Pune | 8005736733 Independent Escorts & Da...
Sarola * Female Escorts Service in Pune | 8005736733 Independent Escorts & Da...Sarola * Female Escorts Service in Pune | 8005736733 Independent Escorts & Da...
Sarola * Female Escorts Service in Pune | 8005736733 Independent Escorts & Da...
 
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
 
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
 
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
 
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
 
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
 
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
 
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
 
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
 
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
 
Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...
Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...
Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...
 
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
 
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
 
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
 
20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf
 

How Tencent Applies Apache Pulsar to Apache InLong - Pulsar Summit Asia 2021

  • 1. How Tencent Applies Apache Pulsar to Apache InLong 2022/1/15 dockerzhang(张超)
  • 2. About Me • Apache InLong (incubating) PPMC • Apache Pulsar Contributor & KoP Maintainer • Multiple Big Data & Cloud Native projects contributor
  • 3. • About Apache InLong • Apache InLong + Pulsar • The User Case of Apache InLong Contents
  • 4. About Apache InLong The Users of InLong
  • 5. About Apache InLong The History of Apache InLong 2013-06 200 2014-06 6201 2015-06 25052 2016-06 54905 2017-06 91096 2018-06 138718 2019-06 227360 TubeMQ open source 2020-06 333029 2020-12 457552 2021-07 663285 0 100000 200000 300000 400000 500000 600000 700000 2013-06 2013-12 2014-06 2014-12 2015-06 2015-12 2016-06 2016-12 2017-06 2017-12 2018-06 2018-12 2019-06 2019-12 2020-06 2020-12 2021-07 Average daily data reporting volume(100 million pieces/day) Rename InLong
  • 6. About Apache InLong What is Apache InLong Apache InLong(incubating) is a one-stop data integration framework that provides automatic, secure and reliable data transmission capabilities. InLong supports both batch and stream data processing at the same time, which offers great power to build data analysis, modeling and other real-time applications based on streaming data. • Ease of Use • can easily and quickly report, transfer, and distribute data • Stability & Reliability • delivers high-performance processing capabilities for 10 trillion-level data streams • Comprehensive Feature • integrated with different types of Message Queue (MQ) services, provides real-time data extract, transform, and load (ETL) and sorting capabilities • Scalability • adopts a pluggable architecture that allows you to plug modules into the system based on specific protocols.
  • 7. About Apache InLong The Architecture of InLong SDK File HTTP DB DataProxy DataProxy TubeMQ Pulsar Kafka Sort Real-time Offline SDK Hive Iceberg HBase ClickHouse Inges t Converge Cache Sort Storage OpenAPI Manager Metadata Authority Scheduler Naming Service Audit Monitor Cluster DataProxy
  • 8. • About Apache InLong • Apache InLong + Pulsar • The User Case of Apache InLong Contents
  • 9. Apache InLong + Pulsar The Pulsar Data Stream Apple | 175.64 AT&T | 24.78 Tesla | 908.87 …… Smith | 24 Jones | 33 Kevin | 19 …… people stream stocks stream InLong Group 1 Data Prox y tenant/group1/people tenant/group1/stocks Pulsar Cluster Sort (Smith, 24) (Jones, 33) (Kevin, 19) (Apple, 175.64) (AT&T, 24.78) (Tesla, 908.87) People table Stocks table • InLong Stream: Data Stream, a stream has a specific flow direction. • InLong Group: Data Stream Group, it contains multiple data streams.
  • 10. Apache InLong + Pulsar Why Choose Pulsar ? Comparison TubeMQ Kafka Pulsar Latency Very low,10ms Low,250ms Very low,10ms TPS High,14W+/s Normal,10W+/s High,14W+/s Filter consume Supports client filter or server filter Supports client filter Supports client filter Data No copies Multiple copies Multiple copies Reliability Relies on RAID 10 Low High, autorecovery Stability High, running in Tencent for almost 7 years with 33 trillions of message per day Unstable when topics grows HIgh Client language supports Java or C++ 1 client (Official support) 7 kinds of client CAP Model AP AP or CP CP or AP
  • 11. Apache InLong + Pulsar KoP(Kafka on Pulsar) Replace Kafka • Migrate the Kafka business • The first team to put KoP in the production environment • 2 KoP maintainers Pulsar Cluster bookie bookie bookie bookie broker broker broker KoP Kafka consumer message Kafka producer InLong Sort message message message InLong DataProxy
  • 12. Apache InLong + Pulsar Pulsar Auto Disaster Tolerance For InLong Pulsar Cluster1 Monitor primary producer failover producer Pulsar Cluster2 check check consumer  Procedures: 1. Initialize two produce and produce to two clusters accordingly 2. Only one producer is active 3. Change producerMonitor checks the errors inside a time window
  • 13. Apache InLong + Pulsar Pulsar Multi Tenancy for InLong Data Stream persistent:// tenant namespace topic business InLong group InLong stream • InLong Stream: Data Stream, a stream has a specific flow direction. • InLong Group: Data Stream Group, it contains multiple data streams. school students teachers teacher s table students table
  • 14. Apache InLong + Pulsar InLong Data Audit Using Pulsar • Separate audit data stream • No data loss Audit Proxy InLong Agent AuditSDK InLong DataProxy AuditSDK InLong Sort AuditSDK Pulsar AuditD ds MySQL ES HDFS Minute Hour Day Audit Repor t
  • 15. Apache InLong + Pulsar InLong Contribute to Pulsar 6 60+ 50+ Contributor Pulsar PR KoP PR
  • 16. • About Apache InLong • Apache InLong + Pulsar • The User Case of Apache InLong Contents
  • 17. The User Case of Apache InLong Tencent Ads • Background • account statement of advertises can be used as data input for analysis or reconciliation. The inputs are mainly binlogs from mysql • Used features in InLong: • Low latency: no more than 10ms • No data loss • Massive consumers: thousands of consumers for one topic • Massive data: over 100 billion/day InLong DB Agent Pulsar Flink Pulsar Client InLong Sort Druid InLong DataProxy Hive Binlog
  • 18. The User Case of Apache InLong Tencent Security Platform • Background • As business goes to the cloud, there are more and more security agents. If a particular module is abnormal, it will cause the entire background data to skyrocket and cause an avalanche. A set of transmission schemes are required to act as a "barrier" to slow the impact on this system. • Used features in InLong: • No data loss • Massive agent: Over 1 million agents Pulsar Flink InLong Sort InLong DataProxy Hive Security Agent Security Agent Security Agent Security Agent Security Agent Security Agent Security Agent Security Agent Security Agent Security Agent

Hinweis der Redaktion

  1. 2021年4月11日,完成社区改名,改为 Apache InLong 2019年9月12日,TubeMQ 对外开源并捐献给 Apache 社区