SlideShare a Scribd company logo
1 of 36
Open Source Real Time BI using 
Storm, Hadoop, Titan, Druid & D3 
Anil Madan 
Sr. Director Engineering, PayPal 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary.
$1 in every $6 
Spent on e-commerce is 
spent through PayPal.*
Creating Tomorrow’s 
Mobile Payment 
Experiences 
25 countries with live PayPal 
fingerprint authentication 
on Samsung devices.
Helping Developers 
Innovate & Monetize 
New Mobile Apps 
Braintree launches its new API, including Pay with 
PayPal.
PayPal Now Available in 203 Markets 
10 new markets added in the second quarter, 
making PayPal available to 80 million new internet users. 
Paraguay 
Côte d’Ivoire 
Nigeria 
Monaco 
Belarus 
Montenegro 
Moldova 
Macedonia 
Cameroon 
Zimbabwe
How can we 
help them to 
complete their 
1st payment? 
Business Problem 
Acquisition Awareness Activation Adoption 
Where do 
prospects 
sign up for 
accounts? 
How do 
prospective 
customers 
learn about 
PayPal? 
How can we 
help them use 
PayPal even 
more? 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. We need to better understand our customers…
How we solved it… 
Tracking Servers 
Mobile 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 
Direct/Home 
Page 
Product 
Experiences 
Search Engine 
Marketing 
Transaction 
Emails 
Tracking Metadata 
Tool 
Taxonomy 
Tracking Event 
Service 
Tag 
Catalog 
Tracking Validation 
Service 
Real Time Systems 
Marketing 
Segmentation 
Experimentation 
Metadata 
Big Data 
Exploratory Analytics Attribution Predictive Analytics
Metadata Instrumentation Collection Processing Analytics 
Server Side 
Events 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 
Pathing 
Store 
DRUID 
Metrics 
Store 
Reporting & 
Visualization 
Logical View 
Client Side 
Events 
Page 
Performance 
Events 
Collection 
Service 
Sessionization 
Behavioral 
Metrics 
Marketing 
Metrics 
Performance 
Metrics 
Operational Metrics (OpenTSDB) 
Real Time 
Event 
Metrics
Metadata –Logical Entity Model 
TEMPLATE PAGE 
COMPONENTS 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 
LINK 
TAGS
Metadata – Logical Event Model 
Impression 
Event 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 
Tracking 
Event 
Reaction 
Event 
Component 
Impression 
Event 
Ad 
Impression 
Event 
Click 
Event 
Click-Through 
Event 
Mouse-over 
Event 
Entry 
Event 
Exit 
Event 
Outcome 
Event 
Page 
Impression 
Event 
Client Page 
Impression 
Event 
Server Page 
Impression 
Event
Metadata - Self-Service Management Workflow… 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 11
DATA PIPELINE 
Processing Analysis & 
Customers 
Client Visualization 
Side 
Metadata 
HTTP 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 
Performance 
Collection 
Metrics 
Tools 
REST 
Spout 
Bot 
flagging 
Bolt 
Sessionization Aggregation 
R 
E 
S 
Proxy T 
Server 
Side 
Geo 
Enrichment 
Bolt R 
e 
p 
o 
r 
ti 
n 
g 
Data Stores 
Druid 
Apache 
Titan 
Developers 
Product Owners 
Meta 
data 
Reporting 
Consumers 
Metadata 
Service
Druid Architecture 
• Open-source 
• Distributed 
• Real-time 
• Highly-Available Data store 
• Column-oriented 
• Approximate or Exact 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary.
Real Time Nodes 
• Ingest data and buffer events in 
memory 
• Incremental indexing 
• Query data as soon as it is 
ingested 
• Periodically persist collected 
events to disk 
• Combine multiple disk indexes 
to create immutable ‘segments’ 
• Log-structured merge-tree 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 14
Druid Architecture 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary.
Historical Nodes 
• Load immutable read-optimized data 
from deep storage 
• Memory mapped storage engine 
• Caches segments 
• Supports tiered storage 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 16
Druid Architecture 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary.
Druid Systems Overview 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 18
Metrics & Dimensions 
"type": "doubleSum", 
"name": "pageviews", 
"fieldName": "PV" 
}, 
{ 
"type": "doubleSum", 
"name": "bounces", 
"fieldName": "bnc" 
}, 
.... 
{ 
"type": "hyperUnique", 
"name": "unique_visits", 
"fieldName": "user_session_guid" 
}, 
{ 
"type": "hyperUnique", 
"name": "unique_visitors", 
"fieldName": "user_guid" 
} 
2014/06/11/10", 
"filter": "part-", 
"parser": { 
"type": "string", 
"timestampSpec": { 
"column": "timestamp", 
"format": "auto" 
}, 
"data": { 
"format": "json", 
"dimensions": [ 
"timestamp", 
"USER_GUID", 
"USER_SESSION_GUID", 
"PAGE_GROUP", 
"PAGE_NAME", 
"PAGEGROUP_LINK_NAME", 
"PAGE_LINK_NAME", 
… 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 19 
Standard 
Metrics 
Estimated 
Metrics 
HyperLogLog 
Dimensions
Sessionization 
Events VisitContainer 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 20 
Visitor 
ID 
Session 
ID 
Timestamp Event 
Payload 
V1 S1 2014-10-16 
05:12 
E1 
V2 S2 2014-10-16 
05:14 
E2 
V1 S1 2014-10-16 
05:15 
E3 
V1 S1 2014-10-16 
05:20 
E4 
V2 S2 2014-10-16 
05:21 
E5 
V1 S3 2014-10-16 
05:25 
E6 
… … … … 
Visitor 
ID 
Session 
ID 
Payload 
V1 S1 sf, mac, {flash, quicktime}, {ca, 
usa}, 480 secs,…. 
E1 
E3 
E4 
V2 S2 ff, win, {acrobat, mediaplayer}. 
{wb, in}, 420 secs….. 
E2 
E5 
V1 S3 sf, mac, {quicktime, java}, {on, ca}, 
60 secs 
E6
Druid Storage – Columns & Dictionaries 
Timestamp (Hr) Sessi 
on 
ID 
Country OS User 
Agent 
Page Name 
Page Name 
0 
1 
2014-10-16 05 S1 US MAC SF Login 
AccountOverview 
0 
2 
3 
0 
2 
4 
0 
5 
4 
0 
5 
2014-10-16 05 S2 DE WIN IE Login 
PaymentReview 
AccountHistory 
2014-10-16 05 S3 US LNX FF Login 
PaymentReview 
Checkout 
2014-10-16 05 S4 UK LNX FF Login 
Profile 
Checkout 
2014-10-16 05 S5 DE WIN CR Login 
Profile 
0 
1 
4 
2014-10-16 05 S6 UK MAC SF Login 
AccountOverview 
Checkout 
Dictionary 
Login 0 
AccountOvervie 
w 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 21 
1 
PaymentReview 2 
AccountHistory 3 
Checkout 4 
LZF Profile 5
Druid Data Structure - Bitmap Indices 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 22
Herald – Self Service Analytics 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 23
Herald – Self Service Analytics 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 24
Druid Metrics 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 25
Pathing 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 26 
Enter
Fallout Reports 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 27
Pathing 
A->B->C->D->X->A->M and A->B->C->D->E 
Visitor ID Current Page Next Page 1 Next Page 2 Prev Page 1 Prev Page 2 
S1 A B C null null 
S1 B C D A null 
S1 C D X B A 
S1 D X A C B 
S1 X A M D C 
S1 A M null X D 
S1 M Null null A X 
S2 A B C null Null 
S2 B C D null A 
S2 C D E B A 
S2 D E Null C B 
S2 E Null null D C 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 28
Pathing 
Next Page 
{ 
“queryType” : “groupBy” 
“dimensions” : (“current_page”, “dimensions like country, segmentation etc”} 
“aggregations” : [ 
{ “type”: “count”, “name”: “next_page_count”, “fieldname” : “next_page, next_page2” }] 
“filter”: { “type”: “selector”, “dimension”: “current_page”, “value”: “C” } 
} 
Previous Page 
{ 
“queryType” : “groupBy” 
“dimensions” : {“current_page”, “dimensions like country, segmentations etc”} 
“aggregations” : [ 
{ “type”: “count”, “name”: “prev_page_count”, “fieldname” : “prev_page1, prev_page2” }] 
“filter”: { “type”: “selector”, “dimension”: “current_page”, “value”: “C” } 
} 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 29
A->B->C->D->X->A->M 
A->D-> X->M 
“queryType” : “search” 
“dimensions” : { “current_page_path_count”, “dimensions like country, segmentation 
etc”} 
“filter”: { “type”: “regex”, “dimension”: “next_page_path”, “pattern”: “^A*D*X*M$” } 
} 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 30 
Fallout 
• Apply them to the dictionary 
• Figure out the values that match 
• Take those bitmap indices 
• OR the bitmap indices together 
• Use the output bitmap as the filter
Model View 
Controller 
Directives NVD3 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 31 
CLIENT SERVER 
Herald Architecture
SSO 
Druid 
Herald Deployment 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 32
Adhoc Graph Analytics 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 33 
Name: 
Login_20141 
01611 
Country: US 
Count: 15 
Name: 
AccountOver 
view_201410 
1611 
Name: 
PaymentRevi 
ew_ 
2014101611 
Name: 
Checkout_20 
14101611 
Country: US 
Count: 5 
Country: US 
Count: 5 
Country: US 
Count: 10 
5 
8 
7 
6
Name: 
Login_2014 
101611 
Country: US 
Count: 15 
Name: 
AccountOv 
erview_201 
4101611 
Name: 
PaymentRe 
view_2014 
101611 
Name: 
Checkout_ 
201410161 
1 
Country: US 
Count: 5 
6 
Country: US 
Count: 5 
7 
Country: US 
Count: 10 
5 
8 
gremlin> g.v(‘Name’, ‘Login_2014101611'). 
as('x’). 
outE.inV.loop('x') 
{it.loops < 4} 
{it.object.getProperty('name') == 
'Checkout_2014101611'}.path 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 34
Summary 
• Problem 
• Understand our customer behavior 
• Across disparate channels & experiences 
• Solution 
• Democratize data 
• Consistent standardized metadata 
• Disciplined instrumentation 
• Distributed scalable backend for adhoc & interactive analytics 
• Self-service BI through modern visualization tools 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 35
Questions ? 
© 2014 PayPal Inc. All rights reserved. Confidential and proprietary.

More Related Content

What's hot

Kafka Summit SF 2017 - DNS for Data: The Need for a Stream Registry
Kafka Summit SF 2017 - DNS for Data: The Need for a Stream RegistryKafka Summit SF 2017 - DNS for Data: The Need for a Stream Registry
Kafka Summit SF 2017 - DNS for Data: The Need for a Stream Registryconfluent
 
Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...
Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...
Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...StampedeCon
 
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...Big Data Spain
 
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful StreamsKafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful Streamsconfluent
 
Building a Hadoop Powered Commerce Data Pipeline
Building a Hadoop Powered Commerce Data PipelineBuilding a Hadoop Powered Commerce Data Pipeline
Building a Hadoop Powered Commerce Data PipelineDataWorks Summit
 
NoSQL no more: SQL on Druid with Apache Calcite
NoSQL no more: SQL on Druid with Apache CalciteNoSQL no more: SQL on Druid with Apache Calcite
NoSQL no more: SQL on Druid with Apache Calcitegianmerlino
 
Big Data Kappa | Mark Senerth, The Walt Disney Company - DMED, Data Tech
Big Data Kappa | Mark Senerth, The Walt Disney Company - DMED, Data TechBig Data Kappa | Mark Senerth, The Walt Disney Company - DMED, Data Tech
Big Data Kappa | Mark Senerth, The Walt Disney Company - DMED, Data TechHostedbyConfluent
 
Druid Overview by Rachel Pedreschi
Druid Overview by Rachel PedreschiDruid Overview by Rachel Pedreschi
Druid Overview by Rachel PedreschiBrian Olsen
 
Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!
Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!
Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!Tugdual Grall
 
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin AmbardDelta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin AmbardParis Data Engineers !
 
Real-time Distributed Stream Processing @ Scale
Real-time Distributed Stream Processing@ ScaleReal-time Distributed Stream Processing@ Scale
Real-time Distributed Stream Processing @ ScaleJerome Boulon
 
Perfecting Your Streaming Skills with Spark and Real World IoT Data
Perfecting Your Streaming Skills with Spark and Real World IoT DataPerfecting Your Streaming Skills with Spark and Real World IoT Data
Perfecting Your Streaming Skills with Spark and Real World IoT DataAdaryl "Bob" Wakefield, MBA
 
Make streaming processing towards ANSI SQL
Make streaming processing towards ANSI SQLMake streaming processing towards ANSI SQL
Make streaming processing towards ANSI SQLDataWorks Summit
 
Hadoop application architectures - using Customer 360 as an example
Hadoop application architectures - using Customer 360 as an exampleHadoop application architectures - using Customer 360 as an example
Hadoop application architectures - using Customer 360 as an examplehadooparchbook
 
Implementing the Lambda Architecture efficiently with Apache Spark
Implementing the Lambda Architecture efficiently with Apache SparkImplementing the Lambda Architecture efficiently with Apache Spark
Implementing the Lambda Architecture efficiently with Apache SparkDataWorks Summit
 
Migration and Coexistence between Relational and NoSQL Databases by Manuel H...
 Migration and Coexistence between Relational and NoSQL Databases by Manuel H... Migration and Coexistence between Relational and NoSQL Databases by Manuel H...
Migration and Coexistence between Relational and NoSQL Databases by Manuel H...Big Data Spain
 
MongoDB Europe 2016 - Choosing Between 100 Billion Travel Options – Instant S...
MongoDB Europe 2016 - Choosing Between 100 Billion Travel Options – Instant S...MongoDB Europe 2016 - Choosing Between 100 Billion Travel Options – Instant S...
MongoDB Europe 2016 - Choosing Between 100 Billion Travel Options – Instant S...MongoDB
 

What's hot (20)

Kafka Summit SF 2017 - DNS for Data: The Need for a Stream Registry
Kafka Summit SF 2017 - DNS for Data: The Need for a Stream RegistryKafka Summit SF 2017 - DNS for Data: The Need for a Stream Registry
Kafka Summit SF 2017 - DNS for Data: The Need for a Stream Registry
 
Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...
Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...
Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...
 
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
 
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful StreamsKafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
 
Building a Hadoop Powered Commerce Data Pipeline
Building a Hadoop Powered Commerce Data PipelineBuilding a Hadoop Powered Commerce Data Pipeline
Building a Hadoop Powered Commerce Data Pipeline
 
NoSQL no more: SQL on Druid with Apache Calcite
NoSQL no more: SQL on Druid with Apache CalciteNoSQL no more: SQL on Druid with Apache Calcite
NoSQL no more: SQL on Druid with Apache Calcite
 
Big Data Kappa | Mark Senerth, The Walt Disney Company - DMED, Data Tech
Big Data Kappa | Mark Senerth, The Walt Disney Company - DMED, Data TechBig Data Kappa | Mark Senerth, The Walt Disney Company - DMED, Data Tech
Big Data Kappa | Mark Senerth, The Walt Disney Company - DMED, Data Tech
 
Lambda architecture
Lambda architectureLambda architecture
Lambda architecture
 
Druid Overview by Rachel Pedreschi
Druid Overview by Rachel PedreschiDruid Overview by Rachel Pedreschi
Druid Overview by Rachel Pedreschi
 
The Evolution of Big Data Pipelines at Intuit
The Evolution of Big Data Pipelines at Intuit The Evolution of Big Data Pipelines at Intuit
The Evolution of Big Data Pipelines at Intuit
 
Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!
Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!
Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!
 
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin AmbardDelta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
 
Real-time Distributed Stream Processing @ Scale
Real-time Distributed Stream Processing@ ScaleReal-time Distributed Stream Processing@ Scale
Real-time Distributed Stream Processing @ Scale
 
Perfecting Your Streaming Skills with Spark and Real World IoT Data
Perfecting Your Streaming Skills with Spark and Real World IoT DataPerfecting Your Streaming Skills with Spark and Real World IoT Data
Perfecting Your Streaming Skills with Spark and Real World IoT Data
 
Make streaming processing towards ANSI SQL
Make streaming processing towards ANSI SQLMake streaming processing towards ANSI SQL
Make streaming processing towards ANSI SQL
 
Hadoop application architectures - using Customer 360 as an example
Hadoop application architectures - using Customer 360 as an exampleHadoop application architectures - using Customer 360 as an example
Hadoop application architectures - using Customer 360 as an example
 
Implementing the Lambda Architecture efficiently with Apache Spark
Implementing the Lambda Architecture efficiently with Apache SparkImplementing the Lambda Architecture efficiently with Apache Spark
Implementing the Lambda Architecture efficiently with Apache Spark
 
Yahoo's Next Generation User Profile Platform
Yahoo's Next Generation User Profile PlatformYahoo's Next Generation User Profile Platform
Yahoo's Next Generation User Profile Platform
 
Migration and Coexistence between Relational and NoSQL Databases by Manuel H...
 Migration and Coexistence between Relational and NoSQL Databases by Manuel H... Migration and Coexistence between Relational and NoSQL Databases by Manuel H...
Migration and Coexistence between Relational and NoSQL Databases by Manuel H...
 
MongoDB Europe 2016 - Choosing Between 100 Billion Travel Options – Instant S...
MongoDB Europe 2016 - Choosing Between 100 Billion Travel Options – Instant S...MongoDB Europe 2016 - Choosing Between 100 Billion Travel Options – Instant S...
MongoDB Europe 2016 - Choosing Between 100 Billion Travel Options – Instant S...
 

Viewers also liked

EAP - Accelerating behavorial analytics at PayPal using Hadoop
EAP - Accelerating behavorial analytics at PayPal using HadoopEAP - Accelerating behavorial analytics at PayPal using Hadoop
EAP - Accelerating behavorial analytics at PayPal using HadoopDataWorks Summit
 
Big data – can it deliver speed and accuracy v1
Big data – can it deliver speed and accuracy v1Big data – can it deliver speed and accuracy v1
Big data – can it deliver speed and accuracy v1GurinderG
 
July 2014 HUG : Pushing the limits of Realtime Analytics using Druid
July 2014 HUG : Pushing the limits of Realtime Analytics using DruidJuly 2014 HUG : Pushing the limits of Realtime Analytics using Druid
July 2014 HUG : Pushing the limits of Realtime Analytics using DruidYahoo Developer Network
 
Druid realtime indexing
Druid realtime indexingDruid realtime indexing
Druid realtime indexingSeoeun Park
 
Aggregated queries with Druid on terrabytes and petabytes of data
Aggregated queries with Druid on terrabytes and petabytes of dataAggregated queries with Druid on terrabytes and petabytes of data
Aggregated queries with Druid on terrabytes and petabytes of dataRostislav Pashuto
 
Pulsar: Real-time Analytics at Scale with Kafka, Kylin and Druid
Pulsar: Real-time Analytics at Scale with Kafka, Kylin and DruidPulsar: Real-time Analytics at Scale with Kafka, Kylin and Druid
Pulsar: Real-time Analytics at Scale with Kafka, Kylin and DruidTony Ng
 
OLAP for Big Data (Druid vs Apache Kylin vs Apache Lens)
OLAP for Big Data (Druid vs Apache Kylin vs Apache Lens)OLAP for Big Data (Druid vs Apache Kylin vs Apache Lens)
OLAP for Big Data (Druid vs Apache Kylin vs Apache Lens)SANG WON PARK
 
Building a Data Pipeline from Scratch - Joe Crobak
Building a Data Pipeline from Scratch - Joe CrobakBuilding a Data Pipeline from Scratch - Joe Crobak
Building a Data Pipeline from Scratch - Joe CrobakHakka Labs
 
Gregorry Letribot - Druid at Criteo - NoSQL matters 2015
Gregorry Letribot - Druid at Criteo - NoSQL matters 2015Gregorry Letribot - Druid at Criteo - NoSQL matters 2015
Gregorry Letribot - Druid at Criteo - NoSQL matters 2015NoSQLmatters
 
Big Data: It's More Than Volume, Paypal
Big Data: It's More Than Volume, PaypalBig Data: It's More Than Volume, Paypal
Big Data: It's More Than Volume, PaypalInnovation Enterprise
 
Big- Data and Risk Management - Ido Lustig, PayPal
Big- Data and Risk Management - Ido Lustig, PayPalBig- Data and Risk Management - Ido Lustig, PayPal
Big- Data and Risk Management - Ido Lustig, PayPalCodemotion Tel Aviv
 
Using druid for interactive count distinct queries at scale @ nmc
Using druid  for interactive count distinct queries at scale @ nmcUsing druid  for interactive count distinct queries at scale @ nmc
Using druid for interactive count distinct queries at scale @ nmcIdo Shilon
 
H2O World - Data Science w/ Big Data in a Corporate Environment - Nachum Shacham
H2O World - Data Science w/ Big Data in a Corporate Environment - Nachum ShachamH2O World - Data Science w/ Big Data in a Corporate Environment - Nachum Shacham
H2O World - Data Science w/ Big Data in a Corporate Environment - Nachum ShachamSri Ambati
 
Interactive analytics at scale with druid
Interactive analytics at scale with druidInteractive analytics at scale with druid
Interactive analytics at scale with druidJulien Lavigne du Cadet
 
Clash of the Titans: Releasing the Kraken | NodeJS @paypal
Clash of the Titans: Releasing the Kraken | NodeJS @paypalClash of the Titans: Releasing the Kraken | NodeJS @paypal
Clash of the Titans: Releasing the Kraken | NodeJS @paypalBill Scott
 
Data Analytics with Druid
Data Analytics with DruidData Analytics with Druid
Data Analytics with DruidYousun Jeong
 
Programmatic Bidding Data Streams & Druid
Programmatic Bidding Data Streams & DruidProgrammatic Bidding Data Streams & Druid
Programmatic Bidding Data Streams & DruidCharles Allen
 
PayPal Behavioral Analytics on Hadoop
PayPal Behavioral Analytics on HadoopPayPal Behavioral Analytics on Hadoop
PayPal Behavioral Analytics on HadoopDataWorks Summit
 

Viewers also liked (20)

EAP - Accelerating behavorial analytics at PayPal using Hadoop
EAP - Accelerating behavorial analytics at PayPal using HadoopEAP - Accelerating behavorial analytics at PayPal using Hadoop
EAP - Accelerating behavorial analytics at PayPal using Hadoop
 
Big data – can it deliver speed and accuracy v1
Big data – can it deliver speed and accuracy v1Big data – can it deliver speed and accuracy v1
Big data – can it deliver speed and accuracy v1
 
July 2014 HUG : Pushing the limits of Realtime Analytics using Druid
July 2014 HUG : Pushing the limits of Realtime Analytics using DruidJuly 2014 HUG : Pushing the limits of Realtime Analytics using Druid
July 2014 HUG : Pushing the limits of Realtime Analytics using Druid
 
Druid realtime indexing
Druid realtime indexingDruid realtime indexing
Druid realtime indexing
 
Aggregated queries with Druid on terrabytes and petabytes of data
Aggregated queries with Druid on terrabytes and petabytes of dataAggregated queries with Druid on terrabytes and petabytes of data
Aggregated queries with Druid on terrabytes and petabytes of data
 
Pulsar: Real-time Analytics at Scale with Kafka, Kylin and Druid
Pulsar: Real-time Analytics at Scale with Kafka, Kylin and DruidPulsar: Real-time Analytics at Scale with Kafka, Kylin and Druid
Pulsar: Real-time Analytics at Scale with Kafka, Kylin and Druid
 
Scalable Real-time analytics using Druid
Scalable Real-time analytics using DruidScalable Real-time analytics using Druid
Scalable Real-time analytics using Druid
 
OLAP for Big Data (Druid vs Apache Kylin vs Apache Lens)
OLAP for Big Data (Druid vs Apache Kylin vs Apache Lens)OLAP for Big Data (Druid vs Apache Kylin vs Apache Lens)
OLAP for Big Data (Druid vs Apache Kylin vs Apache Lens)
 
Building a Data Pipeline from Scratch - Joe Crobak
Building a Data Pipeline from Scratch - Joe CrobakBuilding a Data Pipeline from Scratch - Joe Crobak
Building a Data Pipeline from Scratch - Joe Crobak
 
Hadoop at eBay
Hadoop at eBayHadoop at eBay
Hadoop at eBay
 
Gregorry Letribot - Druid at Criteo - NoSQL matters 2015
Gregorry Letribot - Druid at Criteo - NoSQL matters 2015Gregorry Letribot - Druid at Criteo - NoSQL matters 2015
Gregorry Letribot - Druid at Criteo - NoSQL matters 2015
 
Big Data: It's More Than Volume, Paypal
Big Data: It's More Than Volume, PaypalBig Data: It's More Than Volume, Paypal
Big Data: It's More Than Volume, Paypal
 
Big- Data and Risk Management - Ido Lustig, PayPal
Big- Data and Risk Management - Ido Lustig, PayPalBig- Data and Risk Management - Ido Lustig, PayPal
Big- Data and Risk Management - Ido Lustig, PayPal
 
Using druid for interactive count distinct queries at scale @ nmc
Using druid  for interactive count distinct queries at scale @ nmcUsing druid  for interactive count distinct queries at scale @ nmc
Using druid for interactive count distinct queries at scale @ nmc
 
H2O World - Data Science w/ Big Data in a Corporate Environment - Nachum Shacham
H2O World - Data Science w/ Big Data in a Corporate Environment - Nachum ShachamH2O World - Data Science w/ Big Data in a Corporate Environment - Nachum Shacham
H2O World - Data Science w/ Big Data in a Corporate Environment - Nachum Shacham
 
Interactive analytics at scale with druid
Interactive analytics at scale with druidInteractive analytics at scale with druid
Interactive analytics at scale with druid
 
Clash of the Titans: Releasing the Kraken | NodeJS @paypal
Clash of the Titans: Releasing the Kraken | NodeJS @paypalClash of the Titans: Releasing the Kraken | NodeJS @paypal
Clash of the Titans: Releasing the Kraken | NodeJS @paypal
 
Data Analytics with Druid
Data Analytics with DruidData Analytics with Druid
Data Analytics with Druid
 
Programmatic Bidding Data Streams & Druid
Programmatic Bidding Data Streams & DruidProgrammatic Bidding Data Streams & Druid
Programmatic Bidding Data Streams & Druid
 
PayPal Behavioral Analytics on Hadoop
PayPal Behavioral Analytics on HadoopPayPal Behavioral Analytics on Hadoop
PayPal Behavioral Analytics on Hadoop
 

Similar to PayPal Real Time Analytics

PayPal couchbase 2014
PayPal couchbase 2014PayPal couchbase 2014
PayPal couchbase 2014Anil Madan
 
Neo4j Aura on AWS: The Customer Choice for Graph Databases
Neo4j Aura on AWS: The Customer Choice for Graph DatabasesNeo4j Aura on AWS: The Customer Choice for Graph Databases
Neo4j Aura on AWS: The Customer Choice for Graph DatabasesNeo4j
 
Grokking Engineering - Data Analytics Infrastructure at Viki - Huy Nguyen
Grokking Engineering - Data Analytics Infrastructure at Viki - Huy NguyenGrokking Engineering - Data Analytics Infrastructure at Viki - Huy Nguyen
Grokking Engineering - Data Analytics Infrastructure at Viki - Huy NguyenHuy Nguyen
 
Optimize Your SaaS Offering with Serverless Microservices (GPSTEC405) - AWS r...
Optimize Your SaaS Offering with Serverless Microservices (GPSTEC405) - AWS r...Optimize Your SaaS Offering with Serverless Microservices (GPSTEC405) - AWS r...
Optimize Your SaaS Offering with Serverless Microservices (GPSTEC405) - AWS r...Amazon Web Services
 
[WSO2Con Asia 2018] Patterns for Building Streaming Apps
[WSO2Con Asia 2018] Patterns for Building Streaming Apps[WSO2Con Asia 2018] Patterns for Building Streaming Apps
[WSO2Con Asia 2018] Patterns for Building Streaming AppsWSO2
 
AWS 金融服務概覽與區塊鍊案例分享
AWS 金融服務概覽與區塊鍊案例分享AWS 金融服務概覽與區塊鍊案例分享
AWS 金融服務概覽與區塊鍊案例分享Amazon Web Services
 
Redesigning PayPal APIs for Scale and Simplicity - QCon San Francisco 2013
Redesigning PayPal APIs for Scale and Simplicity - QCon San Francisco 2013Redesigning PayPal APIs for Scale and Simplicity - QCon San Francisco 2013
Redesigning PayPal APIs for Scale and Simplicity - QCon San Francisco 2013Deepak Nadig
 
New Approaches for Fraud Detection on Apache Kafka and KSQL
New Approaches for Fraud Detection on Apache Kafka and KSQLNew Approaches for Fraud Detection on Apache Kafka and KSQL
New Approaches for Fraud Detection on Apache Kafka and KSQLconfluent
 
APIdays Paris 2019 - Adopting Service Mesh by Marco Palladino , Kong
APIdays Paris 2019 - Adopting Service Mesh by Marco Palladino , KongAPIdays Paris 2019 - Adopting Service Mesh by Marco Palladino , Kong
APIdays Paris 2019 - Adopting Service Mesh by Marco Palladino , Kongapidays
 
Building upon existing infrastructure for Mobile Applications with WSO2
Building upon existing infrastructure for Mobile Applications with WSO2Building upon existing infrastructure for Mobile Applications with WSO2
Building upon existing infrastructure for Mobile Applications with WSO2Anthony Carlson
 
When Data Visualizations and Data Imports Just Don’t Work
When Data Visualizations and Data Imports Just Don’t WorkWhen Data Visualizations and Data Imports Just Don’t Work
When Data Visualizations and Data Imports Just Don’t WorkJim Kaplan CIA CFE
 
Transforming Financial Services with Event Streaming Data
Transforming Financial Services with Event Streaming DataTransforming Financial Services with Event Streaming Data
Transforming Financial Services with Event Streaming Dataconfluent
 
WSO2Con EU 2015: An Introduction to the WSO2 Data Analytics Platform
WSO2Con EU 2015: An Introduction to the WSO2 Data Analytics PlatformWSO2Con EU 2015: An Introduction to the WSO2 Data Analytics Platform
WSO2Con EU 2015: An Introduction to the WSO2 Data Analytics PlatformWSO2
 
Modernizing i5 Applications
Modernizing i5 ApplicationsModernizing i5 Applications
Modernizing i5 ApplicationsZendCon
 
AWS Activate Webinar - Growing on AWS
AWS Activate Webinar - Growing on AWSAWS Activate Webinar - Growing on AWS
AWS Activate Webinar - Growing on AWSAmazon Web Services
 
Azure Stream Analytics : Analyse Data in Motion
Azure Stream Analytics  : Analyse Data in MotionAzure Stream Analytics  : Analyse Data in Motion
Azure Stream Analytics : Analyse Data in MotionRuhani Arora
 
"Fintech inside of a SaaS powered by 2000+ Microservices", Volodymyr Malyk
"Fintech inside of a SaaS powered by 2000+ Microservices", Volodymyr Malyk"Fintech inside of a SaaS powered by 2000+ Microservices", Volodymyr Malyk
"Fintech inside of a SaaS powered by 2000+ Microservices", Volodymyr MalykFwdays
 
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...Flink Forward
 
Eventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalkEventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalkconfluent
 
An Entry Point to Impactful Open Banking Architecture
An Entry Point to Impactful Open Banking ArchitectureAn Entry Point to Impactful Open Banking Architecture
An Entry Point to Impactful Open Banking ArchitectureWSO2
 

Similar to PayPal Real Time Analytics (20)

PayPal couchbase 2014
PayPal couchbase 2014PayPal couchbase 2014
PayPal couchbase 2014
 
Neo4j Aura on AWS: The Customer Choice for Graph Databases
Neo4j Aura on AWS: The Customer Choice for Graph DatabasesNeo4j Aura on AWS: The Customer Choice for Graph Databases
Neo4j Aura on AWS: The Customer Choice for Graph Databases
 
Grokking Engineering - Data Analytics Infrastructure at Viki - Huy Nguyen
Grokking Engineering - Data Analytics Infrastructure at Viki - Huy NguyenGrokking Engineering - Data Analytics Infrastructure at Viki - Huy Nguyen
Grokking Engineering - Data Analytics Infrastructure at Viki - Huy Nguyen
 
Optimize Your SaaS Offering with Serverless Microservices (GPSTEC405) - AWS r...
Optimize Your SaaS Offering with Serverless Microservices (GPSTEC405) - AWS r...Optimize Your SaaS Offering with Serverless Microservices (GPSTEC405) - AWS r...
Optimize Your SaaS Offering with Serverless Microservices (GPSTEC405) - AWS r...
 
[WSO2Con Asia 2018] Patterns for Building Streaming Apps
[WSO2Con Asia 2018] Patterns for Building Streaming Apps[WSO2Con Asia 2018] Patterns for Building Streaming Apps
[WSO2Con Asia 2018] Patterns for Building Streaming Apps
 
AWS 金融服務概覽與區塊鍊案例分享
AWS 金融服務概覽與區塊鍊案例分享AWS 金融服務概覽與區塊鍊案例分享
AWS 金融服務概覽與區塊鍊案例分享
 
Redesigning PayPal APIs for Scale and Simplicity - QCon San Francisco 2013
Redesigning PayPal APIs for Scale and Simplicity - QCon San Francisco 2013Redesigning PayPal APIs for Scale and Simplicity - QCon San Francisco 2013
Redesigning PayPal APIs for Scale and Simplicity - QCon San Francisco 2013
 
New Approaches for Fraud Detection on Apache Kafka and KSQL
New Approaches for Fraud Detection on Apache Kafka and KSQLNew Approaches for Fraud Detection on Apache Kafka and KSQL
New Approaches for Fraud Detection on Apache Kafka and KSQL
 
APIdays Paris 2019 - Adopting Service Mesh by Marco Palladino , Kong
APIdays Paris 2019 - Adopting Service Mesh by Marco Palladino , KongAPIdays Paris 2019 - Adopting Service Mesh by Marco Palladino , Kong
APIdays Paris 2019 - Adopting Service Mesh by Marco Palladino , Kong
 
Building upon existing infrastructure for Mobile Applications with WSO2
Building upon existing infrastructure for Mobile Applications with WSO2Building upon existing infrastructure for Mobile Applications with WSO2
Building upon existing infrastructure for Mobile Applications with WSO2
 
When Data Visualizations and Data Imports Just Don’t Work
When Data Visualizations and Data Imports Just Don’t WorkWhen Data Visualizations and Data Imports Just Don’t Work
When Data Visualizations and Data Imports Just Don’t Work
 
Transforming Financial Services with Event Streaming Data
Transforming Financial Services with Event Streaming DataTransforming Financial Services with Event Streaming Data
Transforming Financial Services with Event Streaming Data
 
WSO2Con EU 2015: An Introduction to the WSO2 Data Analytics Platform
WSO2Con EU 2015: An Introduction to the WSO2 Data Analytics PlatformWSO2Con EU 2015: An Introduction to the WSO2 Data Analytics Platform
WSO2Con EU 2015: An Introduction to the WSO2 Data Analytics Platform
 
Modernizing i5 Applications
Modernizing i5 ApplicationsModernizing i5 Applications
Modernizing i5 Applications
 
AWS Activate Webinar - Growing on AWS
AWS Activate Webinar - Growing on AWSAWS Activate Webinar - Growing on AWS
AWS Activate Webinar - Growing on AWS
 
Azure Stream Analytics : Analyse Data in Motion
Azure Stream Analytics  : Analyse Data in MotionAzure Stream Analytics  : Analyse Data in Motion
Azure Stream Analytics : Analyse Data in Motion
 
"Fintech inside of a SaaS powered by 2000+ Microservices", Volodymyr Malyk
"Fintech inside of a SaaS powered by 2000+ Microservices", Volodymyr Malyk"Fintech inside of a SaaS powered by 2000+ Microservices", Volodymyr Malyk
"Fintech inside of a SaaS powered by 2000+ Microservices", Volodymyr Malyk
 
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...
 
Eventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalkEventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalk
 
An Entry Point to Impactful Open Banking Architecture
An Entry Point to Impactful Open Banking ArchitectureAn Entry Point to Impactful Open Banking Architecture
An Entry Point to Impactful Open Banking Architecture
 

Recently uploaded

Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 

Recently uploaded (20)

Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 

PayPal Real Time Analytics

  • 1. Open Source Real Time BI using Storm, Hadoop, Titan, Druid & D3 Anil Madan Sr. Director Engineering, PayPal © 2014 PayPal Inc. All rights reserved. Confidential and proprietary.
  • 2. $1 in every $6 Spent on e-commerce is spent through PayPal.*
  • 3. Creating Tomorrow’s Mobile Payment Experiences 25 countries with live PayPal fingerprint authentication on Samsung devices.
  • 4. Helping Developers Innovate & Monetize New Mobile Apps Braintree launches its new API, including Pay with PayPal.
  • 5. PayPal Now Available in 203 Markets 10 new markets added in the second quarter, making PayPal available to 80 million new internet users. Paraguay Côte d’Ivoire Nigeria Monaco Belarus Montenegro Moldova Macedonia Cameroon Zimbabwe
  • 6. How can we help them to complete their 1st payment? Business Problem Acquisition Awareness Activation Adoption Where do prospects sign up for accounts? How do prospective customers learn about PayPal? How can we help them use PayPal even more? © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. We need to better understand our customers…
  • 7. How we solved it… Tracking Servers Mobile © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. Direct/Home Page Product Experiences Search Engine Marketing Transaction Emails Tracking Metadata Tool Taxonomy Tracking Event Service Tag Catalog Tracking Validation Service Real Time Systems Marketing Segmentation Experimentation Metadata Big Data Exploratory Analytics Attribution Predictive Analytics
  • 8. Metadata Instrumentation Collection Processing Analytics Server Side Events © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. Pathing Store DRUID Metrics Store Reporting & Visualization Logical View Client Side Events Page Performance Events Collection Service Sessionization Behavioral Metrics Marketing Metrics Performance Metrics Operational Metrics (OpenTSDB) Real Time Event Metrics
  • 9. Metadata –Logical Entity Model TEMPLATE PAGE COMPONENTS © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. LINK TAGS
  • 10. Metadata – Logical Event Model Impression Event © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. Tracking Event Reaction Event Component Impression Event Ad Impression Event Click Event Click-Through Event Mouse-over Event Entry Event Exit Event Outcome Event Page Impression Event Client Page Impression Event Server Page Impression Event
  • 11. Metadata - Self-Service Management Workflow… © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 11
  • 12. DATA PIPELINE Processing Analysis & Customers Client Visualization Side Metadata HTTP © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. Performance Collection Metrics Tools REST Spout Bot flagging Bolt Sessionization Aggregation R E S Proxy T Server Side Geo Enrichment Bolt R e p o r ti n g Data Stores Druid Apache Titan Developers Product Owners Meta data Reporting Consumers Metadata Service
  • 13. Druid Architecture • Open-source • Distributed • Real-time • Highly-Available Data store • Column-oriented • Approximate or Exact © 2014 PayPal Inc. All rights reserved. Confidential and proprietary.
  • 14. Real Time Nodes • Ingest data and buffer events in memory • Incremental indexing • Query data as soon as it is ingested • Periodically persist collected events to disk • Combine multiple disk indexes to create immutable ‘segments’ • Log-structured merge-tree © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 14
  • 15. Druid Architecture © 2014 PayPal Inc. All rights reserved. Confidential and proprietary.
  • 16. Historical Nodes • Load immutable read-optimized data from deep storage • Memory mapped storage engine • Caches segments • Supports tiered storage © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 16
  • 17. Druid Architecture © 2014 PayPal Inc. All rights reserved. Confidential and proprietary.
  • 18. Druid Systems Overview © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 18
  • 19. Metrics & Dimensions "type": "doubleSum", "name": "pageviews", "fieldName": "PV" }, { "type": "doubleSum", "name": "bounces", "fieldName": "bnc" }, .... { "type": "hyperUnique", "name": "unique_visits", "fieldName": "user_session_guid" }, { "type": "hyperUnique", "name": "unique_visitors", "fieldName": "user_guid" } 2014/06/11/10", "filter": "part-", "parser": { "type": "string", "timestampSpec": { "column": "timestamp", "format": "auto" }, "data": { "format": "json", "dimensions": [ "timestamp", "USER_GUID", "USER_SESSION_GUID", "PAGE_GROUP", "PAGE_NAME", "PAGEGROUP_LINK_NAME", "PAGE_LINK_NAME", … © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 19 Standard Metrics Estimated Metrics HyperLogLog Dimensions
  • 20. Sessionization Events VisitContainer © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 20 Visitor ID Session ID Timestamp Event Payload V1 S1 2014-10-16 05:12 E1 V2 S2 2014-10-16 05:14 E2 V1 S1 2014-10-16 05:15 E3 V1 S1 2014-10-16 05:20 E4 V2 S2 2014-10-16 05:21 E5 V1 S3 2014-10-16 05:25 E6 … … … … Visitor ID Session ID Payload V1 S1 sf, mac, {flash, quicktime}, {ca, usa}, 480 secs,…. E1 E3 E4 V2 S2 ff, win, {acrobat, mediaplayer}. {wb, in}, 420 secs….. E2 E5 V1 S3 sf, mac, {quicktime, java}, {on, ca}, 60 secs E6
  • 21. Druid Storage – Columns & Dictionaries Timestamp (Hr) Sessi on ID Country OS User Agent Page Name Page Name 0 1 2014-10-16 05 S1 US MAC SF Login AccountOverview 0 2 3 0 2 4 0 5 4 0 5 2014-10-16 05 S2 DE WIN IE Login PaymentReview AccountHistory 2014-10-16 05 S3 US LNX FF Login PaymentReview Checkout 2014-10-16 05 S4 UK LNX FF Login Profile Checkout 2014-10-16 05 S5 DE WIN CR Login Profile 0 1 4 2014-10-16 05 S6 UK MAC SF Login AccountOverview Checkout Dictionary Login 0 AccountOvervie w © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 21 1 PaymentReview 2 AccountHistory 3 Checkout 4 LZF Profile 5
  • 22. Druid Data Structure - Bitmap Indices © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 22
  • 23. Herald – Self Service Analytics © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 23
  • 24. Herald – Self Service Analytics © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 24
  • 25. Druid Metrics © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 25
  • 26. Pathing © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 26 Enter
  • 27. Fallout Reports © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 27
  • 28. Pathing A->B->C->D->X->A->M and A->B->C->D->E Visitor ID Current Page Next Page 1 Next Page 2 Prev Page 1 Prev Page 2 S1 A B C null null S1 B C D A null S1 C D X B A S1 D X A C B S1 X A M D C S1 A M null X D S1 M Null null A X S2 A B C null Null S2 B C D null A S2 C D E B A S2 D E Null C B S2 E Null null D C © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 28
  • 29. Pathing Next Page { “queryType” : “groupBy” “dimensions” : (“current_page”, “dimensions like country, segmentation etc”} “aggregations” : [ { “type”: “count”, “name”: “next_page_count”, “fieldname” : “next_page, next_page2” }] “filter”: { “type”: “selector”, “dimension”: “current_page”, “value”: “C” } } Previous Page { “queryType” : “groupBy” “dimensions” : {“current_page”, “dimensions like country, segmentations etc”} “aggregations” : [ { “type”: “count”, “name”: “prev_page_count”, “fieldname” : “prev_page1, prev_page2” }] “filter”: { “type”: “selector”, “dimension”: “current_page”, “value”: “C” } } © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 29
  • 30. A->B->C->D->X->A->M A->D-> X->M “queryType” : “search” “dimensions” : { “current_page_path_count”, “dimensions like country, segmentation etc”} “filter”: { “type”: “regex”, “dimension”: “next_page_path”, “pattern”: “^A*D*X*M$” } } © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 30 Fallout • Apply them to the dictionary • Figure out the values that match • Take those bitmap indices • OR the bitmap indices together • Use the output bitmap as the filter
  • 31. Model View Controller Directives NVD3 © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 31 CLIENT SERVER Herald Architecture
  • 32. SSO Druid Herald Deployment © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 32
  • 33. Adhoc Graph Analytics © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 33 Name: Login_20141 01611 Country: US Count: 15 Name: AccountOver view_201410 1611 Name: PaymentRevi ew_ 2014101611 Name: Checkout_20 14101611 Country: US Count: 5 Country: US Count: 5 Country: US Count: 10 5 8 7 6
  • 34. Name: Login_2014 101611 Country: US Count: 15 Name: AccountOv erview_201 4101611 Name: PaymentRe view_2014 101611 Name: Checkout_ 201410161 1 Country: US Count: 5 6 Country: US Count: 5 7 Country: US Count: 10 5 8 gremlin> g.v(‘Name’, ‘Login_2014101611'). as('x’). outE.inV.loop('x') {it.loops < 4} {it.object.getProperty('name') == 'Checkout_2014101611'}.path © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 34
  • 35. Summary • Problem • Understand our customer behavior • Across disparate channels & experiences • Solution • Democratize data • Consistent standardized metadata • Disciplined instrumentation • Distributed scalable backend for adhoc & interactive analytics • Self-service BI through modern visualization tools © 2014 PayPal Inc. All rights reserved. Confidential and proprietary. 35
  • 36. Questions ? © 2014 PayPal Inc. All rights reserved. Confidential and proprietary.