Submit Search
Upload
Tubular Labs - Using Elastic to Search Over 2.5B Videos
•
0 likes
•
651 views
Tubular Labs
Follow
Presentation given by Stas Starcha at the Elasticsearch meetup on 9/13/16.
Read less
Read more
Software
Report
Share
Report
Share
1 of 31
Download now
Download to read offline
Recommended
Elasticsearch Sharding Strategy at Tubular Labs
Elasticsearch Sharding Strategy at Tubular Labs
Tubular Labs
Meetup#2: Building responsive Symbology & Suggest WebService
Meetup#2: Building responsive Symbology & Suggest WebService
Minsk MongoDB User Group
Journey and evolution of Presto@Grab
Journey and evolution of Presto@Grab
Shubham Tagra
Toronto High Scalability meetup - Scaling ELK
Toronto High Scalability meetup - Scaling ELK
Andrew Trossman
How bol.com makes sense of its logs, using the Elastic technology stack.
How bol.com makes sense of its logs, using the Elastic technology stack.
Renzo Tomà
ITS World Congress 2014 - Performance Evaluation of Transit Data Formats on a...
ITS World Congress 2014 - Performance Evaluation of Transit Data Formats on a...
Sean Barbeau
WSO2Con USA 2015: Deployment Patterns and Capacity Planning
WSO2Con USA 2015: Deployment Patterns and Capacity Planning
WSO2
3.1.Performance and BigData Ecosystem
3.1.Performance and BigData Ecosystem
振东 刘
Recommended
Elasticsearch Sharding Strategy at Tubular Labs
Elasticsearch Sharding Strategy at Tubular Labs
Tubular Labs
Meetup#2: Building responsive Symbology & Suggest WebService
Meetup#2: Building responsive Symbology & Suggest WebService
Minsk MongoDB User Group
Journey and evolution of Presto@Grab
Journey and evolution of Presto@Grab
Shubham Tagra
Toronto High Scalability meetup - Scaling ELK
Toronto High Scalability meetup - Scaling ELK
Andrew Trossman
How bol.com makes sense of its logs, using the Elastic technology stack.
How bol.com makes sense of its logs, using the Elastic technology stack.
Renzo Tomà
ITS World Congress 2014 - Performance Evaluation of Transit Data Formats on a...
ITS World Congress 2014 - Performance Evaluation of Transit Data Formats on a...
Sean Barbeau
WSO2Con USA 2015: Deployment Patterns and Capacity Planning
WSO2Con USA 2015: Deployment Patterns and Capacity Planning
WSO2
3.1.Performance and BigData Ecosystem
3.1.Performance and BigData Ecosystem
振东 刘
Graylog Engineering - Design Your Architecture
Graylog Engineering - Design Your Architecture
Graylog
Scaling an ELK stack at bol.com
Scaling an ELK stack at bol.com
Renzo Tomà
Presto changes
Presto changes
N Masahiro
Garbage collection in JVM
Garbage collection in JVM
aragozin
OLAP Architecture
OLAP Architecture
Evgeny Budnik
Data- How Does It Work-
Data- How Does It Work-
Boyang Niu
Stor4NFV: Exploration of Cloud native Storage in OPNFV - Ren Qiaowei, Wang Hui
Stor4NFV: Exploration of Cloud native Storage in OPNFV - Ren Qiaowei, Wang Hui
Ceph Community
Solr At Scale For Time-Oriented Data: Presented by Brett Hoerner, Rocana
Solr At Scale For Time-Oriented Data: Presented by Brett Hoerner, Rocana
Lucidworks
Big data: Loading your data with flume and sqoop
Big data: Loading your data with flume and sqoop
Christophe Marchal
Presto Summit 2018 - 07 - Lyft
Presto Summit 2018 - 07 - Lyft
kbajda
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon
Managing your CF templates as a code with python and troposphere
Managing your CF templates as a code with python and troposphere
Yaroslav Tarasenko
How the Automation of a Benchmark Famework Keeps Pace with the Dev Cycle at I...
How the Automation of a Benchmark Famework Keeps Pace with the Dev Cycle at I...
DevOps.com
Speed Up Uber's Presto with Alluxio
Speed Up Uber's Presto with Alluxio
Alluxio, Inc.
Fast dataarchitecture
Fast dataarchitecture
Knoldus Inc.
Sizing Your Scylla Cluster
Sizing Your Scylla Cluster
ScyllaDB
Logs aggregation and analysis
Logs aggregation and analysis
Divante
Petabyte Scale Object Storage Service Using Ceph in A Private Cloud - Varada ...
Petabyte Scale Object Storage Service Using Ceph in A Private Cloud - Varada ...
Ceph Community
NRD: Nagios Result Distributor
NRD: Nagios Result Distributor
Jose Luis Martínez
Breaking Prometheus (Promcon Berlin '16)
Breaking Prometheus (Promcon Berlin '16)
Matthew Campbell
AWS re:Invent 2016: Deploying and Managing .NET Pipelines and Microsoft Workl...
AWS re:Invent 2016: Deploying and Managing .NET Pipelines and Microsoft Workl...
Amazon Web Services
ITV& Bashton
ITV& Bashton
Amazon Web Services
More Related Content
What's hot
Graylog Engineering - Design Your Architecture
Graylog Engineering - Design Your Architecture
Graylog
Scaling an ELK stack at bol.com
Scaling an ELK stack at bol.com
Renzo Tomà
Presto changes
Presto changes
N Masahiro
Garbage collection in JVM
Garbage collection in JVM
aragozin
OLAP Architecture
OLAP Architecture
Evgeny Budnik
Data- How Does It Work-
Data- How Does It Work-
Boyang Niu
Stor4NFV: Exploration of Cloud native Storage in OPNFV - Ren Qiaowei, Wang Hui
Stor4NFV: Exploration of Cloud native Storage in OPNFV - Ren Qiaowei, Wang Hui
Ceph Community
Solr At Scale For Time-Oriented Data: Presented by Brett Hoerner, Rocana
Solr At Scale For Time-Oriented Data: Presented by Brett Hoerner, Rocana
Lucidworks
Big data: Loading your data with flume and sqoop
Big data: Loading your data with flume and sqoop
Christophe Marchal
Presto Summit 2018 - 07 - Lyft
Presto Summit 2018 - 07 - Lyft
kbajda
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon
Managing your CF templates as a code with python and troposphere
Managing your CF templates as a code with python and troposphere
Yaroslav Tarasenko
How the Automation of a Benchmark Famework Keeps Pace with the Dev Cycle at I...
How the Automation of a Benchmark Famework Keeps Pace with the Dev Cycle at I...
DevOps.com
Speed Up Uber's Presto with Alluxio
Speed Up Uber's Presto with Alluxio
Alluxio, Inc.
Fast dataarchitecture
Fast dataarchitecture
Knoldus Inc.
Sizing Your Scylla Cluster
Sizing Your Scylla Cluster
ScyllaDB
Logs aggregation and analysis
Logs aggregation and analysis
Divante
Petabyte Scale Object Storage Service Using Ceph in A Private Cloud - Varada ...
Petabyte Scale Object Storage Service Using Ceph in A Private Cloud - Varada ...
Ceph Community
NRD: Nagios Result Distributor
NRD: Nagios Result Distributor
Jose Luis Martínez
Breaking Prometheus (Promcon Berlin '16)
Breaking Prometheus (Promcon Berlin '16)
Matthew Campbell
What's hot
(20)
Graylog Engineering - Design Your Architecture
Graylog Engineering - Design Your Architecture
Scaling an ELK stack at bol.com
Scaling an ELK stack at bol.com
Presto changes
Presto changes
Garbage collection in JVM
Garbage collection in JVM
OLAP Architecture
OLAP Architecture
Data- How Does It Work-
Data- How Does It Work-
Stor4NFV: Exploration of Cloud native Storage in OPNFV - Ren Qiaowei, Wang Hui
Stor4NFV: Exploration of Cloud native Storage in OPNFV - Ren Qiaowei, Wang Hui
Solr At Scale For Time-Oriented Data: Presented by Brett Hoerner, Rocana
Solr At Scale For Time-Oriented Data: Presented by Brett Hoerner, Rocana
Big data: Loading your data with flume and sqoop
Big data: Loading your data with flume and sqoop
Presto Summit 2018 - 07 - Lyft
Presto Summit 2018 - 07 - Lyft
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
Managing your CF templates as a code with python and troposphere
Managing your CF templates as a code with python and troposphere
How the Automation of a Benchmark Famework Keeps Pace with the Dev Cycle at I...
How the Automation of a Benchmark Famework Keeps Pace with the Dev Cycle at I...
Speed Up Uber's Presto with Alluxio
Speed Up Uber's Presto with Alluxio
Fast dataarchitecture
Fast dataarchitecture
Sizing Your Scylla Cluster
Sizing Your Scylla Cluster
Logs aggregation and analysis
Logs aggregation and analysis
Petabyte Scale Object Storage Service Using Ceph in A Private Cloud - Varada ...
Petabyte Scale Object Storage Service Using Ceph in A Private Cloud - Varada ...
NRD: Nagios Result Distributor
NRD: Nagios Result Distributor
Breaking Prometheus (Promcon Berlin '16)
Breaking Prometheus (Promcon Berlin '16)
Viewers also liked
AWS re:Invent 2016: Deploying and Managing .NET Pipelines and Microsoft Workl...
AWS re:Invent 2016: Deploying and Managing .NET Pipelines and Microsoft Workl...
Amazon Web Services
ITV& Bashton
ITV& Bashton
Amazon Web Services
Mobile and Serverless : an Untold Story
Mobile and Serverless : an Untold Story
Vidyasagar Machupalli
Writing New Relic Plugins: NSQ
Writing New Relic Plugins: NSQ
lxfontes
The Lost Tales of Platform Design (February 2017)
The Lost Tales of Platform Design (February 2017)
Julien SIMON
Heelal
Heelal
Kees De Jager
Gartner 2017 London: How to re-invent your IT Architecture?
Gartner 2017 London: How to re-invent your IT Architecture?
LeanIX GmbH
Reproducible Science with Python
Reproducible Science with Python
Andreas Schreiber
Neuigkeiten von DEPAROM & Co
Neuigkeiten von DEPAROM & Co
Arne Krueger
Evolution of OPNFV CI System: What already exists and what can be introduced
Evolution of OPNFV CI System: What already exists and what can be introduced
OPNFV
(SEC313) Security & Compliance at the Petabyte Scale
(SEC313) Security & Compliance at the Petabyte Scale
Amazon Web Services
Introduction to smpc
Introduction to smpc
Cysinfo Cyber Security Community
Expect the unexpected: Anticipate and prepare for failures in microservices b...
Expect the unexpected: Anticipate and prepare for failures in microservices b...
Bhakti Mehta
Reversing malware analysis training part3 windows pefile formatbasics
Reversing malware analysis training part3 windows pefile formatbasics
Cysinfo Cyber Security Community
Data Visualization on the Tech Side
Data Visualization on the Tech Side
Mathieu Elie
Persistence in the cloud with bosh
Persistence in the cloud with bosh
m_richardson
Security For Humans
Security For Humans
conjur_inc
AWS + Puppet = Dynamic Scale
AWS + Puppet = Dynamic Scale
Shiva Narayanaswamy
Business selectors
Business selectors
benwaine
Een Gezond Gebit2
Een Gezond Gebit2
guest031320
Viewers also liked
(20)
AWS re:Invent 2016: Deploying and Managing .NET Pipelines and Microsoft Workl...
AWS re:Invent 2016: Deploying and Managing .NET Pipelines and Microsoft Workl...
ITV& Bashton
ITV& Bashton
Mobile and Serverless : an Untold Story
Mobile and Serverless : an Untold Story
Writing New Relic Plugins: NSQ
Writing New Relic Plugins: NSQ
The Lost Tales of Platform Design (February 2017)
The Lost Tales of Platform Design (February 2017)
Heelal
Heelal
Gartner 2017 London: How to re-invent your IT Architecture?
Gartner 2017 London: How to re-invent your IT Architecture?
Reproducible Science with Python
Reproducible Science with Python
Neuigkeiten von DEPAROM & Co
Neuigkeiten von DEPAROM & Co
Evolution of OPNFV CI System: What already exists and what can be introduced
Evolution of OPNFV CI System: What already exists and what can be introduced
(SEC313) Security & Compliance at the Petabyte Scale
(SEC313) Security & Compliance at the Petabyte Scale
Introduction to smpc
Introduction to smpc
Expect the unexpected: Anticipate and prepare for failures in microservices b...
Expect the unexpected: Anticipate and prepare for failures in microservices b...
Reversing malware analysis training part3 windows pefile formatbasics
Reversing malware analysis training part3 windows pefile formatbasics
Data Visualization on the Tech Side
Data Visualization on the Tech Side
Persistence in the cloud with bosh
Persistence in the cloud with bosh
Security For Humans
Security For Humans
AWS + Puppet = Dynamic Scale
AWS + Puppet = Dynamic Scale
Business selectors
Business selectors
Een Gezond Gebit2
Een Gezond Gebit2
Similar to Tubular Labs - Using Elastic to Search Over 2.5B Videos
DevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
DevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
Gaurav "GP" Pal
stackArmor presentation for DevOpsDC ver 4
stackArmor presentation for DevOpsDC ver 4
Gaurav "GP" Pal
Meetup #3: Migrating an Oracle Application from on-premise to AWS
Meetup #3: Migrating an Oracle Application from on-premise to AWS
AWS Vietnam Community
The state of Hive and Spark in the Cloud (July 2017)
The state of Hive and Spark in the Cloud (July 2017)
Nicolas Poggi
Retour d'expérience d'un environnement base de données multitenant
Retour d'expérience d'un environnement base de données multitenant
Swiss Data Forum Swiss Data Forum
Data Lessons Learned at Scale
Data Lessons Learned at Scale
Charlie Reverte
Datadog: a Real-Time Metrics Database for One Quadrillion Points/Day
Datadog: a Real-Time Metrics Database for One Quadrillion Points/Day
C4Media
moi-connect16
moi-connect16
Sarath Lakshman
Couchbase live 2016
Couchbase live 2016
Pierre Mavro
Loggly - Benchmarking 5 Node.js Logging Libraries
Loggly - Benchmarking 5 Node.js Logging Libraries
SolarWinds Loggly
[Virtual Meetup] Using Elasticsearch as a Time-Series Database in the Endpoin...
[Virtual Meetup] Using Elasticsearch as a Time-Series Database in the Endpoin...
Anna Ossowski
Scaling Monitoring At Databricks From Prometheus to M3
Scaling Monitoring At Databricks From Prometheus to M3
LibbySchulze
Our Multi-Year Journey to a 10x Faster Confluent Cloud
Our Multi-Year Journey to a 10x Faster Confluent Cloud
HostedbyConfluent
BAXTER phase 1b
BAXTER phase 1b
Franck MIKULECZ
AWS re:Invent 2016: Large-Scale, Cloud-Based Analysis of Cancer Genomes: Less...
AWS re:Invent 2016: Large-Scale, Cloud-Based Analysis of Cancer Genomes: Less...
Amazon Web Services
Enabling Presto to handle massive scale at lightning speed
Enabling Presto to handle massive scale at lightning speed
Shubham Tagra
Training Webinar: Detect Performance Bottlenecks of Applications
Training Webinar: Detect Performance Bottlenecks of Applications
OutSystems
The state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the Cloud
DataWorks Summit/Hadoop Summit
Mastering MongoDB Atlas: Essentials of Diagnostics and Debugging in the Cloud...
Mastering MongoDB Atlas: Essentials of Diagnostics and Debugging in the Cloud...
Mydbops
Rally--OpenStack Benchmarking at Scale
Rally--OpenStack Benchmarking at Scale
Mirantis
Similar to Tubular Labs - Using Elastic to Search Over 2.5B Videos
(20)
DevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
DevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
stackArmor presentation for DevOpsDC ver 4
stackArmor presentation for DevOpsDC ver 4
Meetup #3: Migrating an Oracle Application from on-premise to AWS
Meetup #3: Migrating an Oracle Application from on-premise to AWS
The state of Hive and Spark in the Cloud (July 2017)
The state of Hive and Spark in the Cloud (July 2017)
Retour d'expérience d'un environnement base de données multitenant
Retour d'expérience d'un environnement base de données multitenant
Data Lessons Learned at Scale
Data Lessons Learned at Scale
Datadog: a Real-Time Metrics Database for One Quadrillion Points/Day
Datadog: a Real-Time Metrics Database for One Quadrillion Points/Day
moi-connect16
moi-connect16
Couchbase live 2016
Couchbase live 2016
Loggly - Benchmarking 5 Node.js Logging Libraries
Loggly - Benchmarking 5 Node.js Logging Libraries
[Virtual Meetup] Using Elasticsearch as a Time-Series Database in the Endpoin...
[Virtual Meetup] Using Elasticsearch as a Time-Series Database in the Endpoin...
Scaling Monitoring At Databricks From Prometheus to M3
Scaling Monitoring At Databricks From Prometheus to M3
Our Multi-Year Journey to a 10x Faster Confluent Cloud
Our Multi-Year Journey to a 10x Faster Confluent Cloud
BAXTER phase 1b
BAXTER phase 1b
AWS re:Invent 2016: Large-Scale, Cloud-Based Analysis of Cancer Genomes: Less...
AWS re:Invent 2016: Large-Scale, Cloud-Based Analysis of Cancer Genomes: Less...
Enabling Presto to handle massive scale at lightning speed
Enabling Presto to handle massive scale at lightning speed
Training Webinar: Detect Performance Bottlenecks of Applications
Training Webinar: Detect Performance Bottlenecks of Applications
The state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the Cloud
Mastering MongoDB Atlas: Essentials of Diagnostics and Debugging in the Cloud...
Mastering MongoDB Atlas: Essentials of Diagnostics and Debugging in the Cloud...
Rally--OpenStack Benchmarking at Scale
Rally--OpenStack Benchmarking at Scale
Recently uploaded
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
shikhaohhpro
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
panagenda
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
ICS
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
ABDERRAOUF MEHENNI
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
OnePlan Solutions
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
Arshad QA
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
OnePlan Solutions
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
anilsa9823
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Alberto González Trastoy
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
bodapatigopi8531
Professional Resume Template for Software Developers
Professional Resume Template for Software Developers
Vinodh Ram
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
Evangelist Apps https://twitter.com/EvangelistSW/
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
MyIntelliSource, Inc.
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
ComplianceQuest1
Software Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
Arshad QA
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
harshavardhanraghave
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
Recently uploaded
(20)
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
Professional Resume Template for Software Developers
Professional Resume Template for Software Developers
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
Software Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
Tubular Labs - Using Elastic to Search Over 2.5B Videos
1.
Using Elastic to search
over 2.5B videos
2.
Talk structure ● 4
steps to make user experience great again ● 4 patterns to simplify architecture and reduce costs © 2016 Tubular Labs 2
3.
Data size ● 2.5B
documents ● AVG doc size 2Kb, 4Tb total size ● 200M daily updates (~8% of the index) ● Constant indexing rate of 3k/s with spikes ● Querying rate 1-3 r/s (low concurrency) © 2016 Tubular Labs 3
4.
Hardware ● 52 x
c3.4xlarge ● 128 shards ● 16 cores per node ● ~3 shards per node ● 832 cores, 16Tb SSD, 1.5Tb RAM © 2016 Tubular Labs 4 ● 26 x c3.8xlarge ● 416 shards ● 32 cores per node ● 16 shards per node ● 832 cores, 16Tb SSD, 1.5Tb RAM Before After (25% bigger)
5.
Indexing
6.
Optimize indexing ● Using
bulk API • 1Mb per batch (500 docs), should be 5k docs/s • Recommended 5-15Mb ● Increasing refresh interval • From 1 to 30 seconds ● Monitoring bulk.rejected • Increased bulk.queueSize from 50 to 2000 © 2016 Tubular Labs 6
7.
Searching
8.
Product view © 2016
Tubular Labs 8 Summary Search results Term aggregations
9.
Before optimization © 2016
Tubular Labs 9
10.
Goal © 2016 Tubular
Labs 10 • Slow queries • From 15 to 5 seconds for 95th • Seeking for 3x improvement Problem Goal
11.
Understand hardware utilization ©
2016 Tubular Labs 11 • Run the heaviest query • No bottlenecks (CPU, disk IO, network) • Thread pool search.size 25 • Max search.active is 3
12.
CPU utilization © 2016
Tubular Labs 12 • Know • Your • Concurrency
13.
Benchmarking # of
shards © 2016 Tubular Labs 13 On a single 32 cores node
14.
More CPU per
request results © 2016 Tubular Labs 14 15s to 7.5s
15.
Search & Aggregations ©
2016 Tubular Labs 15 • Searching and sorting is fast • 8 term aggregations are slow
16.
Aggregation impact © 2016
Tubular Labs 16
17.
Check facet usage ©
2016 Tubular Labs 17 ● Talk to your product manager ● Low product usage ● Remove networks and claims aggregations ● Replace facets with filters
18.
Removing two aggregations
results © 2016 Tubular Labs 18 15s to 5.3s
19.
Cardinality © 2016 Tubular
Labs 19 ● Reduce cardinality ● Going from 200M to 5M (channels to creators) ● Reducing # of topics from 5M to 500
20.
Reducing cardinality results ©
2016 Tubular Labs 20 15s to 4.4s
21.
Split query and
aggregations © 2016 Tubular Labs 21 ● Searching and aggregating separately ● Using shard-level query cache ● Showing results in UI asynchronously
22.
Split query and
aggregations results © 2016 Tubular Labs 22 15s to 4.0s
23.
Performance gain © 2016
Tubular Labs 23 ● From 15 to 4 seconds (<5 seconds) ● Overall improvement 3.7x ● What about costs?
24.
Architecture patterns
25.
Part 2. Goals ©
2016 Tubular Labs 25 ● Reduce costs ● Improve reliability ● Simplify architecture ● Reduce variability in latency
26.
Current flow © 2016
Tubular Labs 26 ● Too many dependencies ● Expensive intermediate storage
27.
Denormalization © 2016 Tubular
Labs 27 ● 90% of data is shared ● No extra calls from frontend
28.
Partial updates with
Update API (experimental) © 2016 Tubular Labs 28
29.
“Partial” updates with
parent-child relations (experimental) © 2016 Tubular Labs 29
30.
Split data by
hot/full (idea for future) © 2016 Tubular Labs 30 ● Cheaper hardware on full ● Shard allocation filtering
31.
Thank you
Download now