SlideShare a Scribd company logo
1 of 19
codecentric AG 17.12.2014 Seite 1
Essential Elasticsearch
Performance Tips
codecentric AG
BULKING
17.12.2014 Seite 2
Why doesn‘t anyone bulk?!?
codecentric AG
INDEXING SPEED
17.12.2014 Seite 3
Change your configuration during important events
codecentric AG
MAPPING
17.12.2014 Seite 4
Y‘all need to think more abour your mappings
codecentric AG
FILTERS AND CACHES
17.12.2014 Seite 5
Filters instead of Queries as often as possible
codecentric AG
FILTERS ORDERING
17.12.2014 Seite 6
What comes first?
The Chicken or the Egg?
The Query or the Filter?
codecentric AG
QUERIES
17.12.2014 Seite 7
So much room for optimizations!
codecentric AG
AGGREGATIONS
17.12.2014 Seite 8
Aggregations are expensive!
codecentric AG
DOC VALUES
17.12.2014 Seite 9
Store field data on disk instead of on the heap
codecentric AG
UPDATES
17.12.2014 Seite 10
There‘s no such thing as an update
codecentric AG
IDENTIFIER
17.12.2014 Seite 11
Use friendly IDs !
codecentric AG
SHARDING
17.12.2014 Seite 12
Choose a sharding strategy!
codecentric AG
ROUTING
17.12.2014 Seite 13
Avoid distibuted searches by routing
codecentric AG
CLIENTS
17.12.2014 Seite 14
Use the right client!
codecentric AG
SERVER CONFIGURATION
17.12.2014 Seite 15
There are plenty of essential configurations
codecentric AG
FINAL WORDS
17.12.2014 Seite 16
Tuning: Measure, don‘t guess!
codecentric AG
FINAL WORDS
17.12.2014 Seite 17
One change at a time!
codecentric AG
CHRISTIAN UHL
CHRISTIAN.UHL@CODECENTRIC.DE
@CHRISUHLCC
HTTPS://BLOG.CODECENTRIC.DE/
QUESTIONS?
17.12.2014 Seite 18
codecentric AG
• HTTPS://WWW.FLICKR.COM/PHOTOS/TURATTI/6322618398/
• HTTPS://WWW.FLICKR.COM/PHOTOS/BILLDPIX/15294289120/
• HTTPS://WWW.FLICKR.COM/PHOTOS/63541243@N04/14214457280/
• HTTPS://WWW.FLICKR.COM/PHOTOS/AMORTIZE/527435776/
• HTTPS://WWW.FLICKR.COM/PHOTOS/JAREDZIMMERMAN/1392753867/
• HTTPS://WWW.FLICKR.COM/PHOTOS/JONSEIDMAN1988/6155279037/
• HTTPS://WWW.FLICKR.COM/PHOTOS/FORNAL/424716302/
• HTTPS://WWW.FLICKR.COM/PHOTOS/EDYSON/107902861
• HTTPS://WWW.FLICKR.COM/PHOTOS/JMCPHOTOS/2131206015/
• HTTPS://WWW.FLICKR.COM/PHOTOS/AUTOWITCH/4271929/
• HTTPS://WWW.FLICKR.COM/PHOTOS/MRMUSKRAT/3637703614/
• HTTPS://WWW.FLICKR.COM/PHOTOS/PHOTOLIBRARIAN/7578139852/
BILDNACHWEISE
17.12.2014 Seite 19

More Related Content

Similar to Elasticsearch performance tips

Coffee Oil Extraction unit: LabWork and SolidWorks Design
Coffee Oil Extraction unit: LabWork and SolidWorks DesignCoffee Oil Extraction unit: LabWork and SolidWorks Design
Coffee Oil Extraction unit: LabWork and SolidWorks DesignSara Chergaoui
 
Flash Economics and Lessons learned from operating low latency platforms at h...
Flash Economics and Lessons learned from operating low latency platforms at h...Flash Economics and Lessons learned from operating low latency platforms at h...
Flash Economics and Lessons learned from operating low latency platforms at h...Aerospike, Inc.
 
GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心
GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心
GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心NVIDIA Taiwan
 
High availability microsoftvsoracle
High availability microsoftvsoracleHigh availability microsoftvsoracle
High availability microsoftvsoracleJacques Kostic
 
Get a better user experience by upgrading to the new HP EliteBook 840 G9 - In...
Get a better user experience by upgrading to the new HP EliteBook 840 G9 - In...Get a better user experience by upgrading to the new HP EliteBook 840 G9 - In...
Get a better user experience by upgrading to the new HP EliteBook 840 G9 - In...Principled Technologies
 
5 Tips for a More Efficient Data Center
5 Tips for a More Efficient Data Center5 Tips for a More Efficient Data Center
5 Tips for a More Efficient Data CenterWestern Digital
 
Data proliferation and machine learning: The case for upgrading your servers ...
Data proliferation and machine learning: The case for upgrading your servers ...Data proliferation and machine learning: The case for upgrading your servers ...
Data proliferation and machine learning: The case for upgrading your servers ...Principled Technologies
 
Optimizing Total Cost of Ownership for the AWS Cloud
Optimizing Total Cost of Ownership for the AWS CloudOptimizing Total Cost of Ownership for the AWS Cloud
Optimizing Total Cost of Ownership for the AWS CloudAmazon Web Services
 
Cloud Native Cost Optimization
Cloud Native Cost OptimizationCloud Native Cost Optimization
Cloud Native Cost OptimizationAdrian Cockcroft
 
A way to visual the best storage media for an application
A way to visual the best storage media for an applicationA way to visual the best storage media for an application
A way to visual the best storage media for an applicationTony Roug
 
Getting The Most Out Of Your Flash/SSDs
Getting The Most Out Of Your Flash/SSDsGetting The Most Out Of Your Flash/SSDs
Getting The Most Out Of Your Flash/SSDsAerospike, Inc.
 
The benefits of value SAS and data center NVMe drives with Dell EMC PowerEdge
The benefits of value SAS and data center NVMe drives with Dell EMC PowerEdgeThe benefits of value SAS and data center NVMe drives with Dell EMC PowerEdge
The benefits of value SAS and data center NVMe drives with Dell EMC PowerEdgePrincipled Technologies
 
Support more customers and gain business insights faster - Infographic
Support more customers and gain business insights faster - Infographic Support more customers and gain business insights faster - Infographic
Support more customers and gain business insights faster - Infographic Principled Technologies
 
H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA
H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIAH2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA
H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIASri Ambati
 
Direct3D and the Future of Graphics APIs - AMD at GDC14
Direct3D and the Future of Graphics APIs - AMD at GDC14Direct3D and the Future of Graphics APIs - AMD at GDC14
Direct3D and the Future of Graphics APIs - AMD at GDC14AMD Developer Central
 
Sustainable Architecture Design
Sustainable Architecture DesignSustainable Architecture Design
Sustainable Architecture DesignKevin Francis
 
Flash Stories: How Customers Make Smarter Decisions Faster
Flash Stories: How Customers Make Smarter Decisions FasterFlash Stories: How Customers Make Smarter Decisions Faster
Flash Stories: How Customers Make Smarter Decisions FasterWestern Digital
 
Delivering Exceptional Customer Experiences from the Cloud
Delivering Exceptional Customer Experiences from the CloudDelivering Exceptional Customer Experiences from the Cloud
Delivering Exceptional Customer Experiences from the CloudSAP Customer Experience
 

Similar to Elasticsearch performance tips (20)

Coffee Oil Extraction unit: LabWork and SolidWorks Design
Coffee Oil Extraction unit: LabWork and SolidWorks DesignCoffee Oil Extraction unit: LabWork and SolidWorks Design
Coffee Oil Extraction unit: LabWork and SolidWorks Design
 
Flash Economics and Lessons learned from operating low latency platforms at h...
Flash Economics and Lessons learned from operating low latency platforms at h...Flash Economics and Lessons learned from operating low latency platforms at h...
Flash Economics and Lessons learned from operating low latency platforms at h...
 
GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心
GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心
GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心
 
High availability microsoftvsoracle
High availability microsoftvsoracleHigh availability microsoftvsoracle
High availability microsoftvsoracle
 
Get a better user experience by upgrading to the new HP EliteBook 840 G9 - In...
Get a better user experience by upgrading to the new HP EliteBook 840 G9 - In...Get a better user experience by upgrading to the new HP EliteBook 840 G9 - In...
Get a better user experience by upgrading to the new HP EliteBook 840 G9 - In...
 
5 Tips for a More Efficient Data Center
5 Tips for a More Efficient Data Center5 Tips for a More Efficient Data Center
5 Tips for a More Efficient Data Center
 
Data proliferation and machine learning: The case for upgrading your servers ...
Data proliferation and machine learning: The case for upgrading your servers ...Data proliferation and machine learning: The case for upgrading your servers ...
Data proliferation and machine learning: The case for upgrading your servers ...
 
Optimizing Total Cost of Ownership for the AWS Cloud
Optimizing Total Cost of Ownership for the AWS CloudOptimizing Total Cost of Ownership for the AWS Cloud
Optimizing Total Cost of Ownership for the AWS Cloud
 
Cloud Native Cost Optimization
Cloud Native Cost OptimizationCloud Native Cost Optimization
Cloud Native Cost Optimization
 
A way to visual the best storage media for an application
A way to visual the best storage media for an applicationA way to visual the best storage media for an application
A way to visual the best storage media for an application
 
Getting The Most Out Of Your Flash/SSDs
Getting The Most Out Of Your Flash/SSDsGetting The Most Out Of Your Flash/SSDs
Getting The Most Out Of Your Flash/SSDs
 
The benefits of value SAS and data center NVMe drives with Dell EMC PowerEdge
The benefits of value SAS and data center NVMe drives with Dell EMC PowerEdgeThe benefits of value SAS and data center NVMe drives with Dell EMC PowerEdge
The benefits of value SAS and data center NVMe drives with Dell EMC PowerEdge
 
Support more customers and gain business insights faster - Infographic
Support more customers and gain business insights faster - Infographic Support more customers and gain business insights faster - Infographic
Support more customers and gain business insights faster - Infographic
 
H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA
H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIAH2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA
H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA
 
Direct3D and the Future of Graphics APIs - AMD at GDC14
Direct3D and the Future of Graphics APIs - AMD at GDC14Direct3D and the Future of Graphics APIs - AMD at GDC14
Direct3D and the Future of Graphics APIs - AMD at GDC14
 
Sustainable Architecture Design
Sustainable Architecture DesignSustainable Architecture Design
Sustainable Architecture Design
 
Flash Stories: How Customers Make Smarter Decisions Faster
Flash Stories: How Customers Make Smarter Decisions FasterFlash Stories: How Customers Make Smarter Decisions Faster
Flash Stories: How Customers Make Smarter Decisions Faster
 
Green Cloud Computing
Green Cloud ComputingGreen Cloud Computing
Green Cloud Computing
 
Delivering Exceptional Customer Experiences from the Cloud
Delivering Exceptional Customer Experiences from the CloudDelivering Exceptional Customer Experiences from the Cloud
Delivering Exceptional Customer Experiences from the Cloud
 
OpenStack Days Krakow
OpenStack Days KrakowOpenStack Days Krakow
OpenStack Days Krakow
 

Recently uploaded

5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfCionsystems
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendArshad QA
 

Recently uploaded (20)

5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdf
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the ProcessExploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
 

Elasticsearch performance tips

Editor's Notes

  1. Bulking for Indexing, creating, updating and deleting Bulk size in Bytes, not number of documents If in doubt, smaller batch sizes Parallelize multiple bulks Async calls
  2. Turn of refresh while indexing Delay flushes Throttle merging Maybe increase indices.memory.index_buffer_size Set replicas to zero (only DURING indexing, right?) Disable warmup
  3. Do you really need your _all field? _source field & stored ??? Reduce analysis Field norms Term frequencies & positions Not_analyzed is your friend Dynamic mapping is for playtime, not production
  4. No Scoring Filter results can be cached Most Simple filters are cached, but not all (geo) Compound filters are not cached Expicitly control cache with _cache Bool filters query the cache for sub-filters, but and/or/not don‘t Moving Target Consider the scope -> filtered query probably? Filter applied after query, but not in „filtered query“!
  5. Regular Queries query first, filter afterwards Filtered query filters first Elements of Bool filters are executed sequentially Place most restrictive filter first Accelerator filter Additional filter on general terms Better for caching Reduce Work for heavyweight filters
  6. Pagination Don‘t load too many results at once Avoid deep pagination Index-time vs. Query time optimizations: Try to do prework during index time E.g. Prefix Query vs. Edge Ngram Warmup for „common queries“ Turn on the slow log Use multi-search if applicable
  7. Load lazy as much as possible Hide lesser needed ones Only load once during pagination
  8. For example sorting Filed data stored in RAM Expensive for the JVM, Garbage Collection Issues OS File System cache can take care of that Slightly slower Test them!
  9. Update is a delete + add Partial updates still read the whole document Even „small“ updates can be expensive
  10. Sequential Ids allow optimized storage (binary stored) Javas UUID is truly random Internally Elasticsearch uses FlakeIDs
  11. Multiple Shards allow for paralell writes Multiple Replicas allow parallel reads Indexing more expensive Safety Sharding makes reads slower Accurate scoring round trip Second round trip for the search Reduce step Third roundtrip to retrieve final set of documents 2 Rules of distributed Search: Distributed Search is expensive! Searching multiple indexes is the same as searching multiple shards
  12. Only works for isolated „chunks“ of Data in the same index Maybe „Users“ Routing key overrides shard key Popular Example UserID Multipe users will share a shard Shards will be different in size Alternative: Aliases Move out Large users to new index Have alias point to all indexes Drawback: Cluster state will become big, high network impact
  13. Use existent client librarys If Java, prefer NodeClient Alternative Transport Client Http Long lived connections Check http chunking
  14. Maximum Number of File Descriptors Avoid Swapping ES_HEAP_SIZE (Xms = Xmx) Leave enough memory to the OS ½ memory to ES Not more than 32GB If using doc values, a few GB should be enough Use concurrent GC Default is CMS, maybe try G1 Check your Java Version Avoid virtualisation Noisy Neighbours Storage Use local Use SSD RAID 0