SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Elasticsearch for reporting
analytics on communities
Elasticsearch Meetup - 23 April 2014
Marc Harrison
Lithium makes software that helps brands
better connect with their customers
Our social software helps companies respond on social networks and build
trusted content on a community they own.
Empower brands to distill terabytes of daily
data into understanding participation
▪ fast
▪ flexible
▪ scalable
What
products/services
are generating the
most conversations?
Who is authoring
content that
generates the most
kudos/likes?
Are customer posts
getting timely
replies?
What types of
content does this
audience segment
look for?
Lithium Social Intelligence (LSI)
Cluster specs
▪ One of our clusters – elastic search 1.0
• 7+ billion documents/4.4+ TB and growing fast!
• 21 nodes (3 masters, 2 clients, 16 data)
Lessons learned
▪ Bulk loading
▪ Faceting
Bulk initial load / rebuild of data
Hadoop
mysql stream
Transform/
route
…
JSON Elasticsearch
Bulk loading
▪ Make sure ingest logic is robust
• Idempotent for bulk reply - ‘_id’
• Include revision based on processor/time
• Check cluster/index status to make sure ready to ingest
▪ Know the cache and thread pool sizes
• Bulk – fixed - # of processors - queue size 50
• Handle back off and retry
▪ How many docs?
• Like capacity - test with data –
• number of shards
• index.refresh_interval: 30s
• indices.memory.index_buffer_size: 5%
• indices.memory.*
• index.translog.*
Search - time series pattern for scale
Faceting
▪ Don't forget about memory!
• Strings - not_analyzed
• Numbers long vs int, double vs float, etc
• Do you need seconds/minutes when faceting?
• fielddata format - doc_values (1.0)
• Admin API’s allow checking field data size + evictions
• indices.cache.filter.size: 15%
• indices.fielddata.cache.size: 45%
Faceting II
▪ Accuracy
• shard_size
• Number of shards
• Cardinality
• Routing
▪ Great custom plugin
framework
• Uniques
• Array faceting
Impact
▪ Order of magnitude improvement
▪ Developers able to focus on improving insights
▪ community + elasticsearch + hadoop + horton works =
exciting
Select settings (data center)
• bootstrap.mlockall: true
• cluster.routing.allocation.disk.threshold_enabled: true
• http.compression: true
• transport.tcp.compress: true
• gateway.recover_after_data_nodes: 13
• gateway.recover_after_master_nodes: 2
• gateway.recover_after_time: 3m
• gateway.expected_nodes: 17
• indices.memory.index_buffer_size: 5%
• indices.cache.filter.size: 15%
• indices.fielddata.cache.size: 45%
• index.store.type: mmapfs
• index.translog.flush_threshold_ops: 10000
• action.auto_create_index: false
• action.disable_delete_all_indices: true
• cluster.routing.allocation.node_initial_primaries_recoveries: 4
• cluster.routing.allocation.node_concurrent_recoveries: 15
• indices.recovery.max_bytes_per_sec: 100mb
• indices.recovery.concurrent_streams: 5
• discovery.zen.minimum_master_nodes: 2
• index.search.slowlog.threshold.query.warn: 5s
• index.search.slowlog.threshold.query.info: 1s
• index.indexing.slowlog.threshold.index.warn: 5s
• plugin.mandatory: lithium-unique-facets
Questions?

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Session #2, tech session: Build realtime search by Sylvain Utard from Algolia
Session #2, tech session: Build realtime search by Sylvain Utard from AlgoliaSession #2, tech session: Build realtime search by Sylvain Utard from Algolia
Session #2, tech session: Build realtime search by Sylvain Utard from Algolia
 
Practical Use of a NoSQL
Practical Use of a NoSQLPractical Use of a NoSQL
Practical Use of a NoSQL
 
GOTO Aarhus 2014: Making Enterprise Data Available in Real Time with elastics...
GOTO Aarhus 2014: Making Enterprise Data Available in Real Time with elastics...GOTO Aarhus 2014: Making Enterprise Data Available in Real Time with elastics...
GOTO Aarhus 2014: Making Enterprise Data Available in Real Time with elastics...
 
Next Generation Data Platforms - Deon Thomas
Next Generation Data Platforms - Deon ThomasNext Generation Data Platforms - Deon Thomas
Next Generation Data Platforms - Deon Thomas
 
MongoDB & Hadoop - Understanding Your Big Data
MongoDB & Hadoop - Understanding Your Big DataMongoDB & Hadoop - Understanding Your Big Data
MongoDB & Hadoop - Understanding Your Big Data
 
Ubiquitous Solr - A Database's Not-So-Evil Twin: Presented by Ayon Sinha, Wal...
Ubiquitous Solr - A Database's Not-So-Evil Twin: Presented by Ayon Sinha, Wal...Ubiquitous Solr - A Database's Not-So-Evil Twin: Presented by Ayon Sinha, Wal...
Ubiquitous Solr - A Database's Not-So-Evil Twin: Presented by Ayon Sinha, Wal...
 
Why Elastic? @ 50th Vinitaly 2016
Why Elastic? @ 50th Vinitaly 2016Why Elastic? @ 50th Vinitaly 2016
Why Elastic? @ 50th Vinitaly 2016
 
MongoDB meetup at Hike
MongoDB meetup at HikeMongoDB meetup at Hike
MongoDB meetup at Hike
 
2007 iPres Beijing - MIXED: Preservation by migration to XML
2007 iPres Beijing - MIXED: Preservation by migration to XML2007 iPres Beijing - MIXED: Preservation by migration to XML
2007 iPres Beijing - MIXED: Preservation by migration to XML
 
NoSQL for SQL Users
NoSQL for SQL UsersNoSQL for SQL Users
NoSQL for SQL Users
 
Practical Use of a NoSQL Database
Practical Use of a NoSQL DatabasePractical Use of a NoSQL Database
Practical Use of a NoSQL Database
 
Building tiered data stores using aesop to bridge sql and no sql systems
Building tiered data stores using aesop to bridge sql and no sql systemsBuilding tiered data stores using aesop to bridge sql and no sql systems
Building tiered data stores using aesop to bridge sql and no sql systems
 
Redis & MongoDB: Stop Big Data Indigestion Before It Starts
Redis & MongoDB: Stop Big Data Indigestion Before It StartsRedis & MongoDB: Stop Big Data Indigestion Before It Starts
Redis & MongoDB: Stop Big Data Indigestion Before It Starts
 
Enterprise Presto PaaS offering in Google Cloud
Enterprise Presto PaaS offering in Google Cloud Enterprise Presto PaaS offering in Google Cloud
Enterprise Presto PaaS offering in Google Cloud
 
hbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Spark
hbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Sparkhbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Spark
hbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Spark
 
Basic Introduction to Crate @ ViennaDB Meetup
Basic Introduction to Crate @ ViennaDB MeetupBasic Introduction to Crate @ ViennaDB Meetup
Basic Introduction to Crate @ ViennaDB Meetup
 
Capacity Planning For Your Growing MongoDB Cluster
Capacity Planning For Your Growing MongoDB ClusterCapacity Planning For Your Growing MongoDB Cluster
Capacity Planning For Your Growing MongoDB Cluster
 
Distributed Query Service Powered By Presto & Alluxio Across Clouds @Walmart...
 Distributed Query Service Powered By Presto & Alluxio Across Clouds @Walmart... Distributed Query Service Powered By Presto & Alluxio Across Clouds @Walmart...
Distributed Query Service Powered By Presto & Alluxio Across Clouds @Walmart...
 
Aesop change data propagation
Aesop change data propagationAesop change data propagation
Aesop change data propagation
 
Webinar: MongoDB and Hadoop - Working Together to provide Business Insights
Webinar: MongoDB and Hadoop - Working Together to provide Business InsightsWebinar: MongoDB and Hadoop - Working Together to provide Business Insights
Webinar: MongoDB and Hadoop - Working Together to provide Business Insights
 

Andere mochten auch

Andere mochten auch (8)

Fostering Community With Social Media - Midwest Newspaper Summit 2010
Fostering Community With Social Media - Midwest Newspaper Summit 2010Fostering Community With Social Media - Midwest Newspaper Summit 2010
Fostering Community With Social Media - Midwest Newspaper Summit 2010
 
How to Grow a Vibrant Community that Delivers Real Business Results
How to Grow a Vibrant Community that Delivers Real Business ResultsHow to Grow a Vibrant Community that Delivers Real Business Results
How to Grow a Vibrant Community that Delivers Real Business Results
 
Digital Innovation That Drives Business
Digital Innovation That Drives BusinessDigital Innovation That Drives Business
Digital Innovation That Drives Business
 
Digital Engagement Journey
Digital Engagement Journey Digital Engagement Journey
Digital Engagement Journey
 
Building, Growing & Sustaining Social Media Communities Keynote Presentation
Building, Growing & Sustaining Social Media Communities Keynote Presentation Building, Growing & Sustaining Social Media Communities Keynote Presentation
Building, Growing & Sustaining Social Media Communities Keynote Presentation
 
Social Media: Growing Brand Awareness
Social Media: Growing Brand AwarenessSocial Media: Growing Brand Awareness
Social Media: Growing Brand Awareness
 
Deep Social: The Next Phase of Social Media
Deep Social: The Next Phase of Social MediaDeep Social: The Next Phase of Social Media
Deep Social: The Next Phase of Social Media
 
Radical Candor: No BS, helping your team create better work.
Radical Candor: No BS, helping your team create better work.Radical Candor: No BS, helping your team create better work.
Radical Candor: No BS, helping your team create better work.
 

Ähnlich wie Elasticsearch meetup final_2014_04

Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Open Analytics
 
Open Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe OlsenOpen Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe Olsen
Christopher Whitaker
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Perficient, Inc.
 
Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)
Petter Skodvin-Hvammen
 
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
Joaquin Delgado PhD.
 
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
 RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning... RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
S. Diana Hu
 

Ähnlich wie Elasticsearch meetup final_2014_04 (20)

Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
 
Open Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe OlsenOpen Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe Olsen
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
 
Nisha talagala keynote_inflow_2016
Nisha talagala keynote_inflow_2016Nisha talagala keynote_inflow_2016
Nisha talagala keynote_inflow_2016
 
Engineering patterns for implementing data science models on big data platforms
Engineering patterns for implementing data science models on big data platformsEngineering patterns for implementing data science models on big data platforms
Engineering patterns for implementing data science models on big data platforms
 
Elasticsearch Introduction at BigData meetup
Elasticsearch Introduction at BigData meetupElasticsearch Introduction at BigData meetup
Elasticsearch Introduction at BigData meetup
 
Elasticsearch - Scalability and Multitenancy
Elasticsearch - Scalability and MultitenancyElasticsearch - Scalability and Multitenancy
Elasticsearch - Scalability and Multitenancy
 
Solving Office 365 Big Challenges using Cassandra + Spark
Solving Office 365 Big Challenges using Cassandra + Spark Solving Office 365 Big Challenges using Cassandra + Spark
Solving Office 365 Big Challenges using Cassandra + Spark
 
Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)
 
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
 
Apache drill
Apache drillApache drill
Apache drill
 
Making Session Stores More Intelligent
Making Session Stores More IntelligentMaking Session Stores More Intelligent
Making Session Stores More Intelligent
 
MongoDB .local Houston 2019: Building an IoT Streaming Analytics Platform to ...
MongoDB .local Houston 2019: Building an IoT Streaming Analytics Platform to ...MongoDB .local Houston 2019: Building an IoT Streaming Analytics Platform to ...
MongoDB .local Houston 2019: Building an IoT Streaming Analytics Platform to ...
 
Building high performance and scalable share point applications
Building high performance and scalable share point applicationsBuilding high performance and scalable share point applications
Building high performance and scalable share point applications
 
10 Big Data Technologies you Didn't Know About
10 Big Data Technologies you Didn't Know About 10 Big Data Technologies you Didn't Know About
10 Big Data Technologies you Didn't Know About
 
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
 
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
 RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning... RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
 
ADV Slides: Trends in Streaming Analytics and Message-oriented Middleware
ADV Slides: Trends in Streaming Analytics and Message-oriented MiddlewareADV Slides: Trends in Streaming Analytics and Message-oriented Middleware
ADV Slides: Trends in Streaming Analytics and Message-oriented Middleware
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Search on the fly: how to lighten your Big Data - Simona Russo, Auro Rolle - ...
Search on the fly: how to lighten your Big Data - Simona Russo, Auro Rolle - ...Search on the fly: how to lighten your Big Data - Simona Russo, Auro Rolle - ...
Search on the fly: how to lighten your Big Data - Simona Russo, Auro Rolle - ...
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Kürzlich hochgeladen (20)

Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

Elasticsearch meetup final_2014_04

  • 1. Elasticsearch for reporting analytics on communities Elasticsearch Meetup - 23 April 2014 Marc Harrison
  • 2. Lithium makes software that helps brands better connect with their customers Our social software helps companies respond on social networks and build trusted content on a community they own.
  • 3. Empower brands to distill terabytes of daily data into understanding participation ▪ fast ▪ flexible ▪ scalable What products/services are generating the most conversations? Who is authoring content that generates the most kudos/likes? Are customer posts getting timely replies? What types of content does this audience segment look for?
  • 5. Cluster specs ▪ One of our clusters – elastic search 1.0 • 7+ billion documents/4.4+ TB and growing fast! • 21 nodes (3 masters, 2 clients, 16 data)
  • 6. Lessons learned ▪ Bulk loading ▪ Faceting
  • 7. Bulk initial load / rebuild of data Hadoop mysql stream Transform/ route … JSON Elasticsearch
  • 8. Bulk loading ▪ Make sure ingest logic is robust • Idempotent for bulk reply - ‘_id’ • Include revision based on processor/time • Check cluster/index status to make sure ready to ingest ▪ Know the cache and thread pool sizes • Bulk – fixed - # of processors - queue size 50 • Handle back off and retry ▪ How many docs? • Like capacity - test with data – • number of shards • index.refresh_interval: 30s • indices.memory.index_buffer_size: 5% • indices.memory.* • index.translog.*
  • 9. Search - time series pattern for scale
  • 10. Faceting ▪ Don't forget about memory! • Strings - not_analyzed • Numbers long vs int, double vs float, etc • Do you need seconds/minutes when faceting? • fielddata format - doc_values (1.0) • Admin API’s allow checking field data size + evictions • indices.cache.filter.size: 15% • indices.fielddata.cache.size: 45%
  • 11. Faceting II ▪ Accuracy • shard_size • Number of shards • Cardinality • Routing ▪ Great custom plugin framework • Uniques • Array faceting
  • 12. Impact ▪ Order of magnitude improvement ▪ Developers able to focus on improving insights ▪ community + elasticsearch + hadoop + horton works = exciting
  • 13. Select settings (data center) • bootstrap.mlockall: true • cluster.routing.allocation.disk.threshold_enabled: true • http.compression: true • transport.tcp.compress: true • gateway.recover_after_data_nodes: 13 • gateway.recover_after_master_nodes: 2 • gateway.recover_after_time: 3m • gateway.expected_nodes: 17 • indices.memory.index_buffer_size: 5% • indices.cache.filter.size: 15% • indices.fielddata.cache.size: 45% • index.store.type: mmapfs • index.translog.flush_threshold_ops: 10000 • action.auto_create_index: false • action.disable_delete_all_indices: true • cluster.routing.allocation.node_initial_primaries_recoveries: 4 • cluster.routing.allocation.node_concurrent_recoveries: 15 • indices.recovery.max_bytes_per_sec: 100mb • indices.recovery.concurrent_streams: 5 • discovery.zen.minimum_master_nodes: 2 • index.search.slowlog.threshold.query.warn: 5s • index.search.slowlog.threshold.query.info: 1s • index.indexing.slowlog.threshold.index.warn: 5s • plugin.mandatory: lithium-unique-facets