SlideShare a Scribd company logo
1 of 11
Logging, Metrics and Monitoring 
as a Service 
Lakshminarayanan (LN) Renganarayana 
Architect, Cloud Platform Engineering 
LMM, Cloud Platform Engineering Nov 4, 2014 1
LMM, Cloud Platform Engineering Nov 4, 2014 
2 
The Symantec Team 
• Cloud Platform Engineering 
– We are building a consolidated cloud platform that provides infrastructure 
and platform services for next generation Symantec products and services 
– starting small, but scaling to tens of thousands of nodes across multiple data 
centers 
• Me 
– Architect / Lead for LMM in Cloud Platform Engineering 
– Love to build and break systems 
– Previous life: Research Staff Member, IBM Research 
– Automatic parallelization, Compilers, Deployment Automation (Heat)
Context for LMM: Symantec Cloud Platform 
Common 
Services 
LMM, Cloud Platform Engineering 
3 
Symantec Services & Apps 
Platform Services 
BP, SP, KV, OBS 
IaaS (OpenStack) 
Bare Metal 
Nov 4, 2014 
LMM 
Identity 
Manager 
CI / CD
LMM : Goals we started with … 
• Design for both Developers and Ops 
• Multi Tenant, Elastic Scalability 
• One system for logs, metrics, monitoring, & other events 
• Designed for under and over the cloud 
• Build using open source tools and for open sourcing 
LMM, Cloud Platform Engineering Nov 4, 2014 
4
Guiding principle 
Make it extremely simple to generate logs & metrics 
Provide powerful query and visualizations for free 
LMM, Cloud Platform Engineering Nov 4, 2014 
5 
Gen with 1 line: logger.warn(“WARN KVaaS too many failed writes”) 
Query with all the power of ElasticSearch and visualize with Kibana 
Gen with 1 line: statsd_client.incr(‘myapp_num_requests’) 
Query with all the power of InfluxDB and visualize with Grafana
LMM Client Interface 
expose 
metrics 
JMX 
Push 
Metrics 
LMM, Cloud Platform Engineering Nov 4, 2014 
6 
App 
Platform 
Services 
Writes app 
metrics directly 
Infrastructure 
SNMP Vars 
Pull 
Metrics 
LMM 
Shipper 
VM 
logs 
LMM 
metrics log events 
Alerts & 
Config UI 
Push: StatsD, metrics extension for openstack 
Pull: CollectD. Shipper: logstash, lumberjack
Security and Multi-tenancy 
• User authentication with Keystone for Query API & UI 
• Tenant id and API key used for events sent to LMM 
– Tenant ids from Keystone and API keys generated by LMM 
• Every event is tagged with a tenant id 
– Log events: tenant id as a field 
– Metric events: tenant id prefixed to the metric name 
• Keystone group membership used for sophisticated cross-tenant 
event visibility / correlation 
LMM, Cloud Platform Engineering Nov 4, 2014 
7
Client App / Service 
log & metrics shipper 
log, metric, alert events 
log, metric & alert events 
Authentication, Validation, Alerts Processing 
s1 s2 s3 s4 ... sn 
LMM, Cloud Platform Engineering Nov 4, 2014 
8 
Kafka cluster 
knode1 
Keystone 
knode2 knode3 ... knodeN 
Storm cluster 
Front End Cluster: Multi-tenancy and Kibana, Graphana Proxies 
Elastic 
Search 
Elastic 
ReSdeiasrch 
Alerts email & 
callbacks 
Load Balancer 
InfluxDB 
InfluxDB 
InfluxDB 
Metrics Store 
Elastic 
Search 
Elastic 
Search 
Elastic 
Search 
Log Store 
LMM Under the Hood 
Quota 
& 
Policy
Current deployment 
• Multiple deployments : on bare KVM nodes, on OpenStack VMs 
– On KVM nodes: 40+ VMs, 80+ TB storage, many large memory nodes 
– All components are deployed in clustered mode for HA 
– Some with active/active replication, some with active/passive 
• Use by Platform and Infrastructure Services 
– Hundreds of thousands of events per second 
– Few TBs of data collected and indexed per day 
– Queries are currently coming from Kibana and Grafana, in future from APIs 
LMM, Cloud Platform Engineering Nov 4, 2014 
9
In the Next Iteration … 
• Support Health checks 
– Exploring Sensu and related technologies 
• Multi data center deployment 
• Refactor to a data pipeline architecture 
• Deployment automation using Murano 
• Open Source LMM 
LMM, Cloud Platform Engineering Nov 4, 2014 
10
Thank you! 
Questions, Comments, Suggestions? 
We are interested in open sourcing LMM 
and are looking for collaborators … 
Interested? Feel free to contact me 
LN_Renganarayana@Symantec.com 
LMM, Cloud Platform Engineering Nov 4, 2014 11

More Related Content

Viewers also liked

Intro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew KirchIntro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew Kirch
buildacloud
 

Viewers also liked (18)

Logging for OpenStack - Elasticsearch, Fluentd, Logstash, Kibana
Logging for OpenStack - Elasticsearch, Fluentd, Logstash, KibanaLogging for OpenStack - Elasticsearch, Fluentd, Logstash, Kibana
Logging for OpenStack - Elasticsearch, Fluentd, Logstash, Kibana
 
Real-time streams and logs with Storm and Kafka
Real-time streams and logs with Storm and KafkaReal-time streams and logs with Storm and Kafka
Real-time streams and logs with Storm and Kafka
 
Log Management and Analysis for Cloud Applications
Log Management and Analysis for Cloud ApplicationsLog Management and Analysis for Cloud Applications
Log Management and Analysis for Cloud Applications
 
SignalFx Elasticsearch Metrics Monitoring and Alerting
SignalFx Elasticsearch Metrics Monitoring and AlertingSignalFx Elasticsearch Metrics Monitoring and Alerting
SignalFx Elasticsearch Metrics Monitoring and Alerting
 
JCConf 2015 - 輕鬆學google的雲端開發 - Google App Engine入門(下)
JCConf 2015  - 輕鬆學google的雲端開發 - Google App Engine入門(下)JCConf 2015  - 輕鬆學google的雲端開發 - Google App Engine入門(下)
JCConf 2015 - 輕鬆學google的雲端開發 - Google App Engine入門(下)
 
The ultimate container monitoring bake-off - Rancher Online Meetup October 2016
The ultimate container monitoring bake-off - Rancher Online Meetup October 2016The ultimate container monitoring bake-off - Rancher Online Meetup October 2016
The ultimate container monitoring bake-off - Rancher Online Meetup October 2016
 
The Open-Source Monitoring Landscape
The Open-Source Monitoring LandscapeThe Open-Source Monitoring Landscape
The Open-Source Monitoring Landscape
 
CloudCamp Boston Keynote - Cloud Computing - John Treadway, Judith Hurwitz
CloudCamp Boston Keynote - Cloud Computing - John Treadway, Judith HurwitzCloudCamp Boston Keynote - Cloud Computing - John Treadway, Judith Hurwitz
CloudCamp Boston Keynote - Cloud Computing - John Treadway, Judith Hurwitz
 
Séminaire IBM Marketing Cloud : Présentation Stratégie et Vision IBM pour les...
Séminaire IBM Marketing Cloud : Présentation Stratégie et Vision IBM pour les...Séminaire IBM Marketing Cloud : Présentation Stratégie et Vision IBM pour les...
Séminaire IBM Marketing Cloud : Présentation Stratégie et Vision IBM pour les...
 
Managed Services Presentation
Managed Services PresentationManaged Services Presentation
Managed Services Presentation
 
使用 Raspberry pi + fluentd + gcp cloud logging, big query 做iot 資料搜集與分析
使用 Raspberry pi + fluentd + gcp cloud logging, big query 做iot 資料搜集與分析使用 Raspberry pi + fluentd + gcp cloud logging, big query 做iot 資料搜集與分析
使用 Raspberry pi + fluentd + gcp cloud logging, big query 做iot 資料搜集與分析
 
Business Service Monitoring Challenges in the Cloud Era
Business Service Monitoring Challenges in the Cloud EraBusiness Service Monitoring Challenges in the Cloud Era
Business Service Monitoring Challenges in the Cloud Era
 
Application Logging With Logstash
Application Logging With LogstashApplication Logging With Logstash
Application Logging With Logstash
 
Séminaire IBM Marketing Cloud : Présentation du projet Virgin Mobile par Next...
Séminaire IBM Marketing Cloud : Présentation du projet Virgin Mobile par Next...Séminaire IBM Marketing Cloud : Présentation du projet Virgin Mobile par Next...
Séminaire IBM Marketing Cloud : Présentation du projet Virgin Mobile par Next...
 
Intro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew KirchIntro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew Kirch
 
Open Source Logging and Monitoring Tools
Open Source Logging and Monitoring ToolsOpen Source Logging and Monitoring Tools
Open Source Logging and Monitoring Tools
 
Quick Start Guide to Managed Services
Quick Start Guide to Managed ServicesQuick Start Guide to Managed Services
Quick Start Guide to Managed Services
 
Using Ansible for Deploying to Cloud Environments
Using Ansible for Deploying to Cloud EnvironmentsUsing Ansible for Deploying to Cloud Environments
Using Ansible for Deploying to Cloud Environments
 

Recently uploaded

TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 

Recently uploaded (20)

Pharm-D Biostatistics and Research methodology
Pharm-D Biostatistics and Research methodologyPharm-D Biostatistics and Research methodology
Pharm-D Biostatistics and Research methodology
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 

A Scalable Multi-tenant Logging, Metrics and Monitoring as Service for OpenStack and Cloud Applications

  • 1. Logging, Metrics and Monitoring as a Service Lakshminarayanan (LN) Renganarayana Architect, Cloud Platform Engineering LMM, Cloud Platform Engineering Nov 4, 2014 1
  • 2. LMM, Cloud Platform Engineering Nov 4, 2014 2 The Symantec Team • Cloud Platform Engineering – We are building a consolidated cloud platform that provides infrastructure and platform services for next generation Symantec products and services – starting small, but scaling to tens of thousands of nodes across multiple data centers • Me – Architect / Lead for LMM in Cloud Platform Engineering – Love to build and break systems – Previous life: Research Staff Member, IBM Research – Automatic parallelization, Compilers, Deployment Automation (Heat)
  • 3. Context for LMM: Symantec Cloud Platform Common Services LMM, Cloud Platform Engineering 3 Symantec Services & Apps Platform Services BP, SP, KV, OBS IaaS (OpenStack) Bare Metal Nov 4, 2014 LMM Identity Manager CI / CD
  • 4. LMM : Goals we started with … • Design for both Developers and Ops • Multi Tenant, Elastic Scalability • One system for logs, metrics, monitoring, & other events • Designed for under and over the cloud • Build using open source tools and for open sourcing LMM, Cloud Platform Engineering Nov 4, 2014 4
  • 5. Guiding principle Make it extremely simple to generate logs & metrics Provide powerful query and visualizations for free LMM, Cloud Platform Engineering Nov 4, 2014 5 Gen with 1 line: logger.warn(“WARN KVaaS too many failed writes”) Query with all the power of ElasticSearch and visualize with Kibana Gen with 1 line: statsd_client.incr(‘myapp_num_requests’) Query with all the power of InfluxDB and visualize with Grafana
  • 6. LMM Client Interface expose metrics JMX Push Metrics LMM, Cloud Platform Engineering Nov 4, 2014 6 App Platform Services Writes app metrics directly Infrastructure SNMP Vars Pull Metrics LMM Shipper VM logs LMM metrics log events Alerts & Config UI Push: StatsD, metrics extension for openstack Pull: CollectD. Shipper: logstash, lumberjack
  • 7. Security and Multi-tenancy • User authentication with Keystone for Query API & UI • Tenant id and API key used for events sent to LMM – Tenant ids from Keystone and API keys generated by LMM • Every event is tagged with a tenant id – Log events: tenant id as a field – Metric events: tenant id prefixed to the metric name • Keystone group membership used for sophisticated cross-tenant event visibility / correlation LMM, Cloud Platform Engineering Nov 4, 2014 7
  • 8. Client App / Service log & metrics shipper log, metric, alert events log, metric & alert events Authentication, Validation, Alerts Processing s1 s2 s3 s4 ... sn LMM, Cloud Platform Engineering Nov 4, 2014 8 Kafka cluster knode1 Keystone knode2 knode3 ... knodeN Storm cluster Front End Cluster: Multi-tenancy and Kibana, Graphana Proxies Elastic Search Elastic ReSdeiasrch Alerts email & callbacks Load Balancer InfluxDB InfluxDB InfluxDB Metrics Store Elastic Search Elastic Search Elastic Search Log Store LMM Under the Hood Quota & Policy
  • 9. Current deployment • Multiple deployments : on bare KVM nodes, on OpenStack VMs – On KVM nodes: 40+ VMs, 80+ TB storage, many large memory nodes – All components are deployed in clustered mode for HA – Some with active/active replication, some with active/passive • Use by Platform and Infrastructure Services – Hundreds of thousands of events per second – Few TBs of data collected and indexed per day – Queries are currently coming from Kibana and Grafana, in future from APIs LMM, Cloud Platform Engineering Nov 4, 2014 9
  • 10. In the Next Iteration … • Support Health checks – Exploring Sensu and related technologies • Multi data center deployment • Refactor to a data pipeline architecture • Deployment automation using Murano • Open Source LMM LMM, Cloud Platform Engineering Nov 4, 2014 10
  • 11. Thank you! Questions, Comments, Suggestions? We are interested in open sourcing LMM and are looking for collaborators … Interested? Feel free to contact me LN_Renganarayana@Symantec.com LMM, Cloud Platform Engineering Nov 4, 2014 11

Editor's Notes

  1. You might remember : build for devs and ops