SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
WHEN THE CEPH HITS THE FAN
Dr. Wolfgang Schulze
Director Global Storage Consulting
Practice
Red Hat
October 20, 2016
CAN THE CEPH EVEN HIT THE FAN?
2
•  A"er	all…		
	
•  Architecture	has	no	single	point	of	failure	
•  Code	base	is	very	solid	and	had	many	years	to	mature	
•  Designed	from	the	ground	up	to	accommodate	for	failures	
•  Supposed	to	be	self-healing	and	self-managing	
•  It	simplifies	day-to-day	data	center	opera?ons
WHAT IS “HITTING THE FAN”, ANYWAYS?
3
•  Example	scenarios:	
•  Heavy	storm	takes	out	data	center,	cluster	fails	to	restart	automa?cally	
•  Increased	work	load	makes	cluster	unstable	
•  Performance	is	fine	when	cluster	is	empty	to	moderately	filled,	but	when	
when	geHng	close	physical	capacity,	write	performance	drops	
•  Nearly	full	cluster	has	become	unresponsive	and	denies	writes	
•  Bulk	dele?on	of	objects	takes	so	long	that	the	client	applica?on	?mes	out		
•  Rebalancing	a"er	a	par?al	electric	outage	impacts	clients	with	slow/
blocked	requests		
•  Result	in	each	case:	customer	files		
•  Sev	1:	Produc?on	is	down	
•  Sev	2:	Produc?on	is	impacted
TICKET QUEUE IN RED HAT SUPPORT
4
Real	screenshot,	dated	2016-10-19	
	
Customer	names	removed	
	
	
Many	of	these	,ckets	could	have		
been	avoided	if	best	prac,ces	had		
been	followed
A SAD, BUT TRUE STORY
5
•  Customer	bought	Red	Hat	Ceph	Storage	subscrip?ons	
•  They	were	sure	they	had	enough	experience	on	their	team	and	specifically	
declined	offers	for	training	and	consul?ng	
•  They	designed	and	deployed	Ceph	cluster	without	guidance	
•  Originally	for	feasibility	study,	but	everything	seemed	to	work	fine,		
so	they	put	it	into	produc?on	
•  Nobody	no?ced	that	the	journal	size	was	configured	to	only	100MB	instead	
of	best	prac?ce	size	of	5GB	
•  A	couple	of	months	later	a"er	a	power	failure,	the	Ceph	cluster	failed	to	recover	
•  Support	?cket	went	on	for	several	weeks,	at	the	end	some	permanent	data	loss	
•  End	result:	Par?al	data	loss,	unhappy	management,	unhappy	customers
SOME COMMON MISCONCEPTIONS
6
•  The	new	tools	make	Ceph	easy	to	set	up		
•  You	don’t	need	detailed	planning	or	architecture	design	
•  Ceph	works	on	any	hardware,	and	you	can	mix	&	match	hardware	
•  Storage	infrastructure	people	will	know	how	to	handle	the	product	
•  Server	people	will	know	how	to	handle	the	product	
•  Ceph	community	bits	are	just	fine	(“We	use	a	stable	release”)	
•  Using	community	bits	is	more	“cuHng	edge”
COMMON TROUBLE #1
UPSTREAM BITS FOR PRODUCTION SYSTEMS
7
Observa,on	
•  User	is	running	upstream	bits	
•  This	happens	even	with	users	who	are	paying	for	a	Red	Hat	Support	subscrip?on	
•  People	misinterpret	the	phrase	“stable	release”	in	community	release	notes	
Problem	
•  Red	Hat	Support	won’t	be	able	to	help	
•  Red	Hat	only	supports	long	term	stable	releases	
•  What	could	be	a	safe	and	fully	documented	upgrade	to	a	newer	LTS	version	
suddenly	becomes	a	“migra?on”	with	risks	and	piealls	
	
Mi,ga,on	
•  Use	supported	bits,	stay	informed	about	roadmap,	get	involved
COMMON TROUBLE #2
USE OF UNSUPPORTED FEATURES
8
Observa,on	
•  User	deploys	system	into	produc?on	using	features	which	are	not	(yet)	supported	
•  Examples:	CephFS,	BlueStore	
Problem	
•  Red	Hat	Support	won’t	be	able	to	help	
•  Unless	you	have	a	support	excep?on,	the	conversa?on	may	end	quickly	
•  Red	Hat	Engineering	will	not	build	hot	fixes	for	you	
	
Mi,ga,on	
•  Try	to	get	a	support	excep?on	from	Red	Hat	
•  Don’t	use	the	feature
COMMON TROUBLE #3
USE OF UNSUPPORTED CONFIGURATIONS
9
Observa,on	
•  User	deploy	Ceph	in	a	way	that	is	not	approved	and	has	not	been	tested		
•  Examples:		
•  Running	Ceph	on	unsupported	Opera?ng	System	versions	(e.g.	GenToo,	Debian)	
•  Deploying		
Problem	
•  Red	Hat	Support	won’t	be	able	to	help	
•  Unless	you	have	a	support	excep?on,	the	conversa?on	may	end	quickly	
•  Red	Hat	Engineering	will	not	build	hot	fixes	for	you	
Mi,ga,on	
•  Read	documenta?on,	consider	health	check	before	go-live
COMMON TROUBLE #4
POORLY MANAGED CLUSTER GROWTH
10
Observa,on	
•  Adding	disks	(or	even	en?re	nodes)	to	clusters	of	rela?vely	small	total	capacity	
•  Backfill/recovery	starves	client	I/O	
Problem	
•  In	older	versions	of	Ceph,	default	configura?on	values	are	not	ideal	for	this	
(osd_max_backlls,	osd_recovery_max_ac?ve,	osd_recovery_op_priority)		
•  If	you	fail	to	adjust	these	before	you	change	the	physical	configura?on,	you	will	
indeed	have	huge	impact		
Mi,ga,on	
•  Know	your	stuff,	think	ahead,	es?mate	impact,	gradually	weigh	in
COMMON TROUBLE #5
POOR SKILLS AND OPERATIONAL PRACTICES
11
Observa,ons	
•  Subject	majer	experts	who	brought	Ceph	to	the	organiza?on	were	hired	guns,	
or	employees	who	have	since	le"	
•  Team	that	ends	up	managing	cluster	considers	it	some	sort	of	black	art	
Problem	
•  Operators	who	don’t	know	what	they	are	doing	put	your	data	at	risk	
•  The	built-in	safety/durability	may	be	compromised	
Mi,ga,on	
•  Make	sure	users	receive	proper	training,	and	avoid	staff	SPOF	
•  Conduct	controlled	emergency	drills	to	prac?ce	for	outages	
•  Maintain	separate	cluster	with	same	version	for	experiments	and	dry	run,		
or	learn	how	to	do	it	with	a	cloud	based	environment
COMMON TROUBLE #6
RISKY CONFIGURATION CHOICES
12
Observa,ons	
•  Users	read	somewhere	that	moun?ng	XFS	OSD’s	with	the	‘nobarrier’	op?on	
will	result	in	performance	gains	
	
Problem	
•  While	the	performance	gets	no?ceably	bejer,	you	are	introducing	a	risk	for	
data	corrup?on	during	power	outages	
•  The	built-in	safety/durability	may	be	compromised	
Mi,ga,on	
•  Do	not	use	‘nobarrier’	mount	op?on	unless	you	understand	fully	what	
hardware	you	have,	and	unless	you	know	what	you	are	doing
COMMON TROUBLE #7
POOR NETWORK CONFIGURATION
13
Observa,ons	
•  Users	don’t	pay	enough	ajen?on	to	network	configura?on	
•  Network	inconsistencies	(e.g.	Jumbo	Frames)	and	bojlenecks	go	undetected	
…un?l	Ceph	performs	poorly.		
	
Problem	
•  Troubleshoo?ng	networking	issues	is	difficult	and	experts	hard	to	find	
•  Ceph	heavily	relies	on	proper	configura?on	
Mi,ga,on	
•  Invest	in	your	team	and	network	maintenance	skills
WHAT TO DO WHEN THINGS WENT WRONG
14
1.  Stay	calm	and	don’t	make	it	worse!	
•  Poorly	skilled	operators	may	turn	a	problem	into	a	catastrophe	
2.  Contact	Red	Hat	Support	immediately	
•  Sev	1	and	Sev	2	issues	are	handled	with	top	priority	
•  Chances	are	that	they	will	be	able	to	help	right	away	and	get	your	cluster	
humming	again	
3.  Contact	your	trusted	Red	Hat	Services	or	Sales	contacts	
•  If	problems	persist	or	you	feel	you	need	extra	help,	you	might	want	to	get	a	
Ceph	expert	from	Red	Hat	Professional	Services
GOOD PRACTICES TO AVOID PROBLEMS
15
1.  Don’t	stumble	into	implementa?on/deployment	without	careful	planning	
•  Capture	and	document	requirements,	do	a	POC,	do	an	actual	design	
•  Engage	experts	early	to	help	with	cluster	design	and	hardware	choices	
2.  Unless	you	love	to	take	risks,	use	supported	bits	
3.  Stay	close	to	the	recommended	reference	architectures	from	Red	Hat	partners	
4.  Make	sure	your	staff	receives	proper	training	
•  Red	Hat	Global	Learning	provides	excellent	training	for	Gluster	and	Ceph	
5.  Plan	for	growth		
6.  Don’t	let	things	linger.	Ceph	does	not	like	it	when	the	cluster	is	90%	full	
7.  Have	an	expert	perform	regular	Storage	Health	Checks	to	detect	problems	while	
they	are	s?ll	small
STORAGE DESIGN CONSULTING
16
•  Specialists	from	Red	Hat	Consul?ng	will	help	planning	your	Ceph	
deployment	
•  Start:	Storage	Discovery	Session	
•  We	can	help	discover	requirements	and	design	a	storage	solu?on	
that	matches	
•  You	will	receive	a	detailed	Storage	Solu,on	architecture	document	
which	will	ar?culate	design	choices	and	lay	out	a	step-by-step	plan	
for	implementa?on
STORAGE HEALTH CHECKS
17
•  Standard	3-day	engagement	done	by	Red	Hat	storage	experts	
•  Comprehensive	top-to-bojom	analysis	of	your	so"ware-defined	storage	plaeorm	
•  Six	focus	areas		
1.  Life	cycle	
2.  Configura?on	
3.  Organiza?on	
4.  Use	Case	
5.  Hardware	
6.  Opera?onal	
•  Clear	read-out	of	issues	
•  Ac?onable	recommenda?ons
POSITIVE NOTE
18
•  I	asked	my	consultants	for	feedback	on	this	presenta?on.		
Here	is	one	comment
19
WHERE TO GO NEXT
RED	HAT	
SUBSCRIPTIONS	
hjps://access.redhat.com/subscrip?on-value		
Evalua?on,	Pre-produc?on,	and	Produc?on	subscrip?ons	available	
CONSULTING	 hjp://www.redhat.com/en/services/consul?ng/storage		
TRAINING	 hjps://www.redhat.com/en/services/training	
TEST	DRIVE	 hjp://red.ht/cephtestdrive		
To engage a Territory Service Manager in your area, ask for a local Red Hat Storage sales professional at:
NORTH AMERICA: 1 (888) REDHAT-1; LATIN AMERICA: 54 (11) 4329-7300; EMEA: 00800 7334 2835
APJ: 65 6490 4200; Brazil: 55 (11) 3529-6000,; Australia: 1800 733 428; New Zealand: 0800 733 428

Weitere ähnliche Inhalte

Was ist angesagt?

DR in the Cloud: Finding the Right Tool for the Job
DR in the Cloud: Finding the Right Tool for the JobDR in the Cloud: Finding the Right Tool for the Job
DR in the Cloud: Finding the Right Tool for the JobHostway|HOSTING
 
Building data intensive applications
Building data intensive applicationsBuilding data intensive applications
Building data intensive applicationsAmit Kejriwal
 
Managing Performance in a Virtual Environment
Managing Performance in a Virtual EnvironmentManaging Performance in a Virtual Environment
Managing Performance in a Virtual EnvironmentSolarWinds
 
Production-ready Software
Production-ready SoftwareProduction-ready Software
Production-ready SoftwareUwe Friedrichsen
 
No stress with state
No stress with stateNo stress with state
No stress with stateUwe Friedrichsen
 
Devoxx 2014 michael_neale
Devoxx 2014 michael_nealeDevoxx 2014 michael_neale
Devoxx 2014 michael_nealeMichael Neale
 
The have no fear guide to virtualizing databases
The have no fear guide to virtualizing databasesThe have no fear guide to virtualizing databases
The have no fear guide to virtualizing databasesSolarWinds
 
Ez performance measurement
Ez performance measurementEz performance measurement
Ez performance measurementGaetano Giunta
 
5 Cloud Migration Experiences Not to Be Repeated
5 Cloud Migration Experiences Not to Be Repeated5 Cloud Migration Experiences Not to Be Repeated
5 Cloud Migration Experiences Not to Be RepeatedHostway|HOSTING
 
The Pensions Trust - VM Backup Experiences
The Pensions Trust - VM Backup ExperiencesThe Pensions Trust - VM Backup Experiences
The Pensions Trust - VM Backup Experiencesglbsolutions
 
Visualizing Systems with Statemaps
Visualizing Systems with StatemapsVisualizing Systems with Statemaps
Visualizing Systems with Statemapsbcantrill
 
Scaling apps for the big time
Scaling apps for the big timeScaling apps for the big time
Scaling apps for the big timeproitconsult
 
E2 evc 3-2-1-rule - mikeresseler
E2 evc   3-2-1-rule - mikeresselerE2 evc   3-2-1-rule - mikeresseler
E2 evc 3-2-1-rule - mikeresselerMike Resseler
 

Was ist angesagt? (14)

DR in the Cloud: Finding the Right Tool for the Job
DR in the Cloud: Finding the Right Tool for the JobDR in the Cloud: Finding the Right Tool for the Job
DR in the Cloud: Finding the Right Tool for the Job
 
Building data intensive applications
Building data intensive applicationsBuilding data intensive applications
Building data intensive applications
 
Managing Performance in a Virtual Environment
Managing Performance in a Virtual EnvironmentManaging Performance in a Virtual Environment
Managing Performance in a Virtual Environment
 
Production-ready Software
Production-ready SoftwareProduction-ready Software
Production-ready Software
 
No stress with state
No stress with stateNo stress with state
No stress with state
 
Devoxx 2014 michael_neale
Devoxx 2014 michael_nealeDevoxx 2014 michael_neale
Devoxx 2014 michael_neale
 
Delphix
DelphixDelphix
Delphix
 
The have no fear guide to virtualizing databases
The have no fear guide to virtualizing databasesThe have no fear guide to virtualizing databases
The have no fear guide to virtualizing databases
 
Ez performance measurement
Ez performance measurementEz performance measurement
Ez performance measurement
 
5 Cloud Migration Experiences Not to Be Repeated
5 Cloud Migration Experiences Not to Be Repeated5 Cloud Migration Experiences Not to Be Repeated
5 Cloud Migration Experiences Not to Be Repeated
 
The Pensions Trust - VM Backup Experiences
The Pensions Trust - VM Backup ExperiencesThe Pensions Trust - VM Backup Experiences
The Pensions Trust - VM Backup Experiences
 
Visualizing Systems with Statemaps
Visualizing Systems with StatemapsVisualizing Systems with Statemaps
Visualizing Systems with Statemaps
 
Scaling apps for the big time
Scaling apps for the big timeScaling apps for the big time
Scaling apps for the big time
 
E2 evc 3-2-1-rule - mikeresseler
E2 evc   3-2-1-rule - mikeresselerE2 evc   3-2-1-rule - mikeresseler
E2 evc 3-2-1-rule - mikeresseler
 

Andere mochten auch

Red Hat Storage Day Dallas - Defiance of the Appliance
Red Hat Storage Day Dallas - Defiance of the Appliance Red Hat Storage Day Dallas - Defiance of the Appliance
Red Hat Storage Day Dallas - Defiance of the Appliance Red_Hat_Storage
 
Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...
Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...
Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...Red_Hat_Storage
 
Red Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference ArchitecturesRed Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference ArchitecturesRed_Hat_Storage
 
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red_Hat_Storage
 
Red Hat Storage Day Boston - OpenStack + Ceph Storage
Red Hat Storage Day Boston - OpenStack + Ceph StorageRed Hat Storage Day Boston - OpenStack + Ceph Storage
Red Hat Storage Day Boston - OpenStack + Ceph StorageRed_Hat_Storage
 
Red Hat Storage Day Dallas - Why Software-defined Storage Matters
Red Hat Storage Day Dallas - Why Software-defined Storage MattersRed Hat Storage Day Dallas - Why Software-defined Storage Matters
Red Hat Storage Day Dallas - Why Software-defined Storage MattersRed_Hat_Storage
 
Red Hat Storage Day New York - Red Hat Gluster Storage: Historical Tick Data ...
Red Hat Storage Day New York - Red Hat Gluster Storage: Historical Tick Data ...Red Hat Storage Day New York - Red Hat Gluster Storage: Historical Tick Data ...
Red Hat Storage Day New York - Red Hat Gluster Storage: Historical Tick Data ...Red_Hat_Storage
 
Red Hat Storage Day Dallas - Gluster Storage in Containerized Application
Red Hat Storage Day Dallas - Gluster Storage in Containerized Application Red Hat Storage Day Dallas - Gluster Storage in Containerized Application
Red Hat Storage Day Dallas - Gluster Storage in Containerized Application Red_Hat_Storage
 
Red Hat Storage Day New York - Persistent Storage for Containers
Red Hat Storage Day New York - Persistent Storage for ContainersRed Hat Storage Day New York - Persistent Storage for Containers
Red Hat Storage Day New York - Persistent Storage for ContainersRed_Hat_Storage
 
Red Hat Storage Day New York - QCT: Avoid the mess, deploy with a validated s...
Red Hat Storage Day New York - QCT: Avoid the mess, deploy with a validated s...Red Hat Storage Day New York - QCT: Avoid the mess, deploy with a validated s...
Red Hat Storage Day New York - QCT: Avoid the mess, deploy with a validated s...Red_Hat_Storage
 
Red Hat Storage Day Boston - Supermicro Super Storage
Red Hat Storage Day Boston - Supermicro Super StorageRed Hat Storage Day Boston - Supermicro Super Storage
Red Hat Storage Day Boston - Supermicro Super StorageRed_Hat_Storage
 
Red Hat Storage Day Boston - Persistent Storage for Containers
Red Hat Storage Day Boston - Persistent Storage for Containers Red Hat Storage Day Boston - Persistent Storage for Containers
Red Hat Storage Day Boston - Persistent Storage for Containers Red_Hat_Storage
 
Red Hat Storage Day Boston - Red Hat Gluster Storage vs. Traditional Storage ...
Red Hat Storage Day Boston - Red Hat Gluster Storage vs. Traditional Storage ...Red Hat Storage Day Boston - Red Hat Gluster Storage vs. Traditional Storage ...
Red Hat Storage Day Boston - Red Hat Gluster Storage vs. Traditional Storage ...Red_Hat_Storage
 
Red Hat Storage Day New York - Penguin Computing Spotlight: Delivering Open S...
Red Hat Storage Day New York - Penguin Computing Spotlight: Delivering Open S...Red Hat Storage Day New York - Penguin Computing Spotlight: Delivering Open S...
Red Hat Storage Day New York - Penguin Computing Spotlight: Delivering Open S...Red_Hat_Storage
 
Red Hat Storage Day Dallas - Storage for OpenShift Containers
Red Hat Storage Day Dallas - Storage for OpenShift Containers Red Hat Storage Day Dallas - Storage for OpenShift Containers
Red Hat Storage Day Dallas - Storage for OpenShift Containers Red_Hat_Storage
 
Red Hat Storage Day Boston - Why Software-defined Storage Matters
Red Hat Storage Day Boston - Why Software-defined Storage MattersRed Hat Storage Day Boston - Why Software-defined Storage Matters
Red Hat Storage Day Boston - Why Software-defined Storage MattersRed_Hat_Storage
 
Red Hat Storage Day New York - Intel Unlocking Big Data Infrastructure Effici...
Red Hat Storage Day New York - Intel Unlocking Big Data Infrastructure Effici...Red Hat Storage Day New York - Intel Unlocking Big Data Infrastructure Effici...
Red Hat Storage Day New York - Intel Unlocking Big Data Infrastructure Effici...Red_Hat_Storage
 
Red Hat Storage Day New York - Welcome Remarks
Red Hat Storage Day New York - Welcome Remarks Red Hat Storage Day New York - Welcome Remarks
Red Hat Storage Day New York - Welcome Remarks Red_Hat_Storage
 
Red Hat Storage Day New York -Performance Intensive Workloads with Samsung NV...
Red Hat Storage Day New York -Performance Intensive Workloads with Samsung NV...Red Hat Storage Day New York -Performance Intensive Workloads with Samsung NV...
Red Hat Storage Day New York -Performance Intensive Workloads with Samsung NV...Red_Hat_Storage
 
Ceph: Open Source Storage Software Optimizations on IntelÂŽ Architecture for C...
Ceph: Open Source Storage Software Optimizations on IntelÂŽ Architecture for C...Ceph: Open Source Storage Software Optimizations on IntelÂŽ Architecture for C...
Ceph: Open Source Storage Software Optimizations on IntelÂŽ Architecture for C...Odinot Stanislas
 

Andere mochten auch (20)

Red Hat Storage Day Dallas - Defiance of the Appliance
Red Hat Storage Day Dallas - Defiance of the Appliance Red Hat Storage Day Dallas - Defiance of the Appliance
Red Hat Storage Day Dallas - Defiance of the Appliance
 
Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...
Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...
Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...
 
Red Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference ArchitecturesRed Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference Architectures
 
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
 
Red Hat Storage Day Boston - OpenStack + Ceph Storage
Red Hat Storage Day Boston - OpenStack + Ceph StorageRed Hat Storage Day Boston - OpenStack + Ceph Storage
Red Hat Storage Day Boston - OpenStack + Ceph Storage
 
Red Hat Storage Day Dallas - Why Software-defined Storage Matters
Red Hat Storage Day Dallas - Why Software-defined Storage MattersRed Hat Storage Day Dallas - Why Software-defined Storage Matters
Red Hat Storage Day Dallas - Why Software-defined Storage Matters
 
Red Hat Storage Day New York - Red Hat Gluster Storage: Historical Tick Data ...
Red Hat Storage Day New York - Red Hat Gluster Storage: Historical Tick Data ...Red Hat Storage Day New York - Red Hat Gluster Storage: Historical Tick Data ...
Red Hat Storage Day New York - Red Hat Gluster Storage: Historical Tick Data ...
 
Red Hat Storage Day Dallas - Gluster Storage in Containerized Application
Red Hat Storage Day Dallas - Gluster Storage in Containerized Application Red Hat Storage Day Dallas - Gluster Storage in Containerized Application
Red Hat Storage Day Dallas - Gluster Storage in Containerized Application
 
Red Hat Storage Day New York - Persistent Storage for Containers
Red Hat Storage Day New York - Persistent Storage for ContainersRed Hat Storage Day New York - Persistent Storage for Containers
Red Hat Storage Day New York - Persistent Storage for Containers
 
Red Hat Storage Day New York - QCT: Avoid the mess, deploy with a validated s...
Red Hat Storage Day New York - QCT: Avoid the mess, deploy with a validated s...Red Hat Storage Day New York - QCT: Avoid the mess, deploy with a validated s...
Red Hat Storage Day New York - QCT: Avoid the mess, deploy with a validated s...
 
Red Hat Storage Day Boston - Supermicro Super Storage
Red Hat Storage Day Boston - Supermicro Super StorageRed Hat Storage Day Boston - Supermicro Super Storage
Red Hat Storage Day Boston - Supermicro Super Storage
 
Red Hat Storage Day Boston - Persistent Storage for Containers
Red Hat Storage Day Boston - Persistent Storage for Containers Red Hat Storage Day Boston - Persistent Storage for Containers
Red Hat Storage Day Boston - Persistent Storage for Containers
 
Red Hat Storage Day Boston - Red Hat Gluster Storage vs. Traditional Storage ...
Red Hat Storage Day Boston - Red Hat Gluster Storage vs. Traditional Storage ...Red Hat Storage Day Boston - Red Hat Gluster Storage vs. Traditional Storage ...
Red Hat Storage Day Boston - Red Hat Gluster Storage vs. Traditional Storage ...
 
Red Hat Storage Day New York - Penguin Computing Spotlight: Delivering Open S...
Red Hat Storage Day New York - Penguin Computing Spotlight: Delivering Open S...Red Hat Storage Day New York - Penguin Computing Spotlight: Delivering Open S...
Red Hat Storage Day New York - Penguin Computing Spotlight: Delivering Open S...
 
Red Hat Storage Day Dallas - Storage for OpenShift Containers
Red Hat Storage Day Dallas - Storage for OpenShift Containers Red Hat Storage Day Dallas - Storage for OpenShift Containers
Red Hat Storage Day Dallas - Storage for OpenShift Containers
 
Red Hat Storage Day Boston - Why Software-defined Storage Matters
Red Hat Storage Day Boston - Why Software-defined Storage MattersRed Hat Storage Day Boston - Why Software-defined Storage Matters
Red Hat Storage Day Boston - Why Software-defined Storage Matters
 
Red Hat Storage Day New York - Intel Unlocking Big Data Infrastructure Effici...
Red Hat Storage Day New York - Intel Unlocking Big Data Infrastructure Effici...Red Hat Storage Day New York - Intel Unlocking Big Data Infrastructure Effici...
Red Hat Storage Day New York - Intel Unlocking Big Data Infrastructure Effici...
 
Red Hat Storage Day New York - Welcome Remarks
Red Hat Storage Day New York - Welcome Remarks Red Hat Storage Day New York - Welcome Remarks
Red Hat Storage Day New York - Welcome Remarks
 
Red Hat Storage Day New York -Performance Intensive Workloads with Samsung NV...
Red Hat Storage Day New York -Performance Intensive Workloads with Samsung NV...Red Hat Storage Day New York -Performance Intensive Workloads with Samsung NV...
Red Hat Storage Day New York -Performance Intensive Workloads with Samsung NV...
 
Ceph: Open Source Storage Software Optimizations on IntelÂŽ Architecture for C...
Ceph: Open Source Storage Software Optimizations on IntelÂŽ Architecture for C...Ceph: Open Source Storage Software Optimizations on IntelÂŽ Architecture for C...
Ceph: Open Source Storage Software Optimizations on IntelÂŽ Architecture for C...
 

Ähnlich wie When the Ceph Hits the Fan: Common Issues and Best Practices

Disaster Recovery Plans for Apache Kafka
Disaster Recovery Plans for Apache KafkaDisaster Recovery Plans for Apache Kafka
Disaster Recovery Plans for Apache Kafkaconfluent
 
Kafka Summit SF 2017 - One Data Center is Not Enough: Scaling Apache Kafka Ac...
Kafka Summit SF 2017 - One Data Center is Not Enough: Scaling Apache Kafka Ac...Kafka Summit SF 2017 - One Data Center is Not Enough: Scaling Apache Kafka Ac...
Kafka Summit SF 2017 - One Data Center is Not Enough: Scaling Apache Kafka Ac...confluent
 
Multi-Cluster and Failover for Apache Kafka - Kafka Summit SF 17
Multi-Cluster and Failover for Apache Kafka - Kafka Summit SF 17Multi-Cluster and Failover for Apache Kafka - Kafka Summit SF 17
Multi-Cluster and Failover for Apache Kafka - Kafka Summit SF 17Gwen (Chen) Shapira
 
Sample Solution Blueprint
Sample Solution BlueprintSample Solution Blueprint
Sample Solution BlueprintMike Alvarado
 
Tech Talk Series, Part 3: Why is your CFO right to demand you scale down MySQL?
Tech Talk Series, Part 3: Why is your CFO right to demand you scale down MySQL?Tech Talk Series, Part 3: Why is your CFO right to demand you scale down MySQL?
Tech Talk Series, Part 3: Why is your CFO right to demand you scale down MySQL?Clustrix
 
Capgemini: Observability within the Dutch government
Capgemini: Observability within the Dutch governmentCapgemini: Observability within the Dutch government
Capgemini: Observability within the Dutch governmentElasticsearch
 
Citrix XenDesktop: Dealing with Failure - SYN408
Citrix XenDesktop: Dealing with Failure - SYN408Citrix XenDesktop: Dealing with Failure - SYN408
Citrix XenDesktop: Dealing with Failure - SYN408Tom Gamull
 
IBM MQ High Availabillity and Disaster Recovery (2017 version)
IBM MQ High Availabillity and Disaster Recovery (2017 version)IBM MQ High Availabillity and Disaster Recovery (2017 version)
IBM MQ High Availabillity and Disaster Recovery (2017 version)MarkTaylorIBM
 
Open west 2015 talk ben coverston
Open west 2015 talk ben coverstonOpen west 2015 talk ben coverston
Open west 2015 talk ben coverstonbcoverston
 
VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...
VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...
VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...VMworld
 
Performance Optimization of Cloud Based Applications by Peter Smith, ACL
Performance Optimization of Cloud Based Applications by Peter Smith, ACLPerformance Optimization of Cloud Based Applications by Peter Smith, ACL
Performance Optimization of Cloud Based Applications by Peter Smith, ACLTriNimbus
 
ContainerConf 2022: Kubernetes is awesome - but...
ContainerConf 2022: Kubernetes is awesome - but...ContainerConf 2022: Kubernetes is awesome - but...
ContainerConf 2022: Kubernetes is awesome - but...Nico Meisenzahl
 
Storage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, WhiptailStorage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, WhiptailInternet World
 
OpenStack at EBSCO
OpenStack at EBSCOOpenStack at EBSCO
OpenStack at EBSCOTesora
 
VMworld 2014: Extreme Performance Series
VMworld 2014: Extreme Performance Series VMworld 2014: Extreme Performance Series
VMworld 2014: Extreme Performance Series VMworld
 
DOES SFO 2016 - Chris Fulton - CD for DBs
DOES SFO 2016 - Chris Fulton - CD for DBsDOES SFO 2016 - Chris Fulton - CD for DBs
DOES SFO 2016 - Chris Fulton - CD for DBsGene Kim
 
Scaling SQL Write-Master Database Clusters With Redis Labs: Erik Brandsberg
Scaling SQL Write-Master Database Clusters With Redis Labs: Erik BrandsbergScaling SQL Write-Master Database Clusters With Redis Labs: Erik Brandsberg
Scaling SQL Write-Master Database Clusters With Redis Labs: Erik BrandsbergRedis Labs
 
Cloud Love Conference: Kubernetes is awesome, but...
Cloud Love Conference: Kubernetes is awesome, but...Cloud Love Conference: Kubernetes is awesome, but...
Cloud Love Conference: Kubernetes is awesome, but...Nico Meisenzahl
 
Webinar: Overcoming the Storage Challenges Cassandra and Couchbase Create
Webinar: Overcoming the Storage Challenges Cassandra and Couchbase CreateWebinar: Overcoming the Storage Challenges Cassandra and Couchbase Create
Webinar: Overcoming the Storage Challenges Cassandra and Couchbase CreateStorage Switzerland
 

Ähnlich wie When the Ceph Hits the Fan: Common Issues and Best Practices (20)

Disaster Recovery Plans for Apache Kafka
Disaster Recovery Plans for Apache KafkaDisaster Recovery Plans for Apache Kafka
Disaster Recovery Plans for Apache Kafka
 
Kafka Summit SF 2017 - One Data Center is Not Enough: Scaling Apache Kafka Ac...
Kafka Summit SF 2017 - One Data Center is Not Enough: Scaling Apache Kafka Ac...Kafka Summit SF 2017 - One Data Center is Not Enough: Scaling Apache Kafka Ac...
Kafka Summit SF 2017 - One Data Center is Not Enough: Scaling Apache Kafka Ac...
 
Multi-Cluster and Failover for Apache Kafka - Kafka Summit SF 17
Multi-Cluster and Failover for Apache Kafka - Kafka Summit SF 17Multi-Cluster and Failover for Apache Kafka - Kafka Summit SF 17
Multi-Cluster and Failover for Apache Kafka - Kafka Summit SF 17
 
Sample Solution Blueprint
Sample Solution BlueprintSample Solution Blueprint
Sample Solution Blueprint
 
Tech Talk Series, Part 3: Why is your CFO right to demand you scale down MySQL?
Tech Talk Series, Part 3: Why is your CFO right to demand you scale down MySQL?Tech Talk Series, Part 3: Why is your CFO right to demand you scale down MySQL?
Tech Talk Series, Part 3: Why is your CFO right to demand you scale down MySQL?
 
Capgemini: Observability within the Dutch government
Capgemini: Observability within the Dutch governmentCapgemini: Observability within the Dutch government
Capgemini: Observability within the Dutch government
 
Citrix XenDesktop: Dealing with Failure - SYN408
Citrix XenDesktop: Dealing with Failure - SYN408Citrix XenDesktop: Dealing with Failure - SYN408
Citrix XenDesktop: Dealing with Failure - SYN408
 
IBM MQ High Availabillity and Disaster Recovery (2017 version)
IBM MQ High Availabillity and Disaster Recovery (2017 version)IBM MQ High Availabillity and Disaster Recovery (2017 version)
IBM MQ High Availabillity and Disaster Recovery (2017 version)
 
Open west 2015 talk ben coverston
Open west 2015 talk ben coverstonOpen west 2015 talk ben coverston
Open west 2015 talk ben coverston
 
VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...
VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...
VMworld 2015: vRealize Operations Insight: Manage vSphere and Your Entire Dat...
 
Performance Optimization of Cloud Based Applications by Peter Smith, ACL
Performance Optimization of Cloud Based Applications by Peter Smith, ACLPerformance Optimization of Cloud Based Applications by Peter Smith, ACL
Performance Optimization of Cloud Based Applications by Peter Smith, ACL
 
ContainerConf 2022: Kubernetes is awesome - but...
ContainerConf 2022: Kubernetes is awesome - but...ContainerConf 2022: Kubernetes is awesome - but...
ContainerConf 2022: Kubernetes is awesome - but...
 
Storage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, WhiptailStorage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, Whiptail
 
OpenStack at EBSCO
OpenStack at EBSCOOpenStack at EBSCO
OpenStack at EBSCO
 
VMworld 2014: Extreme Performance Series
VMworld 2014: Extreme Performance Series VMworld 2014: Extreme Performance Series
VMworld 2014: Extreme Performance Series
 
DOES SFO 2016 - Chris Fulton - CD for DBs
DOES SFO 2016 - Chris Fulton - CD for DBsDOES SFO 2016 - Chris Fulton - CD for DBs
DOES SFO 2016 - Chris Fulton - CD for DBs
 
Scaling SQL Write-Master Database Clusters With Redis Labs: Erik Brandsberg
Scaling SQL Write-Master Database Clusters With Redis Labs: Erik BrandsbergScaling SQL Write-Master Database Clusters With Redis Labs: Erik Brandsberg
Scaling SQL Write-Master Database Clusters With Redis Labs: Erik Brandsberg
 
Cloud Love Conference: Kubernetes is awesome, but...
Cloud Love Conference: Kubernetes is awesome, but...Cloud Love Conference: Kubernetes is awesome, but...
Cloud Love Conference: Kubernetes is awesome, but...
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Webinar: Overcoming the Storage Challenges Cassandra and Couchbase Create
Webinar: Overcoming the Storage Challenges Cassandra and Couchbase CreateWebinar: Overcoming the Storage Challenges Cassandra and Couchbase Create
Webinar: Overcoming the Storage Challenges Cassandra and Couchbase Create
 

Mehr von Red_Hat_Storage

Red Hat Storage Day New York - What's New in Red Hat Ceph Storage
Red Hat Storage Day New York - What's New in Red Hat Ceph StorageRed Hat Storage Day New York - What's New in Red Hat Ceph Storage
Red Hat Storage Day New York - What's New in Red Hat Ceph StorageRed_Hat_Storage
 
Red Hat Storage Day Seattle: Why Software-Defined Storage Matters
Red Hat Storage Day Seattle: Why Software-Defined Storage MattersRed Hat Storage Day Seattle: Why Software-Defined Storage Matters
Red Hat Storage Day Seattle: Why Software-Defined Storage MattersRed_Hat_Storage
 
Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...
Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...
Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...Red_Hat_Storage
 
Red Hat Storage Day Seattle: Persistent Storage for Containerized Applications
Red Hat Storage Day Seattle: Persistent Storage for Containerized ApplicationsRed Hat Storage Day Seattle: Persistent Storage for Containerized Applications
Red Hat Storage Day Seattle: Persistent Storage for Containerized ApplicationsRed_Hat_Storage
 
Red Hat Storage Day Seattle: Stretching A Gluster Cluster for Resilient Messa...
Red Hat Storage Day Seattle: Stretching A Gluster Cluster for Resilient Messa...Red Hat Storage Day Seattle: Stretching A Gluster Cluster for Resilient Messa...
Red Hat Storage Day Seattle: Stretching A Gluster Cluster for Resilient Messa...Red_Hat_Storage
 
Red Hat Storage Day Seattle: Stabilizing Petabyte Ceph Cluster in OpenStack C...
Red Hat Storage Day Seattle: Stabilizing Petabyte Ceph Cluster in OpenStack C...Red Hat Storage Day Seattle: Stabilizing Petabyte Ceph Cluster in OpenStack C...
Red Hat Storage Day Seattle: Stabilizing Petabyte Ceph Cluster in OpenStack C...Red_Hat_Storage
 
Storage: Limitations, Frustrations, and Coping with Future Needs
Storage: Limitations, Frustrations, and Coping with Future NeedsStorage: Limitations, Frustrations, and Coping with Future Needs
Storage: Limitations, Frustrations, and Coping with Future NeedsRed_Hat_Storage
 
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers Red_Hat_Storage
 
Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...
Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...
Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...Red_Hat_Storage
 
Red Hat Storage Day Atlanta - Why Software Defined Storage Matters
Red Hat Storage Day Atlanta - Why Software Defined Storage MattersRed Hat Storage Day Atlanta - Why Software Defined Storage Matters
Red Hat Storage Day Atlanta - Why Software Defined Storage MattersRed_Hat_Storage
 
Red Hat Storage Day Atlanta - Red Hat Gluster Storage vs. Traditional Storage...
Red Hat Storage Day Atlanta - Red Hat Gluster Storage vs. Traditional Storage...Red Hat Storage Day Atlanta - Red Hat Gluster Storage vs. Traditional Storage...
Red Hat Storage Day Atlanta - Red Hat Gluster Storage vs. Traditional Storage...Red_Hat_Storage
 

Mehr von Red_Hat_Storage (11)

Red Hat Storage Day New York - What's New in Red Hat Ceph Storage
Red Hat Storage Day New York - What's New in Red Hat Ceph StorageRed Hat Storage Day New York - What's New in Red Hat Ceph Storage
Red Hat Storage Day New York - What's New in Red Hat Ceph Storage
 
Red Hat Storage Day Seattle: Why Software-Defined Storage Matters
Red Hat Storage Day Seattle: Why Software-Defined Storage MattersRed Hat Storage Day Seattle: Why Software-Defined Storage Matters
Red Hat Storage Day Seattle: Why Software-Defined Storage Matters
 
Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...
Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...
Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...
 
Red Hat Storage Day Seattle: Persistent Storage for Containerized Applications
Red Hat Storage Day Seattle: Persistent Storage for Containerized ApplicationsRed Hat Storage Day Seattle: Persistent Storage for Containerized Applications
Red Hat Storage Day Seattle: Persistent Storage for Containerized Applications
 
Red Hat Storage Day Seattle: Stretching A Gluster Cluster for Resilient Messa...
Red Hat Storage Day Seattle: Stretching A Gluster Cluster for Resilient Messa...Red Hat Storage Day Seattle: Stretching A Gluster Cluster for Resilient Messa...
Red Hat Storage Day Seattle: Stretching A Gluster Cluster for Resilient Messa...
 
Red Hat Storage Day Seattle: Stabilizing Petabyte Ceph Cluster in OpenStack C...
Red Hat Storage Day Seattle: Stabilizing Petabyte Ceph Cluster in OpenStack C...Red Hat Storage Day Seattle: Stabilizing Petabyte Ceph Cluster in OpenStack C...
Red Hat Storage Day Seattle: Stabilizing Petabyte Ceph Cluster in OpenStack C...
 
Storage: Limitations, Frustrations, and Coping with Future Needs
Storage: Limitations, Frustrations, and Coping with Future NeedsStorage: Limitations, Frustrations, and Coping with Future Needs
Storage: Limitations, Frustrations, and Coping with Future Needs
 
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
 
Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...
Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...
Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...
 
Red Hat Storage Day Atlanta - Why Software Defined Storage Matters
Red Hat Storage Day Atlanta - Why Software Defined Storage MattersRed Hat Storage Day Atlanta - Why Software Defined Storage Matters
Red Hat Storage Day Atlanta - Why Software Defined Storage Matters
 
Red Hat Storage Day Atlanta - Red Hat Gluster Storage vs. Traditional Storage...
Red Hat Storage Day Atlanta - Red Hat Gluster Storage vs. Traditional Storage...Red Hat Storage Day Atlanta - Red Hat Gluster Storage vs. Traditional Storage...
Red Hat Storage Day Atlanta - Red Hat Gluster Storage vs. Traditional Storage...
 

KĂźrzlich hochgeladen

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 

KĂźrzlich hochgeladen (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

When the Ceph Hits the Fan: Common Issues and Best Practices

  • 1. WHEN THE CEPH HITS THE FAN Dr. Wolfgang Schulze Director Global Storage Consulting Practice Red Hat October 20, 2016
  • 2. CAN THE CEPH EVEN HIT THE FAN? 2 •  A"er all… •  Architecture has no single point of failure •  Code base is very solid and had many years to mature •  Designed from the ground up to accommodate for failures •  Supposed to be self-healing and self-managing •  It simplies day-to-day data center opera?ons
  • 3. WHAT IS “HITTING THE FAN”, ANYWAYS? 3 •  Example scenarios: •  Heavy storm takes out data center, cluster fails to restart automa?cally •  Increased work load makes cluster unstable •  Performance is ne when cluster is empty to moderately lled, but when when geHng close physical capacity, write performance drops •  Nearly full cluster has become unresponsive and denies writes •  Bulk dele?on of objects takes so long that the client applica?on ?mes out •  Rebalancing a"er a par?al electric outage impacts clients with slow/ blocked requests •  Result in each case: customer les •  Sev 1: Produc?on is down •  Sev 2: Produc?on is impacted
  • 4. TICKET QUEUE IN RED HAT SUPPORT 4 Real screenshot, dated 2016-10-19 Customer names removed Many of these ,ckets could have been avoided if best prac,ces had been followed
  • 5. A SAD, BUT TRUE STORY 5 •  Customer bought Red Hat Ceph Storage subscrip?ons •  They were sure they had enough experience on their team and specically declined offers for training and consul?ng •  They designed and deployed Ceph cluster without guidance •  Originally for feasibility study, but everything seemed to work ne, so they put it into produc?on •  Nobody no?ced that the journal size was congured to only 100MB instead of best prac?ce size of 5GB •  A couple of months later a"er a power failure, the Ceph cluster failed to recover •  Support ?cket went on for several weeks, at the end some permanent data loss •  End result: Par?al data loss, unhappy management, unhappy customers
  • 6. SOME COMMON MISCONCEPTIONS 6 •  The new tools make Ceph easy to set up •  You don’t need detailed planning or architecture design •  Ceph works on any hardware, and you can mix & match hardware •  Storage infrastructure people will know how to handle the product •  Server people will know how to handle the product •  Ceph community bits are just ne (“We use a stable release”) •  Using community bits is more “cuHng edge”
  • 7. COMMON TROUBLE #1 UPSTREAM BITS FOR PRODUCTION SYSTEMS 7 Observa,on •  User is running upstream bits •  This happens even with users who are paying for a Red Hat Support subscrip?on •  People misinterpret the phrase “stable release” in community release notes Problem •  Red Hat Support won’t be able to help •  Red Hat only supports long term stable releases •  What could be a safe and fully documented upgrade to a newer LTS version suddenly becomes a “migra?on” with risks and piealls Mi,ga,on •  Use supported bits, stay informed about roadmap, get involved
  • 8. COMMON TROUBLE #2 USE OF UNSUPPORTED FEATURES 8 Observa,on •  User deploys system into produc?on using features which are not (yet) supported •  Examples: CephFS, BlueStore Problem •  Red Hat Support won’t be able to help •  Unless you have a support excep?on, the conversa?on may end quickly •  Red Hat Engineering will not build hot xes for you Mi,ga,on •  Try to get a support excep?on from Red Hat •  Don’t use the feature
  • 9. COMMON TROUBLE #3 USE OF UNSUPPORTED CONFIGURATIONS 9 Observa,on •  User deploy Ceph in a way that is not approved and has not been tested •  Examples: •  Running Ceph on unsupported Opera?ng System versions (e.g. GenToo, Debian) •  Deploying Problem •  Red Hat Support won’t be able to help •  Unless you have a support excep?on, the conversa?on may end quickly •  Red Hat Engineering will not build hot xes for you Mi,ga,on •  Read documenta?on, consider health check before go-live
  • 10. COMMON TROUBLE #4 POORLY MANAGED CLUSTER GROWTH 10 Observa,on •  Adding disks (or even en?re nodes) to clusters of rela?vely small total capacity •  Backll/recovery starves client I/O Problem •  In older versions of Ceph, default congura?on values are not ideal for this (osd_max_backlls, osd_recovery_max_ac?ve, osd_recovery_op_priority) •  If you fail to adjust these before you change the physical congura?on, you will indeed have huge impact Mi,ga,on •  Know your stuff, think ahead, es?mate impact, gradually weigh in
  • 11. COMMON TROUBLE #5 POOR SKILLS AND OPERATIONAL PRACTICES 11 Observa,ons •  Subject majer experts who brought Ceph to the organiza?on were hired guns, or employees who have since le" •  Team that ends up managing cluster considers it some sort of black art Problem •  Operators who don’t know what they are doing put your data at risk •  The built-in safety/durability may be compromised Mi,ga,on •  Make sure users receive proper training, and avoid staff SPOF •  Conduct controlled emergency drills to prac?ce for outages •  Maintain separate cluster with same version for experiments and dry run, or learn how to do it with a cloud based environment
  • 12. COMMON TROUBLE #6 RISKY CONFIGURATION CHOICES 12 Observa,ons •  Users read somewhere that moun?ng XFS OSD’s with the ‘nobarrier’ op?on will result in performance gains Problem •  While the performance gets no?ceably bejer, you are introducing a risk for data corrup?on during power outages •  The built-in safety/durability may be compromised Mi,ga,on •  Do not use ‘nobarrier’ mount op?on unless you understand fully what hardware you have, and unless you know what you are doing
  • 13. COMMON TROUBLE #7 POOR NETWORK CONFIGURATION 13 Observa,ons •  Users don’t pay enough ajen?on to network congura?on •  Network inconsistencies (e.g. Jumbo Frames) and bojlenecks go undetected …un?l Ceph performs poorly. Problem •  Troubleshoo?ng networking issues is dicult and experts hard to nd •  Ceph heavily relies on proper congura?on Mi,ga,on •  Invest in your team and network maintenance skills
  • 14. WHAT TO DO WHEN THINGS WENT WRONG 14 1.  Stay calm and don’t make it worse! •  Poorly skilled operators may turn a problem into a catastrophe 2.  Contact Red Hat Support immediately •  Sev 1 and Sev 2 issues are handled with top priority •  Chances are that they will be able to help right away and get your cluster humming again 3.  Contact your trusted Red Hat Services or Sales contacts •  If problems persist or you feel you need extra help, you might want to get a Ceph expert from Red Hat Professional Services
  • 15. GOOD PRACTICES TO AVOID PROBLEMS 15 1.  Don’t stumble into implementa?on/deployment without careful planning •  Capture and document requirements, do a POC, do an actual design •  Engage experts early to help with cluster design and hardware choices 2.  Unless you love to take risks, use supported bits 3.  Stay close to the recommended reference architectures from Red Hat partners 4.  Make sure your staff receives proper training •  Red Hat Global Learning provides excellent training for Gluster and Ceph 5.  Plan for growth 6.  Don’t let things linger. Ceph does not like it when the cluster is 90% full 7.  Have an expert perform regular Storage Health Checks to detect problems while they are s?ll small
  • 16. STORAGE DESIGN CONSULTING 16 •  Specialists from Red Hat Consul?ng will help planning your Ceph deployment •  Start: Storage Discovery Session •  We can help discover requirements and design a storage solu?on that matches •  You will receive a detailed Storage Solu,on architecture document which will ar?culate design choices and lay out a step-by-step plan for implementa?on
  • 17. STORAGE HEALTH CHECKS 17 •  Standard 3-day engagement done by Red Hat storage experts •  Comprehensive top-to-bojom analysis of your so"ware-dened storage plaeorm •  Six focus areas 1.  Life cycle 2.  Congura?on 3.  Organiza?on 4.  Use Case 5.  Hardware 6.  Opera?onal •  Clear read-out of issues •  Ac?onable recommenda?ons
  • 19. 19 WHERE TO GO NEXT RED HAT SUBSCRIPTIONS hjps://access.redhat.com/subscrip?on-value Evalua?on, Pre-produc?on, and Produc?on subscrip?ons available CONSULTING hjp://www.redhat.com/en/services/consul?ng/storage TRAINING hjps://www.redhat.com/en/services/training TEST DRIVE hjp://red.ht/cephtestdrive To engage a Territory Service Manager in your area, ask for a local Red Hat Storage sales professional at: NORTH AMERICA: 1 (888) REDHAT-1; LATIN AMERICA: 54 (11) 4329-7300; EMEA: 00800 7334 2835 APJ: 65 6490 4200; Brazil: 55 (11) 3529-6000,; Australia: 1800 733 428; New Zealand: 0800 733 428