SlideShare ist ein Scribd-Unternehmen logo
1 von 36
Dong	Yu
Distinguished	Scientist	and	Vice	General	Manager	
Tencent AI	Lab
work	was	done	while	@	Microsoft	Research
Joint	work	with	
Morten	Kolbæk,	Zheng-Hua	Tan,	and	Jesper	Jensen
Multi-talker	
Speech	Separation	and	Tracing	with	
Permutation	Invariant	Training
Outline
• Motivation
• Problem	Setup	and	Prior	Arts
• Multi-talker	Speech	Separation
• Experiments
• Conclusion
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 2
Outline
• Motivation
• Problem	Setup	and	Prior	Arts
• Multi-talker	Speech	Separation
• Experiments
• Conclusion
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 3
Frontier	Shift
• Driven	by	demand	from	users	to	interact	with	devices	
without	wearing	or	carrying	a	close-talk	microphone.	
• Many	difficulties	hidden	by	close-talk	microphones	now	
surface:	
• The	energy	of	speech	signal	is	very	low	when	it	reaches	the	
microphones.	
• The	interfering	signals,	such	as	background	noise,	reverberation,	
and	speech	from	other	talkers,	become	so	distinct	that	they	can	no	
longer	be	ignored.
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 4
close-talk	microphone far-field	microphone
reverberation from
surface reflections
additive noise from
other sound sources
source
Channel
distortion
ASR	in	Real	World	Scenarios
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 5
Cocktail	Party	Problem
• Term	coined	by	Cherry
• “One	of	our	most	important	faculties	is	our	ability	to	listen	to,	and	follow,	
one	speaker	in	the	presence	of	others.	This	is	such	a	common	experience	
that	we	may	take	it	for	granted;	we	may	call	it	‘the	cocktail	party	problem’…”	
(Cherry’57)
• Human’s	performance	is	superior	to	machine
• “For	‘cocktail	party’-like	situations…	when	all	voices	are	equally	loud,	speech	
remains	intelligible	for	normal-hearing	listeners even	when	there	are	as	
many	as	six	interfering	talkers”	(Bronkhorst &	Plomp’92)
• Speech	separation	problem
• Separate and	trace audio	streams
• Sometimes	called	speech	enhancement	when	dealing	with	non-speech	
interference
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 6
Is	Speech	Separation	Work	Needed?
• End-to-end	ASR	system	sufficient?
• Current	ASR	techniques	require	huge	amount	of	training	data	that	covers	various	
conditions	to	train	well
• Speech	separation	can	be	used	as	advanced	front-end
• Speech	separation	criterion	can	be	used	as	regularization	to	aid	and	speed	up	
training	of	ASR	systems
• More	applications	than	ASR
• Hearing	aids
• Cochlear	implants
• Noise	reduction	for	mobile	communication
• Audio	information	retrieval
• Using	microphone	array	sufficient?
• Mic-array	alone	is	not	sufficient,	e.g.,	when	at	same	direction
• Many	recordings	are	still	collected	with	single	microphone
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 7
Outline
• Motivation
• Problem	Setup	and	Prior	Arts
• Multi-talker	Speech	Separation
• Experiments
• Conclusion
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 8
Problem	Definition
• Source	speech	streams
• Mixed	speech
• STFT	domain
• Estimate	Mask
• Reconstruct	with	Mask
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 9
•Ill-posed	problem	(#	constraints	<	#	free	params:	
• There	are	an	infinite	number	of	possible	 𝑋" 𝑡, 𝑓 combinations	that	lead	to	
the	same	 𝑌 𝑡, 𝑓
•Solution:
• Learn	from	training	set	to	look	for	hidden	regularities	(complicated	soft	
constraints)
Prior	Arts	Before	Deep	Learning	Era
• Computational	auditory	scene	analysis	(CASA)	
• Use	perceptual	grouping	cues	to	estimate	time-frequency	masks
• Non-negative	matrix	factorization	(NMF)
• Learn	a	set	of	non-negative	bases	during	training
• Estimate	mixing	factors	during	evaluation
• Model	based	approach	such	as	factorial	GMM-HMM	
• Models	the	interaction	between	the	target	and	competing	speech	signals	and	
their	temporal	dynamics
• Spatial	filtering	with	a	microphone	array
• Beamforming:	Extract	target	sound	from	a	specific	spatial	direction
• Independent	component	analysis:	Find	a	demixing matrix	from	multiple	
mixtures	of	sound	sources
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 10
Training	Criteria	for	Deep	Learning
• Ideal	amplitude	mask	(IAM)	𝑀" 𝑡, 𝑓 =
)* +,,
- +,,
• Minimize	mask estimation	error	(two	problems)
• In	silence	segments	 𝑋" 𝑡, 𝑓 = 0 and	 𝑌 𝑡, 𝑓 = 0 → 𝑀" 𝑡, 𝑓 is	not	well	defined
• Smaller	error	on	masks	may	not	lead	to	a	smaller	error	on	magnitude	(which	is	what	
we	care	about)
• Minimize	magnitude estimation	error	(used	in	this	study)
• Magnitude	still	estimated	through	masks:	often	lead	to	better	performance	esp.	
when	training	set	is	small
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 11
Prior	Arts	with	DL:	Speech	+	Others
(many	works,	OSU,	MERL,	CUST,	etc.)
• Basic	Architecture:	mix	of	different	types	of	signals
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 12
Noise/Music/	
Other	Speakers
Est.	Noise/Music/	
Other	Speakers
Prior	Arts	with	DL:	Focus	on	Speech
(many	works,	OSU,	MERL,	CUST,	etc.)
• Basic	Architecture:	mix	of	different	types	of	signals
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 13
Noise/Music/	
Other	Speakers
Est.	Noise/Music/	
Other	Speakers
Speech +	noise
Speech +	music
Specific	speaker	+	other	speakers
Outline
• Motivation
• Problem	Setup	and	Prior	Arts
• Multi-talker	Speech	Separation
• Experiments
• Conclusion
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 14
Multi-Talker	Speech	Separation
• Label	Ambiguity	/	Label	Permutation	Problem
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 15
Speaker	1	à output	1 ?
Speaker	1	à output	2 ?
Solution	1:	Deep	Clustering
(Hershey,	Chen,	Roux,	Watanabe,	2016)
• Learn	a	unit-size	embedding	for	each	time-frequency	bin	
• If	two	bins	belong	to	the	same	speaker	they	are	close	in	the	embedding	
space,	and	father	away	otherwise.	
• Trained	on	a	large	window	of	frames
• Separation	is	done	by	clustering	embedding	space	representations	
(i.e.,	segment	the	bins)
• Shortcomings
• Pipeline	is	complicated
• Each	bin	is	assumed	to	belong	to	one	and	only	one	speaker	à limited	its	
ability	to	combine	with	other	techniques
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 16
Solution	2:	Use	Manually	Defined	Rules
(Weng,	Yu,	Seltzer,	Droppo,	14,15)
• Use	instantaneous	energy	instead	of	speaker	ID	to	assign	labels:	manually
designed	limited	cues
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 17
Low-energy	
speech
High-energy	
speech
Our	Solution:	Permutation	Invariant	Training
(Yu, Kolbæk,	Tan,	Jensen,	16,	17)
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 18
Simple	to	implement
Can	be	easily	extended	to	3-speakers
𝑋0 − 𝑋20
3
+ 𝑋3 − 𝑋23
3
𝑋3 − 𝑋20
3
+ 𝑋0 − 𝑋23
3
Testing
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 19
• Default	assignment:	concatenate	output	s’s	frames	to	form	stream	s
• Optimal	assignment:	output	of	each	frame	is	correctly	assigned	to	speakers.	
Concatenate	frames	belong	to	speaker	s	to	form	stream	s
• Gap	between	them	indicates	the	gain	from	additional	speaker	tracing
Outline
• Motivation
• Problem	Setup	and	Prior	Arts
• Multi-talker	Speech	Separation
• Experiments
• Conclusion
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 20
Experiment	Setup:	Datasets
• WSJ0-2mix	and	3-mix
• Derived	from	WSJ0	corpus	
• 2- and	3-speaker	mixtures	(artificially	generated)
• 30h	training	set,	10h	validation	set,	5h	test	set	
• Mixed	at	SIRs	between	0	dB	and	5	dB.
• Danish-2mix	and	3-mix	
• Derived	from	a	Danish	corpus	
• 2- or	3-speaker	mixtures	(artificially	generated)
• 10k,	1k,	1k+1k	utterances	in	training,	validation,	and	test	sets	
• Mixed	at	0dB
• WSJ0-2mix-other
• Same	as	WSJ0-2mix	but	mixed	at	0dB
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 21
Models
• Implemented	using	the	Microsoft	cognitive	toolkit	(CNTK)
• Input:	257	dim	STFT;	Output:	257	x	S	streams
• Segment-based	(PIT-S):	Each	segment	is	independent,	no	tracing
• DNN:	3	hidden	layers	each	with	1024	ReLU units
• PIT	with	tracing	(PIT-T):	force	all	frames	from	the	same	output	layer	
to	belong	to	the	same	speaker
• LSTM:	3	LSTM	layers	each	with	1792	units
• BLSTM:	3	BLSTM	layers	each	with	896	units
• Test	Conditions
• Closed	condition	(CC): seen	speakers
• Open	condition	(OC):	unseen	speakers
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 22
PIT-S	Training	Behavior:	WSJ0-2mix
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 23
PIT-S:	SDR	Gain	(dB)	on	WSJ0-2MIX	
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 24
PIT-T	Training	Behavior:	WSJ0-2mix
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 25
PIT-T:	SDR	Gain	(dB)	on	WSJ0-2MIX	
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 26
SDR	(dB)	and	PESQ	Gain	Comparison
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 27
Cross	Language	Behavior	on	2-talker	Mix
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 28
PIT-T	on	WSJ0-3mix
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 29
PIT-T	Trained	with	Both	2- and	3-mix
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 30
Examples:	2-talker	Mix
•Male+Female:
•Mix:
•S1:
•S2:	
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 31
•Female+Male:
•Mix:
•S1:
•S2:	
•Female+Female:
•Mix:
•S1:
•S2:	
•Male+Male:
•Mix:
•S1:
•S2:
Examples:	3-talker	Mix
•Male+2Female:
•Mix:
•S1:
•S2:
•S3:	
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 32
•Female+2Male:
•Mix:
•S1:
•S2:	
•S3:
Example:	Trained	on	3-Mix	Test	on	2-Mix
•Diff	Gender:
•Mix:
•S1:
•S2:
•S3:	
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 33
•Same	Gender:
•Mix:
•S1:
•S2:	
•S3:
Example:	Trained	on	2	and	3-Mix,	test	on	2-Mix
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 34
•Diff	Gender:
•Mix:
•S1:
•S2:
•S3:	
•Same	Gender:
•Mix:
•S1:
•S2:	
•S3:
Outline
• Motivation
• Problem	Setup	and	Prior	Arts
• Multi-talker	Speech	Separation
• Experiments
• Conclusion
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 35
Conclusion
• PIT	can	solve	the	label	permutation	problem
• PIT	is	effective	in	speech	separation	without	knowing	number	of	
speakers
• PIT	trained	models	generalize	well	to	unseen	speakers	and	languages
• PIT	is	simple	to	implement
• PIT	has	great	potential	since	it	can	be	easily	integrated	and	combined	
with	other	techniques
3/27/17 Dong	Yu	:	Multi-talker	Speech	Separation	and	Tracing	with	Permutation	Invariant	Training 36
Classification	View	
(supervised	approach)
Segmentation	view
(deep	clustering)
Separation	View
(PIT)
PIT	is	an	important	ingredient	in	the	
final	solution	to	the	cocktail	party	problem

Weitere ähnliche Inhalte

Ähnlich wie Multi-talker Speech Separation and Tracing at AI NEXT Conference

Mobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSCMobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSCNathanBowen8
 
DTUI6_chap09_accessiblePPT.pptx
DTUI6_chap09_accessiblePPT.pptxDTUI6_chap09_accessiblePPT.pptx
DTUI6_chap09_accessiblePPT.pptxHetaSuto
 
Emerging technologies
Emerging technologiesEmerging technologies
Emerging technologiesMrslogan_1
 
Edtp 558 roa udl assistive_tech
Edtp 558 roa udl assistive_techEdtp 558 roa udl assistive_tech
Edtp 558 roa udl assistive_techJesse Roa
 
Challenges when doing usability tests on physical devices af Lars Bo Larsen, ...
Challenges when doing usability tests on physical devices af Lars Bo Larsen, ...Challenges when doing usability tests on physical devices af Lars Bo Larsen, ...
Challenges when doing usability tests on physical devices af Lars Bo Larsen, ...InfinIT - Innovationsnetværket for it
 
Demo day presentation
Demo day presentationDemo day presentation
Demo day presentationBilly Kennedy
 
CSC8605-005 Mocking Up (Simon Bowen, Newcastle University)
CSC8605-005 Mocking Up (Simon Bowen, Newcastle University)CSC8605-005 Mocking Up (Simon Bowen, Newcastle University)
CSC8605-005 Mocking Up (Simon Bowen, Newcastle University)John Vines
 
Introduction to NLP.pptx
Introduction to NLP.pptxIntroduction to NLP.pptx
Introduction to NLP.pptxbuivantan_uneti
 
Differences in-task-descriptions
Differences in-task-descriptionsDifferences in-task-descriptions
Differences in-task-descriptionsSameer Chavan
 
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptxLiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptxVishnuRajuV
 
Mobile lang lab 2009
Mobile lang lab 2009Mobile lang lab 2009
Mobile lang lab 2009lenerybner
 
Domain-specific Modeling and Code Generation for Cross-platform Mobile and Io...
Domain-specific Modeling and Code Generation for Cross-platform Mobile and Io...Domain-specific Modeling and Code Generation for Cross-platform Mobile and Io...
Domain-specific Modeling and Code Generation for Cross-platform Mobile and Io...Università degli Studi dell'Aquila
 
Social Networks & e-Business: Capturing Real-Time Niche Markets
Social Networks & e-Business: Capturing Real-Time Niche MarketsSocial Networks & e-Business: Capturing Real-Time Niche Markets
Social Networks & e-Business: Capturing Real-Time Niche MarketsDavid Marca
 
Symposium on Utilizing Emerging Technologies and Social Media to Enhance EFL ...
Symposium on Utilizing Emerging Technologies and Social Media to Enhance EFL ...Symposium on Utilizing Emerging Technologies and Social Media to Enhance EFL ...
Symposium on Utilizing Emerging Technologies and Social Media to Enhance EFL ...Steve McCarty
 
Opening remarks for PhoneCom 2011 workshop
Opening remarks for PhoneCom 2011 workshopOpening remarks for PhoneCom 2011 workshop
Opening remarks for PhoneCom 2011 workshopphonecom
 

Ähnlich wie Multi-talker Speech Separation and Tracing at AI NEXT Conference (20)

M6 cp martensdragalinm
M6 cp martensdragalinmM6 cp martensdragalinm
M6 cp martensdragalinm
 
M6 cp martensdragalinm
M6 cp martensdragalinmM6 cp martensdragalinm
M6 cp martensdragalinm
 
M6 cp martensdragalinm
M6 cp martensdragalinmM6 cp martensdragalinm
M6 cp martensdragalinm
 
Mobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSCMobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSC
 
final-day1-july2.pptx
final-day1-july2.pptxfinal-day1-july2.pptx
final-day1-july2.pptx
 
Amadou
AmadouAmadou
Amadou
 
DTUI6_chap09_accessiblePPT.pptx
DTUI6_chap09_accessiblePPT.pptxDTUI6_chap09_accessiblePPT.pptx
DTUI6_chap09_accessiblePPT.pptx
 
Emerging technologies
Emerging technologiesEmerging technologies
Emerging technologies
 
Edtp 558 roa udl assistive_tech
Edtp 558 roa udl assistive_techEdtp 558 roa udl assistive_tech
Edtp 558 roa udl assistive_tech
 
Challenges when doing usability tests on physical devices af Lars Bo Larsen, ...
Challenges when doing usability tests on physical devices af Lars Bo Larsen, ...Challenges when doing usability tests on physical devices af Lars Bo Larsen, ...
Challenges when doing usability tests on physical devices af Lars Bo Larsen, ...
 
Demo day presentation
Demo day presentationDemo day presentation
Demo day presentation
 
CSC8605-005 Mocking Up (Simon Bowen, Newcastle University)
CSC8605-005 Mocking Up (Simon Bowen, Newcastle University)CSC8605-005 Mocking Up (Simon Bowen, Newcastle University)
CSC8605-005 Mocking Up (Simon Bowen, Newcastle University)
 
Introduction to NLP.pptx
Introduction to NLP.pptxIntroduction to NLP.pptx
Introduction to NLP.pptx
 
Differences in-task-descriptions
Differences in-task-descriptionsDifferences in-task-descriptions
Differences in-task-descriptions
 
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptxLiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
 
Mobile lang lab 2009
Mobile lang lab 2009Mobile lang lab 2009
Mobile lang lab 2009
 
Domain-specific Modeling and Code Generation for Cross-platform Mobile and Io...
Domain-specific Modeling and Code Generation for Cross-platform Mobile and Io...Domain-specific Modeling and Code Generation for Cross-platform Mobile and Io...
Domain-specific Modeling and Code Generation for Cross-platform Mobile and Io...
 
Social Networks & e-Business: Capturing Real-Time Niche Markets
Social Networks & e-Business: Capturing Real-Time Niche MarketsSocial Networks & e-Business: Capturing Real-Time Niche Markets
Social Networks & e-Business: Capturing Real-Time Niche Markets
 
Symposium on Utilizing Emerging Technologies and Social Media to Enhance EFL ...
Symposium on Utilizing Emerging Technologies and Social Media to Enhance EFL ...Symposium on Utilizing Emerging Technologies and Social Media to Enhance EFL ...
Symposium on Utilizing Emerging Technologies and Social Media to Enhance EFL ...
 
Opening remarks for PhoneCom 2011 workshop
Opening remarks for PhoneCom 2011 workshopOpening remarks for PhoneCom 2011 workshop
Opening remarks for PhoneCom 2011 workshop
 

Mehr von Bill Liu

Walk Through a Real World ML Production Project
Walk Through a Real World ML Production ProjectWalk Through a Real World ML Production Project
Walk Through a Real World ML Production ProjectBill Liu
 
Redefining MLOps with Model Deployment, Management and Observability in Produ...
Redefining MLOps with Model Deployment, Management and Observability in Produ...Redefining MLOps with Model Deployment, Management and Observability in Produ...
Redefining MLOps with Model Deployment, Management and Observability in Produ...Bill Liu
 
Productizing Machine Learning at the Edge
Productizing Machine Learning at the EdgeProductizing Machine Learning at the Edge
Productizing Machine Learning at the EdgeBill Liu
 
Transformers in Vision: From Zero to Hero
Transformers in Vision: From Zero to HeroTransformers in Vision: From Zero to Hero
Transformers in Vision: From Zero to HeroBill Liu
 
Deep AutoViML For Tensorflow Models and MLOps Workflows
Deep AutoViML For Tensorflow Models and MLOps WorkflowsDeep AutoViML For Tensorflow Models and MLOps Workflows
Deep AutoViML For Tensorflow Models and MLOps WorkflowsBill Liu
 
Metaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at NetflixMetaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at NetflixBill Liu
 
Practical Crowdsourcing for ML at Scale
Practical Crowdsourcing for ML at ScalePractical Crowdsourcing for ML at Scale
Practical Crowdsourcing for ML at ScaleBill Liu
 
Building large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudiBuilding large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudiBill Liu
 
Deep Reinforcement Learning and Its Applications
Deep Reinforcement Learning and Its ApplicationsDeep Reinforcement Learning and Its Applications
Deep Reinforcement Learning and Its ApplicationsBill Liu
 
Big Data and AI in Fighting Against COVID-19
Big Data and AI in Fighting Against COVID-19Big Data and AI in Fighting Against COVID-19
Big Data and AI in Fighting Against COVID-19Bill Liu
 
Highly-scalable Reinforcement Learning RLlib for Real-world Applications
Highly-scalable Reinforcement Learning RLlib for Real-world ApplicationsHighly-scalable Reinforcement Learning RLlib for Real-world Applications
Highly-scalable Reinforcement Learning RLlib for Real-world ApplicationsBill Liu
 
Build computer vision models to perform object detection and classification w...
Build computer vision models to perform object detection and classification w...Build computer vision models to perform object detection and classification w...
Build computer vision models to perform object detection and classification w...Bill Liu
 
Causal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningCausal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningBill Liu
 
Weekly #106: Deep Learning on Mobile
Weekly #106: Deep Learning on MobileWeekly #106: Deep Learning on Mobile
Weekly #106: Deep Learning on MobileBill Liu
 
Weekly #105: AutoViz and Auto_ViML Visualization and Machine Learning
Weekly #105: AutoViz and Auto_ViML Visualization and Machine LearningWeekly #105: AutoViz and Auto_ViML Visualization and Machine Learning
Weekly #105: AutoViz and Auto_ViML Visualization and Machine LearningBill Liu
 
AISF19 - On Blending Machine Learning with Microeconomics
AISF19 - On Blending Machine Learning with MicroeconomicsAISF19 - On Blending Machine Learning with Microeconomics
AISF19 - On Blending Machine Learning with MicroeconomicsBill Liu
 
AISF19 - Travel in the AI-First World
AISF19 - Travel in the AI-First WorldAISF19 - Travel in the AI-First World
AISF19 - Travel in the AI-First WorldBill Liu
 
AISF19 - Unleash Computer Vision at the Edge
AISF19 - Unleash Computer Vision at the EdgeAISF19 - Unleash Computer Vision at the Edge
AISF19 - Unleash Computer Vision at the EdgeBill Liu
 
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...Bill Liu
 
Toronto meetup 20190917
Toronto meetup 20190917Toronto meetup 20190917
Toronto meetup 20190917Bill Liu
 

Mehr von Bill Liu (20)

Walk Through a Real World ML Production Project
Walk Through a Real World ML Production ProjectWalk Through a Real World ML Production Project
Walk Through a Real World ML Production Project
 
Redefining MLOps with Model Deployment, Management and Observability in Produ...
Redefining MLOps with Model Deployment, Management and Observability in Produ...Redefining MLOps with Model Deployment, Management and Observability in Produ...
Redefining MLOps with Model Deployment, Management and Observability in Produ...
 
Productizing Machine Learning at the Edge
Productizing Machine Learning at the EdgeProductizing Machine Learning at the Edge
Productizing Machine Learning at the Edge
 
Transformers in Vision: From Zero to Hero
Transformers in Vision: From Zero to HeroTransformers in Vision: From Zero to Hero
Transformers in Vision: From Zero to Hero
 
Deep AutoViML For Tensorflow Models and MLOps Workflows
Deep AutoViML For Tensorflow Models and MLOps WorkflowsDeep AutoViML For Tensorflow Models and MLOps Workflows
Deep AutoViML For Tensorflow Models and MLOps Workflows
 
Metaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at NetflixMetaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at Netflix
 
Practical Crowdsourcing for ML at Scale
Practical Crowdsourcing for ML at ScalePractical Crowdsourcing for ML at Scale
Practical Crowdsourcing for ML at Scale
 
Building large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudiBuilding large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudi
 
Deep Reinforcement Learning and Its Applications
Deep Reinforcement Learning and Its ApplicationsDeep Reinforcement Learning and Its Applications
Deep Reinforcement Learning and Its Applications
 
Big Data and AI in Fighting Against COVID-19
Big Data and AI in Fighting Against COVID-19Big Data and AI in Fighting Against COVID-19
Big Data and AI in Fighting Against COVID-19
 
Highly-scalable Reinforcement Learning RLlib for Real-world Applications
Highly-scalable Reinforcement Learning RLlib for Real-world ApplicationsHighly-scalable Reinforcement Learning RLlib for Real-world Applications
Highly-scalable Reinforcement Learning RLlib for Real-world Applications
 
Build computer vision models to perform object detection and classification w...
Build computer vision models to perform object detection and classification w...Build computer vision models to perform object detection and classification w...
Build computer vision models to perform object detection and classification w...
 
Causal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningCausal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine Learning
 
Weekly #106: Deep Learning on Mobile
Weekly #106: Deep Learning on MobileWeekly #106: Deep Learning on Mobile
Weekly #106: Deep Learning on Mobile
 
Weekly #105: AutoViz and Auto_ViML Visualization and Machine Learning
Weekly #105: AutoViz and Auto_ViML Visualization and Machine LearningWeekly #105: AutoViz and Auto_ViML Visualization and Machine Learning
Weekly #105: AutoViz and Auto_ViML Visualization and Machine Learning
 
AISF19 - On Blending Machine Learning with Microeconomics
AISF19 - On Blending Machine Learning with MicroeconomicsAISF19 - On Blending Machine Learning with Microeconomics
AISF19 - On Blending Machine Learning with Microeconomics
 
AISF19 - Travel in the AI-First World
AISF19 - Travel in the AI-First WorldAISF19 - Travel in the AI-First World
AISF19 - Travel in the AI-First World
 
AISF19 - Unleash Computer Vision at the Edge
AISF19 - Unleash Computer Vision at the EdgeAISF19 - Unleash Computer Vision at the Edge
AISF19 - Unleash Computer Vision at the Edge
 
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
 
Toronto meetup 20190917
Toronto meetup 20190917Toronto meetup 20190917
Toronto meetup 20190917
 

Kürzlich hochgeladen

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 

Kürzlich hochgeladen (20)

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 

Multi-talker Speech Separation and Tracing at AI NEXT Conference