1. Institut de Technologie du Cambodge (ITC)
Génie Informatique et Communication (GIC)
TTS (Text-To-Speech)
Seangmeng LONG
[seangmeng@itc.edu.kh]
BarCamp
2. What is TTS?
TTS stands for Text-To-Speech
It is a system (module) which takes as input text
in Khmer Unicode and produces Khmer speech
Input Output
Electronic documents TTS system Khmer Speech
2
3. Our Method
Concatenation-Based Synthesis using Diphone
3
5. New Statistical System
Speech corpus
~450 sentences (~30 minutes)
Automatic labeling
EHMM labeler
Sphinx
Statistical parameter synthesis
More natural, but buzzy
Unit selection
Units of variable size (smallest unit is phone)
More natural, but bad quality at join points
5