Using noisy data for speaker adaptation in HMM-based Speech Synthesis

Training data samples * →



Clean speech
F:
M:


Babble-corrupted speech
F:
M:


Factory-corrupted speech
F:
M:

Average voices
F:
M:
CSMAPLR adaptation
with 100 sentences
CSMAPLR adaptation
with 100 sentences
CSMAPLR adaptation
with 100 sentences
Synthesised samples →
(Synthesised MCEP, original F0)
F:
M:
F:
M:
F:
M:

*) This particular sample is from the test set and has not been used in adaptation





For details see: