Reima Karhila, D.R. Sanand, Mikko Kurimo and Peter Smit
Speaker adaptation with 10 sentences, as used in the listening tests for ICASSP 2012.
(See the paper in IEEExplore)
![]()  | 
40 child speakers 60 sentences each![]()  
 | 
|||
 
 | 
 
 | 
 
 | 
||
| VTLN normalisation | 
 
 | 
|||
 
 | 
![]()  | 
CSMAPLR group adaptation | CSMAPLR group adaptation | |
 
 | 
 
 | 
 
 | 
Target speaker 
 | 
|
| CSMAPLR speaker adaptation with 10 sentences | CSMAPLR speaker adaptation with 10 sentences | VTLN + CSMAPLR speaker adaptation with 10 sentences | CSMAPLR speaker adaptation with 10 sentences | |
 
 | 
 
 | 
 
 | 
 
 | 
|
 
Adapted from adult voice  | 
 
Stack adapted voice  | 
 
Stack adapted voice with VTLN  | 
 
Adapted from child voice  | 
|