Reima Karhila, D.R. Sanand, Mikko Kurimo and Peter Smit
Speaker adaptation with 10 sentences, as used in the listening tests for ICASSP 2012.
(See the paper in IEEExplore)
![]() |
40 child speakers 60 sentences each![]()
|
|||
|
|
|
||
| VTLN normalisation |
|
|||
|
![]() |
CSMAPLR group adaptation | CSMAPLR group adaptation | |
|
|
|
Target speaker
|
|
| CSMAPLR speaker adaptation with 10 sentences | CSMAPLR speaker adaptation with 10 sentences | VTLN + CSMAPLR speaker adaptation with 10 sentences | CSMAPLR speaker adaptation with 10 sentences | |
|
|
|
|
|
Adapted from adult voice |
Stack adapted voice |
Stack adapted voice with VTLN |
Adapted from child voice |
|