|
Training data samples *
→
|
![]() Clean speech F: M: |
![]() Babble-corrupted speech M: |
![]() Factory-corrupted speech F: M: |
![]() Babble-corrupted speech with noise suppression F: M: |
![]() Factory-corrupted speech with noise suppression F: M: |
![]() Average voices F: M: |
↓ | ↓ | ↓ | ↓ | ↓ |
| ↳ |
CSMAPLR adaptation with 100 sentences |
CSMAPLR adaptation with 100 sentences |
CSMAPLR adaptation with 100 sentences |
CSMAPLR adaptation with 100 sentences |
CSMAPLR adaptation with 100 sentences |
| ↓ | ↓ | ↓ | ↓ | ↓ | |
|
Synthesised samples → (Synthesised MCEP, original F0) |
F: M: |
F: M: |
F: M: |
F: M: |
F: M: |
*) This particular sample is from the test set and has not been used in adaptation