References and Literature

References and Literature

Abadjieva E., Murray I., Arnott J. (1993). Applying Analysis of Human Emotion Speech to Enhance Synthetic Speech. Proceedings of Eurospeech 93 (2): 909-912.

Acero A. (1998). Source-Filter Models for Time-Scale Pitch-Scale Modification of Speech. Proceedings of ICASSP98.

AcuVoice, Inc. Homepage (1998). <http://www.acuvoice.com>.

Allen J., Hunnicutt S., Klatt D. (1987). From Text to Speech: The MITalk System. Cambridge University Press, Inc.

Altosaar T., Karjalainen M., Vainio M. (1996). A Multilingual Phonetic Representation and Analysis for Different Speech Databases. Proceedings of ICSLP 96 (3).

Amundsen M. (1996). MAPI, SAPI, and TAPI Developers Guide. Sams Publishing. <http://book.ygm.itu.edu.tr/Book/mapi_sapi/index.htm>

Apple Speech Technologies Home Page (1998). <http://www.apple.com/macos/speech/>.

Barber S., Carlson R., Cosi P., Di Benedetto M., Granström B., Vagges K. (1989). A Rule Based Italian Text-to-Speech System. Proceedings of Eurospeech 89 (1): 517-520.

Belhoula K. (1993). Rule-Based Grapheme-to-Phoneme Conversion of Names. Proceedings of Eurospeech 93 (2): 881-884.

Bell Laboratories TTS Homepage (1998). <http://www.bell-labs.com/project/tts/>.

Bellcore ORATOR Homepage (1998). <http://www.bellcore.com/ORATOR>.

Benoit C. (1995). Speech Synthesis: Present and Future. European Studies in Phonetic & Speech Communication. Netherlands. pp. 119-123.

Bernstein J., Pisoni D. (1980). Unlimited Text-to-Speech System: Description and Evaluation of a Microprocessor Based Device. Proceedings of ICASSP 80 (3): 574-579.

Beskow J. (1996). Talking Heads - Communication, Articulation and animation. Proceedings of Fonetik-96: 53-56.

Beskow J., Dahlquist M., Granström B., Lundeberg M., Spens K-E.,. Öhman T. (1997). The Teleface Project - Disability, Feasibility, and Intelligibility. Proceedings of Fonetik97, Swedish Fonetics Conf., Umea, Sweden. <http://www.speech.kth.se/~magnusl/teleface_f97.html>

Beskow K., Elenius K., McGlashan S. (1997). The OLGA Project: An Animated Talking Agent in a Dialogue System. Proceedings of Eurospeech 97. <http://www.speech.kth.se/multimodal/papers/>

Black A., Taylor P. (1994). CHATR: A Generic Speech Synthesis System. COLING94, Japan.

Black A., Taylor P. (1997). Festival Speech Synthesis System: System Documentation (1.1.1). Human Communication Research Centre Technical Report HCRC/TR-83.

Boeffard O., Cherbonnel B., Emerard F., White S. (1993). Automatic Segmentation and Quality Evaluation of Speech Unit Inventories for Concatenation-Based, Multilingual PSOLA Text-to-Speech Systems. Proceedings of Eurospeech 93 (1): 1449-1452.

Breen A. (1992). Speech Synthesis Models: A Review. Electronics & Communication Engineering Journal, vol. 4: 19-31.

Breen A., Bowers E., Welsh W. (1996). An Investigation into the Generation of Mouth Shapes for a Talking Head. Proceedings of ICSLP 96 (4).

BT Laboratories Laureate home page (1998). <http://www.labs.bt.com/innovate/speech/laureate>

Campos G., Gouvea E. (1996). Speech Synthesis Using the CELP Algorithm. Proceedings of ICSLP 96 (3).

Carlson R., Fant G., Gobl C., Granström B., Karlsson I., Lin Q. (1989). Voice Source Rules for Text-to-Speech Synthesis. Proceedings of ICASSP 89 (1): 223-226.

Carlson R., Granström B., Nord L. (1990). Evaluation and Development of the KTH Text-to-Speech System on the Segmental Level. Proceedings of ICASSP 90 (1): 317-320.

Cawley G., Noakes B. (1993a). Allophone Synthesis Using a Neural Network. Proceedings of the First World Congress on Neural Networks (WCNN-93) (2): 122-125. <http://www.sys.uea.ac.uk/~gcc>.

Cawley G., Noakes B. (1993b). LSP Speech Synthesis Using Backpropagation Networks. Proceedings fo the IEE International Conference on Artificial Neural Networks (ANN-93): 291-293. <http://www.sys.uea.ac.uk/~gcc>.

Cawley G. (1996). The Application of Neural Networks to Phonetic Modelling. PhD. Thesis, University of Essex, England. <http://www.sys.uea.ac.uk/~gcc/thesis.html>

Charpentier F., Moulines E. (1989). Pitch-Synchronous Waveform Prosessing Techniques for Text-to-Speech Synthesis Using Diphones. Proceedings of Eurospeech 89 (2): 13-19.

Charpentier F., Stella M. (1986). Diphone Synthesis Using an Overlap-Add Technique for Speech Waveforms Concatenation. Proceedings of ICASSP 86 (3): 2015-2018.

Childers D., Hu H. (1994). Speech Synthesis by Glottal Excited Linear Prediction. Journal of the Acoustical Society of America, JASA vol. 96 (4): 2026-2036.

Cohen M., Massaro D. (1993). Modelling Coarticulation in Synthetic Visual Speech. Proceedings of Computer Animation 93, Suisse.

Cole R., Mariani J., Uszkoreit H., Zaenen A., Zue V. (Editors) (1995). Survey of the State of the Art in Human Language Technology.

Cowie R., Douglas-Cowie E. (1996). Automatic Statistical Analysis of the Signal and Prosodic Signs of Emotion in Speech. Proceedings of ICSLP 96 (3).

Delogu C., Paolini A., Ridolfi P., Vagges K. (1995). Intelligibility of Speech Produced by Texto-to-Speech Systems in Good and Telephonic Condtions. Acta Acoustica 3 (1995): 89-96.

Dettweiler H., Hess W. (1985). Concatenation Rules for Demisyllable Speech Synthesis. Proceedings of ICASSP 85 (2): 752-755.

Donovan R. (1996). Trainable Speech Synthesis. PhD. Thesis. Cambridge University Engineering Department, England.
<ftp://svr-ftp.eng.cam.ac.uk/pub/reports/donovan_thesis.ps.Z>.

Dutoit T. (1994). High Quality Text-to-Speech Synthesis: A Comparison of Four Candidate Algorithms. Proceedings of ICASSP 94 (1): 565-568.

Dutoit T., Leich H. (1992). Improving the TD-PSOLA Text-to-Speech Synthesizer with a Specially Designed MBE Re-Synthesis of the Segments Database. Proceedings of EUSIPCO-92 (1): 343-346.

Dutoit T., Leich H. (1993). MBR-PSOLA: Text-to-Speech Synthesis Based on an MBE Re-Synthesis of the Segments Database. Speech Communication, vol. 13: 435-440.

Dutoit T., Pagel V., Pierret N., Bataille F., Vrecken O. (1996). The MBROLA Project: Towards a Set of High Quality Speech Synthesizers Free of Use for Non Commercial Purposes. Proceedings of ICSLP 96 (3).

Dynastat, Inc. Homepage (1997). <http://www.realtime.net/dynastat/>.

ELAN Informatique Homepage (1998). <http://www.elan.fr/speech/>.

ETI Eloquence Home Page (1998). <http://www.eloq.com/eti0elo.html>.

Eurovocs Homepage (1998). <http://www.elis.rug.ac.be/t%26i/eurovocs.htm>.

Falaschi A., Giustiniani M., Verola M. (1989). A Hidden Markov Model Approach to Speech Synthesis. Proceedings of Eurospeech 89 (2): 187-190.

Fant G. (1970). Acoustic Theory of Speech Production. Mouton, The Hague.

Flanagan J. (1972). Speech Analysis, Synthesis, and Perception. Springer-Verlag, Berlin-Heidelberg-New York.

Flanagan J., Rabiner L. (Editors) (1973). Speech Synthesis. Dowden, Hutchinson & Ross, Inc., Pennsylvania.

Fries G. (1993). Phoneme-Depended Speech Synthesis in the Time and Frequency Domains. Proceedings of Eurospeech 93 (2): 921-924.

Fujisaki H., Ljungqvist M., Murata H. (1993). Analysis and Modeling of Word Accent and Sentence Intonation in Swedish. Proceedings of ICASSP 93 (2): 211-214.

Galanes F., Savoji M., Pardo J. (1995). Speech Synthesis System Based on a Variable Decimation/Interpolation Factor. Proceedings of ICASSP 95: 636-639.

Gaved M. (1993). Pronunciation and Text Normalisation in Applied Text-to-Speech Systems. Proceedings of Eurospeech 93 (2): 897-900.

George E. (1998). Practical High-Quality Speech and Voice Synthesis Using Fixed Frame Rate ABS/OLA Sinusoidal Modeling. Proceedings of ICASSP98.

Goldstein M. (1995). Classification of Methods Used for Assessment of Text-to-Speech Systems According to the Demands Placed on the Listener. Speech Communication vol. 16: 225-244.

Gonzalo E., Olaszy G., Németh G. (1993). Improvements of the Spanish Version of the MULTIVOX Text-to-Speech System. Proceedings of Eurospeech 93 (2): 869-872.

HADIFIX Speech Synthesis Homepage (1997). University of Bonn.
<http://www.ikp.uni-bonn.de/~tpo/Hadifix.en.html>

Hakulinen J. (1998). Suomenkieliset puhesynteesiohjelmistot (The Software Based Speech Synthesizers for Finnish). Report Draft, University of Tampere, Department of Computing Science, Speech Interfaces, 26.8.1998. <http://www.cs.uta.fi/research/hci/SUI/reports/ra0298jh.html>.

Hallahan W. (1996). DECtalk Software: Text-to-Speech Technology and Implementation. Digital Technical Journal.

Hertz S. (1997). The ETI-Eloquence Text-to-Speech System. White Paper, Eloquent Technology Inc. <http://www.eloq.com/White1297-1.htm>.

Hess W. (1992). Speech Synthesis - A Solved Problem? Proceedings of EUSIPCO 92 (1): 37-46.

Heuft B., Portele T., Rauth M. (1996). Emotions in Time Domain Synthesis. Proceedings of ICSLP 96 (3).

Hirakawa T. (1989). Speech Synthesis Using a Waveform Dictionary. Proceedings of Eurospeech 89 (1): 140-143.

Holmes W., Holmes J., Judd M. (1990). Extension of the Bandwith of the JSRU Parallel-Formant Synthesizer for High Quality Synthesis of Male and Female Speech. Proceedings of ICASSP 90 (1): 313-316.

Hon H., Acero A., Huang X., Liu J., Plumpe M. (1998). Automatic Generation of Synthesis Units for Trainable Text-to-Speech Systems. Proceedings of ICASSP 98 (CD-ROM).

Howard-Jones P., SAM Partnership. 'SOAP' - A Speech Output Assessment Package for Controlled Multilingual Evaluation of Synthetic Speech. Proceedings of Eurospeech 91 (1): 281-283.

Huang X., Acero A., Adcock J., Hon H., Goldsmith J., Liu J., Plumpe M. (1996). Whistler: A Trainable Text-to-Speech System. Proceedings of ICSLP96 (4).

Huang X., Acero A., Hon H., Ju Y., Liu J., Mederith S., Plumpe M. (1997). Recent Improvements on Microsoft's Trainable Text-to-Speech System - Whistler. Proceedings of ICASSP97 (2): 959-934.

Hunt A., Black A. (1996). Unit Selection in a Concatenative Speech Synthesis System Using a Large Speech Database. Proceedings of ICASSP 96: 373-376.

INM Homepage (1997). <http://www.ineural.com/products.html>.

IPA (1998). International Phonetic Association Homepage. <http://www.arts.gla.ac.uk/IPA/ipa.html>.

ISO/IEC CD 14496-3TTS (1997). Information Technology - Coding of Audiovisual Objects - Part 3: Audio - Subpart 6: Text-to-Speech.

Jekosch U. (1992). The Cluster-Identification Test. Proceedings of ICSLP 92 (1): 205-208.

Jekosch U. (1993). Speech Quality Assessment and Evaluation. Proceedings of Eurospeech 93 (2): 1387-1394.

Karjalainen M. (1978). An Approach to Hierarchical Information Process With an Application to Speech Synthesis by Rule. Doctorial Thesis. Tampere University of Technology.

Karjalainen M., Altosaar T. (1991). Phoneme Duration Rules for Speech Synthesis by Neural Networks. Proceedings of Eurospeech 91 (2): 633-636.

Karjalainen M., Altosaar T., Vainio M. (1998). Speech Synthesis Using Warped Linear Prediction and Neural Networks. Proceedings of ICASSP 98.

Karjalainen M., Laine U., Toivonen R. (1980). Aids for the Handicapped Based on "SYNTE 2" Speech Synthesizer. Proceedings of ICASSP 80 (3): 851-854.

Karlsson I., Neovius L. (1993). Speech Synthesis Experiments with the GLOVE Synthesizer. Proceedings of Eurospeech 93 (2): 925-928.

Klatt D. (1980). Software for a Cascade/Parallel Formant Synthesizer. Journal of the Acoustical Society of America, JASA, Vol. 67: 971-995.

Klatt D. (1982). The Klattalk Text-to-Speech Conversion System. Proceedings of ICASSP 82 (3): 1589-1592.

Klatt D. (1987) Review of Text-to-Speech Conversion for English. Journal of the Acoustical Society of America, JASA vol. 82 (3), pp.737-793.

Klatt D., Klatt L. (1990). Analysis, Synthesis, and Perception of Voice Quality Variations Among Female and Male Listeners. Journal of the Acoustical Society of America, JASA vol. 87 (2): 820-857.

Klaus H., Klix H., Sotscheck J., Fellbaum K. (1993). An Evaluation System for Ascertaining the Quality of Synthetic Speech Based on Subjective Category Rating Tests. Proceedings of Eurospeech 93 (3): 1679-1682.

Kleijn K., Paliwal K. (Editors) (1998). Speech Coding and Synthesis. Elsevier Science B.V., The Netherlands.

Kortekaas R., Kohlrausch A. (1997). Psychoacoustical Evaluation of the Pitch-Synchronous Overlap-and-Add Speech-Waveform Manipulation Technique Using Single-Formant Stimuli. Journal of the Acoustical Society of America, JASA, Vol. 101 (4): 2202-2213.

Kraft V., Portele T. (1995). Quality Evaluation of Five German Speech Synthesis Systems. Acta Acustica 3 (1995): 351-365.

Kröger B. (1992). Minimal Rules for Articulatory Speech Synthesis. Proceedings of EUSIPCO92 (1): 331-334.

Laine U. (1982). PARCAS, a New Terminal Analog Model for Speech Synthesis. Proceedings of ICASSP 82 (2).

Laine U. (1989). Studies on Modelling of Vocal Tract Acoustics with Applications to Speech Synthesis. Thesis for the degree of Doctor of Technology. Helsinki University of Technology.

Laine U., Karjalainen M., Altosaar T. (1994). Warped Linear Prediction (WLP) in Speech Synthesis and Audio Processing. Proceedings of ICASSP94 (3): 349-352.

Le Goff B., Benoit C. (1996). A Text-to-Audiovisual-Speech Synthesizer for French. Proceedings of ICSLP96.

Lee K. (1989). Hidden Markov Models: Past, Present, and Future. Proceedings of Eurospeech 89 (1): 148-155.

Lehtinen L. (1990). Puhesynteesi aika-alueessa (Speech Synthesis in Time-Domain). Lic. Thesis, University of Helsinki.

Lehtinen L., Karjalainen M. (1989). Individual Sounding Speech Synthesis by Rule Using the Microphonemic Method. Proceedings of Eurospeech 89 (2): 180-183.

Lernout & Hauspies (L&H) Speech Technologies Homepage (1998). <http://www.lhs.com/speechtech/>.

Lewis E., Tatham M. (1993). A Generic Front End for Text-to-Speech Synthesis Systems. Proceedings of Eurospeech 93 (2): 913-916.

Lewis E., Tatham M. (1997). SPRUCE - High Specification Text-to-Speech Synthesis. <http://www.cs.bris.ac.uk/~eric/research/spruce97.html>.

Lindström A., Ljungqvist M., Gustafson K. (1993). A Modular Architecture Supporting Multiple Hypotheses for Conversion of Text to Phonetic and Linguistic Entities. Proceedings of Eurospeech 93 (2): 1463-1466.

Listen2 Homepage (1997). <http://www.islandnet.com/jts/listen2.htm>.

Ljungqvist M., Lindström A., Gustafson K. (1994). A New System for Text-to-Speech and Its Application to Swedish. ICSLP 94 (4): 1779-1782.

Logan J., Greene B., Pisoni D. (1989). Segmental Intelligibility of Synthetic Speech Produced by Rule. Journal of the Acoustical Society of America, JASA vol. 86 (2): 566-581.

Lukaszewicz K., Karjalainen M. (1987). Microphonemic Method of Speech Synthesis. Proceedings of ICASSP87 (3): 1426-1429.

Macchi M., Altom M., Kahn D., Singhal S., Spiegel M. (1993). Intelligibility as a Function of Speech Coding Method for Template-Based Speech Synthesis. Proceedings of Eurospeech 93 (2): 893-896.

Macon M. (1996). Speech Synthesis Based on Sinusoidal Modeling. Doctorial Thesis, Georgia Institute of Technology.

Macon M., Clements C. (1996). Speech Concatenation and Synthesis Using an Overlap-Add Sinusoidal Model. Proceedings of ICASSP 96: 361-364.

Macon M., Jensen-Link L., Oliverio J., Clements M., George E. (1997). A Singing Voice Synthesis System Based on Sinusoidal Modeling. Proceedings of ICASSP97.

Mariniak A. (1993). A Global Framework for the Assessment of Synthetic Speech Without Subjects. Proceedings of Eurospeech 93 (3): 1683-1686.

MBROLA Project Homepage (1998). <http://tcts.fpms.ac.be/synthesis/mbrola.html>.

McAulay R., Quatieri T. (1986). Speech Analysis-Synthesis Based on Sinusoidal Representation. Proceedings of ASSP-34 (4): 744-754.

Meyer P., Rühl H., Krüger R., Kugler M., Vogten L., Dirkensen A., Belhoula K. PHRITTS - A Text-to-Speech Synthesizer for the German Language. Proceedings of Eurospeech 93 (2): 877-880.

ModelTalker Homepage (1997). University of Delaware (ASEL). <http://www.asel.udel.edu/speech/Dsynterf.html>.

Morton K. (1987). The British Telecom Research Text-to-Speech Synthesis System - 1984-1986. Speech Production and Synthesis. Unpublished PhD Thesis. University of Essex. pp. 142-172. <http://wrangler.essex.ac.uk/speech/archive/bt>.

Morton K. (1991). Expectations for Assessment Techniques Applied to Speech Synthesis. Proceedings of the Institute of Acoustics vol. 13 Part 2. <http://wrangler.essex.ac.uk/speech/archive/assess/assess.html>.

Moulines E., Emerard F., Larreur D., Le Saint Milon J., Le Faucheur L., Marty F., Charpentier F., Sorin C. (1990). A Real-Time French Text-to-Speech System Generating High-Quality Synthetic Speech. Proceedings of ICASSP 90 (1): 309-312.

Moulines E., Laroche J. (1995). Non-Parametric Techniques for Pitch-Scale Modification of Speech. Speech Communication 16 (1995): 175-205.

MPEG Homepage (1998). <http://drogo.cselt.stet.it/mpeg/>

Murray I., Arnott J., Alm N., Newell A. (1991). A Communication System for the Disabled with Emotional Synthetic Speech Produced by Rule. Proceedings of Eurospeech 91 (1): 311-314.

Murray I., Arnott L. (1993). Toward the Simulation of Emotions in Synthetic Speech: A Review of the Literature on Human Vocal Emotion. Journal of the Acoustical Society of America, JASA vol. 93 (2): 1097-1108.

Murray I., Arnott L. (1996) Synthesizing Emotions in Speech: Is It Time to Get Excited? Proceedings of ICSLP 96 (3).

Möbius B., Schroeter J., Santen J., Sproat R., Olive J. (1996). Recent Advances in Multilingual Text-to-Speech Synthesis. Fortschritte der Akustik, DAGA-96.

Möbius B., Sproat R., Santen J., Olive J. (1997). The Bell Labs German Text-to-Speech System: An Overview. Proceedings of the European Conference on Speech Communication and Technology vol. 5: 2443-2446.

Neovius L., Raghavendra P. (1993). Comprehension of KTH Text-to-Speech with "Listening Speed" Program. Proceedings of Eurospeech 93 (3): 1687-1690.

Ohala J. (1996). Ethological Theory and the Voice Expression of Emotion in the Voice. Proceedings of ICSLP 96 (3).

Olaszy G. (1989). MULTIVOX - A Flexible Text-to-Speech System for Hungarian, Finnish, German, Esperanto, Italian and Other Laguages for IBM-PC. Proceedings of Eurospeech 89 (2): 525-528.

Olaszy G., Németh G. (1997). Prosody Generation for German CTS/TTS Systems (From Theoretical Intonation Patterns to Practical Realisation). Speech Communication, vol. 21 (1997): 37-60.

Oliveira L., Viana M., Trancoso I. (1992). A Rule Based Text-to-Speech System for Portuguese. Proceedings of ICASSP 92 (2): 73-76.

O'Saughnessy D. (1987). Speech Communication - Human and Machine, Addison-Wesley.

Panasonic CyberTalk Homepage (1998). <http://www.research.panasonic.com/pti/stl/stl_web_demo/demo.html>.

Pavlovic C., Rossi M., Espesser R. (1990). Use of the Magnitude Estimation Technique for Assessing the Performance of Text-to-Speech Synthesis System. Journal of the Acoustical Society of America, JASA vol. 87 (1): 373-382.

Pfister B. (1995). The SVOX Text-to-Speech System. Computer Engineering and Networks Laboratory, Speech Processing Group, Swiss Federal Institute of Technology, Zurich. <http://www.tik.ee.ethz.ch/~spr/publications/Pfister:95d.ps>.

Pisoni D, Hunnicutt S. (1980). Perceptual Evaluation of MITalk: The MIT Unrestricted Text-to-Speech System. Proceedings of ICASSP 80 (3): 572-575.

Pols L. (1994). Voice Quality of Synthetic Speech: Representation and Evaluation. Proceedings of ICSLP 94 (3): 1443-1446.

Pols L., SAM-partners (1992). Multilingual Synthesis Evaluation Methods. Proceedings of ICSLP 92 (1): 181-184.

Portele T., Höfer F., Hess W. (1994). A Mixed Inventory Structure for German Concatenative Syntesis. University of Bonn.
<ftp://asl1.ikp.uni-bonn.de/pub/vm41/tpnpal94.ps.gz>.

Portele T., Krämer J. (1996). Adapting a TTS System to a Reading Machine for the Blind. Proceedings of ICSLP 96 (1).

Portele T., Steffan B., Preuss R., Hess W. (1991). German Text-to-Speech Synthesis by Concatenation of Non-Parametric Units. Proceedings of Eurospeech 91 (1): 317-320.

Portele T., Steffan B., Preuss R., Sendlmeier W., Hess W. (1992). HADIFIX - A Speech Synthesis System for German. Proceedings of ICSLP 92 (2): 1227-1230.

Rabiner L., Shafer R. (1978). Digital Processing of Speech Signals, Prentice-Hall.

Rahim M., Goodyear C., Kleijn B., Schroeter J., Sondhi M. (1993). On the Use of Neural Networks in Articulatory Speech Synthesis. Journal of the Acoustical Society of America, JASA vol. 93 (2): 1109-1121.

Renzepopoulos P., Kokkinakis G. (1992). Multilingual Phoneme to Grapheme Conversion System Based on HMM. Proceedings of ICSLP 92 (2): 1191-1194.

Rossing T. (1990). The Science of Sound. Addison-Wesley.

Rutledge J., Cummings K., Lambert D., Clements M. (1995). Synthesized Styled Speech Using the KLATT Synthesizer. Proceedings of ICASSP 95: 648-651.

Sagisaga Y. (1990). Speech Synthesis from Text.

Salmensaari O. (1989). Puhesyntetisaattoritesti (Speech Synthesizer Test). HUT Acoustics Laboratory. Unpublished report. Espoo 1.12.1989.

Santen J. (1993). Timing in Text-to-Speech Systems. Proceedings of Eurospeech 93 (2): 1397-1404.

Santen J., Sproat R., Olive J., Hirschberg J. (editors) (1997). Progress in Speech Synthesis, Springer-Verlag New York Inc. (Includes CD-ROM).

Scherer K. (1996). Adding the Affective Dimension: A New Look in Speech Analysis and Synthesis. Proceedings of ICSLP 96 (3).

Schroeder M. (1993). A Brief History of Synthetic Speech. Speech Communication vol. 13, pp. 231-237.

Scordilis M., Gowdy J. (1989). Neural Network Based Generation of Fundamental Frequency Contours. Proceedings of ICASSP 89 (1): 219-222.

Shiga Y., Hara Y., Nitta T. (1994). A Novel Segment-Concatenation Algorith for a Cepstrum-Based Synthesizer. Proceedings of ICSLP 94 (4): 1783-1786.

SoftVoice, Inc. Homepage (1997). <http://www.text2speech.com/>.

Spiegel M. (1993). Using the ORATOR Synthesizer for a Public Reverse-Directory Service: Design, Lessons, and Recommendations. Proceedings of Eurospeech 93 (3): 1897-1900.

Sproat R. (1996). Multilingual Text Analysis for Text-to-Speech Synthesis. Proceedings of ICSLP 96 (3).

Sproat R., Taylor P., Tanenblatt M., Isard A. (1997). A Markup Language for Text-to-Speech Synthesis. Proceedings of Eurospeech 97.

SVOX Text-to-Speech Synthesis Homepage (1997).
<http://www.tik.ee.ethz.ch/cgi-bin/w3svox>.

Tatham M., Lewis E. (1992a). Prosodic Assignment in SPRUCE Text-to-Speech Synthesis. Proceedings of Institute of Acoustics, vol. 14, Part 6 (1992): 447-454.

Tatham M., Lewis E. (1992b). Prosodics in a Syllable-Based Text-to-Speech Synthesis System. Proceedings of ICSLP92 (2): 1179-1182.

Tatham M., Lewis E. (1995). Naturalness in a High-Level Synthetic Speech System. Proceedings of Eurospeech 95 (3): 1815-1818.

Tatham M., Lewis E. (1996). Improving Text-to-Speech Synthesis. Proceedings of ICSLP 96 (3).

Tatham M., Morton K., (1972). /p/ and /pp/ in Finnish: Duration of the Voiceless Phase in Intervocalic Context. Occasional Papers No 13/1972, Language Centre, University of Essex. <http://wrangler.essex.ac.uk/speech/archive/>.

Taylor P., Isard A. (1997). SSML: A Speech Synthesis Markup Language. Speech Communication vol. 21: 123-133.

Telia Promotor Home Page (1998). <http://www.infovox.se>.

Valbret H., Moulines E., Tubach J. (1991). Voice Transformation Using PSOLA Techique. Proceedings of Eurospeech 91 (1): 345-348.

Veldhuis R., Bogaert I., Lous N. (1995). Two-Mass Models for Speech Synthesis. Proceedings of Eurospeech 95 (3): 1853-1856.

Waters K., Levergood T. (1993). DECface: An Automatic Lip-Synchronization Algorithm for Synthetic Faces. DEC Technical Report Series, Cambridge Research Laboratory, CRL 93/4. <http://www.crl.research.digital.com/projects/facial/facialdoc.html>.

Witten I. (1982). Principles of Computer Speech, Academic Press Inc.

Previous / Appendix A / Index