[6b] Speech Recognition and Synthesis

Speech Recognition and Synthesis:

   John Allen, Sharon Hunnicut and Dennis H. Klatt, "From Text to Speech:
   The MITalk System", Cambridge University Press, 1987. [Synthesis,
   precursor of DECtalk.]

   Frank Fallside and William A. Woods (editors), "Computer Speech Processing"
   Prentice Hall, Englewood Cliffs, NJ, 1985. 

   X. D. Huang, Y. Ariki and M. A. Jack, "Hidden Markov Models for Speech
   Recognition", Edinburgh University Press, 1990. [Analysis]

   A. Nejat Ince (editor), "Digital Speech Processing: Speech Coding,
   Synthesis, and Recognition", Kluwer Academic Publishers, Boston,
   1992. [Analysis and Synthesis]

   Dennis H. Klatt, "Review of Text-To-Speech Conversion for English",
   Journal of the Acoustic Society of America (JASA), 82(3):737-793,
   September 1987. [Synthesis. Seminal article; biased toward formant
   synthesis.] 

   Kai-Fu Lee, "Automatic Speech Recognition: The Development of the
   SPHINX System", Kluwer Academic Publishers, Boston, MA, 1989. [Analysis]

   S. E. Levinson, L. R. Rabiner and M. M. Sondhi, "An Introduction to the
   Application of the Theory of Probabilistic Functions of a Markov Process
   to Automatic Speech Recognition" in Bell Syst. Tech. Journal
   62(4):1035-1074, April 1983.  [Analysis]

   R. P. Lippmann, "Review of Neural Networks for Speech Recognition", 
   Neural Computation, 1(1):1-38, 1989. [Analysis]

   Douglas O'Shaughnessy, "Speech Communication: Human and Machine"
   Addison-Wesley, MA, 1987. [Analysis and Synthesis]

   Lawrence R. Rabiner and Ronald W. Schafer, "Digital Processing of
   Speech Signals", Prentice Hall, Englewood Cliffs, NJ, 1978.
   [Analysis and Synthesis]

   Lawrence R. Rabiner and Biing-Hwang Juang, "Fundamentals of Speech
   Recognition", Prentice Hall, Englewood Cliffs, NJ, 1993.
   ISBN 0-13-015157-2. [Analysis]

   Ronald W. Schafer and John D. Markel (editors), "Speech Analysis",
   IEEE Press, New York, 1979. [Analysis]

   Alex Waibel and Kai-Fu Lee (editors), "Readings in Speech Recognition"
   Morgan Kaufmann Publishers, San Mateo, CA, 1990, 680 pages. 
   ISBN 1-55860-124-4, $49.95. [Analysis]

   Alex Waibel, "Prosody and Speech Recognition", Morgan Kaufmann
   Publishers, San Mateo, CA, 1988. [Analysis]

Speaker Recognition:

   B. S. Atal, "Automatic recognition of speakers from their voices",
   Proc. IEEE, 64:460-475, April 1976.

   A. E. Rosenberg, "Automatic speaker verification: A review", 
   Proc. IEEE, 64:475-487, April 1976. 

   G.R. Doddington, "Speaker recognition -- identifying people by their
   voices",  Proc. IEEE, 73:1651-1664, March 1985. 

   A.E. Rosenberg and F.K. Soong, "Recent research in automatic speaker
   recognition," in S. Furui and M. Sondhi, editors, Advances in Speech
   Sigmal Processing, 1991.
Go Back Up

Go To Previous

Go To Next