Buch, Englisch, 119 Seiten, Paperback, Format (B × H): 187 mm x 235 mm
Reihe: Synthesis Lectures on Algorithms and Software in Engineering
Buch, Englisch, 119 Seiten, Paperback, Format (B × H): 187 mm x 235 mm
Reihe: Synthesis Lectures on Algorithms and Software in Engineering
ISBN: 978-1-60845-387-0
Verlag: Morgan & Claypool Publishers
Advantages of integrating perceptual models in low bit rate speech coding depend on the accuracy of these models to mimic the human performance and, more importantly, on the achievable ""coding gains"" and ""computational overhead"" associated with these physiological models. Methods that exploit the masking properties of the human ear in speech coding standards, even today, are largely based on concepts introduced by Schroeder and Atal in 1979. For example, a simple approach employed in speech coding standards is to use a perceptual weighting filter to shape the quantization noise according to the masking properties of the human ear. The second half of the book reviews some of the recent developments in perceptual modeling of speech (e.g., masking threshold, psychoacoustic models, auditory excitation pattern, and loudness) with the help of Matlab™ simulations. Supplementary material including Matlab™ programs and simulation examples presented in this book can also be accessed here.
Autoren/Hrsg.
Weitere Infos & Material
- Introduction
- Predictive Modeling of Speech
- Perceptual Modeling of Speech