Kluwer, 2003. — 426 p. — ISBN10: 1402073577, ISBN13: 978-1402073571.
The idea of this book came from creating and teaching a class for graduate students on Audio Coding at Stanford University's Computer Center for Research in Music and Acoustics (CCRMA). The subject of audio coding is a "hot topic" with students wanting to better understand the technology behind the MP3 files they are downloading over the internet, their audio choices on their DVDs, the digital radio proposals in the news, and the digital television offered by cable and satellite providers. Now in its sixth year, the class attracts a wide range of participants including music students, engineering students, and industrial professionals working in telecommunications, hardware design, and software product development.
In designing a course for such a diverse group, it is important to develop a shared vocabulary and understanding of the basic building blocks of a digital audio coder so that the choices made in any particular coder can be discussed using a commonly understood language. In the course, we first address the theory and implementation of each of the basic coder building blocks. We then show how the building blocks fit together into a full coder and how to judge the performance of such a coder. Finally, we discuss the features, choices, and performance of the main state-of-the-art coders in commercial use today.
The ultimate goal of the class, and now of this book, is to present the student and the reader with a solid enough understanding of the major issues in the theory and implementation of perceptual audio coders that they are able to build their own simple audio codec. MB is always very pleasantly surprised to hear the results of her student's work. As a final project for the class, they are able to design and implement perceptual audio coding schemes equivalent to audio coding schemes that were state-of-the-art only a few years ago. It is our hope that this book will allow advanced readers to achieve similar goals.
Audio Coding MethodsQuantization
Representation of Audio Signals
Time to Frequency Mapping Part I: The PQMF
Time to Frequency Mapping Part II: The MDCT
Introduction to Psychoacoustics
Psychoacoustic Models for Audio Coding
Bit Allocation Strategies
Building a Perceptual Audio Coder
Quality Measurement of Perceptual Audio Codecs
Audio Coding StandardsMPEG-1 Audio
MPEG-2 Audio
MPEG-2 AAC
Dolby AC-3
MPEG-4 Audio