CRC Press, 2000. — 247 p.
Всеобъемлющее описание алгоритмов и методов кодирования речи. Детали реализации этих алгоритмов в распространенных речевых кодеках.
Speech ProductionThe Speech Chain
Articulation
Excitation
Vocal Tract
Phonemes
Source-Filter Model
Speech Analysis TechniquesSampling the Speech Waveform
Systems and Filtering
Z-Transform
Fourier Transform
Discrete Fourier Transform
Fast Fourier Transform
Windowing Signal Segments
Linear Prediction Vocal Tract ModelingSound Propagation in the Vocal Tract
Multiple-Tube Model
Estimation of LP Parameters
Autocorrelation Method of Parameter Estimation
Covariance Method
Transformations of LP Parameters for Quantization
Log Area Ratios
Line Spectral Frequencies
Examples of LP Modeling
Pitch ExtractionAutocorrelation Pitch Estimation
Autocorrelation of Center-Clipped Speech
Cross Correlation
Energy Normalized Correlation
Cepstral Pitch Extraction
Frequency-Domain Error Minimization
Pitch Tracking
Median Smoothing
Dynamic Programming Tracking
Auditory Information ProcessingThe Basilar Membrane: A Spectrum Analyzer
Critical Bands
Thresholds of Audibility and Detectability
Monaural Masking
Simultaneous Masking in Frequency
Temporal Masking
Quantization and Waveform CodersUniform Quantization
Uniform Pulse Code Modulation (PCM)
Nonlinear Quantization
Nonuniform Pulse Code Modulation
Differential Waveform Coding
Predictive Differential Coding
Delta Modulation
Adaptive Quantization
Adaptive Delta Modulation
Adaptive Differential Pulse Code Modulation (ADPCM)
Vector Quantization
Distortion Measures
Codebook Training
Complexity Reduction Approaches
Predictive Vector Quantization
Quality EvaluationObjective Measures
Signal-to-Noise Ratio
Spectral Distance
Subjective Measures
Intelligibility
Quality
Background Noise and Channel Conditions
Perceptual Objective Measures
Voice Coding ConceptsChannel Vocoder
Implementations of the Channel Vocoder
Formant Vocoder
The Sinusoidal Speech Coder
The Sinusoidal Mode
Sinusoidal Parameter Analysis.
Linear Prediction Vocoder
Federal Standard 1015, LPC-10e at 2.4 kbit/s
Linear Prediction Analysis by SynthesisAnalysis by Synthesis Estimation of Excitation
Multi-Pulse Linear Prediction Coder
Regular Pulse Excited LP Coder
ETSI GSM Full Rate RPE-LTP
Code Excited Linear Prediction Coder
CELP Concept
CELP Computational Efficiency Improvements
Adaptive Postfiltering
Federal Standard 1016, CELP at 4.8 kbits/sec
TU-T G.728 Low Delay CELP at 16 kbit/s
TU G.723.1 Algebraic CELP/Multi-Pulse Coder at 5.3/6.3 kbit/s
ETSI GSM Enhanced Full Rate Algebraic CELP at 12.2 kbit/s
S-641 EFR 7.4 kbit/s Algebraic CELP for IS-136 North American Digital Cellular
ETSI GSM Adaptive Multi-Rate Algebraic CELP from 4.75 to 12.2 kbit/s
Mixed Excitation CodingMulti-Band Excitation Vocoder
Multi-Band Excitation Analysis
Multi-Band Excitation Synthesis
Implementations of the MBE Vocoder
Mixed Excitation Linear Prediction Coder
Federal Standard MELP Coder at 2.4 kbit/s
Improvements to MELP Coder
Split Band LPC Coder
Bit Allocations and Quality Results
Harmonic Vector Excitation Coder
HVXC Encoder
HVXC Decoder
HVXC Performance
Waveform Interpolation Coding
WI Coder and Decoder
Quantization of SEW and REW
Performance and Enhancements
Perceptual Speech CodingAuditory Processing of Speech
General Perceptual Speech Coder
Frequency and Temporal Masking
Determining Masking Levels
Perceptual Coding Considerations
Limits on Time/Frequency Resolution
Sound Quality of Signal Components
MBE Model for Perceptual Coding
Research in Perceptual Speech Coding
Related Internet SitesInformation on Coding Standards
Technical Conferences