Abstract
In this paper, we present a novel audio coder using the discrete wavelet transform (DWT) and warped linear prediction (WLP). In contrast to conventional LP, WLP allows for the control of frequency resolution to closely match the response of the human auditory system. The structure of the system is similar to the transform coded excitation techniques used in wideband speech coding, where LP has been replaced with WLP, and the residual is analyzed by a wavelet filterbank designed to approximate the critical bands. The inherent shaping of the WLP synthesis filter, and a controlled bit allocation to the wavelet coefficients helps minimise the perceptually significant noise due to the quantization error in the residual. For monophonic signals sampled at 44.1 kHz, the coder achieves near transparent to transparent quality for a variety of speech and music signals at an average bitrate of about 64 kb/s. Tests also show that the coder (in its initial implementation) delivers superior quality to the MPEG layer III and comparable quality to the MPEG2-AAC codec when operating at the same bitrate
| Original language | English |
|---|---|
| Article number | 1709893 |
| Pages (from-to) | 2039-2048 |
| Number of pages | 10 |
| Journal | IEEE Transactions on Audio, Speech and Language Processing |
| Volume | 14 |
| Issue number | 6 |
| DOIs | |
| State | Published - Nov 2006 |
Bibliographical note
Funding Information:Manuscript received April 28, 2002; revised July 27, 2004. This work was supported by King Fahd University of Petroleum and Minerals (KFUPM) and King Abdul Aziz City for Science and Technology (KACST). The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Gerald Schuller.
Keywords
- Audio coding
- Subband coding
- Transform coding
- Warped linear prediction
ASJC Scopus subject areas
- Acoustics and Ultrasonics
- Electrical and Electronic Engineering