Embedded Audio Coding (EAC) With Implicit Auditory Masking
- Jin Li
ACM Multimedia 2002, Nice, France |
An embedded audio coder (EAC) is proposed with compression performance rivals the best available non-scalable audio coder. The key technology that empowers the EAC with high performance is the implicit auditory masking. Unlike the common practice, where an auditory masking threshold is derived from the input audio signal, transmitted to the decoder and used to quantize (modify) the transform coefficients; the EAC integrates the auditory masking process into the embedded entropy coding. The auditory masking threshold is derived from the encoded coefficients and used to change the order of coding. There is no need to store or send the auditory masking threshold in the EAC. By eliminating the overhead of the auditory mask, EAC greatly improves the compression efficiency, especially at low bitrate. Extensive experimental results demonstrate that the EAC coder substantially outperforms existing scalable audio coders and audio compression standards (MP3 and MPEG-4), and rivals the best available commercial audio coder. Yet the EAC compressed bitstream is fully scalable, in term of the coding bitrate, number of audio channels and audio sampling rate.
Copyright © 2002 by the Association for Computing Machinery, Inc. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Publications Dept, ACM Inc., fax +1 (212) 869-0481, or permissions@acm.org.