|
| Type of publication: | Inproceedings |
| Entered by: | ADM |
| Title |
Perceptually motivated Sub-band Decomposition for FDLP Audio Coding |
| Bibtex cite ID | Motlicek_TSD2008_2008 |
| Booktitle | Text, Speech and Dialogue |
| Series | Series of Lecture Notes in Artificial Intelligence (LNAI) |
| Year published | 2008 |
| Volume | 5246 |
| Pages | 435-442 |
| Publisher | Springer-Verlag Berlin, Heidelberg |
| Location | Brno, Czech Republic |
| Keywords | Audio Coding,Frequency Domain Linear Prediction (FDLP),speech coding,IM2.VP,Report_VIII |
| Abstract | This paper describes employment of non-uniform QMF decomposition to increase the efficiency of a generic wide-band audio coding system based on Frequency Domain Linear Prediction (FDLP). The base line FDLP codec, operating at high bit-rates ( 136 kbps), exploits a uniform QMF decomposition into 64 sub-bands followed by sub-band processing based on FDLP. Here, we propose a non-uniform QMF decomposition into 32 frequency sub-bands obtained by merging 64 uni- form QMF bands. The merging operation is performed in such a way that bandwidths of the resulting critically sampled sub-bands emulate the characteristics of the critical band filters in the human auditory system. Such frequency decomposition, when employed in the FDLP audio codec, results in a bit-rate reduction of 40\% over the base line. We also describe the complete audio codec, which provides high-fidelity audio compression at 66 kbps. In subjective listening tests, the FDLP codec outperforms MPEG-1 Layer 3 (MP3) and achieves similar qualities as MPEG-4 HE-AAC codec. |
| Authors | |
Topics
| |
| BibTeX | BibTeX |
| RIS | RIS |
| Attachments |
pdf (main file) |
|
| Total mark: | 5 |
[Add a new note...] |
|
|