2014 | OriginalPaper | Chapter
Model of Auditory Filters and MPEG-7 Descriptors in Sound Recognition
Authors : Aneta Świercz, Jan Żera
Published in: Active Media Technology
Publisher: Springer International Publishing
Activate our intelligent search to find suitable subject content or patents.
Select sections of text to find matching patents with Artificial Intelligence. powered by
Select sections of text to find additional relevant content using AI-assisted search. powered by
It was examined whether applying a model of human auditory filter could improve the quality of sound recognition with the use of MPEG-7 standard audio descriptors. Modeling of filtering in the auditory system was with a bank of 38 gammatone filters closely spaced across the audible frequency range. The bank of filters was implemented as a low-level audio descriptor to replace the short-term Fourier transform (STFT) MPEG-7 audio descriptor. Sound recognition tests were conducted on a large set of sounds of nine musical instruments and speech of twelve speakers. The results showed that the proposed descriptor employing a bank of gammatone filters led to improved recognition of musical instruments and speakers as compared to the STFT-based original low-level MPEG-7 audio descriptor.