Frequency Filtering for a Highly Robust Audio Fingerprinting Scheme in a Real-Noise Environment

Mansoo PARK
Hoi-Rin KIM
Yong Man RO
Munchurl KIM

IEICE TRANSACTIONS on Information and Systems   Vol.E89-D    No.7    pp.2324-2327
Publication Date: 2006/07/01
Online ISSN: 1745-1361
DOI: 10.1093/ietisy/e89-d.7.2324
Print ISSN: 0916-8532
Type of Manuscript: LETTER
Category: Music Information Processing
music information retrieval,  content-based audio identification,  audio fingerprint,  frequency filtering,  

Full Text: PDF(406.9KB)>>
Buy this Article

The noise robustness of an audio fingerprinting system is one of the most important issues in music information retrieval by the content-based audio identification technique. In a real environment, sound recordings are commonly distorted by channel and background noise. Recently, Philips published a robust and efficient audio fingerprinting system for audio identification. To extract a robust and efficient audio fingerprint, Philips applied the first derivative (differential) to the frequency-time sequence of the perceptual filter-bank energies. In practice, however, the noise robustness of Philips' audio fingerprinting scheme is still insufficient. In this paper, we introduce an extension method of the audio fingerprinting scheme for the enhancement of noise robustness. As an alternative to frequency filtering, a type of band-pass filter, instead of a high-pass filter, is used to achieve robustness to background noise in a real situation. Our experimental results show that the proposed filter improves the noise robustness in audio identification.

open access publishing via