Robust and Fast Phonetic String Matching Method for Lyric Searching Based on Acoustic Distance

Xin XU  Tsuneo KATO  

Publication
IEICE TRANSACTIONS on Information and Systems   Vol.E97-D   No.9   pp.2501-2509
Publication Date: 2014/09/01
Online ISSN: 1745-1361
DOI: 10.1587/transinf.2013EDP7418
Type of Manuscript: PAPER
Category: Music Information Processing
Keyword: 
lyric search,  phonetic confusion matrix,  two-pass search,  dynamic programming,  

Full Text: PDF(977.7KB)
>>Buy this Article


Summary: 
This paper proposes a robust and fast lyric search method for music information retrieval (MIR). The effectiveness of lyric search systems based on full-text retrieval engines or web search engines is highly compromised when the queries of lyric phrases contain incorrect parts due to mishearing. To improve the robustness of the system, the authors introduce acoustic distance, which is computed based on a confusion matrix of an automatic speech recognition experiment, into Dynamic-Programming (DP)-based phonetic string matching to identify the songs that the misheard lyric phrases refer to. An evaluation experiment verified that the search accuracy is increased by 4.4% compared with the conventional method. Furthermore, in this paper a two-pass search algorithm is proposed to realize real-time execution. The algorithm pre-selects the probable candidates using a rapid index-based search in the first pass and executes a DP-based search process with an adaptive termination strategy in the second pass. Experimental results show that the proposed search method reduced processing time by more than 86.2% compared with the conventional methods for the same search accuracy.