Content-Based Video Indexing and Retrieval-- A Natural Language Approach--

Yeun-Bae KIM  Masahiro SHIBATA  

IEICE TRANSACTIONS on Information and Systems   Vol.E79-D   No.6   pp.695-705
Publication Date: 1996/06/25
Online ISSN: 
Print ISSN: 0916-8532
Type of Manuscript: Special Section PAPER (Special Issue on Multimedia Computing and Communications)
content-based video indexing,  natural language processing,  multi-path tree structure,  dependency structure,  natural language interface,  

Full Text: PDF(1.2MB)>>
Buy this Article

This paper describes methods in which natural language is used to describe video contents, knowledge of which is needed for intelligent video manipulation. The content encoded by natural language is extracted by a language analyzer in the form of subject-centered dependency structures which is a language-oriented structure, and is combined in an incremental way into a single structure called a multi-path index tree. Content descriptors and their inter-relations are extracted from the index tree in order to provide a high speed retrieval and flexibility. The content-based video index is represented in a two-dimensional structure where in the descriptors are mapped onto a component axis and temporal references (i.e., video segments aligned to the descriptors) are mapped onto a time axis. We implemented an experimental image retrieval systems to illustrate the proposed index structure 1) has superior retrieval capabilities compare to those used in conventional methods, 2) can be generated by an automated procedure, and 3) has a compact and flexible structure that is easily expandable, making an integration with vision processing possible.