Volltext-Downloads (blau) und Frontdoor-Views (grau)
(Leider keine statistischen Daten vom 26.05. – 18.06.2018)

Comparing segmentation strategies for efficient video passage retrieval

  • We compare the effect of different text segmentation strategies on speech based passage retrieval of video. Passage retrieval has mainly been studied to improve document retrieval and to enable question answering. In these domains best results were obtained using passages defined by the paragraph structure of the source documents or by using arbitrary overlapping passages. For the retrieval of relevant passages in a video, using speech transcripts, no author defined segmentation is available. We compare retrieval results from 4 different types of segments based on the speech channel of the video: fixed length segments, a sliding window, semantically coherent segments and prosodic segments. We evaluated the methods on the corpus of the MediaEval 2011 Rich Speech Retrieval task. Our main conclusion is that the retrieval results highly depend on the right choice for the segment length. However, results using the segmentation into semantically coherent parts depend much less on the segment length. Especially, the quality of fixed length and sliding window segmentation drops fast when the segment length increases, while quality of the semantically coherent segments is much more stable. Thus, if coherent segments are defined, longer segments can be used and consequently less segments have to be considered at retrieval time.

Download full text files

Export metadata

Additional Services

Share in Twitter Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Christian WartenaORCiDGND
URN:urn:nbn:de:bsz:960-opus-3983
DOI:https://doi.org/10.1109/CBMI.2012.6269850
Document Type:Working Paper
Language:English
Year of Completion:2012
Release Date:2012/09/07
Tag:Multimedien; Retrieval; Segmentierung; Spracherkennung
Multimedia; Multimedia Information Retrieval; Passage Retrieval; Segmentation; Speech Recognition
GND Keyword:Information Retrieval; Medieninformatik; Segmentierung
Source:10th International Workshop on Content-Based Multimedia Indexing (CBMI), 2012
Link to catalogue:727583255
Institutes:Fakultät III - Medien, Information und Design
DDC classes:004 Informatik
Licence (German):License LogoUrheberrechtlich geschützt