TY - RPRT U1 - Arbeitspapier A1 - Wartena, Christian T1 - Comparing segmentation strategies for efficient video passage retrieval N2 - We compare the effect of different text segmentation strategies on speech based passage retrieval of video. Passage retrieval has mainly been studied to improve document retrieval and to enable question answering. In these domains best results were obtained using passages defined by the paragraph structure of the source documents or by using arbitrary overlapping passages. For the retrieval of relevant passages in a video, using speech transcripts, no author defined segmentation is available. We compare retrieval results from 4 different types of segments based on the speech channel of the video: fixed length segments, a sliding window, semantically coherent segments and prosodic segments. We evaluated the methods on the corpus of the MediaEval 2011 Rich Speech Retrieval task. Our main conclusion is that the retrieval results highly depend on the right choice for the segment length. However, results using the segmentation into semantically coherent parts depend much less on the segment length. Especially, the quality of fixed length and sliding window segmentation drops fast when the segment length increases, while quality of the semantically coherent segments is much more stable. Thus, if coherent segments are defined, longer segments can be used and consequently less segments have to be considered at retrieval time. KW - Information Retrieval KW - Segmentierung KW - Medieninformatik KW - Multimedien KW - Segmentierung KW - Retrieval KW - Spracherkennung KW - Multimedia KW - Segmentation KW - Multimedia Information Retrieval KW - Passage Retrieval KW - Speech Recognition Y1 - 2012 U6 - https://nbn-resolving.org/urn:nbn:de:bsz:960-opus-3983 UN - https://nbn-resolving.org/urn:nbn:de:bsz:960-opus-3983 ER -