1,721,012 research outputs found
Automated threshold selection for the detection of dissolves in Mpeg video
In this paper we show how the use of an automatic threshold selection technique can improve some critical video parsing operations. To this end, in the framework of an information-theoretic approach, we propose a technique based on entropy maximization. We also present an algorithm for detecting dissolves in Mpeg compressed videos, which demonstrates the reliability of the proposed technique. Experimental results are provided and discussed
Joint audio-video processing of MPEG encoded sequences
The current research efforts in the field of video parsing and analysis are focused on the use of pictorial information, while neglecting an important supplementary source of content information such as the embedded audio or soundtrack. In contrast, in this paper we address the issue of scene change detection with the use of video and audio information. We also discuss how joint exploitation of audio and video can be thoroughly performed on MPEG encoded video sequences. First experimental results are presented and discussed
System for parsing MPEG videos
In this paper we address the issue of scene change detection on MPEG encoded video sequences with the use of combined video and audio information. We present the architecture of a system which provides an integration framework for algorithms handling both kinds of information and we show how these can be combined in order to provide a suitable segmentation of the video content. Finally, we discuss the first steps for a distributed version of the proposed architecture
A Multi-Expert System for Shot Change Detection in MPEG Movies
Shot Change Detection (SCD) in MPEG coded videos is a complex and still open research problem whose interest is growing up more and more due to the diffusion of Video Databases and Digital Libraries. Techniques providing fully satisfactory performances on complex video domains are not yet available even if a number of proposals exist; such proposals show very often to be complementary in their results. In this context, the Authors investigated the use of Multi-Expert Systems (MES) for approaching the SCD problem. In the present paper, we propose and discuss a strategy to select the SCD techniques to be combined and a method for choosing an effective combining rule. In order to assess the performance of the proposed MES, we set up a database that is significantly wider than the ones commonly used in the field. Experimental results demonstrate that the proposed system performs better than each of the single SCD technique considered
Segmentation of news videos based on audio-video information
In this paper, we propose an innovative architecture to segment a news video into the so called ‘‘stories’’ by both using the included video and audio information. Segmentation of news into stories is one of the key issues for achieving efficient treatment of news-based digital libraries. While the relevance of this research problem is widely recognized in the scientific community, we are in presence of a few established solutions in the field. In our approach, the segmentation is performed in two steps: first, shots are classified by combining three different anchor shot detection algorithms using video information only. Then, the shot classification is improved by using a novel anchor shot detection method based on features extracted from the audio track. Tests on a large database confirm that the proposed system outperforms each single video-based method as well as their combination
A Graph-based Algorithm for Cluster Detection
In some Computer Vision applications there is the need for grouping, in one or more clusters, only a part of the whole dataset. This happens, for example, when samples of interest for the application at hand are present together with several noisy samples. In this paper we present a graph-based algorithm for cluster detection that is particularly suited for detecting clusters of any size and shape, without the need of specifying either the actual number of clusters or the other parameters. The algorithm has been tested on data coming from two different computer vision applications. A comparison with other four state-of-the-art graph-based algorithms was also provided, demonstrating the effectiveness of the proposed approach
Un Sistema Multimodale per la Segmentazione di Telegiornali basato sull’Individuazione Automatica degli Speaker
A Multi-Stage Approach for News Video Segmentation based on Automatic Anchorperson number detection
- …
