Unsupervised Video Summarization: A Reconstruction Model with Proximal Gradient Methods

Alfaro A.; Sipiran I.

Keywords: unsupervised learning, proximal methods, Video summarization

Abstract

We present a regularized reconstruction model to address video summarization. We assume a video can be viewed as a subspace formed by a selected subset of frames, with frames represented as a sparse linear combination of these selected frames. Our method selects frames that contribute to the reconstruction of the entire video by leveraging both the structure and similarity between sparse codes. The structure is provided by groups of frames showing subtle or significant changes, while the similarity ensures a balanced contribution from the frames in these groups. We propose an optimization problem to produce a sparse representation capturing the relevance of each frame, solving this non-smooth problem using proximal gradient methods. We compared our method with state-of-the-art methods through experiments using a standard dataset and a new dataset for volleyball phase analysis. Our results demonstrate that our method produces effective summaries and outperforms existing methods. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

Más información

Título según WOS: Unsupervised Video Summarization: A Reconstruction Model with Proximal Gradient Methods
Título según SCOPUS: Unsupervised Video Summarization: A Reconstruction Model with Proximal Gradient Methods
Título de la Revista: Lecture Notes in Computer Science
Editorial: Springer Science and Business Media Deutschland GmbH
Fecha de publicación: 2025
Página de inicio: 84
Página final: 99
Idioma: English
DOI:

10.1007/978-3-031-91585-7_6

Notas: ISI, SCOPUS