Unsupervised Video Summarization: A Reconstruction Model with Proximal Gradient Methods
Keywords: unsupervised learning, proximal methods, Video summarization
Abstract
We present a regularized reconstruction model to address video summarization. We assume a video can be viewed as a subspace formed by a selected subset of frames, with frames represented as a sparse linear combination of these selected frames. Our method selects frames that contribute to the reconstruction of the entire video by leveraging both the structure and similarity between sparse codes. The structure is provided by groups of frames showing subtle or significant changes, while the similarity ensures a balanced contribution from the frames in these groups. We propose an optimization problem to produce a sparse representation capturing the relevance of each frame, solving this non-smooth problem using proximal gradient methods. We compared our method with state-of-the-art methods through experiments using a standard dataset and a new dataset for volleyball phase analysis. Our results demonstrate that our method produces effective summaries and outperforms existing methods. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
Más información
| Título según WOS: | Unsupervised Video Summarization: A Reconstruction Model with Proximal Gradient Methods |
| Título según SCOPUS: | Unsupervised Video Summarization: A Reconstruction Model with Proximal Gradient Methods |
| Título de la Revista: | Lecture Notes in Computer Science |
| Editorial: | Springer Science and Business Media Deutschland GmbH |
| Fecha de publicación: | 2025 |
| Página de inicio: | 84 |
| Página final: | 99 |
| Idioma: | English |
| DOI: |
10.1007/978-3-031-91585-7_6 |
| Notas: | ISI, SCOPUS |