Discriminative Hierarchical Modeling of Spatio-Temporally Composable Human Activities

Lillo, I; soto A.; Niebles, JC

Abstract

This paper proposes a framework for recognizing complex human activities in videos. Our method describes human activities in a hierarchical discriminative model that operates at three semantic levels. At the lower level, body poses are encoded in a representative but discriminative pose dictionary. At the intermediate level, encoded poses span a space where simple human actions are composed. At the highest level, our model captures temporal and spatial compositions of actions into complex human activities. Our human activity classifier simultaneously models which body parts are relevant to the action of interest as well as their appearance and composition using a discriminative approach. By formulating model learning in a maxmargin framework, our approach achieves powerful multiclass discrimination while providing useful annotations at the intermediate semantic level. We show how our hierarchical compositional model provides natural handling of occlusions. To evaluate the effectiveness of our proposed framework, we introduce a new dataset of composed human activities. We provide empirical evidence that our method achieves state-of-the-art activity classification performance on several benchmark datasets.

Más información

Título según WOS: Discriminative Hierarchical Modeling of Spatio-Temporally Composable Human Activities
Título según SCOPUS: Discriminative hierarchical modeling of spatio-temporally composable human activities
Título de la Revista: 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)
Editorial: IEEE
Fecha de publicación: 2014
Página de inicio: 812
Página final: 819
Idioma: English
DOI:

10.1109/CVPR.2014.109

Notas: ISI, SCOPUS