Locally time-invariant models of human activities using trajectories on the grassmannian
Title | Locally time-invariant models of human activities using trajectories on the grassmannian |
Publication Type | Conference Papers |
Year of Publication | 2009 |
Authors | Turaga P, Chellappa R |
Conference Name | Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on |
Date Published | 2009/06// |
Keywords | action, activity, analysis;local, analysis;video, content, dynamic, Estimation, estimation;statistical, Grassmann, indexing;distance, linear, manifold;activity-based, method;statistical, method;surveillance;time-varying, metrics;human, model;parameter, recognition;human, recognition;indexing;parameter, summarization;computer, surveillance;, system;computer, time-invariant, vision;consumer, vision;image |
Abstract | Human activity analysis is an important problem in computer vision with applications in surveillance and summarization and indexing of consumer content. Complex human activities are characterized by non-linear dynamics that make learning, inference and recognition hard. In this paper, we consider the problem of modeling and recognizing complex activities which exhibit time-varying dynamics. To this end, we describe activities as outputs of linear dynamic systems (LDS) whose parameters vary with time, or a time-varying linear dynamic system (TV-LDS). We discuss parameter estimation methods for this class of models by assuming that the parameters are locally time-invariant. Then, we represent the space of LDS models as a Grassmann manifold. Then, the TV-LDS model is defined as a trajectory on the Grassmann manifold. We show how trajectories on the Grassmannian can be characterized using appropriate distance metrics and statistical methods that reflect the underlying geometry of the manifold. This results in more expressive and powerful models for complex human activities. We demonstrate the strength of the framework for activity-based summarization of long videos and recognition of complex human actions on two datasets. |
DOI | 10.1109/CVPR.2009.5206710 |