Language Models for Semantic Extraction and Filtering in Video Action Recognition
Title | Language Models for Semantic Extraction and Filtering in Video Action Recognition |
Publication Type | Conference Papers |
Year of Publication | 2011 |
Authors | Tzoukermann E, Neumann J, Kosecka J, Fermüller C, Perera I, Ferraro F, Sapp B, Chaudhry R, Singh G |
Conference Name | Workshops at the Twenty-Fifth AAAI Conference on Artificial Intelligence |
Date Published | 2011/08/24/ |
Abstract | The paper addresses the following issues: (a) how to represent semantic information from natural language so that a vision model can utilize it? (b) how to extract the salient textual information relevant to vision? For a given domain, we present a new model of semantic extraction that takes into account word relatedness as well as word disambiguation in order to apply to a vision model. We automatically process the text transcripts and perform syntactic analysis to extract dependency relations. We then perform semantic extraction on the output to filter semantic entities related to actions. The resulting data are used to populate a matrix of co-occurrences utilized by the vision processing modules. Results show that explicitly modeling the co-occurrence of actions and tools significantly improved performance. |
URL | https://www.aaai.org/ocs/index.php/WS/AAAIW11/paper/viewPaper/3919 |