Skip to main navigation Skip to search Skip to main content

MPEG CDVS Feature Trajectories for Action Recognition in Videos

  • SUNY Buffalo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

Visual Action Recognition on mobile phones is a challenging problem. Mobile and wearable devices deal with power, memory, computational and hardware constraints, which mandate robust and lightweight algorithmic implementations for sophisticated vision applications, like action recognition. Compact Descriptors for Visual Search (CDVS) is an MPEG7 standard for an accelerated visual search on mobiles. In our work, we propose a mobile action recognition framework which classifies actions by tracking CDVS feature trajectories of human subjects. The proposed method capitalizes on the sparse, salient and memory efficient properties of CDVS features. Although our recognition accuracies on standard action datasets KTH, UCF50, and HMDB is not superior to the CNN based methods, our work explores and proves the feasibility of using CDVS features for action recognition.

Original languageEnglish
Title of host publicationProceedings - IEEE 1st Conference on Multimedia Information Processing and Retrieval, MIPR 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages301-304
Number of pages4
ISBN (Electronic)9781538618578
DOIs
StatePublished - Jun 26 2018
Event1st IEEE Conference on Multimedia Information Processing and Retrieval, MIPR 2018 - Miami, United States
Duration: Apr 10 2018Apr 12 2018

Publication series

NameProceedings - IEEE 1st Conference on Multimedia Information Processing and Retrieval, MIPR 2018

Conference

Conference1st IEEE Conference on Multimedia Information Processing and Retrieval, MIPR 2018
Country/TerritoryUnited States
CityMiami
Period04/10/1804/12/18

Keywords

  • Action Recognition
  • Computer Vision
  • Visual Search

Fingerprint

Dive into the research topics of 'MPEG CDVS Feature Trajectories for Action Recognition in Videos'. Together they form a unique fingerprint.

Cite this