Skip to main navigation Skip to search Skip to main content

Semantic quantization of 3D human motion capture data through spatial-temporal feature extraction

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

3D motion capture is a form of multimedia data that is widely used in animation and medical fields (such as physical medicine and rehabilitation where body joint analysis is needed). These applications typically create large repositories of motion capture data and need efficient and accurate content-based retrieval techniques. 3D motion capture data is in the form of multi-dimensional time series data. To reduce the dimensions of human motion data while maintaining semantically important features, we quantize human motion data by extracting Spatial-Temporal Features through SVD and translate them onto a 1-dimensional sequential representation through our proposed sGMMEM (semantic Gaussian Mixture Modeling with EM). Thus, we achieve good classification accuracies for primitive human motion categories (walking 92.85%,run 91.42%,jump 94.11%) and even for subtle categories (dance 89.47%,laugh 83.33%,basketball signal 85.71%,golf putting 80.00%).

Original languageEnglish
Title of host publicationAdvances in Multimedia Modeling - 14th International Multimedia Modeling Conference, MMM 2008, Proceedings
PublisherSpringer Verlag
Pages318-328
Number of pages11
ISBN (Print)3540774076, 9783540774075
DOIs
StatePublished - 2008
Event14th International Multimedia Modeling Conference, MMM2008 - Kyoto, Japan
Duration: Jan 9 2008Jan 11 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4903 LNCS

Conference

Conference14th International Multimedia Modeling Conference, MMM2008
Country/TerritoryJapan
CityKyoto
Period01/9/0801/11/08

Fingerprint

Dive into the research topics of 'Semantic quantization of 3D human motion capture data through spatial-temporal feature extraction'. Together they form a unique fingerprint.

Cite this