Abstract
We address the task of recognizing the category of an ongoing human action from a video stream. This task is challenging because of the need to output categorization decisions based on partial evidence—the action has not finished and not all information about the action has been observed. This task is further complicated because the ongoing action is submerged in the stream of data and the start of the action is not given. Existing methods for early recognition usually ignore this issue, making unrealistic assumption about the availability of the starting point of the ongoing action. In this paper, we prove the importance of starting point detection and subsequently propose a method to determine the start of an ongoing action. Our method is based on a bidirectional recurrent neural network that computes the probability of a frame to be the starting point by comparing the dynamics of the actions before and after the frame. Experiments on three datasets show that our method can reliably detect the starting point of an ongoing action, improving the early recognition accuracy.
| Original language | English |
|---|---|
| Pages (from-to) | 24-31 |
| Number of pages | 8 |
| Journal | Computer Vision and Image Understanding |
| Volume | 175 |
| DOIs | |
| State | Published - Oct 2018 |
Keywords
- Action early recognition
- Event detection
- Online action detection
Fingerprint
Dive into the research topics of 'Back to the beginning: Starting point detection for early recognition of ongoing human actions'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver