RED: Reinforced Encoder-Decoder Networks for Action Anticipation
Jiyang Gao, Zhenheng Yang and Ram Nevatia
Abstract
Action anticipation aims to detect an action before it happens. Many real world
applications in robotics and surveillance are related to this predictive capability. Current
methods address this problem by first anticipating visual representations of future frames
and then categorizing the anticipated representations to actions. However, anticipation is
based on a single past frame’s representation, which ignores the history trend. Besides,
it can only anticipate a fixed future time. We propose a Reinforced Encoder-Decoder
(RED) network for action anticipation. RED takes multiple history representations as
input and learns to anticipate a sequence of future representations. One salient aspect of
RED is that a reinforcement module is adopted to provide sequence-level supervision;
the reward function is designed to encourage the system to make correct predictions as
early as possible.
Session
Orals - Action Recognition
Files
Paper (PDF)
DOI
10.5244/C.31.92
https://dx.doi.org/10.5244/C.31.92
Citation
Jiyang Gao, Zhenheng Yang and Ram Nevatia. RED: Reinforced Encoder-Decoder Networks for Action Anticipation. In T.K. Kim, S. Zafeiriou, G. Brostow and K. Mikolajczyk, editors, Proceedings of the British Machine Vision Conference (BMVC), pages 92.1-92.11. BMVA Press, September 2017.
Bibtex
@inproceedings{BMVC2017_92,
title={RED: Reinforced Encoder-Decoder Networks for Action Anticipation},
author={Jiyang Gao, Zhenheng Yang and Ram Nevatia},
year={2017},
month={September},
pages={92.1-92.11},
articleno={92},
numpages={11},
booktitle={Proceedings of the British Machine Vision Conference (BMVC)},
publisher={BMVA Press},
editor={Tae-Kyun Kim, Stefanos Zafeiriou, Gabriel Brostow and Krystian Mikolajczyk},
doi={10.5244/C.31.92},
isbn={1-901725-60-X},
url={https://dx.doi.org/10.5244/C.31.92}
}