Siddhartha Chandra, Grigorios G. Chrysos and Iasonas Kokkinos
Abstract
Viewpoint variation is one of the main challenges for object-detection frameworks. In this work we describe strategies to improve object detection pipelines by introducing viewpoint based mixture components. We learn accurate mixtures of object detectors for RGB-Depth (RGBD) data using the latent SVM framework. Our contributions are threefold. First, we use surface-based object representations (3D mesh models) from available 3D object model repositories to learn strongly supervised viewpoint classifiers. These are used to guide the first stages of model learning, and help avoid inaccurate local minima of latent SVM training. Second, we develop a geometric dataset augmentation scheme that uses scene geometry to `take another look' at the training data, simulating the effect of camera viewpoint changes. Third, to better exploit depth information, we develop a novel depth-based dense feature extraction method that provides a robust statistical description of scene geometry. We evaluate our learned detectors on the NYU dataset, and demonstrate that each of our advances results in systematic performance improvements over the traditional HOG-based detection pipeline.
Siddhartha Chandra, Grigorios G. Chrysos and Iasonas Kokkinos. Surface Based Object Detection in RGBD Images. In Xianghua Xie, Mark W. Jones, and Gary K. L. Tam, editors, Proceedings of the British Machine Vision Conference (BMVC), pages 187.1-187.13. BMVA Press, September 2015.
Bibtex
@inproceedings{BMVC2015_187,
title={Surface Based Object Detection in RGBD Images},
author={Siddhartha Chandra and Grigorios G. Chrysos and Iasonas Kokkinos},
year={2015},
month={September},
pages={187.1-187.13},
articleno={187},
numpages={13},
booktitle={Proceedings of the British Machine Vision Conference (BMVC)},
publisher={BMVA Press},
editor={Xianghua Xie, Mark W. Jones, and Gary K. L. Tam},
doi={10.5244/C.29.187},
isbn={1-901725-53-7},
url={https://dx.doi.org/10.5244/C.29.187}
}