PatchIt: Self-Supervised Network Weight Initialization for Fine-grained Recognition

Patrick Sudowe and Bastian Leibe

Abstract

ConvNet training is highly sensitive to initialization of the weights. A widespread approach is to initialize the network with weights trained for a different task, an auxiliary task. The ImageNet-based ILSVRC classification task is a very popular choice for this, as it has shown to produce powerful feature representations applicable to a wide variety of tasks. However, this creates a significant entry barrier to exploring non-standard architectures. In this paper, we propose a self-supervised pretraining, the PatchTask, to obtain weight initializations for fine-grained recognition problems, such as person attribute recognition, pose estimation, or action recognition. Our pretraining allows us to leverage additional unlabeled data from the same source, which is often readily available, such as detection bounding boxes. We experimentally show that our method outperforms a standard random initialization by a considerable margin and closely matches the ImageNet-based initialization.

Session

Recognition and Physics-based vision

Files

PDF iconExtended Abstract (PDF, 314K)
PDF iconPaper (PDF, 526K)

DOI

10.5244/C.30.75
https://dx.doi.org/10.5244/C.30.75

Citation

Patrick Sudowe and Bastian Leibe. PatchIt: Self-Supervised Network Weight Initialization for Fine-grained Recognition. In Richard C. Wilson, Edwin R. Hancock and William A. P. Smith, editors, Proceedings of the British Machine Vision Conference (BMVC), pages 75.1-75.12. BMVA Press, September 2016.

Bibtex

        @inproceedings{BMVC2016_75,
        	title={PatchIt: Self-Supervised Network Weight Initialization for Fine-grained Recognition},
        	author={Patrick Sudowe and Bastian Leibe},
        	year={2016},
        	month={September},
        	pages={75.1-75.12},
        	articleno={75},
        	numpages={12},
        	booktitle={Proceedings of the British Machine Vision Conference (BMVC)},
        	publisher={BMVA Press},
        	editor={Richard C. Wilson, Edwin R. Hancock and William A. P. Smith},
        	doi={10.5244/C.30.75},
        	isbn={1-901725-59-6},
        	url={https://dx.doi.org/10.5244/C.30.75}
        }