Global Deconvolutional Networks for Semantic Segmentation

Vladimir Nekrasov, Janghoon Ju and Jaesik Choi

Abstract

Semantic image segmentation is a principal problem in computer vision, where the aim is to correctly classify each individual pixel of an image into a semantic label. Its widespread use in many areas, including medical imaging and autonomous driving, has fostered extensive research in recent years. Empirical improvements in tackling this task have primarily been motivated by successful exploitation of Convolutional Neural Networks (CNNs) pre-trained for image classification and object recognition. However, the pixel-wise labelling with CNNs has its own unique challenges: (1) an accurate deconvolution, or upsampling, of low-resolution output into a higher-resolution segmentation mask and (2) an inclusion of global information, or context, within locally extracted features. To address these issues, we propose a novel architecture to conduct the equivalent of the deconvolution operation globally and acquire dense predictions. We demonstrate that it leads to improved performance of state-of-the-art semantic segmentation models on the PASCAL VOC 2012 benchmark, reaching 74.0% mean IU accuracy on the test set.

Session

Posters 2

Files

PDF iconExtended Abstract (PDF, 513K)
PDF iconPaper (PDF, 2M)

DOI

10.5244/C.30.124
https://dx.doi.org/10.5244/C.30.124

Citation

Vladimir Nekrasov, Janghoon Ju and Jaesik Choi. Global Deconvolutional Networks for Semantic Segmentation. In Richard C. Wilson, Edwin R. Hancock and William A. P. Smith, editors, Proceedings of the British Machine Vision Conference (BMVC), pages 124.1-124.14. BMVA Press, September 2016.

Bibtex

        @inproceedings{BMVC2016_124,
        	title={Global Deconvolutional Networks for Semantic Segmentation},
        	author={Vladimir Nekrasov, Janghoon Ju and Jaesik Choi},
        	year={2016},
        	month={September},
        	pages={124.1-124.14},
        	articleno={124},
        	numpages={14},
        	booktitle={Proceedings of the British Machine Vision Conference (BMVC)},
        	publisher={BMVA Press},
        	editor={Richard C. Wilson, Edwin R. Hancock and William A. P. Smith},
        	doi={10.5244/C.30.124},
        	isbn={1-901725-59-6},
        	url={https://dx.doi.org/10.5244/C.30.124}
        }