Multi-task Relative Attribute Prediction by Incorporating Local Context and Global Style Information

Yuhang He, Long Chen and Jianda Chen

Abstract

Relative attribute represents the correlation degree of one attribute between an image pair\,(e.g. one car image has more seat number than the other car image). While appearance highly and directly correlated relative attribute is easy to predict, fine-grained or appearance insensitive relative attribute prediction still remains as a challenging task. To address this challenge, we propose a multi-task trainable deep neural networks by incorporating an object's both local context and global style information to infer the relative attribute. In particular, we leverage convolutional neural networks (CNNs) to extract feature, followed by a ranking network to score the image pair. In CNNs, we treat features arising from intermediate convolution layers and full connection layers in CNNs as local context and global style information, respectively. Our intuition is that local context corresponds to bottom-to-top localised visual difference and global style information records high-level global subtle difference from a top-to-bottom scope between an image pair. We concatenate them together to escalate overall performance of multi-task relative attribute prediction. Finally, experimental results on 5 publicly available datasets demonstrate that our proposed approach outperforms several other state of the art methods and further achieves comparable results when comparing to very deep networks, like 152-ResNet and inception-v3.

Session

Recognition

Files

PDF iconExtended Abstract (PDF, 357K)
PDF iconPaper (PDF, 8M)

DOI

10.5244/C.30.131
https://dx.doi.org/10.5244/C.30.131

Citation

Yuhang He, Long Chen and Jianda Chen. Multi-task Relative Attribute Prediction by Incorporating Local Context and Global Style Information. In Richard C. Wilson, Edwin R. Hancock and William A. P. Smith, editors, Proceedings of the British Machine Vision Conference (BMVC), pages 131.1-131.12. BMVA Press, September 2016.

Bibtex

        @inproceedings{BMVC2016_131,
        	title={Multi-task Relative Attribute Prediction by Incorporating Local Context and Global Style Information},
        	author={Yuhang He, Long Chen and Jianda Chen},
        	year={2016},
        	month={September},
        	pages={131.1-131.12},
        	articleno={131},
        	numpages={12},
        	booktitle={Proceedings of the British Machine Vision Conference (BMVC)},
        	publisher={BMVA Press},
        	editor={Richard C. Wilson, Edwin R. Hancock and William A. P. Smith},
        	doi={10.5244/C.30.131},
        	isbn={1-901725-59-6},
        	url={https://dx.doi.org/10.5244/C.30.131}
        }