Deep Multi-task Attribute-driven Ranking for Fine-grained Sketch-based Image Retrieval
Jifei Song, Yi-Zhe Song, Tao Xiang, Timothy Hospedales and Xiang Ruan
Abstract
Fine-grained sketch-based image retrieval (SBIR) aims to go beyond conventional SBIR to perform instance-level cross-domain retrieval: finding the specific photo that matches an input sketch. Existing methods focus on designing/learning good features for cross-domain matching and/or learning cross-domain matching functions. However, they neglect the semantic aspect of retrieval, i.e., what meaningful object properties does a user try encode in her/his sketch? We propose a fine-grained SBIR model that exploits semantic attributes and deep feature learning in a complementary way. Specifically, we perform multi-task deep learning with three objectives, including: retrieval by fine-grained ranking on a learned representation, attribute prediction, and attribute-level ranking. Simultaneously predicting semantic attributes and using such predictions in the ranking procedure help retrieval results to be more semantically relevant. Importantly, the introduction of semantic attribute learning in the model allows for the elimination of the otherwise prohibitive cost of human annotations required for training a fine-grained deep ranking model. Experimental results demonstrate that our method outperforms the state-of-the-art on challenging fine-grained SBIR benchmarks while requiring less annotation.
Session
Recognition
Files
Extended Abstract (PDF, 166K)
Paper (PDF, 590K)
DOI
10.5244/C.30.132
https://dx.doi.org/10.5244/C.30.132
Citation
Jifei Song, Yi-Zhe Song, Tao Xiang, Timothy Hospedales and Xiang Ruan. Deep Multi-task Attribute-driven Ranking for Fine-grained Sketch-based Image Retrieval. In Richard C. Wilson, Edwin R. Hancock and William A. P. Smith, editors, Proceedings of the British Machine Vision Conference (BMVC), pages 132.1-132.11. BMVA Press, September 2016.
Bibtex
@inproceedings{BMVC2016_132,
title={Deep Multi-task Attribute-driven Ranking for Fine-grained Sketch-based Image Retrieval},
author={Jifei Song, Yi-Zhe Song, Tao Xiang, Timothy Hospedales and Xiang Ruan},
year={2016},
month={September},
pages={132.1-132.11},
articleno={132},
numpages={11},
booktitle={Proceedings of the British Machine Vision Conference (BMVC)},
publisher={BMVA Press},
editor={Richard C. Wilson, Edwin R. Hancock and William A. P. Smith},
doi={10.5244/C.30.132},
isbn={1-901725-59-6},
url={https://dx.doi.org/10.5244/C.30.132}
}