Transductive Multi-label Zero-shot Learning

Yanwei Fu, Yongxin Yang, Tim Hospedales, Tao Xiang and Shaogang Gong

In Proceedings British Machine Vision Conference 2014
http://dx.doi.org/10.5244/C.28.7

Abstract

Zero-shot learning has received increasing interest as a means to alleviate the often prohibitive expense of annotating training data for large scale recognition problems. These methods have achieved great success via learning intermediate semantic representations in the form of attributes and more recently, semantic word vectors. However, they have thus far been constrained to the single-label case, in contrast to the growing popularity and importance of more realistic multi-label data. In this paper, for the first time, we investigate and formalise a general framework for multi-label zero-shot learning, addressing the unique challenge therein: how to exploit multi-label correlation at test time with no training data for those classes? In particular, we propose (1) a multi-output deep regression model to project an image into a semantic word space, which explicitly exploits the correlations in the intermediate semantic layer of word vectors; (2) a novel zero-shot learning algorithm for multi-label data that exploits the unique compositionality property of semantic word vector representations; and (3) a transductive learning strategy to enable the regression model learned from seen classes to generalise well to unseen classes. Our zero-shot learning experiments on a number of standard multi-label datasets demonstrate that our method outperforms a variety of baselines.

Session

Machine Learning

Files

Extended Abstract (PDF, 1 page, 350K)
Paper (PDF, 11 pages, 479K)
Bibtex File

Presentation

Citation

Yanwei Fu, Yongxin Yang, Tim Hospedales, Tao Xiang, and Shaogang Gong. Transductive Multi-label Zero-shot Learning. Proceedings of the British Machine Vision Conference. BMVA Press, September 2014.

BibTex

@inproceedings{BMVC.28.7
	title = {Transductive Multi-label Zero-shot Learning},
	author = {Fu, Yanwei and Yang, Yongxin and Hospedales, Tim and Xiang, Tao and Gong, Shaogang},
	year = {2014},
	booktitle = {Proceedings of the British Machine Vision Conference},
	publisher = {BMVA Press},
	editors = {Valstar, Michel and French, Andrew and Pridmore, Tony}
	doi = { http://dx.doi.org/10.5244/C.28.7 }
}