Factorized Binary Codes for Large-Scale Nearest Neighbor Search

Frederick Tung and James Little

Abstract

Hashing algorithms for fast large-scale nearest neighbor search transform data points into compact binary codes by applying a set of learned or randomly generated hash functions. Retrieval accuracy generally increases with the number of hash functions, but increasing the number of hash functions also increases the storage requirements of the resulting binary codes. We present a novel factorized binary codes approach that uses an approximate matrix factorization of the binary codes to increase the number of hash functions while maintaining the original storage requirements. The proposed approach does not assume a particular algorithm for generating the hash functions, and requires only that we can discover and take advantage of correlations among the hash functions. Experiments on publicly available datasets suggest that factorized binary codes work particularly well for locality-sensitive hashing algorithms.

Session

Posters 1

Files

PDF iconExtended Abstract (PDF, 151K)
PDF iconPaper (PDF, 1M)

DOI

10.5244/C.30.34
https://dx.doi.org/10.5244/C.30.34

Citation

Frederick Tung and James Little. Factorized Binary Codes for Large-Scale Nearest Neighbor Search. In Richard C. Wilson, Edwin R. Hancock and William A. P. Smith, editors, Proceedings of the British Machine Vision Conference (BMVC), pages 34.1-34.12. BMVA Press, September 2016.

Bibtex

        @inproceedings{BMVC2016_34,
        	title={Factorized Binary Codes for Large-Scale Nearest Neighbor Search},
        	author={Frederick Tung and James Little},
        	year={2016},
        	month={September},
        	pages={34.1-34.12},
        	articleno={34},
        	numpages={12},
        	booktitle={Proceedings of the British Machine Vision Conference (BMVC)},
        	publisher={BMVA Press},
        	editor={Richard C. Wilson, Edwin R. Hancock and William A. P. Smith},
        	doi={10.5244/C.30.34},
        	isbn={1-901725-59-6},
        	url={https://dx.doi.org/10.5244/C.30.34}
        }