Distance Metric Learning by Optimization on the Stiefel Manifold

Ankita Shukla and Saket Anand

Abstract

Distance metric learning has proven to be very successful in various problem domains. Most techniques learn a global metric in the form of a nxn symmetric positive semidefinite (PSD) Mahalanobis distance matrix, which has O(n^2) unknowns. The PSD constraint makes solving the metric learning problem even harder making it computationally intractable for high dimensions. In this work, we propose a flexible formulation that can employ different regularization functions, while implicitly maintaining the positive semidefiniteness constraint. We achieve this by eigendecomposition of the rank p Mahalanobis distance matrix followed by a joint optimization on the Stiefel manifold S_{n,p} and the positive orthant R^p_+. The resulting nonconvex optimization problem is solved by employing an alternating strategy. We use a recently proposed projection free approach for efficient optimization over the Stiefel manifold. Even though the problem is nonconvex, we empirically show competitive classification accuracy on UCI and USPS digits datasets.

Session

Workshop: 1st International Workshop on DIFFerential Geometry in Computer Vision for Analysis of Shapes, Images and Trajectories (DIFF-CV)

Files

PDF iconPaper (PDF, 279K)

DOI

10.5244/C.29.DIFFCV.7
https://dx.doi.org/10.5244/C.29.DIFFCV.7

Citation

Ankita Shukla and Saket Anand. Distance Metric Learning by Optimization on the Stiefel Manifold. In H. Drira, S. Kurtek, and P. Turaga, editors, Proceedings of the 1st International Workshop on DIFFerential Geometry in Computer Vision for Analysis of Shapes, Images and Trajectories (DIFF-CV 2015), pages 7.1-7.10. BMVA Press, September 2015.

Bibtex

@inproceedings{DIFFCV2015_7,
	title={Distance Metric Learning by Optimization on the Stiefel Manifold},
	author={Ankita Shukla and Saket  Anand},
	year={2015},
	month={September},
	pages={7.1-7.10},
	articleno={7},
	numpages={10},
	booktitle={Proceedings of the 1st International Workshop on DIFFerential Geometry in Computer Vision for Analysis of Shapes, Images and Trajectories (DIFF-CV 2015)},
	publisher={BMVA Press},
	editor={H. Drira, S. Kurtek, and P. Turaga},
	doi={10.5244/C.29.DIFFCV.7},
	isbn={1-901725-56-1},
	url={https://dx.doi.org/10.5244/C.29.DIFFCV.7}
}