BMVA 
The British Machine Vision Association and Society for Pattern Recognition 

BibTeX entry

@PHDTHESIS{200610Philip_Tresadern,
  AUTHOR={Philip Tresadern},
  TITLE={Visual Analysis of Articulated Motion},
  SCHOOL={Oxford University},
  MONTH=Oct,
  YEAR=2006,
  URL={http://www.bmva.org/theses/2006/2006-tresadern.pdf},
}

Abstract

The ability of machines to recognise and interpret human action and gesture from standard video footage has wide-ranging applications for control, analysis and security. However, in many scenarios the use of commercial motion capture systems is undesirable or infeasible (eg intelligent surveillance). In particular, commercial systems are restricted by their dependence on markers and the use of multiple cameras that must be synchronized and calibrated by hand. It is the aim of this thesis to develop methods that relax these constraints in order to bring inexpensive, off-the-shelf motion capture several steps closer to a reality. In doing so, we demonstrate that image projections of important anatomical landmarks on the body (specifically, joint centre projections) can be recovered automatically from image data. One approach exploits geometric methods developed in the field of Structure From Motion (SFM), whereby point features on the surface of an articulated body impose constraints on the hidden joint locations, even for a single view. An alternative approach explores Machine Learning to employ context-specific knowledge about the problem in the form of a corpus of training data. In this case, joint locations are recovered from similar exemplars in the training set via searching, sampling or regression. Having recovered such points of interest in an image sequence, we demonstrate that they can be used to synchronize and calibrate a pair of cameras, rather than employing complex engineering solutions. We present a robust algorithm for synchronizing two sequences, of unknown and different frame rates, to sub-frame accuracy. Following synchronization, we recover affine structure using standard methods. The recovered affine structure is then upgraded to a Euclidean co-ordinate frame via a novel self-calibration procedure that is shown to be several times more efficient than existing methods without sacrificing accuracy. Throughout the thesis, methods are quantitatively evaluated on synthetic data for a ground truth comparison and qualitatively demonstrated on real examples.