Next:
3 Solid Objects
Up:
Visual Tracking of Solid
Previous:
1 Introduction
The method used to track an object silhouette in an image sequence is
based on
a framework for spatiotemporal control of the tracking of visual
contours
[
2
].
A contour is efficiently represented in parametric form as a closed
B-spline (
X
(
s
),
Y
(
s
)):
where
and
are the control point vectors and
is a vector of basis functions. In our tracking framework
are chosen
depending on the shape
of the contour, which is approximately known in advance, enabling
efficient contour tracking. More specifically, spline segments can be
linear or quadratic
for linear and curved contour parts, respectively. Furthermore, corners
in the silhouette arising from vertices at the object surface or from
T-junctions are modeled as corners in the spline
.
The contour is assumed to evolve over time with a uniform 2D motion,
where randomly varying forces acting on the contour are modeled as an
additive Gaussian noise process. In an augmented state space
,
representing the position and
the velocity, a simple dynamic model can be introduced as a stochastic
differential equation:
where
is a zero mean, temporally uncorrelated noise process. A similar
equation applies for
. The
and
processes are independent, assuming isotropic noise processes.
The B-spline contour represents the current estimate of the real silhouette, which can be measured in the images as contrast edges . Measurements of the visual feature are made in the normal direction along the B-spline contour at sample points. The measurements are then used to update the current B-spline contour.
A steady state Kalman filter is derived to enhance the tracking process according to the assumed underlying motion model of the contour. A detailed discussion of the spatio-temporal filtering can be found in [ 2 ] and [ 12 ].
The expected shape of the visual feature in the first frame is used to initialize the system state of the contour tracker. However, its influence decreases continuously, so that, in a steady state Kalman filter, no shape memory remains. For tracking silhouettes of 3D rigid objects it is desirable to assure, that the tracker retains some shape memory throughout the tracking process by means of a persistent template. A persistent template enforces additional smoothness on the B-splines. Furthermore, the tracker should not only be attracted to the initial template, but to a set generated by a certain group of transformations on the template, in order to account for the transformations a 3D rigid object (and hence its silhouette) might undergo in space.
The contour generator of a curved object is viewpoint dependent, which makes it impossible to regard it as a rigid space curve and utilizing the persistent template suggested in [ 10 ]. However, due to the availability of the 3D pose tracker we are able to predict the silhouette in the future. For small viewpoint changes we can then approximate the allowed silhouette changes by linear combinations of the current and future silhouettes. This can be integrated in the tracking framework by restricting the state space onto a subspace spanned by a few silhouettes.