Abstract
The objective evaluation of 2D-shape estimation results for moving
objects in a video sequence is still an open problem. First approaches
in the literature evaluate the spatial accuracy and the temporal
coherency of the estimated 2D object shape. Thereby, it is not
distinguished between several estimation errors located around the
object contour and a few, but larger, estimation errors. Both cases
would lead to similar evaluation results, although the 2D-shapes
would be visually very different. To overcome this problem, in this
paper, a new evaluation approach is proposed. In it, the evaluation of
the spatial accuracy and the temporal coherency is based on the mean
and the standard deviation of the 2D-shape estimation errors.