Determining the Similarity Between Two Arbitrary 2-D Shapes and Its Application to Biological Objects

doi:10.15344/2456-4451/2018/139

Graphy

Profile

New User

New to Graphy Publications? Create an account to get started today.

Create Account For:

Registered Users

Have an account? Sign in now.

Article Menu

Journal Menu

How to Cite | Author Information | Publication History | Funding Information

International Journal of Computer & Software Engineering Volume 3 (2018), Article ID 3:IJCSE-139, 12 pages
https://doi.org/10.15344/2456-4451/2018/139

Research Article

Determining the Similarity between Two Arbitrary 2-D Shapes and its Application to Biological Objects

Petra Perner

Institute of Computer Vision and applied Computer Sciences, IBaI, Leipzig, Germany

Corresponding Author Details: Prof. Dr. Petra Perner, Institute of Computer Vision and applied Computer Sciences, IBaI, Arno-Nitzsche-Str. 45, 04277 Leipzig, Germany; E-mail: pperner@ibai-institut.de

Received: 22 October 2018; Accepted: 14 November 2018; Published: 16 October 2018

Citation: Perner P (2018) Determining the Similarity between Two Arbitrary 2-D Shapes and its Application to Biological Objects. Int J Comput Softw Eng 3: 139. doi: https://doi.org/10.15344/2456-4451/2018/139

Funding: The project “Development of methods and techniques for the image-acquisition and computer-aided analysis of biologically dangerous substances BIOGEFA” is sponsored by the German Ministry of Economy BMWA under the grant number 16IN0147.

Copyright: © 2018 Perner. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Abstract

Generalized shape models of objects are necessary to match and identify an object in an image. To acquire these kind of models special methods are necessary that allow to learn the similarity pair-wise similarity between shapes. They mainly concern is the establishment of point correspondences between two shapes and the detection of outlier. Known algorithm assume that the aligned shapes are quite similar in a way. But special problems arise if we have to align shapes that are very different, for example aligning concave to convex shapes. In such cases it is indispensable to take into account the order of the pointsets and to enforce legal sets of correspondences; otherwise the calculated distances are incorrect. We present our novel shape alignment algorithm which can also handle such cases. The algorithm establishes symmetric and legal one-to-one point correspondences between arbitrary shapes, represented as ordered sets of 2D-points and returns a distance measure which runs between 0 and 1.

1. Introduction

The analysis of shapes and shape variation is of great importance in a wide variety of disciplines. Initially in 1917 D’Arcy Thompson [1] studied the field of geometrical shape analysis from a biological point of view. Intuitively it is especially interesting for biologists since shape is one of the most concise features of an object class and may change over time due to growth or evolution. The problems of shape spaces and distances have been intensively studied by Kendall [2] and Bookstein [3] in a statistical theory of shape. We follow Kendall [2] who defined shape as all the geometrical information that remains when location, scale, and rotational effects are filtered out from the object. Sometimes it is also interesting to retain the size of the objects, but we will not consider this case here. In most applications this invariance is not given a-priori, thus it is indispensable to transform the acquired shapes into a common reference frame. Different terms are equivalently used for this operation, i.e. superimposition, registration, and alignment. Only if the shapes are aligned, it will be possible to compare them in order to describe deformations or to define a measure distance between them.

Bookstein geometrically analyzed shapes and measured their change in many biological applications, e.g. bee wings, skulls, and i.e. human schizophrenia brains [4]. To give an example, with his morphometric studies he found out that the region connecting the two hemispheres is narrower in schizophrenic brains as in normal brains. In digital image processing the statistical analysis of shape is a fundamental task in object recognition and classification. It concern applications in a wide variety of fields, e.g. [5].

2. The Problem of Alignment of 2-D Shapes

Consider two shape instances P and O defined by the point-sets p_i ε R², i= 1,2,...,N_p and o_k ε R², k=1,2,...,N₀ respectively. The basic task of aligning two shapes consists of transforming one of them (say P) so that it fits in some optimal way the other one (say O) (see Figure 1).

Figure 1: Alignment of two shapes instances with superimposition and similarity transformation.

Generally the shape instance P={p_i} is said to be aligned to the shape instance O={o_k} if a distance d(P,O) between the two shapes can not be decreased by applying a transformation ψ to P. Various alignment approaches are known [6-12]. They mainly differ in the kind of mapping (i.e. similarity [8], rigid [13], affine [14]) and the chosen distance measure. A survey of different distance measures used in the field of shape matching can be found in [15].

For calculating a distance between two shape instances the knowledge of corresponding points is required. If the shapes are defined by sets of landmarks [16,17] the knowledge of point correspondences is implicit. However at the beginning of many applications this condition is not hold and often it is hard or even impossible to assign landmarks to the acquired shapes. Then it is necessary to automatically determine point correspondences between the points of two aligned shapes P and O, see Figure 2.

Figure 2: Aligned shapes with established point correspondences.

One of the most essential demands on these approaches is symmetry. Symmetry means obtaining the same correspondences when mapping instance P to instance O and vise versa instance O to instance P. This requirement is often bound up with the condition to establish one-to-one correspondences. This means a point o_k in shape instance O has exactly one corresponding point p_k in shape instance P. If we compare point sets with unequal point numbers under the condition of one-to-one mapping then it is clear that some points will not have a correspondence in the other point set. These points are called outlier.

Special problems arise if we have to align shapes that are very different, for example aligning concave to convex shapes. In these cases it is indispensable to take into account the order of the point-sets and to enforce legal sets of correspondences, otherwise the calculated distances are incorrect. This paper presents our novel algorithm for aligning arbitrary 2D-shapes, represented as ordered point-sets. In our work natural shapes are acquired manually from real images. The object shapes can appear with varying orientation, position, and scale in the image. The shapes are arbitrary and there is nothing special about them. They might have a great natural variation. They also might be very similar or very dissimilar. They even might have concave or convex shapes. The algorithm establishes symmetric and legal one-to-one point correspondences between arbitrary shapes, represented as ordered sets of 2D-points and returns a distance measure which runs between 0 and 1.

3. Related Work

The problems of shape spaces and distances have been intensively studied [3,2] in a statistical theory of shape. The well-known Procrustes distance [8,16] between two point-sets P and O is defined as the sum of squared distances between corresponding points

d(P,O)= ∑ i=1 N PO ‖ ( p i − μ p ) σ P −R(θ) o i − μ o σ o ‖ 2 (1)

where R(θ) is the rotation matrix, μP and μ_O are the centroids of the object P and O respectively, σ_P and σ_O are the standard deviations of the distance of a point to the centroid of the shapes and N_PO is the number of point correspondences between the point-sets P and O. This example shows that the knowledge of correspondences is an important prerequisite for calculation of shape distances.

There has been done a lot of work which concerns the problem of automatic finding point correspondences between two unknown shapes. Hill et al. [14] presented an interesting framework for the determination of point correspondences. First, the algorithm calculates pseudo-landmarks based on a set of two-dimensional polygonal shapes. The polygon approximation is controlled by an automatic calculated threshold in order to identify a subset of points on each shape. In the next step they establish an initial estimate of correspondences based on the arc-path length of the polygons. Finally, the greedy algorithm is used as an iterative local optimization scheme to modify the correspondences in order to minimize the distance between both polygons. The complexity of the greedy algorithm is O(nlog n) for sorting the elements and O(n) for the selection. Brett and Taylor [12] presented the extension of this method to threedimensional surfaces. However, the algorithm was only applied to groups of objects from the same category. They assume that all acquired shapes are similar and compared them under non-rigid transformation.

Another popular approach in solving the correspondence problem is called Iterative Closest Point (ICP) developed by Besl and McKay [18] and further improved in [10,11,19]. Given a set of initial, estimated registrations the ICP automatic converges to the nearest local minimum of a mean squared distance metric. It establishes correspondences by mapping the points of one shape to their closest points on the other shape. In the original version of the ICP [18] the complexity of finding for each point p_k in P the closest point in the point-set O is O(N_pN_o) in worst case. Marte et al. [11] improved this complexity by applying a spatial subdivision of the points in the set O. They used clustering techniques to limit the search space of correspondences for a point p_k to points which are located within a defined range around p_k. But this can only be done because they assume that the point-sets are already in a close proximity, i.e. a reasonably good initial registration state is already given. In general, the ICP is a very simple algorithm which can also be applied to shapes with geometric data of different representations (e.g. point-sets, line segments, curves, surfaces). On the other hand it is not very robust with respect to noise and outlier. Fitzgibbon [10] replaced the closed-form inner loop of the ICP by the Levenberg-Marquardt algorithm, a non-linear optimization scheme. His results showed an increased robustness and a reduction of the dependence of the initial estimated registrations without a significant loss of speed.

The main problem of the ICP is that is does not guarantee to produce a legal set of correspondences. By a legal set of correspondences is meant that there are no inversions between successive pairs of correspondences (see figure 3). In detail, when starting from a reference pair of corresponding points and traveling successive around the complete boundaries to pairs of correspondences, the arc path lengths in relation to the reference points always have to be increasing.

Figure 3: Illegal and legal sets of correspondences.

An extension of the classical Procrustes alignment to point-sets of differing point counts is known as the Softassign Procrustes Matching algorithm [8]. It alternates between solving for correspondence, the spatial mapping, and the Procrustes rescaling. The Softassign Procrustes Matching algorithm is also an iterative process and uses deterministic annealing to solve the optimization problem. In deed it is a very timeconsuming and computationally expensive procedure. They applied the algorithm to dense point sets not to closed boundaries. One-toone correspondences between points were established and points without correspondences are rejected as outlier. The establishment of point correspondences is only held in a nearest neighbor framework so they do not guarantee to produce legal sets of correspondences.

Another solution of the correspondence problem was presented by Belongie et al. [20]. He added to each point in the set a descriptor called shape context. The shape context of a point is a histogram which contains information about the relative distribution of all other points in the set. The histogram can provide invariance to translation, scale, and rotation. The cost of mapping two points is calculated by comparing their histograms using the χ² test statistic. The χ² distances between all possible pairs of points between the two shapes have to be calculated which results in a square distance matrix. The best match is found where the sum of distances between all matched histograms has reached its minimum. To solve this square assignment problem they applied a shortest augmenting path algorithm for bipartite graph matching which has a time complexity of O(n³). The result is a set of one-to-one correspondences between points with similar shape contexts. The algorithm can also handle point-sets with different point counts by integrating dummy points. But it is also not guaranteed that legal sets of correspondences were established.

In none of these works is described the problem of aligning a convex to a concave shape. There arise special problems: Let us suppose that the concave shape representing the letter C is compared with the shape of the letter O (see figure 4). If the pair-wise correspondences were established between nearest neighbored points the resulting distance between both shape instances will be very small (see figure 4A). But intuitively we would say that these shapes are not very similar. In particular in such cases it is necessary to regard the order of point correspondences and to remove correspondences if they produce inversions. In addition to legality of correspondences it is important to take into account the complete contours of both shapes (see figure 4B). In result it can be seen that there arise big distances between corresponding points which leads to an increased distance measure.

Figure 4: Establishing correspondences while mapping a concave and convex shape.

4. Material Used for this Study

The materials we used for this study are fungal strains that are naturally 3-D objects but that are acquired in a 2-D image. These objects have a great variance in the appearance of the shape of the object because of their nature and the imaging constraints. Six fungal strains representing species with different spore types were used for the study. Figure 5 shows one of the acquired images for each analyzed fungal strain.

Figure 5: Images of six different fungi strains.

The strains were obtained from the fungal stock collection of the Institute of Microbiology, University of Jena / Germany and from the culture collection of JenaBios GmbH. All strains were cultured in Petri dishes on 2% malt extract agar (Merck) at 24°C in an incubation chamber for at least fourteen days. For microscopy fungal spores were scrapped off from the agar surface and placed on a microscopic slide in a drop of lactic acid. Naturally hyaline spores were additionally stained with lactophenol cotton blue (Merck). A database of images from the spores of these species was produced.

5. Acquisition of Shape Cases

5.1 Background

The acquisition of object shapes from real images is still an essential problem of image segmentation. For automated image segmentation often low-level methods, such as edge detection [8] and region growing [21,22] are used to extract the outline of objects from an image. Low-level methods yield good results if the objects have conspicuous boundaries and are not occluded. In the case of complex backgrounds and occluded or noisy objects, the shape acquisition may result in strong distorted and incorrect cases.

Therefore the acquisition is often performed manually at the cost of a very subjective, time-consuming procedure. In some studies it might be sufficient to reduce the shapes of objects to some characteristic points which are common features of the object class. Landmark coordinates [2,4,17,23] are manually assigned by an expert to some biologically or anatomically significant points of an organism. This has to be done for every single object separately. Additional landmarks can be automatic constructed using a combination of existing landmarks. The calculation of the position of these constructed landmarks has to by precisely defined, i.e. landmark C is located in the midpoint of the shortest line between landmark A and landmark B. Every single landmark is not only defined by its position but also by the specific and unique feature that it represents.

Landmarks might provide information about special features of an object contour but cannot capture the shape of an object because important characteristics might be lost. In many applications it is insufficient or impossible to describe the shape of an object only by landmarks. Then it is a common procedure to trace and capture the complete outlines of the objects in the images manually [14]. Indeed, manually image segmentation may be a very time-consuming and also inaccurate procedure. Therefore new semi-automatic approaches were developed [24,25] for interactive image segmentation. These approaches use live-wire segmentation algorithms which are based on a graph search to locate mathematically optimal boundaries in the image. If the user moves the mouse cursor in the proximity of an object edge, the labeled outline is automatically adjusted to the boundary. The manual acquisition of objects from real images with the help of semi-automatic segmentation approaches is faster and more precise.

As described in above we are actually studying the shape of airborne fungi strains. The great natural variance in the shape of these objects and the imaging constraints have the effect that a manual assignment of landmark coordinates is quite impossible. We also reject a fully automatic segmentation procedure because this might produce outlier shapes due to the objects might be touching and overlapping. Therefore we decided to use a manually labeling procedure in our application.

5.2 Acquisition of object contours from real images

As a shape we are considering the outline of an object but not the appearance of the object inside the contour. Therefore we want to elicit from the real image the object contour P represented by a set of N_P boundary points p_k, k=1,2,...,N_p. The user starts labeling the shape of the object at an arbitrary pixel p₁, of the contour P. In our application we are only interested in the complete and closed outline of the objects. So after having traced the object, the labeling should end at a pixel p_m, k=1,2,...,N_p in the 8-neighbourhood of P₁. Because it might be difficult to exactly meet this pixel manually, the contour will be closed automatic using the Bresenham [26] procedure. This algorithm connects two pixels in a raster graphic with a digital line. In addition to that we demand that two successive points may not be defined by exactly the same point coordinates in the image. In consequence we obtain an ordered, discrete, and closed sequence of points where the Euclidean distance between two successive points is either 1 or 2 . In Figure 6 the scheme of the acquisition of a shape is shown for illustration.

Figure 6: Scheme of the labeled object outline.

Figure 7 presents a screenshot from our developed program Case Acquisition and Case Mining - CACM while labeling a shape of the strain Ulocladium Botrytis with the point coordinates on the right side of the screenshot.

Figure 7: Labeled shape with coordinates.

In fact image digitization and human imprecision always implies an error during the acquisition of the object shapes. It might be very difficult to exactly determine and meet every boundary pixel of an object when manually labeling the contour of an object. Also the quantization of a continuous image constitutes a reduction in resolution which causes considerable image distortion (Moiré effect). Furthermore the contour of an object in a digitized image may be blurred which means the contour is extended over a set of pixels with decreasing grey values.

5.3 Approximation

The amount of acquired contour points N_P of a shape P depends on the resolution of the input image and on the area and the contour length of the object. To speed up the succeeding computation time of the following alignment process we introduced a polygonal approximation. The resulting number of points from a polygonal approximation of the shape will be influenced by the chosen order of the polygon and the allowed approximation error. We apply the approximation only to shapes which consist out of more than 200 points. This limit was introduced because the contour of very small objects in images might be defined by a few pixels only. A further reduction of information about the contour of these objects would be disadvantageous.

For the polygonal approximation, we used the approach based on the area/length ratio according to Wall and Daniellson [27] because it is a very fast and simple algorithm without time-consuming mathematic operations. Suppose the set of N_P points p₁,p₂,...,p_n, defining the contour of the object P, for which a polygonal approximation is desired. We use the first labeled point p₁, as the starting point for the first approximation. Next we virtually draw a line segment from this starting point to the successor point in P. The area A between this line and the corresponding contour segment of P is measured. If the area A divided by the length of the line L is smaller then a predefined threshold T, then the same process is repeated for the next successor point in the set P (see Figure 8).

Figure 8: Polygonal approximation based on the area/length ratio [27].

This procedure repeats until the ratio exceeds the threshold T. In that case the current point p_m, becomes the end point of the approximated line and the starting point for the next approximation. The same process is then repeated until the first point p₁, is reached. The result of the approximation is a subset of points of the contour P.

The ratio A L controls the maximal error of the approximation since A is the area and L the side length of a virtual rectangle. If the ratio is low then the other side of the virtual rectangle is small.

5.4 Normalization of shape cases

In our application we demand the distance measure between two shapes to be invariant under translation, scale, and rotation. Thus, in a preprocessing step we remove differences in translation and we rescale the shapes so that the maximum distance of each point from the centroid of the shape will not be larger than one. The invariance to rotation will be calculated during the alignment process.

The centroid μ ⇀ of a set of N_P points is given by:

x μ ⇀ = 1 N P ∑ i=1 N P x i y μ ⇀ = 1 N P ∑ i=1 N P y i (2)

To obtain invariance under translation we translate the shape so that its centroid is at the origin

x i ' = x i − x μ ⇀ i=1,2,... ,N P y i ' = y i − y μ ⇀ (3)

Furthermore for each shape we calculate the scale factor t_p:

t p = 1 max i x ' i 2 +y ' i 2 ; i=1,2,... ,N P (4)

This scale factor is applied to all points of the shape P to obtain the transformed shape instance P'. The set of acquired shapes are now superposed on the origin of a common coordinate system. The maximum Euclidean distance of a point to the origin of the coordinate system is one.

6. Shape Alignment and Distance Calculation

6.1 The alignment algorithm

In the Procrustes Distance (see Eq. 1) each of the shapes is rescaled in such a way that the standard deviation of all points to the centroid is identical (σ_P=σ_O). This normalization does not ensure that the resulting distance runs between 0 and 1. Since we are interested in comparing objects of different categories we rescale the shapes so that the maximum distance of a point to the centroid is not larger than 1. In comparison to Procrustes Distance we also do not need μ_P and μ_O because we have already removed the translation by transforming the objects into the origin.

The differences in rotation will be removed during our iterative alignment algorithm. In each iteration of this algorithm the first shape is rotated stepwise while the second shape is kept fixed. For every transformed point in the first shape we try to find a corresponding point on the second shape. Based on the distance between these corresponding points the alignment score is calculated for this specific iteration step. When the first shape is rotated once around its centroid, finally that rotation is selected and applied where the minimum alignment score was calculated.

We are regarding arbitrary shapes with varying orientations and different point counts and do not have any information about how the points of two shape instances have to be mapped onto each other. As already stated, we have to solve the correspondence problem before we are able to calculate the distance between two shape instances. In summary, for the establishment of point correspondences we demand the following facts:

Produce only legal sets of correspondences,
Produce one-to-one point correspondences,
Determine points without a correspondence as outlier, and
Produce a symmetric result, which is obtaining the same results when aligning instance P to instance O as when aligning instance O to P.

It was shown above that the establishment of legal sets of correspondences is an important fact to distinguish between concave and convex shapes. The drawback of this requirement is that the acquired shapes have to be ordered point-sets. The demand for establishing only legal sets of correspondences is also the main reason why we decided to extend the initial version of our alignment algorithm [28] that can only align concave with concave and convex with convex shapes. We extended our nearest neighbor-search algorithm presented there so that it can handle the correspondence problem while aligning concave with convex object shapes.

The input for our alignment algorithm is a pair of two normalized shapes P and O as described above. We define the shape instance P as the one which has less contour points then the second shape instance O. The instance with more points is always that one, which will be aligned to shape instance with less points. That means shape P will be kept fixed while shape O will be stepwise rotated to match P.

Before the iterative algorithm starts we define a range where to search for potential correspondences. This range is defined by a maximum deviation of the orientation according to the centroid (see Figure 9). This restriction will help us to produce legal sets of correspondences. The maximum permissible deviation of orientation γ_dev will be calculated in dependence of the amount of contour points n_O of the shape O, which is the instance that has more points than the other one. Our investigations showed that the following formula leads to a well-sized range

Figure 9: The permissible deviation of orientation for finding a point correspondent of the point p_k is illustrated. The line marks the orientation of the point p_k. The range where to search for a correspondence is shown.

γ dev =± 4π n o (5)

The outline of our iterative alignment algorithm is as follows:

6.2 Calculation of point correspondences

The most difficult and time-consuming part is the establishment of point-correspondences. For each point p_k with 1≤k≤n_p in the set P we try to establish exactly one corresponding point oj with 1≤j≤n_o in the set O. To create a list of potential correspondents we first select all points in O which are located in the permissible range of orientation (see Figure 9) and insert them into this list of potential correspondents {CorrList(p_k)} of p_k. Each of the selected points meets the following condition

( γ P k − γ dev )≤ γ o j ≤( γ pk + γ dev ) (6)

with γ_pk the angle of point p_k and γ_dev the permissible range of orientation around p_k.

In Figure 10 is shown a part of two aligned shape instances at this state of registration. Each point in shape P is connected by lines to all of its potential correspondences. It can also be seen that there are outlier on shape O. As an outlier we define all points where no correspondence could be established. Resulting points without correspondences are a logical consequence if we demand one-to-one correspondences when mapping shapes with unequal number of points. Such kind of points does not have any influence on the calculated distance.

Figure 10: Part from a screenshot made during the alignment of two shape cases. The outer dotted shape (P) is aligned to the inner dotted shape (O). Each point p_k in P is connected by lines with all of its potential correspondences that are stored in {CorrList(p_k)}.

If no point in O is located in the permissible range {CorrList(p_k)} is empty), p_k is marked as an outlier. Normally in case of aligning convex objects there is no point in P which has not at least one potential correspondence. The permissible range of orientation is chosen big enough anyway. Indeed, this kind of outliers occurs only in cases of aligning convex to concave objects. Each point p_k which has not a single potential corresponding point in O is included with the maximum distance of one when calculating the distance measure.

If there is at least one potential correspondent we calculate the squared Euclidean distances between p_k and each point in this list. Then the points in this list {CorrList(p_k)} were sorted with ascending distances in relation to p_k. In succession we check for each point in this list if there was already assigned a corresponding point in P. The first found point without a correspondence is selected and we establish a one-to-one correspondence between them. If all points in the list {CorrList(p_k)} have already a correspondence the point p_k is marked as an outlier. In contrast to those outliers in P which do not even have one potential correspondence in O, this kind of outlier will not be included in calculating the distance measure.

Let us assume we had gone through the complete set P and assigned to each point p_k either a correspondence in O or marked it as an outlier. In the next step we want to ensure to produce a legal set of correspondences. Therefore we go through the set P again and remove all one-to-one correspondences that are inversions (see figure 3). Suppose we have found a point p_k assigned to a correspondence o_g in O that produces an inversion with the point p_m which is assigned to the point o_f (k,m≤n_p; g,f≤n_o). First we try to remove the inversion by switching the correspondences, i.e. p_k will be assigned to of and p_m will be assigned to o_g. Otherwise we remove the correspondence of p_k and check in succession all the remaining points in the list {CorrList(p_k)} if it is possible to assign a correspondence with one of these points that does not produce an inversion. If we find a point in this list, we establish the one-to-one correspondence between these two points. Otherwise we have to mark p_k as an outlier.

6.3 Calculation of the distance measure

The distance between two shapes P and O is calculated based on the sum of Euclidean distances. If there was assigned a correspondence to the point p_k, 1≤k≤n_p in p, the Euclidean distance between this point and its corresponding point o_k in O is calculated by

d( P k , o k )= ( X p − X o ) 2 + ( y p − y o ) 2 (7)

If the point p_k is an outlier because there could not be assigned at least one potential correspondence and the list {CorrList(p_k)} is empty, the distance is set with the maximal value of one

d( p k , o k )=1 (8)

The sum of all pair-wise distances between two shapes P and O is defined by

ε(P,O)= ∑ k=1 n p d( P k , O k ) (9)

Due to the fact that we interested in obtaining a distance measure which runs between zero and one, we normalize the sum according to n_P, the number of points in P

ε ¯ (P,O)= 1 n p ε(P,O) (10)

The mean Euclidean distance alone is not a sufficiently satisfactory measure of distance. Particularly in cases of wavy or jagged contours important information get lost using the averaged distance. Therefore we also calculated the maximum distance between a pair of established correspondences

ε max (P,O)=max ∑ k=1 n p d( p k , o k ) (11)

The final similarity measure is the weighted sum of the mean Euclidean distance and the maximum Euclidean distance

Score(P,O)=α ε ¯ (P,O)+β ε max (P,O) (12)

The choice of the value for the weights α and β depends on the importance the user wants to respective distance measure. We chose for α and β a value of 0.5 which results in an equal influence of both distances to the overall distance.

6.4 Evaluation of our alignment algorithm

At first we will demonstrate that the algorithm is symmetric (see Figure 11), i.e. the same distance is calculated when aligning instance P to instance O as when aligning instance O to P. As described in above the instance with more contour points is always that one which will be aligned to shape instance with less points. Therefore, the order in which the cases are given as input into the algorithm doesn’t matter. A special case where the order matters is if both shape instances are defined by the same amount of contour points.

Figure 11: Evaluation of Symmetry.

The following table presents some results of pair-wise aligned shape cases. In the left column of the table the visual results are shown with connecting lines between corresponding points. The right column of the table presents the calculated scores and the number of outlier which are included in the alignment score with a distance value of one. The alignment score of value zero means identity and the value of 0.5 can be understood as neutral. Up to a value of one the shapes become more and more dissimilar.

Figure 12: Different shapes with corresponding distance measures and the alignment scores.

Again we take a closer look at the pair-wise alignment of a concave and a convex shape. Reconsider the example from above where the shape of the letter O has to be aligned to the shape of a letter C. Figure 13 presents a screenshot from our program where a similar situation occurred. There were twelve outlier determined on the convex shape which are included in the calculation of the alignment score. As more of these outliers are included as more dissimilar the two shapes become because each of them is assigned with the maximum distance of one.

Figure 13: Screenshot with outlier, marked as green dots, occurred during the alignment of a circle and a concave shape.

Under these circumstances the points on the inside of the concave shape may not have any influence on the resulting alignment score. This is due to the fact that they were not mapped onto opposite points of the contour of the convex shape (see figure 4B). In deed, this is an error of the algorithm but otherwise it may happen that the alignment score exceeds the value one.

7. Conclusions

We have proposed a method for the acquisition of shape instances and our novel algorithm for aligning arbitrary 2D-shapes, represented by ordered point-sets of varying size. Our algorithm aligns two shapes under similarity transformation; differences in rotation, scale, and translation are removed. It establishes one-to-one correspondences between pairs of shapes and ensures that the found correspondences are symmetric and legal. The method detects outlier points and can handle some amount of noise. We have evaluated that the algorithm also works well if the aligned shapes are very different, like i.e. the alignment of concave and convex shapes. A distance measure which runs between 0 and 1 is returned as result.

The methods are implemented in the program CACM Version 1.4 which runs on a Windows PC.

Competing Interests

The author declare that there is no competing interests regarding the publication of this article.

References

Thompson D (1917) On Growth and Form. Cambridge University Press, Cambridge, UK.
Kendall DG (1989) A Survey of the Statistical Theory of Shape. Statistical Science 4: 87-120. View
Bookstein FL (1986) Size and Shape Spaces for Landmark Data in Two Dimensions. Statistical Science 1: 181-242. View
Bookstein FL (1997) Landmark Methods for Forms without Landmarks: Morphometrics of Group Differences in Outline Shape. Med Image Anal 1: 225-244. View
Alon J, Athitsos V, Sclaroff S (2005) Online and Offline Character Recognition Using Alignment to Prototypes. IEEE Computer Society Press. View
Huttenlocher D, Klanderman G, Rucklidge W (1993) Comparing Images Using the Hausdorff Distance, IEEE Trans. Pattern Analysis and Machine Intelligence 15: 850-863. View
Alt H, Guibas LJ (1996) Discrete Geometric Shapes: Matching, Interpolation and Approximation A Survvey. Elsevier Science Publishers. View
Rangarajan A, Chui H, Bookstein FL (1997) The Softassign Procrustes Matching Algorithm, Proc. Information Processing in Medical Imaging. View
Sclaroff S, Pentland A (1995) Modal Matching for Correspondence and Recognition. IEEE Trans Pattern Analysis and Machine Intelligence 17: 545- 561. View
Fitzgibbon AW (2001) Robust Registration of 2D and 3D Point Sets. In Proc British Machine Vision Conference 2: 411-420. View
Marte OC, Marais P (2002) Model-Based Segmentation of CT Images. In South African Computer Journal 28: 54-59. View
Brett AD, Taylor CJ (1999) A Framework for Automated Landmark Generation for Automated 3D Statistical Model Construction. Proc Information Processing in Medical Imaging. View
Feldmar J, Ayache N (1996) Rigid, Affine and Locally Affine Registration of Free-Form Surfaces. The International Journal of Computer Vision 18: 99- 119. View
Hill A, Taylor CJ, Brett AD (2000) A Framework for Automatic Landmark Identification Using a New Method of Nonrigid Correspondence. IEEE Transactions on Pattern Analysis and Machine Intelligence 22: 241-251. View
Veltkamp RC (2001) Shape Matching: Similarity Measures and Algorithms. Shape Modelling International. View
Lele SR, Richtsmeier JT (2001) An Invariant Approach to Statistical Analysis of Shapes. Chapman & Hall / CRC.
Dryden IL, Mardia KV (1998) Statistical Shape Analysis. John Wiley & Sons Inc. View
Besl P, McKay N (1992) A Method for Registration of 3-D Shapes. IEEE Trans. Pattern Analysis and Machine Intelligence 14: 239-256. View
Aksenov P, Clark I, Grant D, Inman A, Vartikovski L, et al. (2003) 3D Thermography for Quantification of Heat Generation Resulting From Inflammation. Proc 8th 3D Modelling symposium, Paris, France. View
Belongie S, Malik J, Puzicha J (2002) Shape Matching and Object Recognition Using Shape Contexts. IEEE Transactions on Pattern Analysis and Machine Intelligence 24: 509-522. View
Kass M, Witkin A, Terzopoulos D (1987) Snakes: Active contour models. In 1st International Conference on Computer Vision. View
Cheng DC, Schmidt-Trucksäss A, Cheng KS, Burkhardt H (2002) Using Snakes to Detect the Intimal and Aventitial Layers of the Common Carotid Artery Wall in Sonographic Images. Computer Methods and Programs in Biomedicine 67: 27-37. View
Cootes TF, Taylor CJ (1999) A Mixture Model for Representing Shape Variation. Image and Vision Computing 17: 567-574. View
Mortensen EN, Barrett WA (1995) Intelligent Scissors for Image Composition. In Computer Graphics Proceedings. View
Haenselmann T, Effelsberg W (2003) Wavelet-Based Semi-Automatic Live- Wire Segmentation. Proceedings of the SPIE Human Vision and Electronic Imaging 4662: 260-269. View
Bresenham JE (1965) Algorithm for Computer Control of a Digital Plotter. IBM Systems Journal 4: 25-30. View
Wall K, Daniellson PE (1984) A Fast Sequential Method For Polygonal Approximation of Digitized Curves. Comput Graph Image Process 28: 220- 227. View
Perner P, Jänichen S (2004) Learning of Form Models from Exemplars. Lisbon/Portugal, Springer Verlag lncs 3138: 153-161.

Best viewed in Mozilla Firefox, Google Chrome, Safari, Above IE 9.0 version