Image Outline Detection Algorithms

The article presents the four most common loop detection algorithms.

The first two, namely the algorithm for tracing squares and tracing the surroundings of Moore, are easy to implement, and therefore are often used to determine the contour of a given pattern. Unfortunately, both algorithms have several weaknesses, which makes it impossible to detect the contour of a large class of patterns due to their special type of adjacency.

These algorithms will ignore all “holes” in the pattern. For example, if we have a pattern similar to that shown in Figure 1 , then the circuit detected by the algorithms will be similar to that shown in Figure 2 (the outline is indicated by blue pixels). In some areas of application this is quite acceptable, but in other areas, for example, in character recognition, detection of the internal parts of the pattern is required to find all the spaces that distinguish a specific character. ( Figure 3 shows the “full” outline of the pattern.)

Therefore, to obtain a complete contour, you must first use the “hole search” algorithm that determines the holes in a given pattern, and then apply the contour detection algorithm to each hole.

What is connectivity?

In digital images with binary values, a pixel can have one of the following values: 1 - when it is part of the pattern, or 0 - when it is part of the background, i.e. no gradation of gray. (We will assume that pixels with a value of 1 are black, and with a value of 0 are white).

To identify objects in a digital pattern, we need to find groups of black pixels that are “connected” to each other. In other words, the objects in a given digital pattern are the connected components of this pattern.

In the general case, a connected component is a set of black pixels P , such that for each pair of pixels p _i and p _j in P there is a sequence of pixels p _i , ..., p _j such that:

a) all the pixels in the sequence are in the set P , i.e. are black, and

b) every 2 pixels in the sequence adjacent are “neighbors”.

An important question arises: when can we say that 2 pixels are “neighbors”? Since we use square pixels, the answer to the previous question is not trivial for the following reason: in square tessellation, pixels have a common edge or a vertex, or have nothing in common. Each pixel has 8 pixels that share a vertex with it; such pixels make up the "Moore neighborhood" of that pixel. Should we consider “neighbors” pixels having only one common vertex? Or in order to be considered “neighbors”, two pixels must have a common edge?

So there are two types of connectivity, namely: 4-connectedness and 8-connectedness.

4-connection

When can we say that a given set of black pixels is 4-connected? First, you need to define the concept of a 4-neighbor (also called a direct neighbor ):

4-neighbor definition : A pixel Q is a 4-neighbor of a given pixel P if Q and P have a common edge. The 4-neighbors of the pixel P (designated as P2, P4, P6, and P8 ) are shown in Figure 2 below.

Definition of a 4-connected component : the set of black pixels P is a 4-connected component if for each pair of pixels p _i and p _j in P there is a sequence of pixels p _i , ..., p _j such that:

a) all the pixels in the sequence are in the set P , i.e. are black, and

b) every two pixels that are adjacent in the sequence are 4 neighbors

Examples of 4-connected patterns

The diagrams below show examples of 4-connected patterns:

8-connection

When can I say that a given set of black pixels is 8-connected ? First, we need to define the concept of an 8-neighbor (also called an indirect neighbor ):

8-neighbor definition : A pixel Q is an 8-neighbor (or just a neighbor ) of a given pixel P if Q and P have a common edge or vertex. The 8-neighbors of a given pixel P make up the Moore neighborhood of this pixel.

Definition of an 8-connected component : the set of black pixels P is an 8-connected component (or just a connected component ) if for each pair of pixels p _i and p _j in P there is a sequence of pixels p _i , ..., p _j such that :

a) all the pixels in the sequence are in the set P , i.e. are black, and

b) every two pixels that are adjacent in this sequence are 8 neighbors

Note : all 4-connected patterns are 8-connected, i.e. 4-connected patterns are a subset of the many 8-connected patterns. On the other hand, an 8-connected pattern may not be 4-connected.

8-linked pattern example

The diagram below shows a pattern that is 8-connected but not 4-connected:

An example of a non-8-connected pattern:

The diagram below shows an example of a pattern that is not 8-connected, i.e. composed of more than one connected component (the diagram shows three connected components):

Square Trace Algorithm

Idea

The idea behind the square trace algorithm is very simple; this can be attributed to the fact that the algorithm was one of the first attempts to detect the contour of a binary pattern.

To understand how it works, you need a little imagination ...

Suppose we have a digital pattern, for example, a group of black pixels on a background of white pixels, i.e. on the grid; find the black pixel and declare it as our " initial " pixel. (Finding the “ initial ” pixel can be implemented in many different ways; we will start from the lower left corner of the grid, we will scan each column of pixels from bottom to top, from the leftmost column to the rightmost, until we come across a black pixel. We will declare it “ initial ” ".)

Now imagine that you are a ladybug standing on the starting pixel, as shown in Figure 1 below. To get the outline of a pattern, you need to do the following:

 ,      ,  ,  
      

        
        
        
      

     
      

        
        
        
      

      ,      ,  , 
      

        
        
        
      

     
      

        
        
        
      

           .

The black pixels that you circled will be the outline of the pattern.

An important aspect of the square trace algorithm is the “sense of direction”. Turns to the left and right are performed relative to the current location, which depends on how you got to the current pixel. Therefore, in order to make the right moves, you need to track your direction.

Algorithm

The following is a formal description of the square trace algorithm:

Input: square tessellation , T , containing the connected component P of black cells.

Output: row B (b ₁ , b ₂ , ..., b _k ) of border pixels, i.e. circuit.

Start

Define B as an empty set.
We scan cells T from bottom to top and left to right until a black pixel s from P is found .
Insert s into B.
Make the current pixel p the initial pixel s .
Turn left, i.e. go to the neighboring pixel to the left of p .
Update p , i.e. it becomes the current pixel.
While p is not equal to s , execute

If the current pixel p is black
- insert p into B and turn left (go to the neighboring pixel to the left of p ).
- Update p , i.e. it becomes the current pixel.
otherwise
- turn right (go to the neighboring pixel to the right of p ).
- Update p , i.e. it becomes the current pixel.
End of the “Bye” cycle

the end

Note: the concepts of “left” and “right” should be considered not with respect to the page or reader, but with respect to the direction of entry into the “current” pixel during scanning.

Demonstration

The following is an animated demonstration of how the square trace algorithm detects the outline of a pattern. Do not forget that the ladybug moves in pixels; notice how its direction changes when turning left and right. Turns left and right are performed relative to the current direction in a pixel, i.e. ladybug orientations.

Analysis

It turns out that the capabilities of the square trace algorithm are very limited. He is unable to detect the contours of a large family of patterns that often arise in real world applications.

This is mainly due to the fact that left and right rotations do not take into account pixels located “along

diagonals ”from the current pixel.

Let's look at the different patterns with different connectivity and see why the square trace algorithm fails. In addition, we will study ways to improve the capabilities of the algorithm and make it work even with patterns that have a special kind of connectivity.

Stop criterion

One of the weaknesses of the algorithm is the choice of the stopping criterion. In other words, when does an algorithm stop executing?

In the original description of the square trace algorithm, the condition for completion is to hit the initial pixel a second time. It turns out that if the algorithm depends on such a criterion, then it will not be able to detect the contours of a large family of patterns.

The following is an animated demo explaining how the algorithm cannot detect the exact contour of the pattern due to the selection of a bad stopping criterion:

As you can see, improving the stopping criterion can be a good start to improve the overall performance of the algorithm. There are two effective alternatives for an existing shutdown criterion:

a) Stop only by visiting the starting pixel n times, where n is at least 2, OR

b) Stop after hitting the start pixel a second time, just like we hit it initially.

This criterion was proposed by Jacob Eliosoff , so we will call it the criterion for stopping Jacob .

Changing the stopping criterion generally improves the efficiency of the square trace algorithm, but does not allow to overcome other weaknesses that it has in the case of patterns with special types of connectivity.

Square Tracing Algorithm is unable to detect the contour of a family of patterns with a connectivity of 8 that does NOT have a connectivity of 4.

The following is an animated demonstration of how the square trace algorithm (with Jacob's stopping criterion) fails to detect the correct outline of a pattern with connectivity 8 without connectivity 4:

Is this algorithm completely useless?

If you read the above analysis, you probably think that the square trace algorithm fails to detect the outlines of most patterns. But it turns out. that there is a special family of patterns in which the contour is fully detected by the square trace algorithm.

Let P be the set of black pixels with connectivity 4 on the grid. Let the white pixels of the grid, i.e. the background pixels W also have a connectivity of 4. It turns out that under such conditions of the pattern and its background, it can be proved that the square trace algorithm (with the Jacob stop criterion) will always successfully deal with the determination of the contour.

Below is the proof that in the case where both the pattern and the background pixels are 4 connected, the square trace algorithm will correctly determine the outline using the Jacob stop criterion.

Evidence

Given : the pattern P is such that all the pixels of the pattern (i.e. black) and the background pixels (i.e. white) W have a connectivity of 4.

First observation

Since the set of white pixels W has a connectivity of 4, this means that there cannot be any “ holes ” in the pattern (in informal terms, “ holes ” we mean groups of white pixels completely surrounded by black pixels of the pattern).

The presence of any “ hole ” in the pattern will lead to the separation of the group of white pixels from the remaining white pixels; while many white pixels lose their connectivity 4.

Figure 2 and Figure 3 below show two types of “ holes ” that can occur in a pattern with connectivity 4:

Second observation

Any two black pixels of a pattern MUST have one common side.

Suppose two black pixels have only one common vertex. Then, in order to satisfy the property of 4-connectedness of the pattern, there must be a path connecting these two pixels so that every two neighboring pixels on this path have a connectivity of 4. But this will give us a pattern similar to Figure 3 . In other words, this will result in white pixel separation. Figure 4 below shows a typical pattern satisfying the assumption that the pixels in the pattern and background are 4-connected, i.e. do not have “ holes ”, and every two black pixels have one common side:

It is useful to represent such patterns as follows:

First we consider the boundary pixels, i.e. outline of the pattern. Then, if we consider each boundary pixel as having 4 edges of unit length, we will see that some of these edges are common with neighboring white pixels. We call such edges boundary edges .

Such boundary edges can be considered as edges of a polygon. In the picture

5 below, this idea is demonstrated by the example of a polygon corresponding to the pattern from Figure 4 above:

If we consider all the possible “configurations” of boundary pixels that may occur in such patterns, then we will see that there are two simple cases, shown in Figure 6 and Figure 7 below.

The boundary pixels may be multiples of these cases or other arrangements, i.e. the twists and turns of these two cases. Boundary ribs are marked in blue as E1, E2, E3 and E4 .

Third observation

In the case of the two cases discussed above, no matter which initial pixel we choose, and in whatever direction they fall into it, the square trace algorithm will never “go back” (backtrack) , will never “pass through” the boundary edge twice ( only if it does not trace the border a second time) and never miss the boundary edge . Check it out!

Two concepts need to be clarified here:

a) the algorithm “goes back” , when before tracing the entire border it goes back to visit an already visited pixel, and

b) for each boundary rib, there are two ways to “pass through it” , namely “inward” and “outward” (where “inward” means the movement inward of the corresponding polygon, and “outward” - outward of the polygon).

In addition, when the algorithm passes “inward” through one of the boundary edges, it will pass “outward” through the next boundary edge, i.e. the square trace algorithm should not be able to pass through two consecutive edges in the same way.

Last observation

Each pattern has an even number of boundary edges .

If you look at the polygon from Figure 5 , you can see that:

if we want to start from the vertex S marked in the diagram and follow the boundary edges until we reach S again, we notice that in the process we cross an even number of boundary edges. Each boundary rib can be considered a “step” in a separate direction. Then for each “step” to the right there must be a corresponding “step” to the left, if we want to return to the starting position. The same applies to vertical “steps.” Therefore, the “steps” must have corresponding pairs, and this explains why each of these patterns will have an even number of boundary edges.

Therefore, when the algorithm for tracing squares enters through the initial boundary edge (of the initial pixel) a second time, it will do it in the same direction as the first time.

The reason for this is that since there are two ways to go through the boundary edge, and the algorithm alternately moves inward and outward, and there are an even number of boundary edges, the algorithm will go through the initial boundary edge for the second time the same as in the first.

Conclusion

In the case of a 4-connected pattern and background, the square trace algorithm will detect the entire border, i.e. contour, pattern and will stop working after a single trace, i.e. It will not trace it again, because when it reaches the initial boundary edge for the second time, it will enter it the same way as the first time. Therefore, the square trace algorithm with Jacob's stop criterion will correctly determine the counter of any pattern, provided that both the pattern and the background are 4-connected.

Tracing the surroundings of Moore

Idea

The idea behind the Moore-Neighbor tracing is simple; but before explaining it, we need to explain an important concept: the Moore neighborhood of a pixel.

Neighborhood of Moore

The Moore neighborhood of a pixel P is a set of 8 pixels having a common vertex or edge with that pixel. Such pixels, namely P1, P2, P3, P4, P5, P6, P7 and P8 , are shown in Figure 1 .

The Moore neighborhood (also called 8-neighbors or indirect neighbors ) is an important concept often referred to in the literature.

Now we are ready to get acquainted with the idea underlying the trace of the surroundings of Moore.

Let there be a digital pattern, i.e. a group of black pixels, on a background of white pixels, i.e. on the grid; find the black pixel and declare it the " initial " pixel. (There are several ways to find the “ initial ” pixel, but we, as before, will start from the lower left corner and scan all the columns of pixels in order, until we find the first black pixel, which we will declare to be the “ initial ”.)

Now again, imagine that you are a ladybug standing on the starting pixel, as shown in Figure 2 below. Without loss of generalization, we will detect the outline by moving around the pattern clockwise. (It does not matter which direction we choose, the main thing is to use it constantly in the algorithm).

The general idea is this: every time we get to the black pixel P , we go back, that is, to the white pixel in which we stood before. Then we go around the pixel P clockwise, visiting every pixel in its vicinity of Moore, until we get to the black pixel. The algorithm terminates when the starting pixel reaches the starting pixel a second time.

Those black pixels that the algorithm visited will be the outline of the pattern.