ORB Feature

ORB features are composed of two parts: key points and descriptor.

Two steps:

Key point: Oriented FAST. Compared with the original FAST, the main direction of the feature points is calculated
Descriptor: BRIEF (Binary Robust Independent Elementary Feature) . Describe the surrounding image area where the feature points were extracted in the previous step. ORB has made some improvements to BRIEF, mainly referring to utilizing the previously calculated direction

1. FAST Key points

FAST is a kind of corner point, which mainly detects the obvious grayscale changes locally, and is known for its fast speed.

1.1 Main Idea

If a pixel is very different from the neighboring pixels (too bright or too dark), then it is more likely to be a corner points.

Compared with other corner detection algorithms, FAST only needs to compare the brightness of the pixels, which is very fast.

1.2 Entire Procedure

Select pixel $p$ in the image assuming its brightness as $I_p$
Set a threshold $T$ (for example, 20% of $I_p$ )
Take the pixel $p$ as the center, and select the 16 pixels on a circle with a radius of 3.
If there are consecutive N points on the selected circle whose brightness is greater than $I_p + T$ or less than $I_p - T$ , then the central pixel $p$ can be considered as a feature point ( $N$ usually takes 12, which is FAST-12).
Iterate though the above four steps on each pixel.

1.3 Good

In the FAST-12 algorithm, to speed up, checking the brightness of the 1, 5, 9, and 13-th pixels on the circle for each pixel can quickly exclude a lot of pixels that are not corner points. Only when 3 out of 4 pixels are greater than $I_p + T$ or less than $I_p - T$ , may the current pixel be a corner points, otherwise it should be excluded directly. Such 'pre-processing' operation greatly accelerates FAST corner detection.
Non-maximal suppression: the original FAST corners are often ”clustered”, meaning a lot of FAST corners present in the same area. Therefore, after the initial detection, non-maximal suppression is required. Only corner points with maximum response in a certain area will be retained to avoid the corners concentrating.

1.4 Bad

The calculation of FAST feature points only compares the brightness difference between pixels, thus the speed is very fast, but it suffers from bad repeatability and uneven distribution.
FAST corner points do not include direction information.
It fixed the radius of circle as 3, there is also a scaling problem: a place that looks like a corner from a distance may not be a corner when it comes close.

2. Oriented FAST Key points

ORB adds the description of scale and rotation. The scale invariance is achieved by the image pyramid, and detect corner points on each layer of the pyramid. The rotation of features is realized by the Intensity Centroid method.

2.1 Image Pyramid

Pyramid is a common approach in computer vision. The bottom of the pyramid is the original image. For each layer up, the image is scaled with a fixed ratio, so that we have images of different resolutions. The smaller image can be seen as a scene viewed from a distance. In the feature matching algorithm, we can match images on different layers to achieve scale invariance.

2.2 Intensity Centroid

In terms of rotation, we calculate the gray centroid of the image near the feature point. The so-called centroid refers to the gray value of the image block as the center of weight. The specific steps are as follows

In a small image block $B$ , define the moment of the image block as

$m_{pq}=\sum_{x,y \in B}x^{p}y^{q}I(x,y), \quad p, q = \{0,1\}.$

Calculate the centroid of the image block by the moment:

$C=\left(\frac{m_{10}}{m_{00}},\frac{m_{01}}{m_{00}}\right).$

Connect the geometric center $O$ and the centroid $C$ of the image block to get a direction vector $\overrightarrow{OC}$ , so the direction of the feature point can be defined as

$\theta = \arctan(m_{01}/m_{10}).$

3. BRIEF Descriptor

After extracting the Oriented FAST key points, we calculate the descriptor for each point. ORB uses an improved BRIEF descriptor.

3.1 BRIEF

BRIEF is a binary descriptor. Its description vector consists of many 0s and 1s, where 0s and 1s encode the size relationship between two random pixels near the key point (such as $p$ and $q$ ): If $p$ is greater than $q$ , then take 1, otherwise take 0. If wetake 128 such p, q pairs, we will finally get a 128-dimensional vector consisting of 0s and 1s.

Good:

BRIEF implements the comparison of randomly selected points, which is very fast, and since it expresses in binary, it is also very convenient to store and suitable for real-time image matching.

Bad:

The original BRIEF descriptor does not have rotation invariance, so it is easy to get lost when the image is rotated.

3.2 Improved BRIEF

The ORB calculates the direction of the key points in the FAST feature point extraction stage, so the direction information can be used to calculate the ”Steer BRIEF” feature after the rotation, so that the ORB descriptor has better rotation invariance.

ORB Feature

ORB Feature

1. FAST Key points

1.1 Main Idea

1.2 Entire Procedure

1.3 Good

1.4 Bad

2. Oriented FAST Key points

2.1 Image Pyramid

2.2 Intensity Centroid

3. BRIEF Descriptor

3.1 BRIEF

3.2 Improved BRIEF

results matching ""

No results matching ""