CS 180: Project 1

The purpose of this project is to take black and white images where each color channel is a separate image, and combine the channels to properly align them.

Overview

Base Approach:

To start, we took the following approach (which you can find in misc/base_approach.py):

Split image into three equal height chunks on the vertical axis. These become blue, green, and red channels respectively.
For a range of vertical and horizontal shifts, compute normalized cross correlations between channel pairs.
Apply offsets to red and green channels such that normalized cross correlation with blue channel is maximized.
Save and plot resulting 3 channel image.

Plotting the normalized cross correlation performance metric, we can see that there is a peak at the “optimal” offset:

cc plot cathedral

And this is the result produced:

cathedral aligned basic

Issues:

Slow runtimes on large images.
Bad reliability on images with larger differences between channels.

Optimizations:

Pyramid Search:

To address the performance issues, we implement pyramid search as follows:

Compute ideal offset over large range on scaled down image and apply to full size image.
Double image size.
Compute new ideal offset over small range.
Repeat 2 and 3 until image is full size.

The first pass is fast because the image is small, and all subsequent alignments only need to check a small range of offsets because one can assume the images are already roughly aligned. This therefore balances the tradeoff between accuracy and speed where using small images produces worse accuracy but better speed, and large images produce better accuracy for worse speeds.

For sculpture.tif, we get the following steps for alignment:

Pyramid Steps

This also lets us improve run time from >10 minutes to 6.916s on sculpture.tif

Canny edge detection:

To improve reliability of alignment, we apply Canny edge detection to each of the channels using opencv prior to aligning the images. Canny edge detection finds edges in an image finding intensity gradients that fall within a given range and filtering the results to only include well connected edges.

Applying canny edge detection is beneficial for reliable alignment for two main reasons. The first of which is that because only the edges are considered for alignment, actual intensity within those edges are not considered. This is especially beneficial in noisy scenes and when there is greater deviation between color channels. The second benefit of applying an edge detector is that it produces a sharper peak for optimal alignment.

For cathedral.jpg, you can see the peak becomes sharper when canny edge detection is applied.

cathedral canny cc effects

You can also see a noticeable improvement in quality of results on harvesters.tif.

qualatative impact of canny on alignment of harvesters.tif