Update proj1.html

cjxthecoder · web-flow · commit 44e6f705dd5d · 2025-09-12T16:16:53.000-07:00
diff --git a/project-1/proj1.html b/project-1/proj1.html
@@ -55,19 +55,19 @@ <h2>Naive Search</h2>
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">cathedralNaive.jpg</figcaption>
 <img src="images/cathedralNaive.jpg" alt="cathedralNaive.jpg" width="50%">
-<figcaption>Best shift: (-2, 336), (1, -334)<br>5.070914s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-2, 336), (1, -334)<br>5.070914s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">monasteryNaive.jpg</figcaption>
 <img src="images/monasteryNaive.jpg" alt="monasteryNaive.jpg" width="50%">
-<figcaption>Best shift: (-2, 344), (1, -335)<br>5.107604s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-2, 344), (1, -335)<br>5.107604s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">tobolskNaive.jpg</figcaption>
 <img src="images/tobolskNaive.jpg" alt="tobolskNaive.jpg" width="50%">
-<figcaption>Best shift: (-3, 338), (1, -337)<br>5.857907s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-3, 338), (1, -337)<br>5.857907s</figcaption>
 </figure>
 
 </div>
@@ -95,100 +95,101 @@ <h2>Image Pyramid</h2>
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">cathedral.jpg</figcaption>
 <img src="images/cathedral.jpg" alt="cathedral.jpg" width="50%">
-<figcaption>Best shift: (-2, 336), (1, -334)<br>0.033945s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-2, 336), (1, -334)<br>0.033945s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">church.tif</figcaption>
 <img src="images/church.jpg" alt="church.jpg" width="50%">
-<figcaption>Best shift: (-4, 3177), (-8, -3169)<br>3.423309s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-4, 3177), (-8, -3169)<br>3.423309s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">emir.tif</figcaption>
 <img src="images/emir.jpg" alt="emir.jpg" width="50%">
-<figcaption>Best shift: (-24, 3160), (17, -3152)<br>3.36728s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-24, 3160), (17, -3152)<br>3.36728s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">harvesters.tif</figcaption>
 <img src="images/harvesters.jpg" alt="harvesters.jpg" width="50%">
-<figcaption>Best shift: (-17, 3159), (-3, -3153)<br>3.393377s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-17, 3159), (-3, -3153)<br>3.393377s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">icon.tif</figcaption>
 <img src="images/icon.jpg" alt="icon.jpg" width="50%">
-<figcaption>Best shift: (-17, 3204), (5, -3196)<br>3.401831s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-17, 3204), (5, -3196)<br>3.401831s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">italil.tif</figcaption>
 <img src="images/italil.jpg" alt="italil.jpg" width="50%">
-<figcaption>Best shift: (-21, 3193), (15, -3192)<br>3.3279s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-21, 3193), (15, -3192)<br>3.3279s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">lastochikino.tif</figcaption>
 <img src="images/lastochikino.jpg" alt="lastochikino.jpg" width="50%">
-<figcaption>Best shift: (2, 3244), (-7, -3163)<br>3.270373s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (2, 3244), (-7, -3163)<br>3.270373s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">lugano.tif</figcaption>
 <img src="images/lugano.jpg" alt="lugano.jpg" width="50%">
-<figcaption>Best shift: (16, 3203), (-13, -3192)<br>3.317115s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (16, 3203), (-13, -3192)<br>3.317115s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">melons.tif</figcaption>
 <img src="images/melons.jpg" alt="melons.jpg" width="50%">
-<figcaption>Best shift: (-11, 3159), (3, -3145)<br>3.387267s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-11, 3159), (3, -3145)<br>3.387267s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">monastery.jpg</figcaption>
 <img src="images/monastery.jpg" alt="monastery.jpg" width="50%">
-<figcaption>Best shift: (-2, 344), (1, -335)<br>0.032784s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-2, 344), (1, -335)<br>0.032784s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">self_portrait.tif</figcaption>
 <img src="images/self_portrait.jpg" alt="self_portrait.jpg" width="50%">
-<figcaption>Best shift: (-29, 3172), (8, -3153)<br>3.446271s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-29, 3172), (8, -3153)<br>3.446271s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">siren.tif</figcaption>
 <img src="images/siren.jpg" alt="siren.jpg" width="50%">
-<figcaption>Best shift: (6, 3201), (-18, -3203)<br>3.37681s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (6, 3201), (-18, -3203)<br>3.37681s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">three_generations.tif</figcaption>
 <img src="images/three_generations.jpg" alt="three_generations.jpg" width="50%">
-<figcaption>Best shift: (-14, 3157), (-3, -3150)<br>3.285443s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-14, 3157), (-3, -3150)<br>3.285443s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;">tobolsk.jpg</figcaption>
 <img src="images/tobolsk.jpg" alt="tobolsk.jpg" width="50%">
-<figcaption>Best shift: (-3, 338), (1, -337)<br>0.031668s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-3, 338), (1, -337)<br>0.031668s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;"><a href="https://www.loc.gov/item/2018681257">canal.tif</a></figcaption>
 <img src="images/canal.jpg" alt="canal.jpg" width="50%">
-<figcaption>Best shift: (-18, 3156), (6, -3147)<br>3.386601s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (-18, 3156), (6, -3147)<br>3.386601s</figcaption>
 </figure>
 
 <figure style="flex: 1 0 20%; margin:12px;">
 <figcaption style="margin-bottom:6px;"><a href="https://www.loc.gov/item/2018679143">capri.tif</a></figcaption>
 <img src="images/capri.jpg" alt="capri.jpg" width="50%">
-<figcaption>Best shift: (16, 3206), (-8, -3192)<br>3.325347s</figcaption>
+<figcaption style="margin-bottom:6px;">Best shift: (16, 3206), (-8, -3192)<br>3.325347s</figcaption>
 </figure>
 
 </div>
+
 <p>
 Even for small images, the pyramid method achieved a speedup of over 100x due to a reduction in both size and number of NCC computations, while for large images, the computation time is still faster than what naive search takes on a single small image. More concretely, because the image is downscaled by 2x at each layer of the pyramid, the total image area is less than 1 + (1/2)<sup>2</sup> + (1/4)<sup>2</sup> + (1/8)<sup>2</sup> + ... = 1 + 1/4 + 1/16 + 1/64 + ... &lt; 4/3 of the original. Since the search at the lowest scale is limited to W &leq; 72, the total time complexity is only <i>4(3W)W / 3</i> + smaller terms = <i>O(W<sup>2</sup>)</i>, which matches with the actual factor of ~100 between the time it takes to align .jpg and .tif files.
 </p>
@@ -197,39 +198,61 @@ <h2>Image Pyramid</h2>
 <!-- Section 5 -->
 <h2>Cropping with Sobel</h2>
 <p>
-Although having a fixed cropping dimension worked for aligning the images, because the final result is simply the intersection of the translations, artifacts from the border still remain. Is there a way to automatically crop the borders? The answer is yes. We can start by applying the sobel operator
+Although having a fixed cropping dimension worked for aligning the images, because the final result is simply the intersection of the translations, artifacts from the border still remain. Is there a way to automatically crop the borders? The answer is yes. We can start by convolving the sobel operator
 </p>
 
 <div style="display:flex; flex-wrap:wrap; justify-content:center; text-align:center;">
 
-<figure style="flex: 1 0 40%; margin:12px;">
+<figure style="flex: 1 0 20%; margin:12px;">
 <img src="images/equationX.png" alt="equationX.png" width="50%">
 </figure>
 
-<figure style="flex: 1 0 40%; margin:12px;">
+<figure style="flex: 1 0 20%; margin:12px;">
 <img src="images/equationY.png" alt="equationY.png" width="50%">
 </figure>
 
 </div>
 
 <p>
-with convolution to the base image, and add the results together. This will produce a composite of the detected edges along the <i>x</i>- and <i>y</i>-axis. The next step is to reconstruct the borders of each plate from the image. Because Sobel is distance-invariant, we can convolve the kernel on only the valid patches, resulting in the new image having dimensions <i>(W - 1)</i> &times; <i>(H - 1)</i>.
+with the base image, and add the results together. This will produce a composite of the detected edges along the <i>x</i>- and <i>y</i>-axis. The next step is to reconstruct the borders of each plate from the image. Because Sobel is distance-invariant, we can convolve the kernel on only the valid patches, resulting in the new image having dimensions <i>(W - 1)</i> &times; <i>(H - 1)</i>.
 </p>
-
 <div align="center">
 <img src="images/edges" alt="edges.png" width="50%">
 <figcaption style="margin-bottom:6px;">sobelX &starf; img + sobelY &starf; img</figcaption>
 </div>
-
 <p>
 Now, we can take the average of each row, and those with the lowest and highest values will be the edges. By splitting the resulting array into 4 subarrays [0 : <i>H/8</i>], [<i>H/8</i> : <i>H/2</i>], [<i>H/2</i> : <i>7H / 8</i>], and [<i>H/8</i> : <i>H</i>], we can find the argmax and argmin in each subarray, which will be used as the horizontal borders of each plate image.
 </p>
-
 <div align="center">
 <img src="images/y.png" alt="y.png" width="50%">
 </div>
+<p>
+The vertical borders can be computed using a similar strategy, except that we need to split the column averages into 3 subsections based on the values found in the previous part. This will ensure that we are able to find the vertical edges of each image plate. Although the center/green plate is not used in the alignment process, its borders will be needed to compute the final crop of the composited image.
+</p>
+
+<div style="display:flex; flex-wrap:wrap; justify-content:center; text-align:center;">
 
-The vertical borders can be estimated using a similar strategy, except that we need to split the column averages into 3 subsections based on the values found in the previous part. This will ensure that we are able to find the vertical edges of each image plate. Although the center plate
+<figure style="flex: 1 0 20%; margin:12px;">
+<img src="images/xb.png" alt="xb.png" width="50%">
+<figcaption style="margin-bottom:6px;">Blue plate</figcaption>
+</figure>
+
+<figure style="flex: 1 0 20%; margin:12px;">
+<img src="images/xg.png" alt="xg.png" width="50%">
+<figcaption style="margin-bottom:6px;">Green plate</figcaption>
+</figure>
+
+<figure style="flex: 1 0 20%; margin:12px;">
+<img src="images/xr.png" alt="xr.png" width="50%">
+<figcaption style="margin-bottom:6px;">Red plate</figcaption>
+</figure>
+
+</div>
+
+<div align="center">
+<img src="images/borders.png" alt="borders.png" width="50%">
+<figcaption style="margin-bottom:6px;">All computed borders & subarray boundries</figcaption>
+</div>
 
 <hr>