de-pixelate youtube video gaV-O6NPWrI "Who pays $450 for 2TB?"

first of, huge thanks to jeff for this challenge.
i was always under the impression that this should work,
but never had the opportunity to do it.

input video
full video

saw the video, looked doable.. did it.
around 4 hour of coding, quality of code "proof of concept".

Explanation

when i saw the video it was immediately clear to me that you might be able restore the original content.

the window has a (assumed) static content
while moving the window around, the "pixelated" content changed
the "pixelated" content moved differently than the window

basically it is like a fence with small holes,
if the fence moves or the content behind the fence moves.
you can see more parts of the content behind.

V1

extracting the frames with ffmpeg

ffmpeg -i "$video_file" -filter_complex "select=bitor(gt(scene\,0.01)\,eq(n\,0))" -vsync drop frames/%04d.png

this gave me ~56 frames, i looked through the frames and deleted the frames without the window.

finding the window position

since it was only 56 pictures, i did it manually by hand. i used this template.
i just put a red rectangle around the window.

extracting the window

pretty simple, just find the first red pixel, from the "finding the window position" step.
the window size i extracted from gimp 1842px × 930px

extracting the cells (pixels)

i measured the cell size in gimp and it should be 25px.
but after some testing the it just didn't work..
the video might be scaled or the pixelated content might be scaled.
anyways i just counted the cells and measured it and calculated the grid size 730px/29 × 1763px/70

here i simply measured the difference of pixels in x and y direction (edge detection).
and then used the magic value 4, found by testing, to detect the edges of the cells.
then i took the first index of the x and y direction of this edge detection as starting coordinates of the grid.
(i had some other plans here, but thats the approach i ended with)

blue lines are from the edge detection, green dots are the centers of the cells

accumulate the pixels

basically I just take the pixels from the window only at the center of the cells.
and put them into a picture where they are accumulated.

which resulted in this:

.. hard to read so i add a fill algorithm, basically i grow all pixels in size until there is no transparent pixels left

which results in something like this:

which then resulted in this picture for the window

result video

this looked very promising, i was pretty sure i could extract the content with more frames.
but "finding the window position" would be too much work