Reduced QuickDecode memory consumption by 128,884x #420

sidd-27 · 2026-01-06T20:33:32Z

Pigeonhole-principle based quick decode table

Problem Statement

The previous AprilTag decoding implementation relied on exhaustive precomputation of error permutations to achieve O(1) lookups. While effective for small tag families (e.g., 16h5), this approach is computationally and spatially prohibitive for larger families such as 52h13.

For the 52h13 family, precomputing all variations with up to 2-bit errors results in approximately 67 million hash map entries. which amounts to approximately 3 GB of RAM being used for this simple check. If we were to extend this to 3-bit errors, the amount of space needed would increase exponentially to approximately 54 GB, making it essentially unusable. even for smaller tag families, dedicating this many resources to a single check is not possible in many small robotics applications

Proposed Solution

This PR replaces the combinatorial precomputation strategy with a search-based algorithm utilizing the Pigeonhole Principle.

Instead of storing every possible error permutation, the new implementation indexes the valid tags by splitting the code into 4 discrete chunks (for eg 13 bits each for 52h13). Given a maximum tolerance of 3-bit errors, the Pigeonhole Principle guarantees that at least one of the four chunks in an observed tag must match the valid tag perfectly.

The decoding process is updated to:

Perform lookups on the 4 chunks to identify candidate tags.
Compute the Hamming distance between the observed code and the candidate's perfect code.
Return the match if the distance is within the specified threshold.

Memory Complexity

The legacy implementation required storing every error permutation ($1 + N + \binom{N}{2} + \binom{N}{3}$) multiplied by a load factor of 3 to resolve collisions. For the 52h13 family, this necessitated allocating over 69000 slots per tag (approx. 3 billion total slots). The new implementation eliminates this combinatorial explosion, storing exactly 4 references per tag. 52h13 Memory Footprint went from ~54 GB to ~450 KB which is nearly a 128,884x Reduction

In summary -

memory requirements and initialization: Both of these are now several orders of magnitude better than the earlier implementation
Error correction: now supports up to 3-bit error correction without any additional memory overhead; also, this technique can be scaled to test for 4 or even 5-bit errors with relative ease.
Regression: Existing tests pass without modification.
performance: as this is a technically more complex structure, the quickdecode function may be slightly slower than it used to be, but it still runs in nanoseconds and is not the bottleneck and there's no observable change to the overall performance

christian-rauch · 2026-01-08T20:13:33Z

It looks like your second commit is a "fixup" of the first. Can you squash and rebase your PR?

sidd-27 · 2026-01-08T20:17:01Z

the second commit was a whole different optimization, the pr is already up to date with the master branch, I've also updated the description of the pr to explain the changes

christian-rauch · 2026-01-08T20:25:44Z

the second commit was a whole different optimization, the pr is already up to date with the master branch, I've also updated the description of the pr to explain the changes

If it makes semantically sense to keep them separated, could you at least rework the commit messages to better explain these changes? At the moment, they are just "reduced quickdecode memory usage by 8x" and "tiny table". It is not clear from these, what technically changed in these commits. I know it is described here in the PR, but could you please summarise these changes on a technical level in the commit message, such that it is easier for someone reading the git history to understand what changed and why?

sidd-27 · 2026-01-08T20:35:09Z

all i did was rename the commit, i don't understand why this test failed

Allows them to be stored in CPU registers and avoids memcopy-ing the struct (cherry picked from commit e504236)

…into opti-quickdecode

sidd-27 · 2026-01-08T20:45:08Z

i think i messed up a bit, im not really familiar with rebase, should i just open up a new branch and pr with just this optimization? @christian-rauch

reduced quickdecode memory usage by 8x

e1129dd

sidd-27 marked this pull request as ready for review January 6, 2026 20:59

sidd-27 changed the title ~~Reduced QuickDecode memory consumption by 8x~~ Reduced QuickDecode memory consumption by 128,884x Jan 8, 2026

refactor: implement pigeonhole based quick decode table

244866d

sidd-27 force-pushed the opti-quickdecode branch from 7995110 to 244866d Compare January 8, 2026 20:30

jvanvugt and others added 6 commits January 9, 2026 02:10

Use local variables for accumulation

8196c25

Allows them to be stored in CPU registers and avoids memcopy-ing the struct (cherry picked from commit e504236)

corrects capacity calculations

28bb541

make internal image_u8 functions static

af77cd9

refactor: reduce QuickDecode Struct size from 16 to 2 bytes

a5fb256

refactor: implement pigeonhole based quick decode table

2f80818

Merge branch 'opti-quickdecode' of https://github.com/sidd-27/apriltag …

4355367

…into opti-quickdecode

sidd-27 closed this Jan 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reduced QuickDecode memory consumption by 128,884x #420

Reduced QuickDecode memory consumption by 128,884x #420

Uh oh!

sidd-27 commented Jan 6, 2026 •

edited

Loading

Uh oh!

christian-rauch commented Jan 8, 2026

Uh oh!

sidd-27 commented Jan 8, 2026 •

edited

Loading

Uh oh!

christian-rauch commented Jan 8, 2026

Uh oh!

sidd-27 commented Jan 8, 2026

Uh oh!

sidd-27 commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Reduced QuickDecode memory consumption by 128,884x #420

Reduced QuickDecode memory consumption by 128,884x #420

Uh oh!

Conversation

sidd-27 commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pigeonhole-principle based quick decode table

Problem Statement

Proposed Solution

Memory Complexity

In summary -

Uh oh!

christian-rauch commented Jan 8, 2026

Uh oh!

sidd-27 commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

christian-rauch commented Jan 8, 2026

Uh oh!

sidd-27 commented Jan 8, 2026

Uh oh!

sidd-27 commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sidd-27 commented Jan 6, 2026 •

edited

Loading

sidd-27 commented Jan 8, 2026 •

edited

Loading