Optimize FlatText.get/3 memory usage and speed by preciz · Pull Request #664 · philss/floki

preciz · 2026-03-17T11:07:22Z

Optimizes shallow text extraction in Floki.FlatText by replacing Enum.reduce/3 with a direct tail-recursive approach.

Performance Impact:
Across various Benchee tests (Small/Medium/Large HTML, and highly nested lists), this approach yields:

~10% to 25% faster execution speeds.
~35% to 45% reduction in memory allocations.

Tests:
Adds a comprehensive set of edge-case tests to flat_text_test.exs (covering deeply nested nodes, mixed tuples, and sequential text) to guarantee 100% behavioral parity with the previous implementation and protect against regressions.

Replaced Enum.reduce/3 with an optimal tail-recursive approach in FlatText. By traversing lists manually, we avoid the overhead of the Enumerable protocol and the allocation of anonymous closures per reduction iteration. Benchmark results when running Floki.FlatText.get on a large document: ```text Name ips average deviation median 99th % Tail recursion 3.21 M 311.16 ns ±23057.88% 50 ns 160 ns Enum.reduce 2.76 M 361.71 ns ±20499.20% 50 ns 180 ns Comparison: Tail recursion 3.21 M Enum.reduce 2.76 M - 1.16x slower +50.55 ns Memory usage statistics: Name Memory usage Tail recursion 136 B Enum.reduce 208 B - 1.53x memory usage +72 B ```

preciz added 3 commits March 17, 2026 11:21

Add tests for FlatText edge cases

076685b

Run formatter

f21263e

philss merged commit f50d7f2 into philss:main Mar 17, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize FlatText.get/3 memory usage and speed#664

Optimize FlatText.get/3 memory usage and speed#664
philss merged 3 commits intophilss:mainfrom
preciz:perf/optimize-flat-text

preciz commented Mar 17, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

preciz commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

preciz commented Mar 17, 2026 •

edited

Loading