Implement ANTLR4 grammar parsing and migration from regex focus (refs #147) by justinfx · Pull Request #149 · justinfx/fileseq

justinfx · 2026-02-11T03:21:53Z

#147

Migrate to ANTLR4 grammar-based parsing (v3)

Summary

Replaces regex-based parsing with shared ANTLR4 grammar used by Go and C++ implementations. All 181 tests passing.

Breaking Changes

Removed API:

FileSequence.SPLIT_RE, FileSequence.DISK_RE class variables
constants.SPLIT_PATTERN, constants.SPLIT_RE, constants.SPLIT_SUB_PATTERN, constants.SPLIT_SUB_RE

Removed Files:

setup.py → replaced with pyproject.toml
src/fileseq/__version__.py → automatic versioning via setuptools-scm

Behavior:

Auto-padding now only applies to single-frame files (foo.100.exr)
Explicit padding preserved (foo.1@@@@.exr keeps 4 chars)

New Features

Decimal frame ranges: foo.1-5x0.25#.exr
Subframe sequences: foo.#.#.exr, foo.1-5#.10-20@@.exr
Fixed hidden file parsing: .bar1000.exr → basename=.bar, frame=1000, ext=.exr
Cross-platform path handling (both / and \\)

Implementation

Grammar: grammar/fileseq.g4 (shared with Go/C++)
Parser generator: hatch run generate or python src/fileseq/grammar/generate.py
Modern packaging: PEP 517/518 with pyproject.toml
CI: Grammar validation + version verification on deploy

Performance

Zero regression vs v2.x regex parsing:

Simple patterns: ~240 μs
Complex patterns: ~445 μs

…147)

Replace fully-expanded _items (frozenset) and _order (tuple) with a compact list of Range objects. Memory reduction of 99.9%+ for typical ranges (100k frames: 7.8MB -> ~536 bytes). Bug fixes: - isConsecutive(): rewrite as O(n) range-based algorithm; fixes incorrect True for interleaved ranges and IndexError on empty FrameSet - hasSubFrames(): correctly returns True for decimal notation like 1.0-5.0 where normalizeFrame collapses values to integers before storage - Stagger modifier: deduplicate frames across stagger iterations - MAX_FRAME_SIZE check: calculate size mathematically for x and plain ranges instead of materializing all frames API compatibility: no breaking changes. .items and .order remain public with DeprecationWarning. All 181 existing tests pass.

Refactor FrameSet to use range-based storage (fixes #148)

Implement ANTLR4 grammar parsing and migration from regex focus (refs #…

080207d

…147)

justinfx added this to the v3 milestone Feb 11, 2026

justinfx self-assigned this Feb 11, 2026

justinfx added the v3 label Feb 11, 2026

justinfx added 11 commits February 12, 2026 08:37

Fix mypy errors

2a8f77b

Fix regression on normalizing path sep to most common

41a9efb

Ignore more antlr symbols from mypy

770810a

More fixes to path normalization

459885d

docs: fix bug referencing old __version__.py file during sphinx build

5f5da10

Add doc building tool commands and update README

5c225ce

Reformat docs to include grammar, and reorganize license location

cc12f8b

Fix python 3.8 issues

fe94925

Update fuzz tests after refactor, and fix uncovered frameset bugs

c3a6136

mypy type checking fixes

2a124fd

justinfx mentioned this pull request Feb 20, 2026

Multiple step sizes lost when calling intersection() #54

Closed

justinfx added 5 commits February 22, 2026 09:20

Update CHANGES

ceac963

Update CHANGES

ea754f2

Update CHANGES

e26f7b3

Merge pull request #150 from justinfx/v3_ranges

d265e14

Refactor FrameSet to use range-based storage (fixes #148)

Fix CHANGES typo

cf4004e

justinfx merged commit 7d5a8c7 into master Feb 21, 2026
10 checks passed

justinfx deleted the v3 branch February 21, 2026 21:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement ANTLR4 grammar parsing and migration from regex focus (refs #147)#149

Implement ANTLR4 grammar parsing and migration from regex focus (refs #147)#149
justinfx merged 17 commits intomasterfrom
v3

justinfx commented Feb 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

justinfx commented Feb 11, 2026

Migrate to ANTLR4 grammar-based parsing (v3)

Summary

Breaking Changes

New Features

Implementation

Performance

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant