Enhance code quality and robustness: fixes for pattern context, parser loop, namespace validation, week validation, and bitflags by mundanevision20 · Pull Request #287 · facelessuser/soupsieve

mundanevision20 · 2026-01-14T21:34:30Z

Hi @facelessuser ,
thank you for maintaining soupsieve; I appreciate your work.

Suggested changes

Fixes for pattern-context offsets, parser loop termination, namespace validation, week validation, and bitflags assignment.

The targeted fixes informed by reproduced edge cases and investigation notes to address incorrect offsets, potential parser non-termination on malformed input, namespace validation gaps, week validation edge cases, and an incorrect bitflag assignment. Changes are minimal and focused on root causes.

I'm happy to split this PR into smaller PRs or add a changelog entry if preferred.

facelessuser · 2026-01-14T21:51:16Z

I probably would like to see these broken up, but additionally, tests defined that show the use cases you are fixing to help justify the changes.

facelessuser · 2026-01-14T23:07:59Z

Just taking a quick look:

The change related to calendar week makes sense and was a missed case; this should be easily testable, demonstrating edge cases.
The CustomSelector and Namespaces validator change seems to be a valid change (honestly, I don't know why I wasn't validating the keys of the dictionary). Should be testable.
I'm curious if you were able to reproduce the infinite loop case, but the idea of handling a failed case makes sense. I'm curious if the issue can be produced or if you are just protecting against a theoretical case. Through development, I've never run into such a case as things "should" always be valid, but, obviously, the real-world may prove different 🙂.
I'm not sure if the flag case is fixing a real case or a perceived case. I'd like to see a test that fails before the fix that is now working after.
The index case, I'm also curious to see if there is a real case. I hadn't noticed issues with the indexing, but that doesn't mean there isn't one.

facelessuser · 2026-01-14T23:10:01Z

Also, changelog entries would be appreciated.

facelessuser · 2026-01-15T00:55:19Z

It looks like the changes break a few week :in-range and :out-of-range tests. It also seems to break a debug output case.

These changes need to be fully tested, and if an existing test needs to be changed, we need to be very sure the original case was wrong. From what I can tell, none of the failing tests should be failing. It seems the proposed changes actually introduce some regressions. So breaking these changes up and validating separately is definitely my preferred approach. I think some of these changes may not be desired.

facelessuser · 2026-01-15T02:45:16Z

It seems our :in-range and :out-of-range tests were wrong. The tests break because your change fixes the issue. It's likey I implemented this before browsers supported things as right now I can only get Chrome to show results.

facelessuser · 2026-01-15T02:48:07Z

I'm fairly certain the failure TestSyntaxErrorReporting.test_syntax_error_with_multiple_lines is a regression, but the week tests are actually an improvement.

facelessuser · 2026-01-15T03:04:10Z

The infinite loop, I suspect, doesn't actually occur in the wild. I'm thinking, if it does, it should likely throw an exception.

facelessuser

Since I started digging into this, I think we can address everything in this PR without splitting. I hadn't planned to do a deep dive on this, but I did.

soupsieve/css_match.py

facelessuser · 2026-01-15T03:11:34Z

soupsieve/css_parser.py

        # Some patterns require additional logic, such as default. We try to make these the
        # last pattern, and append the appropriate flag to that selector which communicates
        # to the matcher what additional logic is required.
+        # Preserve any flags that were set during parsing (e.g. :empty, :root)


I would need to see evidence via new test(s), showing why this is needed.

soupsieve/css_types.py

soupsieve/pretty.py

facelessuser · 2026-01-15T03:18:20Z

soupsieve/util.py

            col = index - last + 1
        elif last <= index < m.end(0):
            indent = '--> '
-            offset = (-1 if index > m.start(0) else 0) + 3


I'm not convinced this fixes anything, but tests show it regresses one of our tests. I'm open to making a change if a real case can be demonstrated, but it seems like the logic would have to be altered to keep the existing tests working and solve the new test case, if one was shown to need fixing.

facelessuser · 2026-01-15T14:31:48Z

I've actually picked out the verified fixes from this PR in #288. I've added appropriate tests and such and will merge them separately.

That leaves us just with two things:

Proving there is an issue that requires the flag fix.
Proving there is an issue that requires an indexing fix, and if so, providing a proper fix that doesn't break current tests.

mundanevision20 · 2026-01-16T11:30:38Z

Thanks @facelessuser for your valuable feedback on this PR. I'm very happy to see that you already merged some of the suggested changes. I'm very grateful for your time spent on this project. I'll need a few days for a deep dive to answer your questions.

Thanks again for your support!

facelessuser · 2026-01-18T16:14:56Z

@mundanevision20 I'm going to close this PR as it is now out of date. If your investigations into the other two issues end up turning up real-world problems, feel free to open up a new PR or issue so we can evaluate a path forward. I will probably go ahead and publish a release just to ensure the fix for the infinite loop problem gets out there, as it is the more serious issue.

mundanevision20 added 5 commits January 14, 2026 21:21

Fix bug in bitflags assignment

413deab

Fix bug in validate_week

c4ebe28

Enhance namespace validation

38a5012

Fix bug to prevent never ending loop

12a0b80

Fix offset in pattern context output

1876e43

gir-bot added S: needs-review Needs to be reviewed and/or approved. C: css-matching Related to CSS matching. C: css-parsing Related to CSS parsing. C: source Related to source code. labels Jan 14, 2026

facelessuser requested changes Jan 15, 2026

View reviewed changes

facelessuser closed this Jan 18, 2026

Uh oh!

Conversation

mundanevision20 commented Jan 14, 2026

Suggested changes

Uh oh!

facelessuser commented Jan 14, 2026

Uh oh!

facelessuser commented Jan 14, 2026

Uh oh!

facelessuser commented Jan 14, 2026

Uh oh!

facelessuser commented Jan 15, 2026

Uh oh!

facelessuser commented Jan 15, 2026

Uh oh!

facelessuser commented Jan 15, 2026

Uh oh!

facelessuser commented Jan 15, 2026

Uh oh!

facelessuser left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

facelessuser Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

facelessuser Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

facelessuser commented Jan 15, 2026

Uh oh!

mundanevision20 commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facelessuser commented Jan 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mundanevision20 commented Jan 16, 2026 •

edited

Loading