Skip to content

Regex with the backreference to a non-greedy group fails unexpectedly #127291

@Frimaire

Description

@Frimaire

Bug report

Bug description:

Hello,

There is a problem in the regex module. The regular expression with the backreference to a non-greedy group fails (matches nothing) on some inputs unexpectedly.

For example:

import re

r1 = re.compile('(a+)+\\1')
print(r1.search('a' * 28))
# OK, matches the whole string

r2 = re.compile('(a+)+?\\1')
print(r2.search('a' * 28))
# runs very slowly and eventually returns None

This problem seems to exist even in Python 2.7, which has been fixed in a version of Python 3.11. Since version 3.9 is still under maintenance, would this problem be fixed in the following release? (or is the pull request about this problem accepted?)

Thank You!

CPython versions tested on:

3.9

Operating systems tested on:

Linux, Windows

Metadata

Metadata

Assignees

No one assigned

    Labels

    pendingThe issue will be closed if no feedback is providedtype-bugAn unexpected behavior, bug, or error

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions