[mypyc] feat: further optimize equality check with string literals [1/1] #19883

BobTheBuidler · 2025-09-18T18:15:53Z

This PR further optimizes string equality checks against literals by getting rid of the PyUnicode_GET_LENGTH call against the literal value, which is not necessary since the value is known at compile-time

I think this optimization will be helpful in cases where the non-literal string DOES match but is actually a subtype of string (actual strings instances that match would be caught by the identity check), or in cases where an exact string does NOT match. But we can also extend this implementation to use c-strings in certain cases where we know at compile-time that the literal value is compact ascii. Actually, maybe I should do that now? Thoughts?

for more information, see https://pre-commit.ci

mypyc/lib-rt/str_ops.c

BobTheBuidler · 2025-10-01T13:47:29Z

mypyc/irbuild/ll_builder.py

            return self.primitive_op(str_eq, [lhs, rhs], line)
        elif op == "!=":
-            eq = self.primitive_op(str_eq, [lhs, rhs], line)
+            if is_string_literal(lhs):


looking at this again, I think we can just refactor this whole block for "!=" into:

return self.add(ComparisonOp(compare_strings(lhs, rhs, line), self.false(), ComparisonOp.EQ, line)

for more information, see https://pre-commit.ci

BobTheBuidler · 2025-10-13T20:02:04Z

mypyc/lib-rt/str_ops.c

+    Py_ssize_t str1_length = PyUnicode_GET_LENGTH(str1);
+    if (str1_length != str2_length)
        return 0;
    int kind = PyUnicode_KIND(str1);


Can we deduce a literal's kind at compile time as well?

Looks like there isn't a good way to reliably do this

JukkaL · 2025-10-14T09:53:47Z

mypyc/irbuild/ll_builder.py

+            if is_string_literal(lhs):
+                if is_string_literal(rhs):
+                    # we can optimize out the check entirely in some constant-folded cases
+                    return self.true() if lhs.value == rhs.value else self.false()


Add a irbuild test cases for constant folding.

JukkaL · 2025-10-14T09:54:42Z

mypyc/irbuild/ll_builder.py

+                    return self.true() if lhs.value == rhs.value else self.false()
+
+                # if lhs argument is string literal, switch sides to match specializer C api
+                lhs, rhs = rhs, lhs


Add irbuild test case for string literal as the lhs.

BobTheBuidler and others added 15 commits September 18, 2025 18:14

feat: optimize equality check with string literals

f095d5f

[pre-commit.ci] auto fixes from pre-commit.com hooks

c3c04a5

for more information, see https://pre-commit.ci

refactor

6528245

[pre-commit.ci] auto fixes from pre-commit.com hooks

d781dde

for more information, see https://pre-commit.ci

Update ll_builder.py

2580b10

Update ll_builder.py

fea651d

[pre-commit.ci] auto fixes from pre-commit.com hooks

9ed369c

for more information, see https://pre-commit.ci

fix: missing ;

fb21187

fix name err

a0d36ec

Update CPy.h

577cd74

Update irbuild-dict.test

9bec581

Update irbuild-unreachable.test

bdee878

Merge branch 'master' into str-eq-literal

45f4885

Update irbuild-classes.test

613f644

Update ll_builder.py

e99864d

BobTheBuidler commented Oct 1, 2025

View reviewed changes

mypyc/lib-rt/str_ops.c Outdated Show resolved Hide resolved

BobTheBuidler commented Oct 1, 2025

View reviewed changes

BobTheBuidler and others added 7 commits October 1, 2025 09:54

refactor

0014212

refactor

4f8786f

[pre-commit.ci] auto fixes from pre-commit.com hooks

996c4d6

for more information, see https://pre-commit.ci

Update ll_builder.py

4054151

[pre-commit.ci] auto fixes from pre-commit.com hooks

f22d8ac

for more information, see https://pre-commit.ci

missing import

b95facd

Update ll_builder.py

e27c716

BobTheBuidler changed the title ~~[mypyc] feat: further optimize equality check with string literals~~ [mypyc] feat: further optimize equality check with string literals [1/1] Oct 1, 2025

BobTheBuidler added 2 commits October 2, 2025 03:14

Merge branch 'master' into str-eq-literal

8a66a0f

Merge branch 'master' into str-eq-literal

e3045d5

BobTheBuidler mentioned this pull request Oct 10, 2025

1.19 Release Planning #19964

Open

BobTheBuidler added 2 commits October 10, 2025 12:57

Merge branch 'master' into str-eq-literal

b5a4bf3

Update irbuild-classes.test

28c6510

Update str_ops.c

b957e14

BobTheBuidler commented Oct 13, 2025

View reviewed changes

Merge branch 'master' into str-eq-literal

4d491c1

JukkaL reviewed Oct 14, 2025

View reviewed changes

BobTheBuidler added 3 commits October 14, 2025 06:09

Update irbuild-str.test

ef0a46e

Update irbuild-str.test

60c093f

Update irbuild-str.test

e494f0c

JukkaL approved these changes Oct 14, 2025

View reviewed changes

JukkaL merged commit b8f57fd into python:master Oct 14, 2025
13 checks passed

BobTheBuidler deleted the str-eq-literal branch October 14, 2025 13:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[mypyc] feat: further optimize equality check with string literals [1/1] #19883

[mypyc] feat: further optimize equality check with string literals [1/1] #19883

Uh oh!

BobTheBuidler commented Sep 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

BobTheBuidler Oct 1, 2025

Uh oh!

BobTheBuidler Oct 13, 2025

Uh oh!

BobTheBuidler Oct 13, 2025

Uh oh!

JukkaL Oct 14, 2025

Uh oh!

JukkaL Oct 14, 2025

Uh oh!

BobTheBuidler Oct 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[mypyc] feat: further optimize equality check with string literals [1/1] #19883

[mypyc] feat: further optimize equality check with string literals [1/1] #19883

Uh oh!

Conversation

BobTheBuidler commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

BobTheBuidler Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

BobTheBuidler Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

BobTheBuidler Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

JukkaL Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

JukkaL Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

BobTheBuidler Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

BobTheBuidler commented Sep 18, 2025 •

edited

Loading