Skip to content

Commit e442e7b

Browse files
authored
Merge branch 'main' into patch-1
2 parents 871a471 + 3f23888 commit e442e7b

File tree

79 files changed

+949
-474
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

79 files changed

+949
-474
lines changed

.pre-commit-config.yaml

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,10 @@ repos:
1414
name: Run Ruff (lint) on Tools/build/
1515
args: [--exit-non-zero-on-fix, --config=Tools/build/.ruff.toml]
1616
files: ^Tools/build/
17+
- id: ruff
18+
name: Run Ruff (lint) on Tools/i18n/
19+
args: [--exit-non-zero-on-fix, --config=Tools/i18n/.ruff.toml]
20+
files: ^Tools/i18n/
1721
- id: ruff
1822
name: Run Ruff (lint) on Argument Clinic
1923
args: [--exit-non-zero-on-fix, --config=Tools/clinic/.ruff.toml]

Doc/howto/descriptor.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -420,7 +420,7 @@ Here are three practical data validation utilities:
420420

421421
def validate(self, value):
422422
if not isinstance(value, str):
423-
raise TypeError(f'Expected {value!r} to be an str')
423+
raise TypeError(f'Expected {value!r} to be a str')
424424
if self.minsize is not None and len(value) < self.minsize:
425425
raise ValueError(
426426
f'Expected {value!r} to be no smaller than {self.minsize!r}'

Doc/library/unicodedata.rst

Lines changed: 68 additions & 32 deletions
Original file line numberDiff line numberDiff line change
@@ -25,80 +25,133 @@ Standard Annex #44, `"Unicode Character Database"
2525
<https://www.unicode.org/reports/tr44/>`_. It defines the
2626
following functions:
2727

28+
.. seealso::
29+
30+
The :ref:`unicode-howto` for more information about Unicode and how to use
31+
this module.
32+
2833

2934
.. function:: lookup(name)
3035

3136
Look up character by name. If a character with the given name is found, return
3237
the corresponding character. If not found, :exc:`KeyError` is raised.
38+
For example::
39+
40+
>>> unicodedata.lookup('LEFT CURLY BRACKET')
41+
'{'
42+
43+
The characters returned by this function are the same as those produced by
44+
``\N`` escape sequence in string literals. For example::
45+
46+
>>> unicodedata.lookup('MIDDLE DOT') == '\N{MIDDLE DOT}'
47+
True
3348

3449
.. versionchanged:: 3.3
3550
Support for name aliases [#]_ and named sequences [#]_ has been added.
3651

3752

38-
.. function:: name(chr[, default])
53+
.. function:: name(chr, default=None, /)
3954

4055
Returns the name assigned to the character *chr* as a string. If no
4156
name is defined, *default* is returned, or, if not given, :exc:`ValueError` is
42-
raised.
57+
raised. For example::
58+
59+
>>> unicodedata.name('½')
60+
'VULGAR FRACTION ONE HALF'
61+
>>> unicodedata.name('\uFFFF', 'fallback')
62+
'fallback'
4363

4464

45-
.. function:: decimal(chr[, default])
65+
.. function:: decimal(chr, default=None, /)
4666

4767
Returns the decimal value assigned to the character *chr* as integer.
4868
If no such value is defined, *default* is returned, or, if not given,
49-
:exc:`ValueError` is raised.
69+
:exc:`ValueError` is raised. For example::
5070

71+
>>> unicodedata.decimal('\N{ARABIC-INDIC DIGIT NINE}')
72+
9
73+
>>> unicodedata.decimal('\N{SUPERSCRIPT NINE}', -1)
74+
-1
5175

52-
.. function:: digit(chr[, default])
76+
77+
.. function:: digit(chr, default=None, /)
5378

5479
Returns the digit value assigned to the character *chr* as integer.
5580
If no such value is defined, *default* is returned, or, if not given,
56-
:exc:`ValueError` is raised.
81+
:exc:`ValueError` is raised::
82+
83+
>>> unicodedata.digit('\N{SUPERSCRIPT NINE}')
84+
9
5785

5886

59-
.. function:: numeric(chr[, default])
87+
.. function:: numeric(chr, default=None, /)
6088

6189
Returns the numeric value assigned to the character *chr* as float.
6290
If no such value is defined, *default* is returned, or, if not given,
63-
:exc:`ValueError` is raised.
91+
:exc:`ValueError` is raised::
92+
93+
>>> unicodedata.numeric('½')
94+
0.5
6495

6596

6697
.. function:: category(chr)
6798

6899
Returns the general category assigned to the character *chr* as
69-
string.
100+
string. General category names consist of two letters.
101+
See the `General Category Values section of the Unicode Character
102+
Database documentation <https://www.unicode.org/reports/tr44/#General_Category_Values>`_
103+
for a list of category codes. For example::
104+
105+
>>> unicodedata.category('A') # 'L'etter, 'u'ppercase
106+
'Lu'
70107

71108

72109
.. function:: bidirectional(chr)
73110

74111
Returns the bidirectional class assigned to the character *chr* as
75112
string. If no such value is defined, an empty string is returned.
113+
See the `Bidirectional Class Values section of the Unicode Character
114+
Database <https://www.unicode.org/reports/tr44/#Bidi_Class_Values>`_
115+
documentation for a list of bidirectional codes. For example::
116+
117+
>>> unicodedata.bidirectional('\N{ARABIC-INDIC DIGIT SEVEN}') # 'A'rabic, 'N'umber
118+
'AN'
76119

77120

78121
.. function:: combining(chr)
79122

80123
Returns the canonical combining class assigned to the character *chr*
81124
as integer. Returns ``0`` if no combining class is defined.
125+
See the `Canonical Combining Class Values section of the Unicode Character
126+
Database <www.unicode.org/reports/tr44/#Canonical_Combining_Class_Values>`_
127+
for more information.
82128

83129

84130
.. function:: east_asian_width(chr)
85131

86132
Returns the east asian width assigned to the character *chr* as
87-
string.
133+
string. For a list of widths and or more information, see the
134+
`Unicode Standard Annex #11 <https://www.unicode.org/reports/tr11/>`_.
88135

89136

90137
.. function:: mirrored(chr)
91138

92139
Returns the mirrored property assigned to the character *chr* as
93140
integer. Returns ``1`` if the character has been identified as a "mirrored"
94-
character in bidirectional text, ``0`` otherwise.
141+
character in bidirectional text, ``0`` otherwise. For example::
142+
143+
>>> unicodedata.mirrored('>')
144+
1
95145

96146

97147
.. function:: decomposition(chr)
98148

99149
Returns the character decomposition mapping assigned to the character
100150
*chr* as string. An empty string is returned in case no such mapping is
101-
defined.
151+
defined. For example::
152+
153+
>>> unicodedata.decomposition('Ã')
154+
'0041 0303'
102155

103156

104157
.. function:: normalize(form, unistr)
@@ -122,9 +175,9 @@ following functions:
122175
normally would be unified with other characters. For example, U+2160 (ROMAN
123176
NUMERAL ONE) is really the same thing as U+0049 (LATIN CAPITAL LETTER I).
124177
However, it is supported in Unicode for compatibility with existing character
125-
sets (e.g. gb2312).
178+
sets (for example, gb2312).
126179

127-
The normal form KD (NFKD) will apply the compatibility decomposition, i.e.
180+
The normal form KD (NFKD) will apply the compatibility decomposition, that is,
128181
replace all compatibility characters with their equivalents. The normal form KC
129182
(NFKC) first applies the compatibility decomposition, followed by the canonical
130183
composition.
@@ -133,6 +186,7 @@ following functions:
133186
a human reader, if one has combining characters and the other
134187
doesn't, they may not compare equal.
135188

189+
136190
.. function:: is_normalized(form, unistr)
137191

138192
Return whether the Unicode string *unistr* is in the normal form *form*. Valid
@@ -154,24 +208,6 @@ In addition, the module exposes the following constant:
154208
Unicode database version 3.2 instead, for applications that require this
155209
specific version of the Unicode database (such as IDNA).
156210

157-
Examples:
158-
159-
>>> import unicodedata
160-
>>> unicodedata.lookup('LEFT CURLY BRACKET')
161-
'{'
162-
>>> unicodedata.name('/')
163-
'SOLIDUS'
164-
>>> unicodedata.decimal('9')
165-
9
166-
>>> unicodedata.decimal('a')
167-
Traceback (most recent call last):
168-
File "<stdin>", line 1, in <module>
169-
ValueError: not a decimal
170-
>>> unicodedata.category('A') # 'L'etter, 'u'ppercase
171-
'Lu'
172-
>>> unicodedata.bidirectional('\u0660') # 'A'rabic, 'N'umber
173-
'AN'
174-
175211

176212
.. rubric:: Footnotes
177213

Doc/library/zipimport.rst

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -30,6 +30,9 @@ Any files may be present in the ZIP archive, but importers are only invoked for
3030
corresponding :file:`.pyc` file, meaning that if a ZIP archive
3131
doesn't contain :file:`.pyc` files, importing may be rather slow.
3232

33+
.. versionchanged:: next
34+
Zstandard (*zstd*) compressed zip file entries are supported.
35+
3336
.. versionchanged:: 3.13
3437
ZIP64 is supported
3538

Doc/reference/lexical_analysis.rst

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -628,10 +628,10 @@ to indicate that an ending quote ends the literal.
628628
STRING: [`stringprefix`] (`stringcontent`)
629629
stringprefix: <("r" | "u" | "b" | "br" | "rb"), case-insensitive>
630630
stringcontent:
631-
| "'" ( !"'" `stringitem`)* "'"
632-
| '"' ( !'"' `stringitem`)* '"'
633631
| "'''" ( !"'''" `longstringitem`)* "'''"
634632
| '"""' ( !'"""' `longstringitem`)* '"""'
633+
| "'" ( !"'" `stringitem`)* "'"
634+
| '"' ( !'"' `stringitem`)* '"'
635635
stringitem: `stringchar` | `stringescapeseq`
636636
stringchar: <any `source_character`, except backslash and newline>
637637
longstringitem: `stringitem` | newline

Doc/using/cmdline.rst

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -369,8 +369,8 @@ Miscellaneous options
369369
.. option:: -R
370370

371371
Turn on hash randomization. This option only has an effect if the
372-
:envvar:`PYTHONHASHSEED` environment variable is set to ``0``, since hash
373-
randomization is enabled by default.
372+
:envvar:`PYTHONHASHSEED` environment variable is set to anything other
373+
than ``random``, since hash randomization is enabled by default.
374374

375375
On previous versions of Python, this option turns on hash randomization,
376376
so that the :meth:`~object.__hash__` values of str and bytes objects

Include/cpython/object.h

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -491,3 +491,7 @@ PyAPI_FUNC(int) PyUnstable_TryIncRef(PyObject *);
491491
PyAPI_FUNC(void) PyUnstable_EnableTryIncRef(PyObject *);
492492

493493
PyAPI_FUNC(int) PyUnstable_Object_IsUniquelyReferenced(PyObject *);
494+
495+
/* Utility for the tp_traverse slot of mutable heap types that have no other
496+
* references. */
497+
PyAPI_FUNC(int) _PyObject_VisitType(PyObject *op, visitproc visit, void *arg);

Include/cpython/pystate.h

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -28,10 +28,10 @@ typedef int (*Py_tracefunc)(PyObject *, PyFrameObject *, int, PyObject *);
2828
#define PyTrace_OPCODE 7
2929

3030
/* Remote debugger support */
31-
#define Py_MAX_SCRIPT_PATH_SIZE 512
31+
#define _Py_MAX_SCRIPT_PATH_SIZE 512
3232
typedef struct {
3333
int32_t debugger_pending_call;
34-
char debugger_script_path[Py_MAX_SCRIPT_PATH_SIZE];
34+
char debugger_script_path[_Py_MAX_SCRIPT_PATH_SIZE];
3535
} _PyRemoteDebuggerSupport;
3636

3737
typedef struct _err_stackitem {

Include/internal/pycore_abstract.h

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -51,6 +51,10 @@ extern int _PyObject_RealIsSubclass(PyObject *derived, PyObject *cls);
5151
// Export for '_bisect' shared extension.
5252
PyAPI_FUNC(int) _Py_convert_optional_to_ssize_t(PyObject *, void *);
5353

54+
// Convert Python int to Py_ssize_t. Do nothing if the argument is None.
55+
// Raises ValueError if argument is negative.
56+
PyAPI_FUNC(int) _Py_convert_optional_to_non_negative_ssize_t(PyObject *, void *);
57+
5458
// Same as PyNumber_Index() but can return an instance of a subclass of int.
5559
// Export for 'math' shared extension.
5660
PyAPI_FUNC(PyObject*) _PyNumber_Index(PyObject *o);

Include/internal/pycore_debug_offsets.h

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -368,7 +368,7 @@ typedef struct _Py_DebugOffsets {
368368
.remote_debugging_enabled = offsetof(PyInterpreterState, config.remote_debug), \
369369
.debugger_pending_call = offsetof(_PyRemoteDebuggerSupport, debugger_pending_call), \
370370
.debugger_script_path = offsetof(_PyRemoteDebuggerSupport, debugger_script_path), \
371-
.debugger_script_path_size = Py_MAX_SCRIPT_PATH_SIZE, \
371+
.debugger_script_path_size = _Py_MAX_SCRIPT_PATH_SIZE, \
372372
}, \
373373
}
374374

0 commit comments

Comments
 (0)