- 
          
- 
                Notifications
    You must be signed in to change notification settings 
- Fork 33.2k
gh-46236: PyUnicode docs improvements #129966
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Move Py_UNICODE to a new "Deprecated API" section. Formally soft-deprecate PyUnicode_READY, and move it Document and soft-deprecate PyUnicode_IS_READY, and move it
| @vstinner @serhiy-storchaka Do these changes look OK to you? | 
| .. c:function:: unsigned int PyUnicode_IS_ASCII(PyObject *unicode) | ||
| Return true if the string only contains ASCII characters. | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| Return true if the string only contains ASCII characters. | |
| Return non-zero if the string only contains ASCII characters. | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"Return true" is common for such functions (see for example PyUnicode_Check()).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have a couple of suggestion, but in general LGTM. 👍
| .. c:function:: unsigned int PyUnicode_IS_ASCII(PyObject *unicode) | ||
| Return true if the string only contains ASCII characters. | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"Return true" is common for such functions (see for example PyUnicode_Check()).
| .. c:function:: unsigned int PyUnicode_CHECK_INTERNED(PyObject *str) | ||
| Return a non-zero value if *str* is interned, zero if not. | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Most documentation uses "Return true" (60 occurrences), some use "Return non-zero" (13 occurrences) and one uses "Return a non-zero".
In this case using "Return a non-zero" looks justified, as it may encode additional information.
Co-authored-by: Serhiy Storchaka <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
| Thank you for the reviews! | 
Move deprecated PyUnicode API docs to new section Move Py_UNICODE to a new "Deprecated API" section. Formally soft-deprecate PyUnicode_READY, and move it Document and soft-deprecate PyUnicode_IS_READY, and move it Document PyUnicode_IS_ASCII, PyUnicode_CHECK_INTERNED PyUnicode_New docs: Clarify requirements for "fresh" strings PyUnicodeWriter_DecodeUTF8Stateful: Link "error-handlers" Co-authored-by: Serhiy Storchaka <[email protected]>
While planning to deprecate the
PyASCIIObjectstructs, I found some docs improvements. IMO, these should be applied (and backported) regardless of what happens toPyASCIIObject.Move deprecated PyUnicode API docs to new section
I intend to add more here. IMO, it's good practice to separate deprecated API out like this.
Document PyUnicode_IS_ASCII, PyUnicode_CHECK_INTERNED
PyUnicode_New: Clarify requirements for "fresh" strings
Also, refer to PyUnicode_New wrom all the "writers" for which you need to follow the requirements
PyUnicodeWriter_DecodeUTF8Stateful: Link "error-handlers"
📚 Documentation preview 📚: https://cpython-previews--129966.org.readthedocs.build/