GH-103929: handle long input lines with PyOS_InputHook #103931

tacaswell · 2023-04-27T15:42:58Z

If:

stdin is in line buffer mode
the readline module is not loaded
not on windows
a GUI toolkit has installd PyOS_InputHook

Then, if the user enters lines longer than 98 characters into input:

user calls input(...)
user types/pastes a line longer than 98 characters + 1 newline
PyOS_InputHook returns
the first 99 characters are returned
the last character is not a new line so we go back to my_fgets
the PyOS_InputHook blocks because despite there being value in the buffer, stdin does not flag itself as ready to be read again
user hits enter again and to finish reading the input
the extra new line comes out the next time the user calls input

This fixes this bug by passing the currently read number of characters to my_fgets to skip the input hook when reading out a long buffer.

closes #103929

Qustions:

I do not know how to test this other than manually
It seems to work when stdin is unbuffered, but I do not fully understand why
Is the edit to the docs of PyOS_InputHook correct?

Issue: non-readline PyOS_StdioReadline when used with PyOS_InputHook fails is buggy with long input #103929

mdboom · 2023-05-02T15:19:48Z

I do not know how to test this other than manually

The usual way to test a C API call directly is add a method to the testcapi module. It seems like you could test this through a call to PyOS_StdioReadline, but it's not exactly trivial to set up the FILE objects and buffer them with data etc. I don't see any other tests in testcapi doing anything like that.

mdboom · 2023-05-19T17:11:13Z

@brandtbucher: Maybe you have enough context to review this since you've been in readline-adjacent things lately?

brandtbucher · 2023-05-19T18:52:06Z

I'll take a look ~~today~~ next week. Thanks for the PR!

tacaswell · 2023-06-30T22:52:01Z

rebased to handle conflicts with the sub-processor work.

tacaswell · 2023-10-17T18:48:12Z

Rebased again.

jdtsmith · 2024-04-13T23:57:48Z

@brandtbucher did you ever get a chance to review this one?

mdboom · 2024-06-11T11:56:50Z

@pablogsal, @ambv: If I understand correctly, users are more likely to hit this bug in 3.14 since the new REPL doesn't use readline. Should we try to get this into the next 3.14 beta?

pablogsal · 2024-06-11T12:17:03Z

@pablogsal, @ambv: If I understand correctly, users are more likely to hit this bug in 3.14 since the new REPL doesn't use readline. Should we try to get this into the next 3.14 beta?

This fix doesn't apply to the new REPL, so I don't think how this PR could help here

mdboom · 2024-06-11T12:53:28Z

@pablogsal, @ambv: If I understand correctly, users are more likely to hit this bug in 3.14 since the new REPL doesn't use readline. Should we try to get this into the next 3.14 beta?

This fix doesn't apply to the new REPL, so I don't think how this PR could help here

Got it. My bad.

jdtsmith · 2024-08-13T23:57:32Z

@pablogsal Does that mean the new REPL should/will not have this 100 char limit while PyOS_InputHook is installed?

tacaswell · 2024-08-14T13:34:19Z

The bug that is fixed is in the readline-replacement implementation that Python carries, not in the input hooks, so if there are bugs in the new repl with long lines, they will be different bugs ;)

jdtsmith · 2024-08-14T15:09:55Z

I'll take that as a solid maybe ;). Will report if I find similar long-line bugs with active input hooks. Update: re-reading, it seems you mean the bug remains, but that the new REPL has worked around it in another way. Those of us using input-based REPL's the still need this fix.

mdehoon · 2025-10-17T16:10:39Z

This bug shows up not only with input() but also with interactive Python when typing commands at the Python prompt.

For example, running the commands in this script one-by-one will freeze Python after long_string until the user hits enter one more time:

>>> import tkinter
>>> t = tkinter.Tk()
>>> short_string = "ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz"
>>> long_string = "ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz"

>>> >>> ^D

(as @tacaswell mentioned, the bug only appears on Python without readline).

If: - stdin is in line buffer mode - the readline module is not loaded - not on windows - a GUI toolkit has installd PyOS_InputHook Then, if the user enters lines longer than 98 characters into `input`: - user calls `input(...)` - user types/pastes a line longer than 98 characters + 1 newline - PyOS_InputHook returns - the first 99 characters are returned - the last character is not a new line so we go back to my_fgets - the PyOS_InputHook blocks because despite there being value in the buffer, stdin does not flag itself as ready to be read again - user hits enter again and to finish reading the input - the extra new line comes out the next time the user calls input This fixes this bug by passing the currently read number of characters to `my_fgets` to skip the input hook when reading out a long buffer. closes python#103929

jdtsmith · 2025-10-17T16:24:07Z

It would be nice to get this reviewed and merged. For now the only workaround is import readline, which may be undesirable.

mdehoon · 2025-10-17T16:29:13Z

It seems to work when stdin is unbuffered, but I do not fully understand why

When stdin is unbuffered, then fgets reads directly from stdin and leaves any characters in stdin that do not fit in the fgets buffer. PyOS_InputHook will then see those remaining characters, and return.

When stdin is buffered, then fgets reads the data from stdin into an internal buffer. This internal buffer is likely bigger than even the long input line, so it will read the full line into its internal buffer, leaving nothing in stdin. PyOS_InputHook then doesn't find anything in stdin, and will block waiting for new input.

tacaswell · 2025-10-17T16:39:15Z

rebased to re-trigger CI.

mdehoon · 2025-10-17T16:46:57Z

Is the edit to the docs of PyOS_InputHook correct?

(meaning this sentence: "This function [PyOS_InputHook] should block until stdin is readable.").

I have not seen it described in the documentation, but this is how PyOS_InputHook is used in practice (e.g. in the EventHook function in _tkinter.c, and in other GUI toolkits such as PyGTK). I think it is the only reasonable way to use PyOS_InputHook, and it would be good to have its usage described somewhere in the documentation.

But, if so, then the current PR is not ideal. Once we start calling fgets, then we have already decided that there is input available on stdin, and there is no point in calling PyOS_InputHook again. Then I would think that PyOS_InputHook needs to be called only once, just before we get into the do { ... } while (p[n-1] != '\n'); loop, and it does not need to be called from my_fgets.

tacaswell · 2025-10-17T17:01:39Z

I think we need the inputhook inside of that do...while loop as if the input hook returns early (like some flavors on windows do).

Maybe it should be rephrased to:

This function [PyOS_InputHook] should block until stdin is readable but may return early

This lets you run a loop like "run UI for 5ms, check stid, run UI for 5ms ...". This bit of code is here due to https://github.com/python/cpython/pull/7978/files which I put in with guidance from @zooba .

mdehoon · 2025-10-17T21:44:31Z

This lets you run a loop like "run UI for 5ms, check stid, run UI for 5ms ...".

This loop happens inside of the PyOS_InputHook function. See the EventHook function in Modules\_tkinter.c, which boils down to

    while (!stdin_ready) {
        int result = Tcl_DoOneEvent(TCL_DONT_WAIT);
        if (result == 0)
            Sleep(Tkinter_busywaitinterval);
    }

There is no need to have such a second loop on top of it outside of PyOS_InputHook.

See also pull request #140147 which gets rid of the Windows-specific code in EventHook.

tacaswell · 2025-10-20T18:05:57Z

But does it happen inside the function for all function currently in the wild? I have a recent-ish survey here: #119843 (comment)

mdehoon · 2025-10-21T00:57:33Z

But does it happen inside the function for all function currently in the wild? I have a recent-ish survey here: #119843 (comment)

In the main packages (matplotlib, PyQt5, PyQt6, PySide, and PyGTK), PyOS_InputHook points to a function that returns once data is available on stdin.

PyGObject (which superseded PyGTK) does not use PyOS_InputHook.

For wxPython: there was a discussion about it 16 years ago (see https://discuss.wxpython.org/t/interactive-usage-of-wxpython/32128/22, where I was making the same comments). But I did not find any (recent or old) version of wxPython that actually uses PyOS_InputHook, so I guess it was never implemented.

Now there may be some module in the wild that assumes that PyOS_InputHook is called repeatedly. I didn't find such a module, but if it exists, its event loop will freeze with Python without readline support.

tacaswell requested review from lysnikolaou and pablogsal as code owners April 27, 2023 15:42

bedevere-bot mentioned this pull request Apr 27, 2023

non-readline PyOS_StdioReadline when used with PyOS_InputHook fails is buggy with long input #103929

Open

bedevere-bot added the awaiting review label Apr 27, 2023

tacaswell mentioned this pull request Apr 27, 2023

[Bug]: matplotlib figures inhibit input longer than 100 chars in IPython matplotlib/matplotlib#25756

Closed

tacaswell force-pushed the fix/pyos_inputhook_longlines branch from a63c908 to 1e6fc5b Compare April 27, 2023 22:41

mdboom requested a review from brandtbucher May 19, 2023 17:10

brandtbucher self-assigned this May 19, 2023

tacaswell force-pushed the fix/pyos_inputhook_longlines branch from 1e6fc5b to e3759ab Compare June 30, 2023 22:51

tacaswell force-pushed the fix/pyos_inputhook_longlines branch from e3759ab to e69cd68 Compare October 17, 2023 18:48

tacaswell force-pushed the fix/pyos_inputhook_longlines branch from e69cd68 to 281588a Compare October 17, 2023 18:49

tacaswell force-pushed the fix/pyos_inputhook_longlines branch from 281588a to 5f76a95 Compare December 4, 2023 20:04

tacaswell force-pushed the fix/pyos_inputhook_longlines branch from 5f76a95 to c433296 Compare June 10, 2024 21:04

tacaswell force-pushed the fix/pyos_inputhook_longlines branch from c433296 to 1d13392 Compare October 17, 2025 16:20

DOC: add additional qualifier to expected behavior of PyOS_InputHook

ffad7f7

Uh oh!

Uh oh!

GH-103929: handle long input lines with PyOS_InputHook #103931

Are you sure you want to change the base?

GH-103929: handle long input lines with PyOS_InputHook #103931

Uh oh!

Conversation

tacaswell commented Apr 27, 2023 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdboom commented May 2, 2023

Uh oh!

mdboom commented May 19, 2023

Uh oh!

brandtbucher commented May 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tacaswell commented Jun 30, 2023

Uh oh!

tacaswell commented Oct 17, 2023

Uh oh!

jdtsmith commented Apr 13, 2024

Uh oh!

mdboom commented Jun 11, 2024

Uh oh!

pablogsal commented Jun 11, 2024

Uh oh!

mdboom commented Jun 11, 2024

Uh oh!

jdtsmith commented Aug 13, 2024

Uh oh!

tacaswell commented Aug 14, 2024

Uh oh!

jdtsmith commented Aug 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdehoon commented Oct 17, 2025

Uh oh!

jdtsmith commented Oct 17, 2025

Uh oh!

mdehoon commented Oct 17, 2025

Uh oh!

tacaswell commented Oct 17, 2025

Uh oh!

mdehoon commented Oct 17, 2025

Uh oh!

tacaswell commented Oct 17, 2025

Uh oh!

mdehoon commented Oct 17, 2025

Uh oh!

tacaswell commented Oct 20, 2025

Uh oh!

mdehoon commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

tacaswell commented Apr 27, 2023 •

edited by bedevere-bot

Loading

brandtbucher commented May 19, 2023 •

edited

Loading

jdtsmith commented Aug 14, 2024 •

edited

Loading