- 
          
- 
                Notifications
    You must be signed in to change notification settings 
- Fork 33.2k
gh-90102: Remove isatty call during regular open #124922
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gh-90102: Remove isatty call during regular open #124922
Conversation
Co-authored-by: Victor Stinner <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
| Merged, thank you. At the first read, I didn't understand the relationship between S_ISCHR() and isatty(). But your comment makes sense and explains it correctly. | 
| """ | ||
| if (self._stat_atopen is not None | ||
| and not stat.S_ISCHR(self._stat_atopen.st_mode)): | ||
| return True | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this should be:
| return True | |
| return False | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep, Py_RETURN_FALSE; I got right in the C but not the pyio. Making a new PR to update....
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Co-authored-by: Victor Stinner <[email protected]>
TTYs are always character devices. If the interpreter knows a file is not a character device when it would call
isatty, skip the isatty call. Insideopen()in the same python library call there is a fresh stat result that contains that information. Use the stat result to skip a system call.This shortcut was suggested by @eryksun in 2021.
isattyis not necessarily constant over time, but inside the python library call which opened the fd and has yet to return the fd, it seems reasonable to rely on the fd not changing. The fd may be visible to caller code which passes in anopeneror intercepts specific calls.For gh-120754, with this change reading text with
open('README.md').read()is down to 6 system calls:When reading bytes with
buffering=0(disabling BufferedIO which remoevs thelseek), reading a regular file is down to 5 system calls (strace python -c "from pathlib import Path; Path('README.rst').read_bytes()")Benchmark
Run on a 2024 MacBook Air running Squoia 15.0
Benchmark code
Before:
After:
For files which are recently read / in OS filecache (or on fast devices), there is a ~15% performance overhead with
BufferedIO(see: #122111 where I updatedPath.read_bytes()and measured the change). Unfortunately multiple attempts I've made to do small reworks to make thelseekunnecessary inBufferedIOhave resulted in no performance change and a lot of complexity.I have been developing a more broad refactoring that could reduce the
BufferedIOoverhead as well as several other I/O overheads while meeting current API expectations (ex. each layer of the stack re-figures outreadableandwriteable, every call must re-validate with many branches the fd state and arguments,_pyioneeds to copy at least once in user-space becauseos.readcan't be passed a buffer / always allocates a new one, etc.).I'm planning to put together a talk on "Journey to the center of
open().read()" and submit it to present at a San Francisco bay area python meetup since as I've been working on this I've found it's a very intricate set of operations which didn't match my mental image in some interesting ways. Hope is to then do a second talk with sample working implementation which shows how could rework internals while keeping the existing Python I/O API to reduce overheads, increase readability, solve some longstanding bugs, and possibly enable usage ofio_uringfor more performance improvement. Overarching goal would be to get down to one largely python native I/O implementation with improved performance from the optimizations as well as opening new performance improvement avenues.@vstinner This replaces #121593 on top of #123412