GH-139416: Fix portability of sendfile(2) support detection for Lustre filesystems #139417

bnikolic · 2025-09-29T12:50:33Z

On rare occasions shutil.copyfile fails when source file is on AWS Lustre implementation because the sendfile(2) syscall returns ENODATA. This patch works around this by disabling the sendfile(2) implementation in the same way as if sendfile(2) was not available.

Issue: Fix portability of sendfile(2) support detection for Lustre filesystems #139416

bedevere-app · 2025-09-29T12:50:39Z

Most changes to Python require a NEWS entry. Add one using the blurb_it web app or the blurb command-line tool.

If this change has little impact on Python users, wait for a maintainer to apply the skip news label instead.

picnixz

Please add a NEWS entry.

Lib/shutil.py

picnixz

I would be fine with this but is it possible that ENODATA could be sent for other reasons where we shouldn't fall back to the legacy sendfile?

Lib/shutil.py

Improve whitespace Co-authored-by: Bénédikt Tran <[email protected]>

python-cla-bot · 2025-09-29T15:04:54Z

All commit authors signed the Contributor License Agreement.

bnikolic · 2025-09-29T15:26:02Z

Ah, there is one thing that comes to mind, thank you: what if ENODATA comes after some number of blocks have already been written. Need to check if that is possible and/or how to hanlde. Will convert this to draft for time being.

picnixz · 2025-09-29T15:30:15Z

Ah, there is one thing that comes to mind, thank you: what if ENODATA comes after some number of blocks have already been written. Need to check if that is possible and/or how to hanlde. Will convert this to draft for time being.

Thank you for investigating this yes.

Lib/shutil.py

picnixz · 2025-09-30T10:30:09Z

Lib/shutil.py

+                dstpos = os.lseek(outfd, 0, os.SEEK_CUR)
+                if dstpos > 0:
+                    # Some data has already been written but we use
+                    # sendfile in a mode that does not update the
+                    # input fd position when reading. Hence seek the
+                    # input fd to the correct position before falling
+                    # back on POSIX read/write method. Since sendfile
+                    # requires mmapable infd, it should also be seekable
+                    os.lseek(infd, dstpos, os.SEEK_SET)


Suggested change

dstpos = os.lseek(outfd, 0, os.SEEK_CUR)

if dstpos > 0:

# Some data has already been written but we use

# sendfile in a mode that does not update the

# input fd position when reading. Hence seek the

# input fd to the correct position before falling

# back on POSIX read/write method. Since sendfile

# requires mmapable infd, it should also be seekable

os.lseek(infd, dstpos, os.SEEK_SET)

# 'infd' and 'outfd' are assumed to be seekable,

# as they are checked to be "regular" files.

if offset > 0:

# Some data has already been written but we use

# sendfile in a mode that does not update the

# input fd position when reading. Hence seek the

# input fd to the correct position before falling

# back on POSIX read/write method.

os.lseek(infd, offset, os.SEEK_SET)

AFAIK, offset would match dstpos right? (or maybe offset + 1, I didn't think about it much here)

I will check this but I think this would not work, because offset is updated only at the bottom with the return value of sendfile() (which is the number of bytes actually sent). If are catching the exception then no return value (I think it would be available in C). So this is why the offset is not used in in the next condition but rather the return of lseek, even if it is just to check no data have been written.

Thanks for the comments. I think I would leave the dstpos = os.lseek(outfd, 0, os.SEEK_CUR) in case sendfile has failed after writing some data. This would be consistent with the usage below (Give up on first call and if no data was copied and subsequent line) which was I think done for same reason.

Lib/test/test_shutil.py

picnixz · 2025-09-30T10:33:15Z

Lib/test/test_shutil.py

+                                     side_effect=syscall) as m2:
+                with self.get_files() as (src, dst):
+                    with self.assertRaises(_GiveupOnFastCopy) as cm:
+                        self.zerocopy_fun(src, dst)


with ( unittest.mock.patch('os.fstat', return_value=mock): unittest.mock.patch(self.PATCHPOINT, create=True, side_effect=syscall): self.get_files() as (src, dst) ): self.assertRaises(_GiveupOnFastCopy, self.zerocopy_func, src, dst)

You'll need to dedent the rest of the code as well. Also, don't bind context managers if they are not used.

Lib/test/test_shutil.py

Co-authored-by: Bénédikt Tran <[email protected]>

bnikolic · 2025-10-01T08:50:48Z

As this got a bit more complicated than I expected I'm going to see if I can back-port this into one of our production-like runs just to check nothing unexpected happens.

bnikolic · 2025-10-27T15:33:50Z

I have tested this in our AWS production env, and confirmed it fixed the observed issue.

pythonGH-139416: Fix copyfile failure due to sendfile + Lustre

ee336eb

bnikolic requested a review from giampaolo as a code owner September 29, 2025 12:50

bedevere-app bot added the awaiting review label Sep 29, 2025

bedevere-app bot mentioned this pull request Sep 29, 2025

Fix portability of sendfile(2) support detection for Lustre filesystems #139416

Open

picnixz changed the title ~~GH-139416: Fix copyfile failure due to sendfile + Lustre~~ GH-139416: Fix portability of sendfile(2) support detection for Lustre filesystems Sep 29, 2025

picnixz reviewed Sep 29, 2025

View reviewed changes

Lib/shutil.py Show resolved Hide resolved

Bojan Nikolic added 2 commits September 29, 2025 13:24

Combine error check into existing ENOSTSOCK check

646868a

Add News entry

520ee09

picnixz reviewed Sep 29, 2025

View reviewed changes

Lib/shutil.py Outdated Show resolved Hide resolved

Update Lib/shutil.py

a09ff4d

Improve whitespace Co-authored-by: Bénédikt Tran <[email protected]>

bnikolic marked this pull request as draft September 29, 2025 15:26

bedevere-app bot removed the awaiting review label Sep 29, 2025

Bojan Nikolic added 2 commits September 30, 2025 08:30

Handle if ENODATA error and some data already written

5917a6b

Add test for ENODATA handling

452c877

picnixz reviewed Sep 30, 2025

View reviewed changes

Lib/shutil.py Show resolved Hide resolved

Bojan Nikolic added 5 commits September 30, 2025 08:58

fixup whitespace

faa9229

Further whitespace fixup

482fb09

Split handling of ENOTSOCK and ENODATA

883f158

clean comment and space

1316d6a

Add comment on seekable property of infd

3426f7b

picnixz reviewed Sep 30, 2025

View reviewed changes

bnikolic and others added 5 commits September 30, 2025 14:18

Update Lib/test/test_shutil.py

1b88269

Co-authored-by: Bénédikt Tran <[email protected]>

Update Lib/test/test_shutil.py

1513ea3

Co-authored-by: Bénédikt Tran <[email protected]>

Update Lib/test/test_shutil.py

f59253b

Co-authored-by: Bénédikt Tran <[email protected]>

Pull comment on seekability ahead of dstpos lseek

44c0184

Update test case style as per review

ef900b0

Bojan Nikolic added 2 commits October 1, 2025 08:52

Whitespace fixup

4987596

Another whitespace

e624cb5

bnikolic marked this pull request as ready for review October 27, 2025 15:33

bedevere-app bot added the awaiting review label Oct 27, 2025

bnikolic mentioned this pull request Oct 29, 2025

Package installation fails with os error 61 astral-sh/uv#15304

Open

Uh oh!

GH-139416: Fix portability of sendfile(2) support detection for Lustre filesystems #139417

Are you sure you want to change the base?

GH-139416: Fix portability of sendfile(2) support detection for Lustre filesystems #139417

Uh oh!

Conversation

bnikolic commented Sep 29, 2025 • edited by bedevere-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bedevere-app bot commented Sep 29, 2025

Uh oh!

picnixz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

picnixz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

python-cla-bot bot commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bnikolic commented Sep 29, 2025

Uh oh!

picnixz commented Sep 29, 2025

Uh oh!

Uh oh!

picnixz Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

bnikolic Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

bnikolic Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

picnixz Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bnikolic commented Oct 1, 2025

Uh oh!

bnikolic commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bnikolic commented Sep 29, 2025 •

edited by bedevere-app bot

Loading

python-cla-bot bot commented Sep 29, 2025 •

edited

Loading