WIP: Fix incomplete Minio downloads when downloading many files concurrently #697

kyungeonchoi · 2025-12-30T18:55:35Z

Story

When downloading a large number of files (~3000) from MinIO/S3 backend, a small number of files (~10) are downloaded with an incorrect size mostly at the end. The size mismatch is small (less than 1% of file size) and can be either smaller and larger than the expected size.
This happens only with download. Signed URLs works fine.
Played with the options of the MinioAdapter class (e.g. transferconfig for aioboto3, concurrency setting (_file_transfer_sem) in the download_file function) but nothing fixes the issue.
On the other hand the standard-alone download using CLI command - servicex transforms download <request-id> works fine.

Updates

Download files using a .part suffix and rename them to the final filename only after validating the file size.

for more information, see https://pre-commit.ci

codecov · 2025-12-30T18:55:48Z

Codecov Report

❌ Patch coverage is 60.00000% with 4 lines in your changes missing coverage. Please review.
✅ Project coverage is 93.92%. Comparing base (3e9ab61) to head (c796005).

Files with missing lines	Patch %	Lines
servicex/minio_adapter.py	66.66%	2 Missing ⚠️
servicex/query_core.py	50.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #697      +/-   ##
==========================================
- Coverage   98.27%   93.92%   -4.35%     
==========================================
  Files          29       27       -2     
  Lines        2085     2074      -11     
==========================================
- Hits         2049     1948     -101     
- Misses         36      126      +90

Flag	Coverage Δ
unittests	`93.92% <60.00%> (-4.35%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

for more information, see https://pre-commit.ci

ponyisi · 2026-01-02T22:03:54Z

servicex/query_core.py

        # Now just wait until all of our tasks complete
-        await asyncio.gather(*download_tasks)
+        MAX_INFLIGHT = 100
+        if len(download_tasks) >= MAX_INFLIGHT:


don't we already control this via sempahore?

Number of downloads is controlled by a semaphore in the minio_adapter.py, but these new lines control the number of concurrent asyncio tasks . Maybe it's okay to await for thousands of tasks but only downloading small number of tasks. At least this doesn't fix the problem.

ponyisi · 2026-01-02T22:08:18Z

servicex/minio_adapter.py

-                localsize = path.stat().st_size
+
+                # Ensure filesystem flush visibility
+                await asyncio.sleep(0.05)


this is a magic number and might not work in general. Would it be enough to download to the .part file, rename, then stat the new file?

kyungeonchoi · 2026-01-06T14:55:39Z

Here is an example size difference which leads to a download failure

Download of 
root:::c114.af.uchicago.edu:1094::https:::dcgftp.usatlas.bnl.gov:443:pnfs:usatlas.bnl.gov:BNLT0D1:rucio:data17_13Te
V:61:5b:DAOD_BPHY28.44180100._000018.pool.root.1 failed:  local size - 290668105, remote 
size - 290671373

kyungeonchoi added 10 commits December 29, 2025 16:33

Add sleep before check local file size

52c0e47

Add temp file name

0067e41

increase flush time

d037e74

Reduce boto3 concurrency

9f68637

Internal retry

d241568

Change boto3 setting

9403c1c

check etag

eda843c

delete if size doesn't match

038a08b

temp change

f221741

Reduce boto3 max concurrency

4ff7512

kyungeonchoi added the bug Something isn't working label Dec 30, 2025

[pre-commit.ci] auto fixes from pre-commit.com hooks

4ff9971

for more information, see https://pre-commit.ci

kyungeonchoi and others added 3 commits December 30, 2025 16:47

temp check asyncio tasks

4c6adb3

Max asyncio tasks

7eb6ee2

[pre-commit.ci] auto fixes from pre-commit.com hooks

c796005

for more information, see https://pre-commit.ci

ponyisi reviewed Jan 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP: Fix incomplete Minio downloads when downloading many files concurrently #697

WIP: Fix incomplete Minio downloads when downloading many files concurrently #697

Uh oh!

kyungeonchoi commented Dec 30, 2025 •

edited

Loading

Uh oh!

codecov bot commented Dec 30, 2025 •

edited

Loading

Uh oh!

ponyisi Jan 2, 2026

Uh oh!

kyungeonchoi Jan 6, 2026

Uh oh!

ponyisi Jan 2, 2026

Uh oh!

kyungeonchoi commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

WIP: Fix incomplete Minio downloads when downloading many files concurrently #697

Are you sure you want to change the base?

WIP: Fix incomplete Minio downloads when downloading many files concurrently #697

Uh oh!

Conversation

kyungeonchoi commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Story

Updates

Uh oh!

codecov bot commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ponyisi Jan 2, 2026

Choose a reason for hiding this comment

Uh oh!

kyungeonchoi Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

ponyisi Jan 2, 2026

Choose a reason for hiding this comment

Uh oh!

kyungeonchoi commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kyungeonchoi commented Dec 30, 2025 •

edited

Loading

codecov bot commented Dec 30, 2025 •

edited

Loading