Skip to content

Conversation

@JannikSt
Copy link
Member

@JannikSt JannikSt commented Jan 31, 2026

Summary

  • Move HF_HUB_ETAG_TIMEOUT=500 from eval-only to base envs so all entrypoints get it
  • Add HF_HUB_DOWNLOAD_TIMEOUT=300 for large model downloads

Both have 10s defaults in huggingface_hub which causes timeout issues on slow networks or when downloading large models.


Note

Low Risk
Low risk: Docker-only environment variable changes that affect download behavior but do not modify application logic or security.

Overview
Updates Dockerfile.cuda to bake in HuggingFace Hub timeout environment variables for all container entrypoints, increasing HF_HUB_ETAG_TIMEOUT to 500 and adding HF_HUB_DOWNLOAD_TIMEOUT=300 to better support slow networks and large model downloads.

Written by Cursor Bugbot for commit 6f2d83a. This will update automatically on new commits. Configure here.

Defaults are 10s which causes issues on slow networks or large model downloads.
@JannikSt JannikSt force-pushed the feature/hf-hub-resilience branch from 169a1ff to 6f2d83a Compare January 31, 2026 23:54
Copy link
Member

@Jackmin801 Jackmin801 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ye makes sense! lgtm

@Jackmin801 Jackmin801 merged commit e5566f9 into main Feb 1, 2026
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants