Skip to content

Conversation

@prabhath004
Copy link
Contributor

Description

Add optional max_file_size parameter to the AmazonS3 source to filter files by size in both list() and get_value() APIs. Files exceeding the specified limit are treated as non-existent and skipped during processing.

Changes

  • Added max_file_size: Option<i64> field to AmazonS3 Spec and Executor
  • Implemented file size filtering in list() method using S3 metadata
  • Implemented file size filtering in get_value() method using head_object()
  • Updated Python spec to expose the new parameter
  • Added documentation for the new parameter

Closes #1252

Add optional max_file_size parameter to filter files by size in both
list() and get_value() APIs. Files exceeding the limit are treated as
non-existent. Closes cocoindex-io#1252
Copy link
Member

@georgeh0 georgeh0 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the change!

@georgeh0 georgeh0 merged commit c1eaee9 into cocoindex-io:main Nov 5, 2025
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[FEATURE] Support max_file_size for AmazonS3 source

2 participants