Skip to content

feat: add auto-update jobs for continuous dataset updates #130#131

Merged
Fedir-Yatsenko merged 9 commits intodevelopmentfrom
feat/93-continuous-datasets-onboarding-and-updates
Mar 3, 2026
Merged

feat: add auto-update jobs for continuous dataset updates #130#131
Fedir-Yatsenko merged 9 commits intodevelopmentfrom
feat/93-continuous-datasets-onboarding-and-updates

Conversation

@Fedir-Yatsenko
Copy link
Collaborator

@Fedir-Yatsenko Fedir-Yatsenko commented Feb 10, 2026

Applicable issues

Resolve #130

Description of changes

Implement auto-update functionality that checks for dataset changes and triggers reindexing when needed. This enables automated detection of upstream dataset version changes without manual intervention.

  • Add AutoUpdateJob model and migration for tracking update jobs.
  • Add API endpoints to trigger, list, and poll auto-update jobs.
  • Implement config resolution to resolve dynamic URNs (e.g., 'latest').
  • Compare structure and data hashes to detect changes.
  • Trigger reindexing workflow when changes are detected.

See examples of the responses in the comments below.

Checklist

By submitting this pull request, I confirm that my contribution is made under the terms of the MIT license.

Fedir-Yatsenko and others added 4 commits February 9, 2026 14:36
Implement auto-update functionality that checks for dataset changes and
triggers reindexing when needed. This enables automated detection of
upstream dataset version changes without manual intervention.

- Add AutoUpdateJob model and migration for tracking update jobs
- Add API endpoints to trigger, list, and poll auto-update jobs
- Implement config resolution to resolve dynamic URNs (e.g., 'latest')
- Compare structure and data hashes to detect changes
- Trigger reindexing workflow when changes are detected

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@Fedir-Yatsenko Fedir-Yatsenko self-assigned this Feb 10, 2026
@Fedir-Yatsenko Fedir-Yatsenko added enhancement New feature or request python Pull requests that update python code labels Feb 10, 2026
@Fedir-Yatsenko Fedir-Yatsenko linked an issue Feb 10, 2026 that may be closed by this pull request
@Fedir-Yatsenko
Copy link
Collaborator Author

Fedir-Yatsenko commented Feb 10, 2026

Examples of job results: image image

…s-onboarding-and-updates

# Conflicts:
#	statgpt/admin/services/dataset.py
#	statgpt/common/config/versions.py
#	statgpt/common/models/__init__.py
#	statgpt/common/models/models.py
#	statgpt/common/schemas/__init__.py
#	statgpt/common/schemas/enums.py
@Fedir-Yatsenko Fedir-Yatsenko marked this pull request as ready for review March 2, 2026 13:53
@Fedir-Yatsenko Fedir-Yatsenko requested a review from ypldan as a code owner March 2, 2026 13:53
@Fedir-Yatsenko Fedir-Yatsenko merged commit 0f30831 into development Mar 3, 2026
9 checks passed
@Fedir-Yatsenko Fedir-Yatsenko deleted the feat/93-continuous-datasets-onboarding-and-updates branch March 3, 2026 08:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request python Pull requests that update python code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Continuous datasets onboarding and updates Add auto-update jobs for continuous dataset updates

2 participants