Skip to content

Conversation

@Satvik-Singh192
Copy link
Contributor

Description

This pull request refactors transform(df) function in app/etl/transform.py into smaller, composable helper functions (_remove_duplicates, _handle_nulls, _standardize_dates, _validate_types).

The main transform function now acts as an orchestrator, calling these helpers in sequence.

This change directly addresses the request in issue #37 to improve maintainability, code readability, and test granularity. A new test file, tests/test_transform_helpers.py, has been added to provide specific unit tests for each new helper function.

Semver Changes

  • [] Patch (bug fix, no new features)
  • Minor (new features, no breaking changes)
  • Major (breaking changes)

Issues

Closes #37

Checklist

@Dheerajyadav1 Dheerajyadav1 merged commit aa97624 into OPCODE-Open-Spring-Fest:main Nov 8, 2025
5 of 6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Feature]: Split Transform into composable functions with unit tests per step

2 participants