Skip to content

Conversation

@artem1205
Copy link
Contributor

@artem1205 artem1205 commented Mar 6, 2025

What

Resolve https://github.com/airbytehq/airbyte-internal-issues/issues/8861
add TypeTransformer based on pydanticV2 to speedup

How

Test in Docker;

  • source-hardcoded-records
  • 2_000_000 records
  • stream: Customers
  • command to run:
    /usr/bin/time -h docker run --rm -v $(pwd)/secrets:/secrets -v $(pwd)/integration_tests:/integration_tests airbyte/source-hardcoded-records:dev read --config /secrets/config.json --catalog /integration_tests/configured_catalog.json > /tmp/test.txt
TIME real user sys
TypeTransformer 12m40.88s 25.26s 1m28.35s
PydanticTypeTransformer 4m38.88s (2.73x) 11.36s 36.36s

Tip

🚀 🚀 🚀 Expected performance increase is 2.73x 🚀 🚀 🚀 For source that use defaulttransformation

@artem1205 artem1205 self-assigned this Mar 6, 2025
@github-actions github-actions bot added the enhancement New feature or request label Mar 6, 2025
Signed-off-by: Artem Inzhyyants <[email protected]>
Signed-off-by: Artem Inzhyyants <[email protected]>
Signed-off-by: Artem Inzhyyants <[email protected]>
@artem1205
Copy link
Contributor Author

@artem1205 artem1205 closed this Mar 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants