Skip to content

Conversation

@ian-at-airbyte
Copy link
Contributor

@ian-at-airbyte ian-at-airbyte commented Nov 19, 2025

What

This PR expands the Snowflake V4 migration guide. It includes more detailed instructions about how to handle legacy raw tables, particularly the use case where both raw and final tables are needed. It also provides more active clarification about how to clean up unnecessary raw tables after upgrade.

This is based on a customer request after they indicated they were not comfortable proceeding without more specific instructions.

How

Review guide

User Impact

Can this PR be safely reverted and rolled back?

  • YES 💚
  • NO ❌

@github-actions
Copy link
Contributor

👋 Greetings, Airbyte Team Member!

Here are some helpful tips and reminders for your convenience.

Helpful Resources

PR Slash Commands

Airbyte Maintainers (that's you!) can execute the following slash commands on your PR:

  • /format-fix - Fixes most formatting issues.
  • /bump-version - Bumps connector versions.
    • You can specify a custom changelog by passing changelog. Example: /bump-version changelog="My cool update"
    • Leaving the changelog arg blank will auto-populate the changelog from the PR title.
  • /run-cat-tests - Runs legacy CAT tests (Connector Acceptance Tests)
  • /build-connector-images - Builds and publishes a pre-release docker image for the modified connector(s).
  • JVM connectors:
    • /update-connector-cdk-version connector=<CONNECTOR_NAME> - Updates the specified connector to the latest CDK version.
      Example: /update-connector-cdk-version connector=destination-bigquery
    • /bump-bulk-cdk-version bump=patch changelog='foo' - Bump the Bulk CDK's version. bump can be major/minor/patch.
  • Python connectors:
    • /poe connector source-example lock - Run the Poe lock task on the source-example connector, committing the results back to the branch.
    • /poe source example lock - Alias for /poe connector source-example lock.
    • /poe source example use-cdk-branch my/branch - Pin the source-example CDK reference to the branch name specified.
    • /poe source example use-cdk-latest - Update the source-example CDK dependency to the latest available version.

📝 Edit this welcome message.


If you do not interact with the raw tables, you can safely upgrade. There is no breakage for this usecase.
:::note
Version 4 of the Snowfalke destination uses the `airbyte_internal` database for temporary scratch space (for example, streams running in dedup mode, truncate refreshes, and overwrite syncs). Dropping the entire `airbyte_internal database` can interrupt active syncs and cause data loss. Only drop the specific raw tables you no longer need.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚫 [vale] reported by reviewdog 🐶
[Vale.Spelling] Did you really mean 'Snowfalke'?


If you do not interact with the raw tables, you can safely upgrade. There is no breakage for this usecase.
:::note
Version 4 of the Snowfalke destination uses the `airbyte_internal` database for temporary scratch space (for example, streams running in dedup mode, truncate refreshes, and overwrite syncs). Dropping the entire `airbyte_internal database` can interrupt active syncs and cause data loss. Only drop the specific raw tables you no longer need.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚫 [vale] reported by reviewdog 🐶
[Vale.Spelling] Did you really mean 'dedup'?

@github-actions
Copy link
Contributor

github-actions bot commented Nov 19, 2025

Deploy preview for airbyte-docs ready!

✅ Preview
https://airbyte-docs-nm706zn7m-airbyte-growth.vercel.app

Built with commit 6cf894e.
This pull request is being automatically deployed with vercel-action

…69729)

Co-authored-by: Devin AI <158243242+devin-ai-integration[bot]@users.noreply.github.com>
Co-authored-by: [email protected] <[email protected]>
The number of underscores between `raw` and `stream` may vary depending on the longest underscore sequence in your namespace and stream names.

:::note
Version 4 of the Snowflake destination uses the `airbyte_internal` schema for temporary scratch space. For example, Airbyte needs this for streams running in dedup mode, truncate refreshes, and overwrite syncs. Dropping the entire `airbyte_internal` schema can interrupt active syncs and cause data loss. Only drop the specific raw tables you no longer need.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚫 [vale] reported by reviewdog 🐶
[Vale.Spelling] Did you really mean 'dedup'?

@ian-at-airbyte ian-at-airbyte marked this pull request as ready for review November 20, 2025 00:26
@ian-at-airbyte ian-at-airbyte requested a review from a team as a code owner November 20, 2025 00:26
@ian-at-airbyte ian-at-airbyte changed the title Docs snowflake 4 detailed migration guide [DO NOT MERGE] Docs snowflake 4 detailed migration guide Nov 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants