SNOW-2872192: Support targeted delete-insert in save_as_table #4031

sfc-gh-mayliu · 2025-12-10T18:57:06Z

Which Jira issue is this PR addressing? Make sure that there is an accompanying issue to your PR.

Fixes SNOW-2872192
Fill out the following pre-review checklist:
- I am adding a new automated test(s) to verify correctness of my new code
  - If this test skips Local Testing mode, I'm requesting review from @snowflakedb/local-testing
- I am adding new logging messages
- I am adding a new telemetry message
- I am adding new credentials
- I am adding a new dependency
- If this is a new feature/behavior, I'm adding the Local Testing parity changes.
- I acknowledge that I have ensured my changes to be thread-safe. Follow the link for more information: Thread-safe Developer Guidelines
- If adding any arguments to public Snowpark APIs or creating new public Snowpark APIs, I acknowledge that I have ensured my changes include AST support. Follow the link for more information: AST Support Guidelines
Please describe how your code solves the related issue.

Please write a short description of how your code change solves the related issue.

Added override_condition parameter to DataFrameWriter.save_as_table() that enables atomic targeted delete-insert operations when used with mode="append". Delete and insert is wrapped in transaction to ensure atomicity and protect tables from entering a bad state.

This performs a similar operation to PySpark's DataFrameWriterV2.overwrite(condition), where rows matching the condition are deleted from the target table before inserting all rows from the DataFrame.

For more details on this PR, refer to this JIRA that contains customer's code snippet.

Monorepo for AST: https://github.com/snowflake-eng/snowflake/pull/368680

src/snowflake/snowpark/dataframe_writer.py

src/snowflake/snowpark/_internal/analyzer/snowflake_plan.py

…erwrite modes

src/snowflake/snowpark/_internal/analyzer/snowflake_plan.py

sfc-gh-joshi

I have two general questions about this PR:

In slack you mentioned this behavior is similar to Spark's DataFrameWriterV2.overwrite method. Why not implement that instead of adding a flag to saveAsTable? Doing so would keep code simpler, and avoid the potential semantic mismatch between the mode flag and overwrite_condition that @sfc-gh-aling mentioned.
What happens if the schema of the new dataframe differs from that of the original table? My guess from looking at the code is that the INSERT query would fail, which I assume would fail the whole transaction, but if mode="overwrite" we would expect to drop the original table and the operation should succeed, as per our documentation:

”overwrite”: Overwrite the existing table by dropping old table.

…upport for append mode

sfc-gh-mayliu · 2025-12-11T19:43:32Z

I have two general questions about this PR:

In slack you mentioned this behavior is similar to Spark's DataFrameWriterV2.overwrite method. Why not implement that instead of adding a flag to saveAsTable? Doing so would keep code simpler, and avoid the potential semantic mismatch between the mode flag and overwrite_condition that @sfc-gh-aling mentioned.

What happens if the schema of the new dataframe differs from that of the original table? My guess from looking at the code is that the INSERT query would fail, which I assume would fail the whole transaction, but if mode="overwrite" we would expect to drop the original table and the operation should succeed, as per our documentation:

”overwrite”: Overwrite the existing table by dropping old table.

Great questions @sfc-gh-joshi.

The existing dataframe_writer.py APIs (save_as_table, copy_into_location, csv, json, etc.) each map to specific Snowflake SQL patterns. We are cautious when adding a new API -- since Snowflake SQL does not natively support targeted delete-insert, this is only a client-level change to provide similar functionality to Spark users. Then adding a completely new API (i.e. overwrite()) would deviate from the existing pattern and expand the API surface unnecessarily.

Now that we've restricted overwrite_condition to only work with mode="overwrite", the semantics are clearer:

mode="overwrite" without overwrite_condition: Full table replacement (DROP + CREATE)
mode="overwrite" with overwrite_condition: Selective overwrite (DELETE matching rows + INSERT)
Both are overwrite operations, just with different scopes.

You are right that if the schema differs, insert will fail and cause the entire transaction to rollback. But this is intentional behavior -- selective overwrite requires schema compatibility. If users explicitly provide the optional overwrite_condition, it's their responsibility to ensure both new dataframe's and table's schemas match overwrite_condition. This distinction is similar to PySpark's DataFrameWriterV2.overwrite(condition) which also requires schema compatibility when selectively overwriting partitions.

If overwrite_condition is not provided, the original "overwrite the existing table by dropping old table" semantics still preserves by default.

Support override_condition in save_as_table for targeted delete-insert

53dedc7

sfc-gh-mayliu requested review from a team as code owners December 10, 2025 18:57

sfc-gh-mayliu requested review from a team, sfc-gh-aling, sfc-gh-jdu and sfc-gh-yuwang December 10, 2025 18:57

sfc-gh-aling reviewed Dec 11, 2025

View reviewed changes

src/snowflake/snowpark/dataframe_writer.py Outdated Show resolved Hide resolved

src/snowflake/snowpark/dataframe_writer.py Outdated Show resolved Hide resolved

src/snowflake/snowpark/_internal/analyzer/snowflake_plan.py Show resolved Hide resolved

rename param to overwrite_condition; can work with both Append and Ov…

557d260

…erwrite modes

graphite-app bot reviewed Dec 11, 2025

View reviewed changes

src/snowflake/snowpark/_internal/analyzer/snowflake_plan.py Show resolved Hide resolved

sfc-gh-aling reviewed Dec 11, 2025

View reviewed changes

src/snowflake/snowpark/_internal/analyzer/snowflake_plan.py Outdated Show resolved Hide resolved

sfc-gh-aling reviewed Dec 11, 2025

View reviewed changes

src/snowflake/snowpark/_internal/analyzer/snowflake_plan.py Show resolved Hide resolved

sfc-gh-aling approved these changes Dec 11, 2025

View reviewed changes

Add AST support

7a463f9

sfc-gh-mayliu requested a review from a team December 11, 2025 02:47

sfc-gh-joshi reviewed Dec 11, 2025

View reviewed changes

overwrite_condition should only work with overwrite mode, reverting s…

eaf4292

…upport for append mode

sfc-gh-joshi approved these changes Dec 11, 2025

View reviewed changes

elaborate selective overwrite behavior in docstring

afe2dbe

sfc-gh-heshah approved these changes Dec 11, 2025

View reviewed changes

sfc-gh-mayliu merged commit 266334b into main Dec 11, 2025
29 checks passed

sfc-gh-mayliu deleted the SNOW-2872192-saveAsTable-targeted-delete-insert branch December 11, 2025 20:52

github-actions bot locked and limited conversation to collaborators Dec 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SNOW-2872192: Support targeted delete-insert in save_as_table #4031

SNOW-2872192: Support targeted delete-insert in save_as_table #4031

Uh oh!

sfc-gh-mayliu commented Dec 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sfc-gh-joshi left a comment •

edited

Loading

Uh oh!

sfc-gh-mayliu commented Dec 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

SNOW-2872192: Support targeted delete-insert in save_as_table #4031

SNOW-2872192: Support targeted delete-insert in save_as_table #4031

Uh oh!

Conversation

sfc-gh-mayliu commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sfc-gh-joshi left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sfc-gh-mayliu commented Dec 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

sfc-gh-mayliu commented Dec 10, 2025 •

edited

Loading

sfc-gh-joshi left a comment •

edited

Loading