Merge not skipping files even with 2 level partitioning #3696

auyer · 2025-08-22T19:48:53Z

auyer
Aug 22, 2025

Hi.

I am facing a very weird behavior when running upserts: my entire base is being loaded into memory, even with two partitions in place.

I have a "multi tenant" database, that I extract one tenant at a time.
In one dataframe, there is only data related to one tennant. So I expected the upsert to only load data for this tennant.

I looked into optimizing-merge-performance, but something is off for me.

The structure is equivalent to:

/table
   /tennant=x
      / month_id=y

My source code and output metrics look like this:

        metrics = (
            df.write_delta(
                dt,
                mode="merge",
                delta_write_options={
                    "writer_properties": deltalake.WriterProperties(compression=compression.upper()),
                    "partition_by": partitions,
                    "schema_mode": "overwrite",
                },
                delta_merge_options={
                    "predicate": predicate,
                    "source_alias": "s",
                    "target_alias": "t",
                },
            )
            .when_matched_update_all()
            .when_not_matched_insert_all()
            .execute()
        )

        print(f"source_size: {len(df)}")
        print(f"Partitions: {partitions}")
        print(f"Predicate: {predicate}")
        print(f"Files scanned: {metrics.get('num_target_files_scanned')}")
        print(f"Files skipped: {metrics.get('num_target_files_skipped_during_scan')}")
        print(f"Execution time: {metrics.get('execution_time_ms')} ms")
# output:
source_size: 288
Partitions: ['tennant', 'month_id']
Predicate: s.id = t.id AND s.tennant = t.tennant AND s.month_id = t.month_id
Files scanned: 665
Files skipped: 0
Execution time: 103803 ms

"polars~=1.32",
"deltalake>=1.1.4",

auyer · 2025-08-22T20:25:08Z

auyer
Aug 22, 2025
Author

Not sure if this could be related, but I also get lots of these errors:

thread 'tokio-runtime-worker' panicked at /root/.cargo/registry/src/index.crates.io-1949cf8c6b5b557f/object_store-0.12.3/src/client/http/spawn.rs:86:32:
called `Result::unwrap()` on an `Err` value: SendError { .. }

WITH RUST_BACKTRACE=1:

thread 'tokio-runtime-worker' panicked at /root/.cargo/registry/src/index.crates.io-1949cf8c6b5b557f/object_store-0.12.3/src/client/http/spawn.rs:86:32:
called `Result::unwrap()` on an `Err` value: SendError { .. }
stack backtrace:
   0: __rustc::rust_begin_unwind
   1: core::panicking::panic_fmt
   2: core::result::unwrap_failed
   3: <object_store::client::http::spawn::SpawnService<T> as object_store::client::http::connection::HttpService>::call::{{closure}}::{{closure}}
   4: tokio::runtime::task::core::Core<T,S>::poll
   5: tokio::runtime::task::harness::Harness<T,S>::poll
   6: tokio::runtime::scheduler::multi_thread::worker::Context::run_task
   7: tokio::runtime::scheduler::multi_thread::worker::Context::run
   8: tokio::runtime::context::scoped::Scoped<T>::set
   9: tokio::runtime::context::runtime::enter_runtime
  10: tokio::runtime::scheduler::multi_thread::worker::run
  11: tokio::runtime::task::core::Core<T,S>::poll
  12: tokio::runtime::task::harness::Harness<T,S>::poll
  13: tokio::runtime::blocking::pool::Inner::run
note: Some details are omitted, run with `RUST_BACKTRACE=full` for a verbose backtrace.

and

WITH RUST_BACKTRACE=full:

thread 'tokio-runtime-worker' panicked at /root/.cargo/registry/src/index.crates.io-1949cf8c6b5b557f/object_store-0.12.3/src/client/http/spawn.rs:86:32:
called `Result::unwrap()` on an `Err` value: SendError { .. }
stack backtrace:
   0:     0x7f48c207f852 - <std::sys::backtrace::BacktraceLock::print::DisplayBacktrace as core::fmt::Display>::fmt::hf435e8e9347709a8
   1:     0x7f48c20ac743 - core::fmt::write::h0a51fad3804c5e7c
   2:     0x7f48c207b8e3 - std::io::Write::write_fmt::h9759e4151bf4a45e
   3:     0x7f48c207f6a2 - std::sys::backtrace::BacktraceLock::print::h1ec5ce5bb8ee285e
   4:     0x7f48c2081196 - std::panicking::default_hook::{{closure}}::h5ffefe997a3c75e4
   5:     0x7f48c2080f99 - std::panicking::default_hook::h820c77ba0601d6bb
   6:     0x7f48c2081b22 - std::panicking::rust_panic_with_hook::h8b29cbe181d50030
   7:     0x7f48c20818da - std::panicking::begin_panic_handler::{{closure}}::h9f5b6f6dc6fde83e
   8:     0x7f48c207fd59 - std::sys::backtrace::__rust_end_short_backtrace::hd7b0c344383b0b61
   9:     0x7f48c208156d - __rustc[5224e6b81cd82a8f]::rust_begin_unwind
  10:     0x7f48bc51bb40 - core::panicking::panic_fmt::hc49fc28484033487
  11:     0x7f48bc51bf66 - core::result::unwrap_failed::h9e4c136384b1cfa3
  12:     0x7f48c0dbad9d - <object_store::client::http::spawn::SpawnService<T> as object_store::client::http::connection::HttpService>::call::{{closure}}::{{closure}}::h5b87e8050ef3d01d
  13:     0x7f48c0da39c7 - tokio::runtime::task::core::Core<T,S>::poll::hf91515e89b2b1a81
  14:     0x7f48c0dc2fbf - tokio::runtime::task::harness::Harness<T,S>::poll::h094ae26d09d6486f
  15:     0x7f48c123ae97 - tokio::runtime::scheduler::multi_thread::worker::Context::run_task::h3c2636e7b6a2688a
  16:     0x7f48c123a350 - tokio::runtime::scheduler::multi_thread::worker::Context::run::hb5227abe216ecc07
  17:     0x7f48c124b2e8 - tokio::runtime::context::scoped::Scoped<T>::set::hd3596ec0c1293944
  18:     0x7f48c1253688 - tokio::runtime::context::runtime::enter_runtime::hcdc0213b78e7b45e
  19:     0x7f48c1239bcd - tokio::runtime::scheduler::multi_thread::worker::run::hc0d943a43db679c6
  20:     0x7f48c124eed6 - tokio::runtime::task::core::Core<T,S>::poll::h9028d52430295479
  21:     0x7f48c1232401 - tokio::runtime::task::harness::Harness<T,S>::poll::h15dabfae04b9ae12
  22:     0x7f48c12435b7 - tokio::runtime::blocking::pool::Inner::run::hf067ba3ff6b2cc24
  23:     0x7f48c1251c47 - std::sys::backtrace::__rust_begin_short_backtrace::h09e5fc373045dfd9
  24:     0x7f48c124fc22 - core::ops::function::FnOnce::call_once{{vtable.shim}}::h1b8ac65b86469291
  25:     0x7f48c2084b6f - std::sys::pal::unix::thread::Thread::new::thread_start::h1ff51d6e85162efd
  26:     0x7f48d34969cb - <unknown>
  27:     0x7f48d351aa0c - <unknown>
  28:                0x0 - <unknown>

Also these datafusion warnings too:

WARN  datafusion_datasource_parquet::source] The SchemaAdapter API will be removed from ParquetSource in a future release. Use PhysicalExprAdapterFactory API instead. See https://github.com/apache/datafusion/issues/16800 for discussion and https://datafusion.apache.org/library-user-guide/upgrading.html#datafusion-49-0-0 for upgrade instructions.

0 replies

auyer · 2025-08-25T20:16:23Z

auyer
Aug 25, 2025
Author

I created a fully reproducible example, and found a way to make the merge skip files.
I had to add AND t.tennant = 'tennant2' to the s.id = t.id AND s.tennant = t.tennant predicate. Is this the expected behavior ?
If so, when working with multiple level partitioning, I would need to break my data-frames into the n sub partitions manually, right? (which is OK I guess)
If not, I can create an issue and investigate.

There is a comment pointing out what like solved it, for anyone to test with or without it. Look for : THIS LINE MAKES THE SKIPPER WORK

# !pip install 'polars[pyarrow]' deltalake
import polars as pl
import deltalake

partitions = ["tennant"]

primary_key = "id"


def read_delta_table():
    try:
        return deltalake.DeltaTable("./delta")
    except:
        return "./delta"


def write(df):
    predicate = " AND ".join(
        [f"s.{primary_key} = t.{primary_key}"]  # base predicate + for each partition
        + [f"s.{partition} = t.{partition}" for partition in partitions]
    )

    # read current tennant
    tennant = df[partitions[0]][0]
    # THIS LINE MAKES THE SKIPPER WORK
    predicate = predicate + f" AND t.{partitions[0]} = '{tennant}' "
    # no effect
    # predicate = predicate + f" AND s.{partitions[0]} = '{tennant}' "
    #
    try:
        dt = deltalake.DeltaTable("./data/delta")
        metrics = (
            df.write_delta(
                dt,
                mode="merge",
                delta_write_options={
                    # "writer_properties": deltalake.WriterProperties(compression=compression.upper()),
                    "partition_by": partitions,
                    "schema_mode": "overwrite",
                },
                delta_merge_options={
                    "predicate": predicate,
                    "source_alias": "s",
                    "target_alias": "t",
                },
            )
            .when_matched_update_all()
            .when_not_matched_insert_all()
            .execute()
        )

    except deltalake.exceptions.TableNotFoundError:
        dt = "./data/delta"
        metrics = df.write_delta(
            dt,
            mode="overwrite",
            delta_write_options={
                # "writer_properties": deltalake.WriterProperties(compression=compression.upper()),
                "partition_by": partitions,
                "schema_mode": "overwrite",
            },
        )
    if metrics:
        print(f"source_size: {len(df)}")
        print(f"Partitions: {partitions}")
        print(f"Predicate: {predicate}")
        print(f"Files scanned: {metrics.get('num_target_files_scanned')}")
        print(f"Files skipped: {metrics.get('num_target_files_skipped_during_scan')}")
        print(f"Execution time: {metrics.get('execution_time_ms')} ms")
    else:
        print("metrics not generated")


def add_partition(df: pl.DataFrame, tennant: str) -> pl.DataFrame:
    print(f"Writing tennant {tennant}")
    return df.with_columns(pl.lit(tennant).alias(partitions[0]))


df = pl.DataFrame(
    {
        "id": [1, 2, 3, 4, 5, 6, 7],
        "dt": pl.Series(
            [
                "2019-09-01",
                "2019-09-02",
                "2019-09-03",
                "2019-09-04",
                "2019-09-05",
                "2019-09-06",
                "2019-09-07",
            ],
        ).str.to_date(),
        "groups": [[], ["a"], ["b"], ["a", "b"], ["c"], ["d", "e"], []],
    }
)

write(add_partition(df, "tennant1"))
write(add_partition(df, "tennant2"))

# write_deltalake("./data/delta", df)


# Load data from the delta table
dt = deltalake.DeltaTable("./data/delta")


df = pl.read_delta(dt)
print(df)

df = pl.DataFrame(
    {
        "id": [6, 7, 8, 9, 10, 11, 12],
        "dt": pl.Series(
            [
                "2029-09-01",
                "2029-09-02",
                "2029-09-03",
                "2029-09-04",
                "2029-09-05",
                "2029-09-06",
                "2029-09-07",
            ],
        ).str.to_date(),
        "groups": [[], ["a"], ["b"], ["a", "b"], ["c"], ["d", "e"], []],
    }
)


write(add_partition(df, "tennant1"))
write(add_partition(df, "tennant2"))


df = pl.read_delta(dt).sort("id")
pl.Config.set_tbl_rows(-1)
print(df)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Merge not skipping files even with 2 level partitioning #3696

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Merge not skipping files even with 2 level partitioning #3696

Uh oh!

auyer Aug 22, 2025

Replies: 2 comments

Uh oh!

Uh oh!

auyer Aug 22, 2025 Author

Uh oh!

auyer Aug 25, 2025 Author

auyer
Aug 22, 2025

auyer
Aug 22, 2025
Author

auyer
Aug 25, 2025
Author