When querying tables with millions of files, datafusion-ducklake fetches all file metadata from the catalog database before any filtering occurs. This causes performance degradation for large catalogs.
This issue was reported in the upstream DuckLake project: duckdb/ducklake#640