This provides a seamless integration between Entity Framework Core and DuckDB, allowing you to use DuckDB as your database while maintaining the familiar EF Core API.
- Simple API: Just use
UseDuckDb()similar toUseSqlite()orUseSqlServer() - Automatic SQL Translation: Converts EF Core generated SQL to DuckDB-compatible SQL
- Full EF Core Support: Works with LINQ queries, navigation properties, and all EF Core features
- Performance: Leverages DuckDB's columnar storage for fast analytical queries on Parquet files with significant performance improvements over SQLite
- Developer-Friendly API: The extension replaces fragile, manual setups with a clean, fluent API. This reduces code complexity, improves maintainability, and accelerates development.
The performance improvements are significant when using DuckDB over SQLite for analytical queries:
| Database | OData Query Performance |
|---|---|
| SQLite | 7.61s - 8.83s |
| DuckDB | 800ms - 1s |
Performance Gain: ~8-10x faster 🚀
DuckDB's columnar storage and vectorized execution engine provide substantial performance benefits for analytical workloads, making it an excellent choice for data analysis and reporting scenarios.
Ensure you have the necessary NuGet packages:
<PackageReference Include="Microsoft.EntityFrameworkCore" Version="8.0.8" />
<PackageReference Include="Microsoft.EntityFrameworkCore.Sqlite" Version="8.0.8" />
<PackageReference Include="DuckDB.NET.Data.Full" Version="1.3.2" />using EnergyExemplar.EntityFrameworkCore.DuckDb;
using Microsoft.EntityFrameworkCore;
var options = new DbContextOptionsBuilder<MyDbContext>()
.UseDuckDb(opts =>
{
opts.ConnectionString = "DataSource=C:/data/mydb.duckdb";
opts.MemoryLimitGB = 4; // Optional: set DuckDB memory limit to 4GB
opts.FileSearchPath = "C:/data/"; // Optional: set file_search_path for relative views
})
.Options;
using var context = new MyDbContext(options);
var data = await context.MyEntities.Where(e => e.Name.StartsWith("Gen")).ToListAsync();// In your Program.cs or Startup.cs
services.AddDbContext<MyDbContext>(options =>
options.UseDuckDb(opts =>
{
opts.ConnectionString = builder.Configuration["DuckDb:ConnectionString"];
opts.MemoryLimitGB = 8;
opts.FileSearchPath = builder.Configuration["DuckDb:FileSearchPath"];
})
);var duckDbOptions = new DuckDbConnectionOptions
{
ConnectionString = "DataSource=C:/data/mydb.duckdb",
MemoryLimitGB = 16,
FileSearchPath = "C:/data/"
};
var options = new DbContextOptionsBuilder<MyDbContext>()
.UseDuckDb(duckDbOptions)
.Options;var options = new DbContextOptionsBuilder<MyDbContext>()
.UseDuckDbOnParquet("C:/data/mydata.parquet")
.Options;Tip: Change tracking is disabled by default for maximum performance. To enable it, pass
noTracking: falsetoUseDuckDbOnParquet()or setNoTracking = falseonDuckDbConnectionOptions.
Manual DbConnection management, custom interceptors, and scattered configuration.
var options = new DbContextOptionsBuilder<MyDbContext>()
.UseDuckDb(opts =>
{
opts.ConnectionString = "DataSource=C:/data/mydb.duckdb";
})
.Options;- ConnectionString (required): Path to the DuckDB database file (e.g.,
DataSource=mydb.duckdb) - FileSearchPath (optional): Base directory for resolving relative paths in DuckDB views
- MemoryLimitGB (optional): Memory limit for DuckDB operations (in GB)
- NoTracking (optional): If true (the default), all queries will be executed with
AsNoTracking(). Set tofalseto enable change tracking.
using Microsoft.EntityFrameworkCore;
using Microsoft.Extensions.DependencyInjection;
using Microsoft.Extensions.Hosting;
using EnergyExemplar.EntityFrameworkCore.DuckDb;
var host = Host.CreateDefaultBuilder(args)
.ConfigureServices((context, services) =>
{
services.AddDbContext<MyDbContext>(options =>
options.UseDuckDb(opts =>
{
opts.ConnectionString = context.Configuration["DuckDb:ConnectionString"];
opts.MemoryLimitGB = 8;
opts.FileSearchPath = context.Configuration["DuckDb:FileSearchPath"];
})
);
services.AddScoped<IDataService, DataService>();
})
.Build();
await host.RunAsync();
public class DataService : IDataService
{
private readonly MyDbContext _context;
public DataService(MyDbContext context) => _context = context;
public async Task<List<MyEntity>> GetDataAsync()
=> await _context.MyEntities.Where(e => e.IsActive).ToListAsync();
}services.AddDbContext<InputContext>(options =>
options.UseDuckDb(opts =>
{
opts.ConnectionString = "DataSource=input.duckdb";
})
);
services.AddDbContext<OutputContext>(options =>
options.UseDuckDb(opts =>
{
opts.ConnectionString = "DataSource=output.duckdb";
})
);services.AddDbContext<MyParquetContext>(options =>
options.UseDuckDbOnParquet("C:/data/mydata.parquet")
);- Memory Issues: Set
MemoryLimitGBin the options lambda or object. - Path Issues: Ensure
FileSearchPathis set correctly for relative paths in views. - Write Operations: The integration is read-only; any write operation will throw
NotSupportedException. - Performance: DuckDB excels at analytical queries but may be slower for OLTP workloads.
- Cleaner Code: No need to manually configure interceptors and options
- Type Safety: Configuration options are strongly typed
- Consistency: Ensures all required components are properly configured
- Maintainability: Changes to DuckDB integration are centralized
- Testability: Easy to mock or substitute in tests
- The implementation uses SQLite as the base provider but intercepts and translates all SQL to DuckDB
- Query tracking is disabled by default for better performance (can be overridden via
NoTracking = false) - All DuckDB dependencies are automatically registered when using the extensions
The library includes optional EF Core conventions that can simplify your model configuration by automatically mapping entity and property names directly to table and column names.
- EntityNameAsTableNameConvention: Automatically sets table names to match entity class names
- PropertyNameAsColumnNameConvention: Automatically sets column names to match property names
- CustomConventionSetBuilder: Applies both conventions together
using EnergyExemplar.EntityFrameworkCore.DuckDb.Conventions;
using Microsoft.EntityFrameworkCore;
using Microsoft.EntityFrameworkCore.Metadata.Conventions.Infrastructure;
// Option 1: Apply conventions when configuring DbContext
var options = new DbContextOptionsBuilder<MyDbContext>()
.UseDuckDb(opts =>
{
opts.ConnectionString = "DataSource=mydb.duckdb";
})
.ReplaceService<IProviderConventionSetBuilder, CustomConventionSetBuilder>()
.Options;
// Option 2: Apply in dependency injection
services.AddDbContext<MyDbContext>(options =>
{
options.UseDuckDb(opts =>
{
opts.ConnectionString = configuration["DuckDb:ConnectionString"];
})
.ReplaceService<IProviderConventionSetBuilder, CustomConventionSetBuilder>();
});- Reduced Boilerplate: No need to manually configure table and column names
- Consistency: Ensures naming conventions are applied uniformly across your model
- DuckDB Compatibility: Particularly useful when working with DuckDB views or Parquet files where column names must match exactly
- Simplified Migrations: When entity/property names match database schema, there's less configuration needed
Without conventions:
public class CustomerOrder
{
public int OrderId { get; set; }
public string CustomerName { get; set; }
}
// Requires manual configuration
modelBuilder.Entity<CustomerOrder>()
.ToTable("CustomerOrder")
.Property(e => e.CustomerName)
.HasColumnName("CustomerName");With conventions (automatically applied):
public class CustomerOrder
{
public int OrderId { get; set; }
public string CustomerName { get; set; }
}
// No additional configuration needed!These conventions are particularly helpful when:
- Your entity and property names already match your database schema
- You're working with existing Parquet files or DuckDB views
- You want to reduce EF Core configuration code
- You prefer convention over configuration approach
Note: These conventions are optional. If you need different table/column names than your entity/property names, simply don't apply the conventions and use standard EF Core configuration.
This project is licensed under the MIT License - see the LICENSE file for details.
This project depends on the following third-party libraries:
- Microsoft.EntityFrameworkCore - Licensed under the MIT License. See Entity Framework Core repository for details.
- Microsoft.EntityFrameworkCore.Sqlite - Licensed under the MIT License. See Entity Framework Core repository for details.
- DuckDB.NET.Data.Full - Licensed under the MIT License. See DuckDB.NET repository for details.