Partition by row id #32886

dridk · 2025-11-21T08:59:36Z

dridk
Nov 21, 2025

Hi,

I retrieve parquet files from a SQL database. Typically, I partition the data by month using TimePartitionDefinition and something like :

@asset(partition_def = MonthlyPartitionsDefinition(..))
def example(context, conn):

     return conn.sql("SELECT x FROM table WHERE date = {context.partition_key}")

However, in some cases, the date column is not indexed, and retrieving the data takes a very long time. We want to partition the data based on the 'id' column instead, meaning we retrieve the data in batches of 1000 rows for instance . I would like something like :

 @asset(partition_def = ??? ) 
def example(context, conn ) :

      return conn.sql("SELECT x FROM table WHERE id BETWEEN {partition_key} AND {partition_key + batch_size}")

Should I use static or dynamic partitioning? The issue is that I don’t know the total number of partitions in advance. The total number of row and then partition will change in time. I would need to run SELECT MAX(id) to determine this.
And I would like to see in dagster UI , the green line of ordered partition where I can select and materialize a specific partition.

What do you suggest ?

Answered by danielgafni

Nov 27, 2025

Sorry, I completely misunderstood you.

Yes, use dynamic partitions, populate them from another asset or job. Refresh them with a sensor or smth.

Associate partition keys with ID ranges, like int(key) + batch_size

View full answer

danielgafni · 2025-11-27T10:02:22Z

danielgafni
Nov 27, 2025

Sorry, I completely misunderstood you.

Yes, use dynamic partitions, populate them from another asset or job. Refresh them with a sensor or smth.

Associate partition keys with ID ranges, like int(key) + batch_size

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Partition by row id #32886

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Partition by row id #32886

Uh oh!

Uh oh!

dridk Nov 21, 2025

Replies: 1 comment

Uh oh!

Uh oh!

danielgafni Nov 27, 2025

dridk
Nov 21, 2025

danielgafni
Nov 27, 2025