[ENH] Schedule tasks from the sysdb. #5562

rescrv · 2025-10-06T17:08:55Z

Description of changes

This PR adds a scheduler that talks to the sysdb to pull tasks.

Test plan

CI

Migration plan

N/A

Observability plan

N/A

Documentation Changes

N/A

github-actions · 2025-10-06T17:09:34Z

propel-code-bot · 2025-10-06T17:10:18Z

Add Heap Scheduler Integration between sysdb and Heap Tender (Task Scheduling Orchestration)

This PR introduces substantial orchestration between the Rust heap scheduling/compaction layer (s3heap-service) and the Go sysdb coordination layer to enable scheduled task execution based on sysdb state. It includes protocol additions (gRPC/protobuf), new sysdb APIs for task scheduling/advancement, and a full Rust integration that pulls, schedules, and acknowledges tasks from sysdb, wiring real scheduler state into HeapTender. Additionally, several tests, database DAO implementations, and minor refactors are included to fully close the end-to-end loop for distributed heap scheduling and durable task progression.

Key Changes

• Introduced AdvanceTask RPC in sysdb: Plumbed through Go protobuf, coordinator, DAO, and gRPC server.
• Added PeekScheduleByCollectionId RPC for batch scheduling lookups based on collection IDs.
• Expanded Go sysdb DAO (taskDb) and interface to support GetByID, AdvanceTask, and PeekScheduleByCollectionId, with new test coverage in task_test.go.
• Integrated sysdb-based scheduling and nonce advancement into the Rust HeapTender orchestration logic-now triggers sysdb peeks, converts schedule entries, and calls HeapWriter.
• Expanded Rust types to support task schedule entries (ScheduleEntry) and conversion from proto to type-safe representations.
• Wire up sysdb config and client into the Rust heap tender service configuration and startup flow; enforce correct sysdb mode (grpc, not sqlite).
• Schema/proto changes: Added/updated messages for AdvanceTask, PeekScheduleByCollectionId, and schedule entry representations.
• Refactored Rust S3heap schedule/triggerable types to be copyable, hashable, and comparable for efficient use in collections and deduplication logic.
• Numerous test updates and additions covering DAO correctness, schedule peeking/filtering, and end-to-end tendering.
• Cargo.toml/lock and workspace dependency updates for new sysdb/chrono dependencies and propagation.

Affected Areas

• go/pkg/sysdb/coordinator/task.go (protos/coordination; business logic)
• go/pkg/sysdb/metastore/db/dao/task.go and related DAO/test files
• idl/chromadb/proto/coordinator.proto (protobuf API surface)
• rust/s3heap-service/ (src/lib.rs - HeapTender logic; config; startup)
• rust/sysdb/ (sysdb-side proto bindings, error handling, and peeks)
• rust/types/ and rust/s3heap/ (task schedule and heap types, S3heap scheduler interface)
• rust/s3heap/tests/ and rust/s3heap-service/tests/ (integration and unit tests)
• Cargo.toml and Cargo.lock files (dependency tree)

This summary was automatically generated by @propel-code-bot

propel-code-bot · 2025-10-06T17:20:20Z

go/pkg/sysdb/coordinator/task.go

+	// Can do both calls with a JOIN
+	task, err := s.catalog.metaDomain.TaskDb(ctx).GetByName(req.InputCollectionId, req.TaskName)
+	if err != nil {
+		return nil, err
+	}
+
+	// If task not found, return empty response
+	if task == nil {
+		return nil, common.ErrTaskNotFound
+	}
+
+	// Look up operator name from operators table
+	operator, err := s.catalog.metaDomain.OperatorDb(ctx).GetByID(task.OperatorID)
+	if err != nil {
+		log.Error("GetTaskByName: failed to get operator", zap.Error(err))
+		return nil, err
+	}
+	if operator == nil {
+		log.Error("GetTaskByName: operator not found", zap.String("operator_id", task.OperatorID.String()))
+		return nil, common.ErrOperatorNotFound
+	}


[PerformanceOptimization]

You've correctly identified a potential optimization with the comment // Can do both calls with a JOIN. To avoid making two separate database calls, you could implement a dedicated DAO method that performs a JOIN between the tasks and operators tables. This would improve efficiency and prevent potential N+1 query issues if this function is called in a loop in the future.

For example, you could add a method like GetByNameWithOperator to your ITaskDb interface:

// In dbmodel/task.go type TaskAndOperator struct { Task OperatorName string `gorm:"column:operator_name"` } // In ITaskDb interface GetByNameWithOperator(inputCollectionID string, taskName string) (*TaskAndOperator, error) // In dao/task.go func (s *taskDb) GetByNameWithOperator(inputCollectionID string, taskName string) (*dbmodel.TaskAndOperator, error) { var result dbmodel.TaskAndOperator err := s.db.Table("tasks"). Select("tasks.*, operators.operator_name"). Joins("JOIN operators ON tasks.operator_id = operators.operator_id"). Where("tasks.input_collection_id = ?", inputCollectionID). Where("tasks.task_name = ?", taskName). Where("tasks.is_deleted = ?", false). First(&result).Error if err != nil { if errors.Is(err, gorm.ErrRecordNotFound) { return nil, nil } return nil, err } return &result, nil }

Then GetTaskByName in the coordinator can be simplified to a single database call.

Context for Agents

[**PerformanceOptimization**] You've correctly identified a potential optimization with the comment `// Can do both calls with a JOIN`. To avoid making two separate database calls, you could implement a dedicated DAO method that performs a JOIN between the `tasks` and `operators` tables. This would improve efficiency and prevent potential N+1 query issues if this function is called in a loop in the future. For example, you could add a method like `GetByNameWithOperator` to your `ITaskDb` interface: ```go // In dbmodel/task.go type TaskAndOperator struct { Task OperatorName string `gorm:"column:operator_name"` } // In ITaskDb interface GetByNameWithOperator(inputCollectionID string, taskName string) (*TaskAndOperator, error) // In dao/task.go func (s *taskDb) GetByNameWithOperator(inputCollectionID string, taskName string) (*dbmodel.TaskAndOperator, error) { var result dbmodel.TaskAndOperator err := s.db.Table("tasks"). Select("tasks.*, operators.operator_name"). Joins("JOIN operators ON tasks.operator_id = operators.operator_id"). Where("tasks.input_collection_id = ?", inputCollectionID). Where("tasks.task_name = ?", taskName). Where("tasks.is_deleted = ?", false). First(&result).Error if err != nil { if errors.Is(err, gorm.ErrRecordNotFound) { return nil, nil } return nil, err } return &result, nil } ``` Then `GetTaskByName` in the coordinator can be simplified to a single database call. File: go/pkg/sysdb/coordinator/task.go Line: 146

Not for this PR.

go/pkg/sysdb/coordinator/task.go

propel-code-bot · 2025-10-06T22:37:17Z

go/pkg/sysdb/metastore/db/migrations/20251001073000.sql

+CREATE TABLE "public"."tasks" (
+  "task_id" uuid NOT NULL,
+  "task_name" text NOT NULL,
+  "tenant_id" text NOT NULL,
+  "database_id" text NOT NULL,
+  "input_collection_id" text NOT NULL, -- Keeping these as text instead of UUID until collections.id becomes a UUID
+  "output_collection_name" text NOT NULL,
+  "operator_id" uuid NOT NULL,
+  "operator_params" jsonb NOT NULL,
+  "completion_offset" bigint NOT NULL DEFAULT 0,
+  "last_run" timestamp NULL DEFAULT NULL,
+  "next_run" timestamp NULL DEFAULT NULL,
+  "min_records_for_task" bigint NOT NULL DEFAULT 100,
+  "current_attempts" integer NOT NULL DEFAULT 0,
+  "is_alive" boolean NOT NULL DEFAULT true,
+  "is_deleted" boolean NOT NULL DEFAULT false,
+  "created_at" timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP,
+  "updated_at" timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP,
+  "task_template_parent" uuid NULL,
+  "next_nonce" UUID NOT NULL, -- UUIDv7
+  "oldest_written_nonce" UUID DEFAULT NULL, -- UUIDv7
+  PRIMARY KEY ("task_id"),
+  CONSTRAINT "unique_task_per_collection" UNIQUE ("input_collection_id", "task_name")
+);


[BestPractice]

The tasks table defines an operator_id column but lacks a foreign key constraint to the operators table. This could lead to data integrity issues if an operator is deleted, leaving tasks with orphaned operator_id references. This would cause GetTaskByName to fail when it tries to look up the operator.

To ensure referential integrity, please consider adding a foreign key constraint.

Suggested Change

Suggested change

CREATE TABLE "public"."tasks" (

"task_id" uuid NOT NULL,

"task_name" text NOT NULL,

"tenant_id" text NOT NULL,

"database_id" text NOT NULL,

"input_collection_id" text NOT NULL, -- Keeping these as text instead of UUID until collections.id becomes a UUID

"output_collection_name" text NOT NULL,

"operator_id" uuid NOT NULL,

"operator_params" jsonb NOT NULL,

"completion_offset" bigint NOT NULL DEFAULT 0,

"last_run" timestamp NULL DEFAULT NULL,

"next_run" timestamp NULL DEFAULT NULL,

"min_records_for_task" bigint NOT NULL DEFAULT 100,

"current_attempts" integer NOT NULL DEFAULT 0,

"is_alive" boolean NOT NULL DEFAULT true,

"is_deleted" boolean NOT NULL DEFAULT false,

"created_at" timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP,

"updated_at" timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP,

"task_template_parent" uuid NULL,

"next_nonce" UUID NOT NULL, -- UUIDv7

"oldest_written_nonce" UUID DEFAULT NULL, -- UUIDv7

PRIMARY KEY ("task_id"),

CONSTRAINT "unique_task_per_collection" UNIQUE ("input_collection_id", "task_name")

);

CREATE TABLE "public"."tasks" (

"task_id" uuid NOT NULL,

"task_name" text NOT NULL,

"tenant_id" text NOT NULL,

"database_id" text NOT NULL,

"input_collection_id" text NOT NULL, -- Keeping these as text instead of UUID until collections.id becomes a UUID

"output_collection_name" text NOT NULL,

"operator_id" uuid NOT NULL,

"operator_params" jsonb NOT NULL,

"completion_offset" bigint NOT NULL DEFAULT 0,

"last_run" timestamp NULL DEFAULT NULL,

"next_run" timestamp NULL DEFAULT NULL,

"min_records_for_task" bigint NOT NULL DEFAULT 100,

"current_attempts" integer NOT NULL DEFAULT 0,

"is_alive" boolean NOT NULL DEFAULT true,

"is_deleted" boolean NOT NULL DEFAULT false,

"created_at" timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP,

"updated_at" timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP,

"task_template_parent" uuid NULL,

"next_nonce" UUID NOT NULL, -- UUIDv7

"oldest_written_nonce" UUID DEFAULT NULL, -- UUIDv7

PRIMARY KEY ("task_id"),

CONSTRAINT "unique_task_per_collection" UNIQUE ("input_collection_id", "task_name"),

CONSTRAINT "fk_tasks_operator_id" FOREIGN KEY ("operator_id") REFERENCES "public"."operators" ("operator_id")

);

⚡ Committable suggestion

Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation.

Context for Agents

[**BestPractice**] The `tasks` table defines an `operator_id` column but lacks a foreign key constraint to the `operators` table. This could lead to data integrity issues if an operator is deleted, leaving tasks with orphaned `operator_id` references. This would cause `GetTaskByName` to fail when it tries to look up the operator. To ensure referential integrity, please consider adding a foreign key constraint. <details> <summary>Suggested Change</summary> ```suggestion CREATE TABLE "public"."tasks" ( "task_id" uuid NOT NULL, "task_name" text NOT NULL, "tenant_id" text NOT NULL, "database_id" text NOT NULL, "input_collection_id" text NOT NULL, -- Keeping these as text instead of UUID until collections.id becomes a UUID "output_collection_name" text NOT NULL, "operator_id" uuid NOT NULL, "operator_params" jsonb NOT NULL, "completion_offset" bigint NOT NULL DEFAULT 0, "last_run" timestamp NULL DEFAULT NULL, "next_run" timestamp NULL DEFAULT NULL, "min_records_for_task" bigint NOT NULL DEFAULT 100, "current_attempts" integer NOT NULL DEFAULT 0, "is_alive" boolean NOT NULL DEFAULT true, "is_deleted" boolean NOT NULL DEFAULT false, "created_at" timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP, "updated_at" timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP, "task_template_parent" uuid NULL, "next_nonce" UUID NOT NULL, -- UUIDv7 "oldest_written_nonce" UUID DEFAULT NULL, -- UUIDv7 PRIMARY KEY ("task_id"), CONSTRAINT "unique_task_per_collection" UNIQUE ("input_collection_id", "task_name"), CONSTRAINT "fk_tasks_operator_id" FOREIGN KEY ("operator_id") REFERENCES "public"."operators" ("operator_id") ); ``` ⚡ **Committable suggestion** Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. </details> File: go/pkg/sysdb/metastore/db/migrations/20251001073000.sql Line: 43

go/pkg/sysdb/coordinator/task.go

propel-code-bot · 2025-10-07T17:52:05Z

rust/types/operator_codegen.rs

+fn camel_to_snake_case(s: &str) -> String {
+    let mut result = String::new();
+    for (i, ch) in s.chars().enumerate() {
+        if ch.is_uppercase() {
+            if i > 0 {
+                result.push('_');
+            }
+            result.push(ch.to_ascii_lowercase());
+        } else {
+            result.push(ch);
+        }
+    }
+    result
+}


[BestPractice]

The current implementation of camel_to_snake_case incorrectly handles acronyms in Go constant names. For example, a name like HTTPGetter would be converted to h_t_t_p_getter instead of the correct http_getter. While this works for the current RecordCounter constant, it will cause issues with future constants that use acronyms.

A more robust implementation should handle consecutive uppercase letters correctly.

Suggested Change

Suggested change

fn camel_to_snake_case(s: &str) -> String {

let mut result = String::new();

for (i, ch) in s.chars().enumerate() {

if ch.is_uppercase() {

if i > 0 {

result.push('_');

}

result.push(ch.to_ascii_lowercase());

} else {

result.push(ch);

}

}

result

}

fn camel_to_snake_case(s: &str) -> String {

let mut result = String::new();

let mut last_was_upper = false;

for (i, ch) in s.chars().enumerate() {

if ch.is_uppercase() {

if i > 0 && !last_was_upper {

result.push('_');

}

result.push(ch.to_ascii_lowercase());

last_was_upper = true;

} else {

result.push(ch);

last_was_upper = false;

}

}

result

}

⚡ Committable suggestion

Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation.

Context for Agents

[**BestPractice**] The current implementation of `camel_to_snake_case` incorrectly handles acronyms in Go constant names. For example, a name like `HTTPGetter` would be converted to `h_t_t_p_getter` instead of the correct `http_getter`. While this works for the current `RecordCounter` constant, it will cause issues with future constants that use acronyms. A more robust implementation should handle consecutive uppercase letters correctly. <details> <summary>Suggested Change</summary> ```suggestion fn camel_to_snake_case(s: &str) -> String { let mut result = String::new(); let mut last_was_upper = false; for (i, ch) in s.chars().enumerate() { if ch.is_uppercase() { if i > 0 && !last_was_upper { result.push('_'); } result.push(ch.to_ascii_lowercase()); last_was_upper = true; } else { result.push(ch); last_was_upper = false; } } result } ``` ⚡ **Committable suggestion** Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. </details> File: rust/types/operator_codegen.rs Line: 148

rust/s3heap-service/src/lib.rs

propel-code-bot · 2025-10-07T21:46:50Z

rust/s3heap-service/src/lib.rs

+            chroma_sysdb::SysDbConfig::Grpc(_) => {}
+            chroma_sysdb::SysDbConfig::Sqlite(_) => {
+                panic!("Expected grpc sysdb config, got sqlite sysdb config")
+            }


[BestPractice]

Using panic! for configuration errors can cause the service to crash abruptly on startup, which can be difficult to debug in production environments. Since this function already returns a Result, it would be more robust to return an Err variant for invalid configurations. This allows for graceful error handling and clearer error messages upon service startup failure.

Context for Agents

[**BestPractice**] Using `panic!` for configuration errors can cause the service to crash abruptly on startup, which can be difficult to debug in production environments. Since this function already returns a `Result`, it would be more robust to return an `Err` variant for invalid configurations. This allows for graceful error handling and clearer error messages upon service startup failure. File: rust/s3heap-service/src/lib.rs Line: 282

go/pkg/sysdb/coordinator/task.go

tanujnay112 · 2025-10-09T19:52:36Z

go/pkg/sysdb/coordinator/task.go

 }

+// Mark a task run as complete and set the nonce for the next task run.
+func (s *Coordinator) DoneTask(ctx context.Context, req *coordinatorpb.DoneTaskRequest) (*coordinatorpb.DoneTaskResponse, error) {


Can we put a verb in this method's name to make it clear that this modifies state?

propel-code-bot · 2025-10-09T22:32:16Z

go/pkg/sysdb/metastore/db/dao/task_test.go

+func (suite *TaskDbTestSuite) TestTaskDb_GetByID() {
+	taskID := uuid.New()
+	operatorID := dbmodel.OperatorRecordCounter
+	nextNonce, _ := uuid.NewV7()
+
+	task := &dbmodel.Task{
+		ID:                   taskID,
+		Name:                 "test-get-by-id-task",
+		OperatorID:           operatorID,
+		InputCollectionID:    "input_col_id",
+		OutputCollectionName: "output_col_name",
+		OperatorParams:       "{}",
+		TenantID:             "tenant1",
+		DatabaseID:           "db1",
+		MinRecordsForTask:    100,
+		NextNonce:            nextNonce,
+	}
+
+	err := suite.Db.Insert(task)
+	suite.Require().NoError(err)
+
+	retrieved, err := suite.Db.GetByID(taskID)
+	suite.Require().NoError(err)
+	suite.Require().NotNil(retrieved)
+	suite.Require().Equal(task.ID, retrieved.ID)
+	suite.Require().Equal(task.Name, retrieved.Name)
+	suite.Require().Equal(task.OperatorID, retrieved.OperatorID)
+
+	suite.db.Unscoped().Delete(&dbmodel.Task{}, "task_id = ?", task.ID)


[BestPractice]

Resource leak: The test creates and inserts a task but uses Unscoped().Delete() for cleanup, which bypasses GORM hooks and constraints. If the test fails before reaching cleanup, the task will remain in the database permanently. Use proper cleanup with defer or t.Cleanup():

func (suite *TaskDbTestSuite) TestTaskDb_GetByID() { // ... setup code ... // Clean up immediately after insert, regardless of test outcome suite.T().Cleanup(func() { suite.db.Unscoped().Delete(&dbmodel.Task{}, "task_id = ?", task.ID) }) err := suite.Db.Insert(task) suite.Require().NoError(err) // ... rest of test ... }

Context for Agents

[**BestPractice**] Resource leak: The test creates and inserts a task but uses `Unscoped().Delete()` for cleanup, which bypasses GORM hooks and constraints. If the test fails before reaching cleanup, the task will remain in the database permanently. Use proper cleanup with `defer` or `t.Cleanup()`: ```go func (suite *TaskDbTestSuite) TestTaskDb_GetByID() { // ... setup code ... // Clean up immediately after insert, regardless of test outcome suite.T().Cleanup(func() { suite.db.Unscoped().Delete(&dbmodel.Task{}, "task_id = ?", task.ID) }) err := suite.Db.Insert(task) suite.Require().NoError(err) // ... rest of test ... } ``` File: go/pkg/sysdb/metastore/db/dao/task_test.go Line: 343

go/pkg/sysdb/metastore/db/dao/task.go

rust/s3heap-service/src/lib.rs

tanujnay112 · 2025-10-10T08:54:54Z

go/pkg/sysdb/metastore/db/dao/task.go

+	return &task, nil
+}
+
+func (s *taskDb) FinishTask(taskID uuid.UUID, taskRunNonce uuid.UUID) error {


Should we consider generating the next nonce here instead of letting the caller supply it?

Is that not what the code is doing?

tanujnay112 · 2025-10-10T08:55:21Z

go/pkg/sysdb/metastore/db/dao/task_test.go

+
+	task := &dbmodel.Task{
+		ID:                   taskID,
+		Name:                 "test-done-task",


Naming inconsistency with FinishTask

tanujnay112 · 2025-10-10T08:55:29Z

go/pkg/sysdb/metastore/db/dao/task_test.go

+
+	task := &dbmodel.Task{
+		ID:                   taskID,
+		Name:                 "test-done-task-wrong-nonce",


propel-code-bot · 2025-10-14T19:32:15Z

rust/s3heap-service/src/lib.rs

    reader: LogReader,
    cursor: CursorStore,
-    _writer: HeapWriter,
+    writer: HeapWriter,
 }



[BestPractice]

Missing error handling: If self.sysdb.peek_schedule_by_collection_id() fails, the method returns early with ? but doesn't log the failure or provide context about which collections caused the issue. This makes debugging production failures difficult. Add error logging:

let scheduled = self .sysdb .clone() .peek_schedule_by_collection_id(&collection_ids) .await .map_err(|e| { log::error!("Failed to peek schedule for collections {:?}: {}", collection_ids, e); e })?;

Context for Agents

[**BestPractice**] Missing error handling: If `self.sysdb.peek_schedule_by_collection_id()` fails, the method returns early with `?` but doesn't log the failure or provide context about which collections caused the issue. This makes debugging production failures difficult. Add error logging: ```rust let scheduled = self .sysdb .clone() .peek_schedule_by_collection_id(&collection_ids) .await .map_err(|e| { log::error!("Failed to peek schedule for collections {:?}: {}", collection_ids, e); e })?; ``` File: rust/s3heap-service/src/lib.rs Line: 54

go/pkg/sysdb/metastore/db/dao/task.go

tanujnay112 · 2025-10-15T00:46:04Z

go/pkg/sysdb/metastore/db/dao/task.go

+	return &task, nil
+}
+
+func (s *taskDb) AdvanceTask(taskID uuid.UUID, taskRunNonce uuid.UUID) error {


FYI I'm going to remove/modify this later. The path that updates next_nonce also needs to transactionally update completion_offset.

propel-code-bot bot reviewed Oct 6, 2025

View reviewed changes

rescrv requested a review from tanujnay112 October 6, 2025 20:45

rescrv changed the base branch from main to rescrv/refactor-scheduler October 6, 2025 22:30

propel-code-bot bot reviewed Oct 6, 2025

View reviewed changes

go/pkg/sysdb/coordinator/task.go Outdated Show resolved Hide resolved

propel-code-bot bot reviewed Oct 6, 2025

View reviewed changes

rescrv force-pushed the rescrv/refactor-scheduler branch from e8da43e to f96c1d8 Compare October 7, 2025 15:56

rescrv force-pushed the rescrv/heap-scheduler branch from 5e54c6d to 52da7a6 Compare October 7, 2025 16:19

propel-code-bot bot reviewed Oct 7, 2025

View reviewed changes

go/pkg/sysdb/coordinator/task.go Outdated Show resolved Hide resolved

rescrv force-pushed the rescrv/heap-scheduler branch from 52da7a6 to 43edec4 Compare October 7, 2025 17:42

propel-code-bot bot reviewed Oct 7, 2025

View reviewed changes

Base automatically changed from rescrv/refactor-scheduler to main October 7, 2025 21:30

rescrv force-pushed the rescrv/heap-scheduler branch from 43edec4 to 4007de1 Compare October 7, 2025 21:41

propel-code-bot bot reviewed Oct 7, 2025

View reviewed changes

rust/s3heap-service/src/lib.rs Outdated Show resolved Hide resolved

propel-code-bot bot reviewed Oct 7, 2025

View reviewed changes

blacksmith-sh bot deleted a comment from rescrv Oct 7, 2025

rescrv force-pushed the rescrv/heap-scheduler branch from 4007de1 to 08647ab Compare October 8, 2025 15:37

propel-code-bot bot reviewed Oct 8, 2025

View reviewed changes

go/pkg/sysdb/coordinator/task.go Outdated Show resolved Hide resolved

tanujnay112 reviewed Oct 9, 2025

View reviewed changes

propel-code-bot bot reviewed Oct 9, 2025

View reviewed changes

go/pkg/sysdb/metastore/db/dao/task.go Show resolved Hide resolved

propel-code-bot bot reviewed Oct 9, 2025

View reviewed changes

rust/s3heap-service/src/lib.rs Show resolved Hide resolved

tanujnay112 reviewed Oct 10, 2025

View reviewed changes

propel-code-bot bot reviewed Oct 14, 2025

View reviewed changes

blacksmith-sh bot deleted a comment from rescrv Oct 14, 2025

propel-code-bot bot reviewed Oct 14, 2025

View reviewed changes

go/pkg/sysdb/metastore/db/dao/task.go Outdated Show resolved Hide resolved

rescrv force-pushed the rescrv/heap-scheduler branch from 2741ea7 to 75ff5ab Compare October 14, 2025 21:59

rescrv requested a review from tanujnay112 October 14, 2025 22:22

blacksmith-sh bot deleted a comment from rescrv Oct 14, 2025

rescrv added 8 commits October 14, 2025 16:09

[ENH] Wire up the heap scheduler from sysdb to heap tender.

6e5ace1

an rpc endpoint to mark a task "done".

463cb20

propel feedback

88050d0

pick a better verb than the universal "do"

c48418f

review: rename DoneTask to AdvanceTask and make some stuff copy.

aca6398

fix timestamp arithmetic

265ea01

try greatest operator

9300f1e

fix

1de60cc

rescrv force-pushed the rescrv/heap-scheduler branch from 75ff5ab to 1de60cc Compare October 14, 2025 23:09

tanujnay112 reviewed Oct 15, 2025

View reviewed changes

tanujnay112 approved these changes Oct 15, 2025

View reviewed changes

blacksmith-sh bot deleted a comment from rescrv Oct 15, 2025

rescrv merged commit cb190f1 into main Oct 15, 2025
113 of 117 checks passed

[ENH] Schedule tasks from the sysdb. #5562

[ENH] Schedule tasks from the sysdb. #5562

Conversation

rescrv commented Oct 6, 2025

Description of changes

Test plan

Migration plan

Observability plan

Documentation Changes

Uh oh!

github-actions bot commented Oct 6, 2025

Reviewer Checklist

Testing, Bugs, Errors, Logs, Documentation

System Compatibility

Quality

Uh oh!

propel-code-bot bot commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

propel-code-bot bot Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

propel-code-bot bot Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

propel-code-bot bot Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

propel-code-bot bot Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

propel-code-bot bot Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

propel-code-bot bot Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

propel-code-bot bot commented Oct 6, 2025 •

edited

Loading