feat: Refactoring the after trigger of private task #18443

KKould · 2025-07-30T03:29:48Z

I hereby agree to the terms of the CLA available at: https://docs.databend.com/dev/policies/cla/

Summary

PrivateTask originally used SQL to query the task_after and task_run tables to determine whether the task was the last completed task among other tasks to drive the execution of the next task.

Using SQL is relatively complex and can lead to low performance and even difficult to maintain. This PR moves this part of the after check to the meta module, simplifying the logic and making it easier to maintain.

TaskSucceededStateIdent => current task/ next task -> empty (If the key exists, it means that the before task and after task are succeeded.)
TaskDependentIdent => /DependentType (Before or After)/task_name 1

When task A is completed, check if any task C — which comes after a task B that comes before task A — is also completed.
If such a task C is completed, set both task A and task C’s is_successed to false; otherwise, set task A’s is_successed to true.

Tests

Unit Test
Logic Test
Benchmark Test
No Test - test script: test-private-task.sh

Type of change

Bug Fix (non-breaking change which fixes an issue)
New Feature (non-breaking change which adds functionality)
Breaking Change (fix or feature that could cause existing functionality not to work as expected)
Documentation Update
Refactoring
Performance Improvement
Other (please describe):

This change is

…& `clean_task_state_and_dependents`

drmingdrmer

Reviewed 10 of 15 files at r1, 6 of 8 files at r4, all commit messages.
Reviewable status: 11 of 15 files reviewed, 6 unresolved discussions (waiting on @zhang2014)

src/meta/app/src/principal/task.rs line 201 at r4 (raw file):

#[derive(Debug, Clone, PartialEq)]
pub struct TaskDependentValue(pub HashSet<String>);

If it is not a big data set and there is no performance concern, use ordered container BTreeSet instead of HashSet. HashSet yields arbitrary key orders that might mess up some tests.

src/query/management/src/task/task_mgr.rs line 577 at r4 (raw file):

        update_ops: &mut Vec<TxnOp>,
        task_names: &[String],
        dependent_ident: &TIdent<TaskDependentResource, TaskDependentKey>,

there is already an alias: TaskDependentIdent

src/query/management/src/task/task_mgr.rs line 578 at r4 (raw file):

        task_names: &[String],
        dependent_ident: &TIdent<TaskDependentResource, TaskDependentKey>,
        is_add: bool,

is_add is a little bit confusing. I kind of think of removing this method may simplify the logic and structure.

There are some shortcut utils you can use to build txn, such as TxnCondition::eq_seq:

https://github.com/datafuselabs/databend/blob/9c2271ec6daced8f145f6705df7f1ecfe3b8e2e8/src/meta/types/src/proto_ext/txn_condition_ext.rs#L30

src/query/management/src/task/task_mgr.rs line 601 at r4 (raw file):

                    dependent_ident.to_string_key(),
                    old_dependent.to_pb()?.encode_to_vec(),
                ));

eq_value does not guarantee the data is still the same. For example the value could have changed from a -> b -> a. In this case, asserting value == a does not guarantee anything and there is potential consistency issues.

Always check against seq if possible. And in fact the condition to assert the seq can be extracted out of these two match branches, which is more explicit: does not update unless it is still the same.

Code quote:

                check_ops.push(TxnCondition::eq_value(
                    dependent_ident.to_string_key(),
                    old_dependent.to_pb()?.encode_to_vec(),
                ));

src/meta/protos/proto/task.proto line 98 at r4 (raw file):

  }
  string source = 1;
  DependentType ty = 3;

Keep the fields in the order that provides alphabetic order for the struct

Suggestion:

  DependentType ty = 1;
  string source = 2;

src/meta/proto-conv/src/task_from_to_protobuf_impl.rs line 200 at r1 (raw file):

}

impl FromToProto for mt::TaskDependent {

If TaskDependent won't be used as value, it does not need to implement FromToProto

drmingdrmer

Reviewed 3 of 4 files at r5, 1 of 1 files at r6, 2 of 3 files at r9, all commit messages.
Reviewable status: 11 of 15 files reviewed, 5 unresolved discussions (waiting on @zhang2014)

src/query/management/src/task/task_mgr.rs line 384 at r10 (raw file):

    ) -> Result<Result<(), TaskError>, TaskApiError> {
        let mut check_ops = Vec::new();
        let mut update_ops = Vec::new();

Create a txn: let txn=TxnRequest::new() then use &mut txn as the parameter in clean_related_dependent.

Code quote:

        let mut check_ops = Vec::new();
        let mut update_ops = Vec::new();

src/query/management/src/task/task_mgr.rs line 389 at r10 (raw file):

            .await?;
        self.clean_related_dependent(task_name, &mut check_ops, &mut update_ops, false)
            .await?;

it seems like clean_related_dependent it self should remove with after/before edges in one call.

There is no need to do it in two step.

Code quote:

        self.clean_related_dependent(task_name, &mut check_ops, &mut update_ops, true)
            .await?;
        self.clean_related_dependent(task_name, &mut check_ops, &mut update_ops, false)
            .await?;

src/query/management/src/task/task_mgr.rs line 442 at r10 (raw file):

                TaskDependentIdent::new_generic(&self.tenant, after_dependent)
            }))
            .await?

This line is too long. Break it down into several statements to improve readability.

Code quote:

        for (target_after_ident, target_after_dependent) in self
            .kv_api
            .get_pb_vec(task_before_dependent.0.iter().map(|before| {
                let after_dependent = TaskDependentKey::new(DependentType::After, before.clone());
                TaskDependentIdent::new_generic(&self.tenant, after_dependent)
            }))
            .await?

src/query/management/src/task/task_mgr.rs line 449 at r10 (raw file):

            let mut conditions = Vec::new();
            let mut if_ops = Vec::new();

create a txn let txn = TxnRequest::new() instead of creating its fields

Code quote:

            let mut conditions = Vec::new();
            let mut if_ops = Vec::new();

src/query/management/src/task/task_mgr.rs line 564 at r10 (raw file):

                    TaskDependentIdent::new_generic(&self.tenant, target_key.clone());

                let seq_dep = self.kv_api.get_pb(&target_ident).await?;

use get_pb_vec to batch the get.

KKould self-assigned this Jul 30, 2025

github-actions bot added the pr-feature this PR introduces a new feature to the codebase label Jul 30, 2025

KKould force-pushed the refactor/private_task_after branch from 22c4674 to e1282c7 Compare July 30, 2025 03:40

KKould marked this pull request as ready for review July 30, 2025 05:15

KKould requested a review from drmingdrmer as a code owner July 30, 2025 05:15

KKould requested review from sundy-li and zhang2014 July 30, 2025 05:15

sundy-li approved these changes Aug 4, 2025

View reviewed changes

KKould force-pushed the refactor/private_task_after branch from 9b2ec1a to 6d8a7fd Compare August 4, 2025 09:24

KKould and others added 11 commits August 5, 2025 15:11

feat: impl WarehouseOptions for Private Task

7524eb5

test: add test-private-task-warehouse.sh

4d3c1d3

chore: rebase

9a5a48d

chore: rebase

a519250

refactor: transfer the after check sql to meta

67b7427

chore: rebase

096d280

chore: codefmt

ff767ae

fix: test_decode_v141_task_state & test_decode_v141_task_dependent

b909a89

chore: update_after is divided into add_afters & remove_afters …

6920877

…& `clean_task_state_and_dependents`

chore: codefmt

9808a4b

refactor: store Task's dependents in the single key

d1b1d96

KKould force-pushed the refactor/private_task_after branch from 6d8a7fd to d1b1d96 Compare August 5, 2025 07:57

KKould added 2 commits August 5, 2025 16:03

chore: codefmt

309b5f4

chore: codefmt

5a9de35

drmingdrmer requested changes Aug 5, 2025

View reviewed changes

KKould added 2 commits August 6, 2025 11:00

chore: codefmt

7ef3131

chore: codefmt

ea3f1bf

KKould requested a review from drmingdrmer August 6, 2025 04:06

KKould added 3 commits August 6, 2025 15:37

chore: codefmt

b1ea18e

chore: codefmt

8b34211

chore: codefmt

c6c7373

KKould added 2 commits August 6, 2025 17:58

chore: codefmt

804fcd6

chore: codefmt

60da863

drmingdrmer reviewed Aug 7, 2025

View reviewed changes

KKould added 7 commits August 7, 2025 12:08

chore: codefmt

ee04c4a

chore: codefmt

6132b93

chore: codefmt

0662907

chore: codefmt

1b6d063

fix: task state will be affected by get_next_ready_tasks

f10608f

chore: codefmt

6a37789

chore: codefmt

be933c3

KKould force-pushed the refactor/private_task_after branch from 968bc46 to be933c3 Compare August 8, 2025 05:54

chore: codefmt

14d3cad

zhang2014 approved these changes Aug 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Refactoring the after trigger of private task #18443

feat: Refactoring the after trigger of private task #18443

Uh oh!

KKould commented Jul 30, 2025 •

edited

Loading

Uh oh!

drmingdrmer left a comment

Uh oh!

drmingdrmer left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat: Refactoring the after trigger of private task #18443

Are you sure you want to change the base?

feat: Refactoring the after trigger of private task #18443

Uh oh!

Conversation

KKould commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Tests

Type of change

Uh oh!

drmingdrmer left a comment

Choose a reason for hiding this comment

Uh oh!

drmingdrmer left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

KKould commented Jul 30, 2025 •

edited

Loading