Skip to content

Conversation

@ericallam
Copy link
Member

@ericallam ericallam commented Sep 19, 2024

TaskRuns now keep track of their parent task run, their root task run, the depth in the hierarchy, whether they will be resuming their parent on completion (e.g. using triggerAndWait), and what batch they are apart of (we already had this but now it's denormalized). This allows us to add additional output to the runs.retrieve() endpoint:

{
  "id": "run_t5ahqdu9jzybvsofut73a",
  "status": "COMPLETED",
  "taskIdentifier": "task-hierarchy/grand-child-task",
  "version": "20240919.58",
  "createdAt": "2024-09-19T20:54:06.019Z",
  "updatedAt": "2024-09-19T20:54:08.225Z",
  "startedAt": "2024-09-19T20:54:06.551Z",
  "finishedAt": "2024-09-19T20:54:08.225Z",
  "payload": {
    "useWaits": true,
    "useBatch": false
  },
  "isTest": true,
  "ttl": "10m",
  "tags": [],
  "costInCents": 0,
  "baseCostInCents": 0,
  "durationMs": 0,
  "depth": 2,
  "isQueued": false,
  "isExecuting": false,
  "isCompleted": true,
  "isFailed": false,
  "isSuccess": true,
  "isCancelled": false,
  "attempts": [
    {
      "id": "attempt_jdarpfuigz2qev4i8xnzr",
      "status": "COMPLETED",
      "createdAt": "2024-09-19T20:54:06.671Z",
      "updatedAt": "2024-09-19T20:54:08.225Z",
      "startedAt": "2024-09-19T20:54:06.671Z",
      "completedAt": "2024-09-19T20:54:08.234Z"
    }
  ],
  "relatedRuns": {
    "root": {
      "id": "run_zr42bkgjmvx8jy13iowbi",
      "taskIdentifier": "task-hierarchy/root-task",
      "version": "20240919.58",
      "status": "COMPLETED",
      "createdAt": "2024-09-19T20:53:39.763Z",
      "startedAt": "2024-09-19T20:54:04.286Z",
      "updatedAt": "2024-09-19T20:54:08.940Z",
      "finishedAt": "2024-09-19T20:54:08.941Z",
      "ttl": "10m",
      "costInCents": 0,
      "baseCostInCents": 0,
      "durationMs": 0,
      "isTest": true,
      "depth": 0,
      "tags": [],
      "isQueued": false,
      "isExecuting": false,
      "isCompleted": true,
      "isFailed": false,
      "isSuccess": true,
      "isCancelled": false
    },
    "parent": {
      "id": "run_yoompzk4vract4zvhxucp",
      "taskIdentifier": "task-hierarchy/child-task",
      "version": "20240919.58",
      "status": "COMPLETED",
      "createdAt": "2024-09-19T20:54:04.825Z",
      "startedAt": "2024-09-19T20:54:05.422Z",
      "updatedAt": "2024-09-19T20:54:08.225Z",
      "finishedAt": "2024-09-19T20:54:08.242Z",
      "isRoot": false,
      "ttl": "10m",
      "costInCents": 0,
      "baseCostInCents": 0,
      "durationMs": 0,
      "isTest": true,
      "depth": 1,
      "tags": [],
      "isQueued": false,
      "isExecuting": false,
      "isCompleted": true,
      "isFailed": false,
      "isSuccess": true,
      "isCancelled": false
    },
    "children": [
      {
        "id": "run_9n1r1h2w2i53f31k1jgit",
        "taskIdentifier": "task-hierarchy/great-grand-child-task",
        "version": "20240919.58",
        "triggerFunction": "triggerAndWait",
        "status": "COMPLETED",
        "createdAt": "2024-09-19T20:54:07.149Z",
        "startedAt": "2024-09-19T20:54:07.706Z",
        "updatedAt": "2024-09-19T20:54:08.287Z",
        "finishedAt": "2024-09-19T20:54:08.321Z",
        "ttl": "10m",
        "tags": [],
        "costInCents": 0,
        "baseCostInCents": 0,
        "durationMs": 0,
        "depth": 3,
        "isTest": true,
        "isQueued": false,
        "isExecuting": false,
        "isCompleted": true,
        "isFailed": false,
        "isSuccess": true,
        "isCancelled": false
      }
    ]
  }
}

In the future we will use this data to:

  • only display root runs in the run list
  • display the task run hierarchy without having to use TaskEvent data
  • draw pretty diagrams

Summary by CodeRabbit

Release Notes

  • New Features

    • Enhanced API response to include detailed relationships between task runs, improving context and usability.
    • Introduced a structured task hierarchy with new properties such as depth, parentBatch, and relatedRuns for better task management.
    • Added optional fields in the request schema for more granular tracking of task dependencies.
  • Documentation

    • Updated OpenAPI specifications for clearer representation of run data and relationships.

@changeset-bot
Copy link

changeset-bot bot commented Sep 19, 2024

⚠️ No Changeset found

Latest commit: 013bd47

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

@coderabbitai
Copy link
Contributor

coderabbitai bot commented Sep 19, 2024

Walkthrough

The changes encompass enhancements across multiple components and services within the application, focusing on improving the handling and representation of task runs. Key modifications include the introduction of new methods and properties for data retrieval, restructuring response schemas to capture relationships among task runs, and the addition of new fields in the OpenAPI specification to support hierarchical task management. These updates enhance the overall functionality and clarity of task execution tracking.

Changes

File Path Change Summary
apps/webapp/app/presenters/v3/ApiRetrieveRunPresenter.server.ts Modified retrieveRun method to use findFirst and expanded response structure with related task run details. Added new helper functions for response formatting.
docs/v3-openapi.yaml Replaced RetrieveRunResponse with CommonRunObject and expanded response schemas for task runs.
packages/core/src/v3/schemas/api.ts Updated TriggerTaskRequestBody schema to include parentAttempt and parentBatch. Introduced TriggerFunction enum and enhanced related run structures.

Possibly related PRs

Poem

🐇 In the meadow where tasks run free,
A new ID hops with glee!
With roots and branches, they intertwine,
Task flows dance, all align.
With every leap, a clearer view,
Of runs and paths, both old and new! 🌼


Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

Share
Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    -- I pushed a fix in commit <commit_id>, please review it.
    -- Generate unit testing code for this file.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    -- @coderabbitai generate unit testing code for this file.
    -- @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    -- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    -- @coderabbitai read src/utils.ts and generate unit testing code.
    -- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    -- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 18

Outside diff range and nitpick comments (7)
references/v3-catalog/src/trigger/taskHierarchy.ts (2)

76-81: Ensure consistent logging across all tasks

While greatGrandChildTask logs "great-grand-child-task", consider verifying that all tasks log their execution consistently. This helps in maintaining uniformity and ease of debugging.

Optionally, you might want to standardize the log messages or include additional context if needed.


1-2: Organize imports for clarity

Currently, the imports combine both named imports and namespace imports. For better readability, consider grouping similar imports together.

- import { runs, task } from "@trigger.dev/sdk/v3";
- import { setTimeout } from "node:timers/promises";
+ import { task } from "@trigger.dev/sdk/v3";
+ import { runs } from "@trigger.dev/sdk/v3/runs";
+ import { setTimeout } from "node:timers/promises";

This separates your imports by functionality or source, enhancing code clarity.

apps/webapp/app/presenters/v3/ApiRetrieveRunPresenter.server.ts (3)

215-215: Handle potential null or undefined for taskRun.depth

When adding depth: taskRun.depth, consider that taskRun.depth may be null or undefined. To prevent potential issues, you might want to provide a default value.

Apply this change to provide a default value:

- depth: taskRun.depth,
+ depth: taskRun.depth ?? 0,

405-410: Evaluate the necessity of apiBooleanHelpersFromTaskRunStatus method

The method apiBooleanHelpersFromTaskRunStatus converts TaskRunStatus to RunStatus and then to boolean helpers. Consider if this indirection is necessary, or if it would be more efficient to directly generate the boolean helpers from TaskRunStatus.


Line range hint 25-410: Consider error handling for missing taskRun

Currently, if taskRun is not found, the method returns undefined, and a debug log is recorded. Ensure that the calling function handles this undefined case gracefully, or consider throwing an error to make the failure more explicit.

Optionally, you could throw an error:

  if (!taskRun) {
    logger.debug("Task run not found", { friendlyId, envId: env.id });

-   return undefined;
+   throw new Error(`TaskRun with friendlyId ${friendlyId} not found.`);
  }
packages/database/prisma/schema.prisma (2)

1723-1723: Typo in comment: Duplicate 'that'

The comment on line 1723 contains a duplicated word "that":
"This represents the original task that that was triggered outside of a Trigger.dev task."
Please remove the extra "that" for clarity.


2107-2107: Rename field 'TaskRun' to 'taskRuns' for consistency

In the BatchTaskRun model, the field is currently named TaskRun TaskRun[]. To follow naming conventions and improve code readability, please rename it to taskRuns TaskRun[].

Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

Commits

Files that changed from the base of the PR and between 3dd303b and cc5b52d.

Files ignored due to path filters (1)
  • pnpm-lock.yaml is excluded by !**/pnpm-lock.yaml
Files selected for processing (16)
  • apps/webapp/app/components/runs/v3/RunInspector.tsx (1 hunks)
  • apps/webapp/app/db.server.ts (5 hunks)
  • apps/webapp/app/presenters/v3/ApiRetrieveRunPresenter.server.ts (5 hunks)
  • apps/webapp/app/presenters/v3/SpanPresenter.server.ts (1 hunks)
  • apps/webapp/app/routes/resources.orgs.$organizationSlug.projects.v3.$projectParam.runs.$runParam.spans.$spanParam/route.tsx (1 hunks)
  • apps/webapp/app/v3/services/batchTriggerTask.server.ts (1 hunks)
  • apps/webapp/app/v3/services/triggerTask.server.ts (6 hunks)
  • apps/webapp/package.json (1 hunks)
  • docs/v3-openapi.yaml (2 hunks)
  • packages/core/src/v3/schemas/api.ts (2 hunks)
  • packages/database/prisma/migrations/20240918144036_add_task_run_parent_columns/migration.sql (1 hunks)
  • packages/database/prisma/migrations/20240918150510_add_root_task_run_and_descendants/migration.sql (1 hunks)
  • packages/database/prisma/migrations/20240919203806_add_depth_to_task_run/migration.sql (1 hunks)
  • packages/database/prisma/schema.prisma (4 hunks)
  • packages/trigger-sdk/src/v3/shared.ts (2 hunks)
  • references/v3-catalog/src/trigger/taskHierarchy.ts (1 hunks)
Files skipped from review due to trivial changes (1)
  • apps/webapp/package.json
Additional comments not posted (24)
packages/database/prisma/migrations/20240919203806_add_depth_to_task_run/migration.sql (1)

1-2: LGTM! The SQL statement correctly adds the depth column to the TaskRun table.

The addition of the depth column aligns with the PR objective of introducing a hierarchy structure for task runs. Setting the column as NOT NULL with a default value of 0 ensures data integrity for existing task runs.

A few additional suggestions:

  • Ensure that any existing queries or application code that interacts with the TaskRun table are updated to handle the new depth column.
  • Consider adding database-level constraints or triggers to validate the integrity of the depth hierarchy, such as ensuring that a task run's depth is always greater than its parent's depth.
  • Update the relevant documentation or data dictionary to reflect the new depth column and its purpose.
packages/database/prisma/migrations/20240918150510_add_root_task_run_and_descendants/migration.sql (2)

1-2: LGTM!

The SQL statement correctly adds a new column named rootTaskRunId of type TEXT to the TaskRun table. This allows establishing a hierarchical relationship among task runs.


4-5: Looks good!

The SQL statement correctly adds a foreign key constraint named TaskRun_rootTaskRunId_fkey to the TaskRun table. The constraint references the id column of the same table, establishing a self-referential relationship. The ON DELETE SET NULL and ON UPDATE CASCADE clauses handle the cases of deleted or updated referenced task runs appropriately, ensuring data integrity.

apps/webapp/app/v3/services/batchTriggerTask.server.ts (1)

116-116: LGTM! The change enhances the clarity and control flow of batch processing.

The new conditional assignment for the parentBatch property within the options object improves the clarity and control flow of batch processing by explicitly distinguishing between scenarios where a batch is triggered independently versus when it is dependent on another attempt.

  • When dependentAttempt is not present, parentBatch is set to batch.friendlyId, indicating that the batch is triggered independently.
  • When dependentAttempt is present, parentBatch is set to undefined, indicating that the batch is dependent on another attempt.

This change aligns with the PR objective of incorporating a hierarchy structure for task runs and provides more comprehensive data about task runs and their hierarchical relationships.

apps/webapp/app/db.server.ts (2)

1-1: Importing withOptimize from @prisma/extension-optimize is a good addition.

The withOptimize function is used to optimize the Prisma client by reducing the bundle size and improving the startup time. It does this by removing unused Prisma Client code and generating a minimal Prisma Client based on the used features.


Line range hint 98-119: The modifications to the Prisma client logging configuration look good.

The changes introduce a more flexible and granular logging setup:

  • Additional logging levels, specifically for query events, are conditionally included based on the VERBOSE_PRISMA_LOGS environment variable. This allows for more detailed logging during development or debugging when needed.
  • Removing the commented-out sections related to logging streamlines the code and focuses on the new logging strategy.

These modifications enhance the functionality of the Prisma client while improving the clarity and control over logging behavior.

Also applies to: 149-170

apps/webapp/app/presenters/v3/SpanPresenter.server.ts (1)

232-232: LGTM!

The addition of the id property to the return object is a valid enhancement. It provides a unique identifier for the run, which can be useful for tracking, referencing, or performing operations related to specific run instances. The value is correctly set to run.id, maintaining consistency with the existing code.

apps/webapp/app/components/runs/v3/RunInspector.tsx (1)

357-360: LGTM!

The new Property.Item correctly displays the "Run ID" and its value, enhancing the context provided by the RunInspector component. The change is consistent with the existing code structure and conventions.

apps/webapp/app/routes/resources.orgs.$organizationSlug.projects.v3.$projectParam.runs.$runParam.spans.$spanParam/route.tsx (1)

596-603: LGTM!

The additions of the "Run ID" and "Internal ID" properties to the Property.Table component within the RunBody component look good. They provide more visibility into the run identifiers by displaying the values of run.friendlyId and run.id respectively.

packages/trigger-sdk/src/v3/shared.ts (2)

794-794: LGTM!

The change looks good. It allows passing the parent attempt ID when triggering a task, enabling tracking of the task run hierarchy. The taskContext.ctx?.attempt.id safely accesses the attempt ID from the current task context.


865-865: Looks good!

The change is consistent with the one made in trigger_internal. It allows passing the parent attempt ID when batch triggering tasks, enabling tracking of the task run hierarchy within a batch. The taskContext.ctx?.attempt.id safely accesses the attempt ID from the current task context.

packages/database/prisma/migrations/20240918144036_add_task_run_parent_columns/migration.sql (2)

2-5: Ensure data type consistency between foreign keys and referenced keys

The new columns parentTaskRunId, parentTaskRunAttemptId, and batchId are of type TEXT. Please confirm that the id columns in the TaskRun, TaskRunAttempt, and BatchTaskRun tables are also of type TEXT. Mismatched data types between foreign keys and primary keys can lead to constraint violations and runtime errors.

To verify the data types of the referenced id columns, you can run:

#!/bin/bash
# Description: Verify that the data types of foreign keys match the primary keys.

# Retrieve schema definitions for the referenced tables.
rg --type sql 'CREATE TABLE "?(TaskRun|TaskRunAttempt|BatchTaskRun)"?' -A 20

This will display the table definitions, allowing you to check the data types of the id columns.


11-11: Verify that ON DELETE SET NULL is appropriate for foreign key constraints

The foreign key constraints for parentTaskRunId, parentTaskRunAttemptId, and batchId are set to ON DELETE SET NULL. This means that when a referenced record is deleted, the corresponding foreign key in TaskRun will be set to NULL. Please verify that this behavior aligns with the application's data integrity requirements. Depending on the business logic, you might consider using ON DELETE CASCADE to delete dependent TaskRun records automatically, or ON DELETE RESTRICT to prevent deletion of referenced records.

To examine how deletions of referenced records are handled in the codebase, you can run the following script:

This will help identify any delete operations on the referenced tables and allow you to assess whether ON DELETE SET NULL is suitable.

Also applies to: 14-14, 17-17

Verification successful

ON DELETE SET NULL is appropriate for these foreign key constraints

The current schema design using ON DELETE SET NULL for parentTaskRunId, parentTaskRunAttemptId, and batchId in the TaskRun model is appropriate. This approach allows TaskRun records to persist even when related parent runs, attempts, or batches are deleted, maintaining data integrity while allowing flexibility in relationship management.

Key points:

  • TaskRun relationships with parent runs and batches use ON DELETE SET NULL, preserving the TaskRun record if parents are deleted.
  • Other relationships in TaskRun, TaskRunAttempt, and BatchTaskRun models use ON DELETE CASCADE where appropriate, ensuring proper cleanup of directly dependent records.
  • The absence of custom deletion logic suggests reliance on database-level cascading and nullification, which is a clean and efficient approach.

This design balances data integrity with the need for flexible relationship management in a complex task execution system.

Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Search for DELETE operations on referenced tables and assess implications.

# Find where deletions of TaskRun are performed.
rg --type sql 'DELETE\s+FROM\s+"?TaskRun"?' -A 3

# Find where deletions of TaskRunAttempt are performed.
rg --type sql 'DELETE\s+FROM\s+"?TaskRunAttempt"?' -A 3

# Find where deletions of BatchTaskRun are performed.
rg --type sql 'DELETE\s+FROM\s+"?BatchTaskRun"?' -A 3

Length of output: 17019


Script:

#!/bin/bash
# Search for Prisma schema definitions and application code related to deletions

# Search for Prisma schema definitions
echo "Prisma schema definitions:"
rg --type ts 'model\s+(TaskRun|TaskRunAttempt|BatchTaskRun)' -A 10

# Search for deletion-related code in TypeScript files
echo "\nDeletion-related code:"
rg --type ts '(delete|remove)(TaskRun|TaskRunAttempt|BatchTaskRun)' -A 5

# Search for Prisma client delete operations
echo "\nPrisma client delete operations:"
rg --type ts 'prisma.*delete.*where.*TaskRun|TaskRunAttempt|BatchTaskRun' -A 5

Length of output: 45619


Script:

#!/bin/bash
# Search for specific foreign key constraints and their handling

# Look for schema definitions of TaskRun, TaskRunAttempt, and BatchTaskRun
echo "Schema definitions:"
rg --type ts 'model\s+(TaskRun|TaskRunAttempt|BatchTaskRun)' -A 20

# Search for code handling parentTaskRunId, parentTaskRunAttemptId, or batchId
echo "\nHandling of foreign key fields:"
rg --type ts '(parentTaskRunId|parentTaskRunAttemptId|batchId)' -A 5

# Look for implementation of CancelTaskRunService
echo "\nCancelTaskRunService implementation:"
rg --type ts 'class CancelTaskRunService' -A 20

# Search for any code that might be nullifying these foreign keys
echo "\nPossible nullification of foreign keys:"
rg --type ts '(parentTaskRunId|parentTaskRunAttemptId|batchId)\s*=\s*null' -A 5

Length of output: 15809


Script:

#!/bin/bash
# Search for schema definitions and deletion handling

# Look for Prisma schema definitions
echo "Prisma schema definitions:"
rg --type ts '(model\s+(TaskRun|TaskRunAttempt|BatchTaskRun)|@relation)' -A 10 -g 'schema.prisma'

# Search for any DELETE operations in SQL migrations
echo "\nDELETE operations in migrations:"
rg --type sql 'DELETE\s+FROM' -A 5

# Look for any code that might be handling deletions of related records
echo "\nDeletion handling code:"
rg --type ts '(delete|remove)(TaskRun|TaskRunAttempt|BatchTaskRun)' -A 10

Length of output: 97723

apps/webapp/app/presenters/v3/ApiRetrieveRunPresenter.server.ts (3)

25-28: Confirm the use of findFirst instead of findUnique

The query now uses findFirst to retrieve the taskRun. Ensure that this change is intentional and that handling multiple matching records by returning the first one is acceptable for your use case.


39-66: Verify the inclusion of related task runs in the query

The addition of parentTaskRun, rootTaskRun, and childRuns in the include clause enhances the data retrieved. Confirm that all necessary fields are selected and that this does not introduce any performance issues due to the increased data retrieval.


93-126: Ensure all required fields are selected in childRuns

The fields selected for childRuns seem comprehensive. Double-check that all necessary fields for your application's requirements are included, and remove any unnecessary fields to optimize the query performance.

packages/core/src/v3/schemas/api.ts (3)

71-73: Addition of parentAttempt and parentBatch fields approved

The addition of optional parentAttempt and parentBatch fields to TriggerTaskRequestBody enhances the ability to track task dependencies and hierarchy. The implementation is consistent with existing patterns and properly typed.


499-499: Addition of depth field to CommonRunFields

Including the depth field in CommonRunFields allows for tracking the hierarchical level of each run. This enhancement aligns well with the PR objectives to represent task run hierarchies.


524-528: Addition of relatedRuns field enhances run hierarchy representation

The new relatedRuns field in RetrieveRunResponse, which includes root, parent, and children, effectively structures the relationships among runs. This addition aligns with the PR objectives and improves the clarity of the response data.

apps/webapp/app/v3/services/triggerTask.server.ts (2)

111-112: LGTM

Including rootTaskRunId and depth in the select clause ensures that the complete hierarchy information of the task run is retrieved.


167-168: LGTM

Adding rootTaskRunId and depth to the select clause for dependentBatchRun aligns with the retrieval of necessary hierarchy details.

packages/database/prisma/schema.prisma (2)

2105-2106: Addition of 'createdAt' and 'updatedAt' fields

Including createdAt and updatedAt timestamps in the BatchTaskRun model is beneficial for tracking record creation and modification times.


1914-1914: Child runs relation in 'TaskRunAttempt' established correctly

The addition of childRuns in the TaskRunAttempt model correctly establishes the reverse relation to parentTaskRunAttempt in TaskRun, enabling navigation between parent attempts and their child runs.

docs/v3-openapi.yaml (1)

Line range hint 1824-1912: Refactoring with CommonRunObject enhances schema consistency

The introduction of CommonRunObject consolidates shared properties of run objects, improving reusability and maintainability across the API specification.

Comment on lines 2 to 5
ALTER TABLE "TaskRun" ADD COLUMN "batchId" TEXT,
ADD COLUMN "parentTaskRunAttemptId" TEXT,
ADD COLUMN "parentTaskRunId" TEXT,
ADD COLUMN "resumeParentOnCompletion" BOOLEAN NOT NULL DEFAULT false;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Consider adding indexes on all foreign key columns for improved performance

An index has been created on parentTaskRunId, which will optimize queries filtering on this column. To further enhance query performance, especially for joins and lookups involving foreign keys, consider adding indexes on the other foreign key columns: parentTaskRunAttemptId and batchId.

You can add the following indexes:

CREATE INDEX "TaskRun_parentTaskRunAttemptId_idx" ON "TaskRun"("parentTaskRunAttemptId");

CREATE INDEX "TaskRun_batchId_idx" ON "TaskRun"("batchId");

Comment on lines +4 to +32
export const rootTask = task({
id: "task-hierarchy/root-task",
run: async (
{ useWaits = true, useBatch = false }: { useWaits: boolean; useBatch: boolean },
{ ctx }
) => {
console.log("root-task");

if (useWaits) {
if (useBatch) {
await childTask.batchTriggerAndWait([{ payload: { useWaits, useBatch } }]);
} else {
await childTask.triggerAndWait({ useWaits, useBatch });
}
} else {
if (useBatch) {
await childTask.batchTrigger([{ payload: { useWaits, useBatch } }]);
} else {
await childTask.trigger({ useWaits, useBatch });
}
}

if (!useWaits) {
await setTimeout(10_000); // Wait for 10 seconds, all the runs will be finished by then
}

await logRunHierarchy(ctx.run.id);
},
});
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Refactor repetitive task triggering logic to improve maintainability

The rootTask, childTask, and grandChildTask have similar code blocks for triggering child tasks based on the useWaits and useBatch flags. This repetition can make the code harder to maintain and increases the risk of inconsistencies or bugs.

Consider extracting the task triggering logic into a helper function. Here's how you might implement it:

async function triggerTask(
  taskFunction: any,
  useWaits: boolean,
  useBatch: boolean,
  payload: { useWaits: boolean; useBatch: boolean }
) {
  if (useWaits) {
    if (useBatch) {
      await taskFunction.batchTriggerAndWait([{ payload }]);
    } else {
      await taskFunction.triggerAndWait(payload);
    }
  } else {
    if (useBatch) {
      await taskFunction.batchTrigger([{ payload }]);
    } else {
      await taskFunction.trigger(payload);
    }
  }
}

Then, you can simplify the run functions:

export const rootTask = task({
  id: "task-hierarchy/root-task",
  run: async (
    { useWaits = true, useBatch = false }: { useWaits: boolean; useBatch: boolean },
    { ctx }
  ) => {
    console.log("root-task");

-   if (useWaits) {
-     if (useBatch) {
-       await childTask.batchTriggerAndWait([{ payload: { useWaits, useBatch } }]);
-     } else {
-       await childTask.triggerAndWait({ useWaits, useBatch });
-     }
-   } else {
-     if (useBatch) {
-       await childTask.batchTrigger([{ payload: { useWaits, useBatch } }]);
-     } else {
-       await childTask.trigger({ useWaits, useBatch });
-     }
-   }
+   await triggerTask(childTask, useWaits, useBatch, { useWaits, useBatch });

    if (!useWaits) {
      await setTimeout(10_000); // Wait for 10 seconds, all the runs will be finished by then
    }

    await logRunHierarchy(ctx.run.id);
  },
});

Apply similar changes to childTask and grandChildTask to reduce duplication across your tasks.

Comment on lines +27 to +28
await setTimeout(10_000); // Wait for 10 seconds, all the runs will be finished by then
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Replace fixed timeout with proper task completion handling

Using a fixed delay with setTimeout(10_000) assumes that all child tasks will complete within 10 seconds. This approach can lead to unreliable behavior if tasks take longer than expected or if they're faster, unnecessarily delaying execution.

Consider implementing a mechanism to confirm that all child tasks have completed before proceeding. Since you're already leveraging task dependencies, you might:

  • Use triggerAndWait for non-batch tasks even when useWaits is false, if appropriate.
  • Implement a status check loop to verify completion:
if (!useWaits) {
  let isComplete = false;
  while (!isComplete) {
    const runStatus = await runs.getStatusOfChildTasks(ctx.run.id);
    isComplete = runStatus.every((status) => status.isCompleted);
    if (!isComplete) {
      await setTimeout(1000); // Poll every second
    }
  }
}

This ensures that you proceed only after confirming all child tasks have finished.

Comment on lines +84 to +101
runId: string,
parentTaskIdentifier?: string,
triggerFunction?: string
) {
const runData = await runs.retrieve(runId);

const indent = " ".repeat(runData.depth * 2);
const triggerInfo = triggerFunction ? ` (triggered by ${triggerFunction})` : "";
const parentInfo = parentTaskIdentifier ? ` (parent task: ${parentTaskIdentifier})` : "";

console.log(
`${indent}Level ${runData.depth}: [${runData.taskIdentifier}] run ${runData.id}${triggerInfo}${parentInfo}`
);

for (const childRun of runData.relatedRuns.children ?? []) {
await logRunHierarchy(childRun.id, runData.taskIdentifier, childRun.triggerFunction);
}
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add null-check for relatedRuns to prevent potential errors

In the logRunHierarchy function, runData.relatedRuns may be undefined if there are no related runs. Accessing runData.relatedRuns.children without checking can lead to runtime errors.

Modify your code to safely handle the absence of relatedRuns:

- for (const childRun of runData.relatedRuns.children ?? []) {
+ const children = runData.relatedRuns?.children ?? [];
+ for (const childRun of children) {
    await logRunHierarchy(childRun.id, runData.taskIdentifier, childRun.triggerFunction);
  }

This ensures that the function doesn't throw an error when there are no child runs.

Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
runId: string,
parentTaskIdentifier?: string,
triggerFunction?: string
) {
const runData = await runs.retrieve(runId);
const indent = " ".repeat(runData.depth * 2);
const triggerInfo = triggerFunction ? ` (triggered by ${triggerFunction})` : "";
const parentInfo = parentTaskIdentifier ? ` (parent task: ${parentTaskIdentifier})` : "";
console.log(
`${indent}Level ${runData.depth}: [${runData.taskIdentifier}] run ${runData.id}${triggerInfo}${parentInfo}`
);
for (const childRun of runData.relatedRuns.children ?? []) {
await logRunHierarchy(childRun.id, runData.taskIdentifier, childRun.triggerFunction);
}
}
runId: string,
parentTaskIdentifier?: string,
triggerFunction?: string
) {
const runData = await runs.retrieve(runId);
const indent = " ".repeat(runData.depth * 2);
const triggerInfo = triggerFunction ? ` (triggered by ${triggerFunction})` : "";
const parentInfo = parentTaskIdentifier ? ` (parent task: ${parentTaskIdentifier})` : "";
console.log(
`${indent}Level ${runData.depth}: [${runData.taskIdentifier}] run ${runData.id}${triggerInfo}${parentInfo}`
);
const children = runData.relatedRuns?.children ?? [];
for (const childRun of children) {
await logRunHierarchy(childRun.id, runData.taskIdentifier, childRun.triggerFunction);
}
}

finishedAt: taskRun.parentTaskRun.completedAt ?? undefined,
expiredAt: taskRun.parentTaskRun.expiredAt ?? undefined,
delayedUntil: taskRun.parentTaskRun.delayUntil ?? undefined,
isRoot: taskRun.parentTaskRun.id === taskRun.rootTaskRun?.id,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Safely compare IDs in isRoot property

The expression taskRun.parentTaskRun.id === taskRun.rootTaskRun?.id may cause issues if taskRun.rootTaskRun is undefined. Consider adding a null check to prevent potential runtime errors.

Apply this change for a safer comparison:

- isRoot: taskRun.parentTaskRun.id === taskRun.rootTaskRun?.id,
+ isRoot: taskRun.rootTaskRun
+   ? taskRun.parentTaskRun.id === taskRun.rootTaskRun.id
+   : false,
Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
isRoot: taskRun.parentTaskRun.id === taskRun.rootTaskRun?.id,
isRoot: taskRun.rootTaskRun
? taskRun.parentTaskRun.id === taskRun.rootTaskRun.id
: false,

Comment on lines 1724 to 1725
rootTaskRun TaskRun? @relation("TaskRootRun", fields: [rootTaskRunId], references: [id], onDelete: SetNull)
rootTaskRunId String?
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Consider adding an index on 'rootTaskRunId'

To optimize queries involving the rootTaskRunId field, consider adding an index on this field, similar to the index on parentTaskRunId. This can improve query performance when retrieving root task runs.

Comment on lines 1738 to 1739
parentTaskRunAttempt TaskRunAttempt? @relation("TaskParentRunAttempt", fields: [parentTaskRunAttemptId], references: [id], onDelete: SetNull)
parentTaskRunAttemptId String?
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Consider adding an index on 'parentTaskRunAttemptId'

Adding an index on the parentTaskRunAttemptId field can enhance the performance of queries that involve parent task run attempts. This is especially useful if you frequently query or join on this field.

Comment on lines 1742 to 1743
batch BatchTaskRun? @relation(fields: [batchId], references: [id], onDelete: SetNull)
batchId String?
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Consider adding an index on 'batchId'

For improved query efficiency when accessing batch-related data, consider adding an index on the batchId field in the TaskRun model.

required:
- id
- generator
- attempts
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Redundant attempts in required properties

The attempts property is already declared as required in CommonRunObject. Including it again in RetrieveRunResponse is redundant and can be removed.

Apply this diff to remove the redundant attempts requirement:

          required:
-           - attempts
Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
- attempts
required:

Comment on lines 1913 to 1916
depth:
type: number
example: 0
description: The depth of the run in the task run hierarchy. The root run has a depth of 0.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Change depth property type to integer

The depth property represents hierarchical levels and should be defined as type integer rather than number for accurate data modeling.

Apply the following diff to correct the type:

            depth:
-             type: number
+             type: integer
              example: 0
              description: The depth of the run in the task run hierarchy. The root run has a depth of 0.
Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
depth:
type: number
example: 0
description: The depth of the run in the task run hierarchy. The root run has a depth of 0.
depth:
type: integer
example: 0
description: The depth of the run in the task run hierarchy. The root run has a depth of 0.

@pkg-pr-new
Copy link

pkg-pr-new bot commented Sep 20, 2024

pnpm add https://pkg.pr.new/triggerdotdev/trigger.dev/@trigger.dev/build@1332
pnpm add https://pkg.pr.new/triggerdotdev/trigger.dev/@trigger.dev/core@1332
pnpm add https://pkg.pr.new/triggerdotdev/trigger.dev@1332
pnpm add https://pkg.pr.new/triggerdotdev/trigger.dev/@trigger.dev/sdk@1332

commit: 013bd47

@ericallam ericallam force-pushed the v3/task-hierarchy-relationships branch from 3ba198b to 0b984a5 Compare September 20, 2024 09:10
Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

Commits

Files that changed from the base of the PR and between 6dac1a2 and 013bd47.

Files selected for processing (1)
  • packages/core/src/v3/schemas/api.ts (3 hunks)
Additional comments not posted (5)
packages/core/src/v3/schemas/api.ts (5)

71-71: LGTM!

The addition of the optional parentAttempt field to the TriggerTaskRequestBody schema is a valid change. It allows specifying the parent attempt ID when triggering a task, enabling the establishment of parent-child relationships between task attempts.


73-73: LGTM!

The addition of the optional parentBatch field to the TriggerTaskRequestBody schema is a valid change. It allows specifying the parent batch ID when triggering a task, enabling the establishment of parent-child relationships between task batches.


475-482: LGTM!

The introduction of the TriggerFunction enum is a valuable addition. It provides a clear specification of the types of trigger functions that can be utilized, covering both single task triggers and batch triggers. The enum improves code readability and maintainability by providing a predefined set of valid trigger function types.


510-515: LGTM!

The RelatedRunDetails schema is a well-structured representation of related run details. It inherits the common fields from CommonRunFields using the spread operator, ensuring consistency with other run-related schemas. The addition of the depth field allows specifying the depth of the related run in the hierarchy, while the triggerFunction field utilizes the previously defined TriggerFunction enum to ensure valid trigger function types are used. The optional batchId field provides the flexibility to associate the related run with a specific batch.


524-528: LGTM!

The addition of the relatedRuns field to the RetrieveRunResponse schema is a valuable enhancement. It allows representing the relationships among runs by providing fields for the root run, parent run, and an array of child runs. The use of the RelatedRunDetails type ensures consistency with the previously defined schema for related run details. This structured approach improves the clarity and usability of the response data, facilitating better management of task execution flows.

export const RelatedRunDetails = z.object({
...CommonRunFields,
depth: z.number(),
triggerFunction: z.enum(["triggerAndWait", "trigger", "batchTriggerAndWait", "batchTrigger"]),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Use the TriggerFunction enum type for the triggerFunction field.

The triggerFunction field in the RelatedRunDetails schema should utilize the previously defined TriggerFunction enum type instead of directly specifying the enum values.

Apply this diff to fix the field type:

-triggerFunction: z.enum(["triggerAndWait", "trigger", "batchTriggerAndWait", "batchTrigger"]),
+triggerFunction: TriggerFunction,
Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
triggerFunction: z.enum(["triggerAndWait", "trigger", "batchTriggerAndWait", "batchTrigger"]),
triggerFunction: TriggerFunction,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant