pg-workflows

The simplest Postgres workflow engine for TypeScript. Durable execution, event-driven orchestration, and automatic retries - powered entirely by PostgreSQL. No extra infrastructure. No vendor lock-in.

npm install pg-workflows

Why pg-workflows?

Most workflow engines ask you to adopt an entirely new platform - a new runtime, a new deployment target, a new bill. pg-workflows takes a different approach: if you already have PostgreSQL, you already have everything you need.

	pg-workflows	Temporal	Inngest	DBOS	pgflow
Runs on your existing Postgres	Yes	No	No	Partial	Supabase only
Zero extra infrastructure	Yes	No	No	No	No
Framework-agnostic	Yes	Yes	No	Yes	No
Event-driven pause/resume	Yes	Yes	Yes	No	No
Open source	MIT	MIT	ELv2	MIT	Apache-2.0
TypeScript-first	Yes	Via SDK	Yes	Via SDK	Yes

When to use pg-workflows

You already run PostgreSQL and want to add durable workflows without new services
You need a lightweight, self-hosted workflow engine with zero operational overhead
You want event-driven orchestration (pause, resume, wait for external signals)
You're building AI agents or LLM pipelines that need durable execution, retries, and human-in-the-loop
You're building with TypeScript/Node.js and want a native developer experience

When to consider alternatives

If you need enterprise-grade features like distributed tracing, complex DAG scheduling, or plan to scale to millions of concurrent workflows, consider Temporal, Inngest, Trigger.dev, or DBOS.

Features

Durable Execution on Postgres - Workflow state is persisted in PostgreSQL. Workflows survive process crashes, restarts, and deployments.
Step-by-Step Execution - Break complex processes into discrete, resumable steps. Each step runs exactly once, even across retries.
Event-Driven Orchestration - Pause workflows and wait for external events with step.waitFor(). Resume automatically when signals arrive.
Pause and Resume - Manually pause long-running workflows and resume them later via API.
Built-in Retries - Automatic retries with exponential backoff at the workflow level.
Configurable Timeouts - Set workflow-level and step-level timeouts to prevent runaway executions.
Progress Tracking - Monitor workflow completion percentage, completed steps, and total steps in real-time.
Input Validation - Define schemas with Zod for type-safe, validated workflow inputs.
Built on pg-boss - Leverages the battle-tested pg-boss job queue for reliable task scheduling.

How It Works

pg-workflows uses PostgreSQL as both the job queue and the state store. Under the hood:

Define workflows as TypeScript functions with discrete steps
Start a workflow run - the engine creates a database record and enqueues the first execution
Execute steps one by one - each step's result is persisted before moving to the next
Pause on waitFor() or pause() - the workflow sleeps with zero resource consumption
Resume when an external event arrives or resumeWorkflow() is called
Complete - the final result is stored and the workflow is marked as done

All state lives in PostgreSQL. No Redis. No message broker. No external scheduler. Just Postgres.

Quick Start

1. Install

npm install pg-workflows pg-boss
# or
yarn add pg-workflows pg-boss
# or
bun add pg-workflows pg-boss

2. Define a Workflow

import { WorkflowEngine, workflow } from 'pg-workflows';
import PgBoss from 'pg-boss';
import { z } from 'zod';

// Define a durable workflow
const sendWelcomeEmail = workflow(
  'send-welcome-email',
  async ({ step, input }) => {
    // Step 1: Create user record (runs exactly once)
    const user = await step.run('create-user', async () => {
      return { id: '123', email: input.email };
    });

    // Step 2: Send email (runs exactly once)
    await step.run('send-email', async () => {
      await sendEmail(user.email, 'Welcome!');
    });

    // Step 3: Wait for user confirmation (pauses the workflow)
    const confirmation = await step.waitFor('wait-confirmation', {
      eventName: 'user-confirmed',
      timeout: 24 * 60 * 60 * 1000, // 24 hours
    });

    return { success: true, user, confirmation };
  },
  {
    inputSchema: z.object({
      email: z.string().email(),
    }),
    timeout: 48 * 60 * 60 * 1000, // 48 hours
    retries: 3,
  }
);

3. Start the Engine

const boss = new PgBoss({
  connectionString: process.env.DATABASE_URL,
});

const engine = new WorkflowEngine({
  boss,
  workflows: [sendWelcomeEmail],
});

await engine.start();

4. Run Workflows

// Start a workflow run
const run = await engine.startWorkflow({
  workflowId: 'send-welcome-email',
  resourceId: 'user-123',
  input: { email: 'user@example.com' },
});

// Send an event to resume the waiting workflow
await engine.triggerEvent({
  runId: run.id,
  resourceId: 'user-123',
  eventName: 'user-confirmed',
  data: { confirmedAt: new Date() },
});

// Check progress
const progress = await engine.checkProgress({
  runId: run.id,
  resourceId: 'user-123',
});

console.log(`Progress: ${progress.completionPercentage}%`);

What Can You Build?

AI Agents & LLM Pipelines - Build durable AI agents that survive crashes, retry on flaky LLM APIs, and pause for human-in-the-loop review. See examples below.
User Onboarding Flows - Multi-step signup sequences with email verification, waiting for user actions, and conditional paths.
Payment & Checkout Pipelines - Durable payment processing that survives failures, with automatic retries and event-driven confirmations.
Background Job Orchestration - Replace fragile cron jobs with durable, observable workflows that track progress.
Approval Workflows - Pause execution and wait for human approval events before proceeding.
Data Processing Pipelines - ETL workflows with step-by-step execution, error handling, and progress monitoring.

AI & Agent Workflows

AI agents and LLM pipelines are one of the best use cases for durable execution. LLM calls are slow, expensive, and unreliable - exactly the kind of work that should never be repeated unnecessarily. pg-workflows gives you:

Cached step results - If your process crashes after a $0.50 GPT-4 call, the result is already persisted. On retry, it skips the LLM call and picks up where it left off.
Automatic retries - LLM APIs return 429s and 500s. Built-in exponential backoff handles transient failures without custom retry logic.
Human-in-the-loop - Pause an AI pipeline with step.waitFor() to wait for human review, approval, or feedback before continuing.
Observable progress - Track which step your agent is on, how far along it is, and inspect intermediate results with checkProgress().
Long-running agents - Multi-step agents that run for minutes or hours don't need to hold a connection open. They persist state and resume.

Example: Multi-Step AI Agent

const researchAgent = workflow(
  'research-agent',
  async ({ step, input }) => {
    // Step 1: Plan the research (persisted - never re-runs on retry)
    const plan = await step.run('create-plan', async () => {
      return await llm.chat({
        model: 'gpt-4o',
        messages: [{ role: 'user', content: `Create a research plan for: ${input.topic}` }],
      });
    });

    // Step 2: Execute each research task durably
    const findings = [];
    for (const task of plan.tasks) {
      const result = await step.run(`research-${task.id}`, async () => {
        return await llm.chat({
          model: 'gpt-4o',
          messages: [{ role: 'user', content: `Research: ${task.description}` }],
        });
      });
      findings.push(result);
    }

    // Step 3: Synthesize results
    const report = await step.run('synthesize', async () => {
      return await llm.chat({
        model: 'gpt-4o',
        messages: [{ role: 'user', content: `Synthesize these findings: ${JSON.stringify(findings)}` }],
      });
    });

    return { plan, findings, report };
  },
  {
    retries: 3,
    timeout: 30 * 60 * 1000, // 30 minutes
  }
);

If the process crashes after completing 3 of 5 research tasks, the agent resumes from task 4 - no LLM calls are wasted.

Example: Human-in-the-Loop AI Pipeline

const contentPipeline = workflow(
  'ai-content-pipeline',
  async ({ step, input }) => {
    // Step 1: Generate draft with AI
    const draft = await step.run('generate-draft', async () => {
      return await llm.chat({
        model: 'gpt-4o',
        messages: [{ role: 'user', content: `Write a blog post about: ${input.topic}` }],
      });
    });

    // Step 2: Pause for human review - costs nothing while waiting
    const review = await step.waitFor('human-review', {
      eventName: 'content-reviewed',
      timeout: 7 * 24 * 60 * 60 * 1000, // 7 days
    });

    // Step 3: Revise based on feedback
    if (review.approved) {
      return { status: 'published', content: draft };
    }

    const revision = await step.run('revise-draft', async () => {
      return await llm.chat({
        model: 'gpt-4o',
        messages: [
          { role: 'user', content: `Revise this draft based on feedback:\n\nDraft: ${draft}\n\nFeedback: ${review.feedback}` },
        ],
      });
    });

    return { status: 'revised', content: revision };
  },
  { retries: 3 }
);

// A reviewer approves or requests changes via your API
await engine.triggerEvent({
  runId: run.id,
  eventName: 'content-reviewed',
  data: { approved: false, feedback: 'Make the intro more engaging' },
});

Example: RAG Pipeline with Tool Use

const ragAgent = workflow(
  'rag-agent',
  async ({ step, input }) => {
    // Step 1: Generate embeddings (cached on retry)
    const embedding = await step.run('embed-query', async () => {
      return await openai.embeddings.create({
        model: 'text-embedding-3-small',
        input: input.query,
      });
    });

    // Step 2: Search vector store
    const documents = await step.run('search-docs', async () => {
      return await vectorStore.search(embedding, { topK: 10 });
    });

    // Step 3: Generate answer with context
    const answer = await step.run('generate-answer', async () => {
      return await llm.chat({
        model: 'gpt-4o',
        messages: [
          { role: 'system', content: `Answer using these documents:\n${documents.map(d => d.text).join('\n')}` },
          { role: 'user', content: input.query },
        ],
      });
    });

    // Step 4: Validate and fact-check
    const validation = await step.run('fact-check', async () => {
      return await llm.chat({
        model: 'gpt-4o',
        messages: [
          { role: 'user', content: `Fact-check this answer against the source documents. Answer: ${answer}` },
        ],
      });
    });

    return { answer, validation, sources: documents };
  },
  { retries: 3, timeout: 5 * 60 * 1000 }
);

Why Durable Execution Matters for AI

Problem	Without pg-workflows	With pg-workflows
Process crashes mid-pipeline	All LLM calls re-run from scratch	Resumes from the last completed step
LLM API returns 429/500	Manual retry logic everywhere	Automatic retries with exponential backoff
Human review needed	Custom polling/webhook infrastructure	`step.waitFor()` - zero resource consumption while waiting
Debugging failed agents	Lost intermediate state	Full timeline of every step's input/output in PostgreSQL
Cost control	Repeated expensive LLM calls on failure	Each LLM call runs exactly once, result cached
Long-running pipelines	Timeout or lost connections	Runs for hours/days, state persisted in Postgres

Core Concepts

Workflows

A workflow is a durable function that breaks complex operations into discrete, resumable steps. Define workflows using the workflow() function:

const myWorkflow = workflow(
  'workflow-id',
  async ({ step, input }) => {
    // Your workflow logic here
  },
  {
    inputSchema: z.object({ /* ... */ }),
    timeout: 60000, // milliseconds
    retries: 3,
  }
);

Steps

Steps are the building blocks of durable workflows. Each step is executed exactly once, even if the workflow is retried:

await step.run('step-id', async () => {
  // This will only execute once - the result is persisted in Postgres
  return { result: 'data' };
});

Event-Driven Workflows

Wait for external events to pause and resume workflows without consuming resources:

const eventData = await step.waitFor('wait-step', {
  eventName: 'payment-completed',
  timeout: 5 * 60 * 1000, // 5 minutes
});

Resource ID

The optional resourceId associates a workflow run with an external entity in your application - a user, an order, a subscription, or any domain object the workflow operates on. It serves two purposes:

Association - Links each workflow run to the business entity it belongs to, so you can query all runs for a given resource.
Scoping - When provided, all read and write operations (get, update, pause, resume, cancel, trigger events) include resource_id in their database queries, ensuring you only access workflow runs that belong to that resource. This is useful for enforcing tenant isolation or ownership checks.

resourceId is optional on every API method. If you don't need to group or scope runs by an external entity, you can omit it entirely and use runId alone.

// Start a workflow scoped to a specific user
const run = await engine.startWorkflow({
  workflowId: 'send-welcome-email',
  resourceId: 'user-123',          // ties this run to user-123
  input: { email: 'user@example.com' },
});

// Later, list all workflow runs for that user
const { items } = await engine.getRuns({
  resourceId: 'user-123',
});

Pause and Resume

Manually pause a workflow and resume it later:

// Pause inside a workflow
await step.pause('pause-step');

// Resume from outside the workflow
await engine.resumeWorkflow({
  runId: run.id,
  resourceId: 'resource-123',
});

Examples

Conditional Steps

const conditionalWorkflow = workflow('conditional', async ({ step }) => {
  const data = await step.run('fetch-data', async () => {
    return { isPremium: true };
  });

  if (data.isPremium) {
    await step.run('premium-action', async () => {
      // Only runs for premium users
    });
  }
});

Batch Processing with Loops

const batchWorkflow = workflow('batch-process', async ({ step }) => {
  const items = await step.run('get-items', async () => {
    return [1, 2, 3, 4, 5];
  });

  for (const item of items) {
    await step.run(`process-${item}`, async () => {
      // Each item is processed durably
      return processItem(item);
    });
  }
});

Error Handling with Retries

const resilientWorkflow = workflow('resilient', async ({ step }) => {
  await step.run('risky-operation', async () => {
    // Retries up to 3 times with exponential backoff
    return await riskyApiCall();
  });
}, {
  retries: 3,
  timeout: 60000,
});

Monitoring Workflow Progress

const progress = await engine.checkProgress({
  runId: run.id,
  resourceId: 'resource-123',
});

console.log({
  status: progress.status,
  completionPercentage: progress.completionPercentage,
  completedSteps: progress.completedSteps,
  totalSteps: progress.totalSteps,
});

API Reference

WorkflowEngine

Constructor

const engine = new WorkflowEngine({
  boss: PgBoss,                    // Required: pg-boss instance
  workflows: WorkflowDefinition[], // Optional: register workflows on init
  logger: WorkflowLogger,          // Optional: custom logger
});

Methods

Method	Description
`start(asEngine?, options?)`	Start the engine and workers
`stop()`	Stop the engine gracefully
`registerWorkflow(definition)`	Register a workflow definition
`startWorkflow({ workflowId, resourceId?, input, options? })`	Start a new workflow run. `resourceId` optionally ties the run to an external entity (see Resource ID).
`pauseWorkflow({ runId, resourceId? })`	Pause a running workflow
`resumeWorkflow({ runId, resourceId?, options? })`	Resume a paused workflow
`cancelWorkflow({ runId, resourceId? })`	Cancel a workflow
`triggerEvent({ runId, resourceId?, eventName, data?, options? })`	Send an event to a workflow
`getRun({ runId, resourceId? })`	Get workflow run details
`checkProgress({ runId, resourceId? })`	Get workflow progress
`getRuns(filters)`	List workflow runs with pagination

workflow()

workflow<I extends Parameters>(
  id: string,
  handler: (context: WorkflowContext) => Promise<unknown>,
  options?: {
    inputSchema?: I,
    timeout?: number,
    retries?: number,
  }
): WorkflowDefinition<I>

WorkflowContext

The context object passed to workflow handlers:

{
  input: T,                    // Validated input data
  workflowId: string,          // Workflow ID
  runId: string,               // Unique run ID
  timeline: Record<string, unknown>, // Step execution history
  logger: WorkflowLogger,      // Logger instance
  step: {
    run: <T>(stepId, handler) => Promise<T>,
    waitFor: <T>(stepId, { eventName, timeout?, schema? }) => Promise<T>,
    waitUntil: (stepId, { date }) => Promise<void>,
    pause: (stepId) => Promise<void>,
  }
}

WorkflowStatus

enum WorkflowStatus {
  PENDING = 'pending',
  RUNNING = 'running',
  PAUSED = 'paused',
  COMPLETED = 'completed',
  FAILED = 'failed',
  CANCELLED = 'cancelled',
}

Configuration

Environment Variables

Variable	Description	Default
`DATABASE_URL`	PostgreSQL connection string	required
`WORKFLOW_RUN_WORKERS`	Number of worker processes	`3`
`WORKFLOW_RUN_EXPIRE_IN_SECONDS`	Job expiration time in seconds	`300`

Database Setup

The engine automatically runs migrations on startup to create the required tables:

workflow_runs - Stores workflow execution state, step results, and timeline. The optional resource_id column (indexed) associates each run with an external entity in your application. See Resource ID.
pgboss.* - pg-boss job queue tables for reliable task scheduling

The PostgreSQL-for-Everything Philosophy

As championed by postgresforeverything.com, PostgreSQL is one of the most reliable, feature-rich, and cost-effective databases ever built. pg-workflows embraces this philosophy:

One database to rule them all - Your application data and workflow state live in the same PostgreSQL instance. No distributed systems headaches.
Battle-tested reliability - PostgreSQL's ACID transactions guarantee your workflow state is always consistent.
Zero operational overhead - No Redis cluster to manage. No message broker to monitor. No external service to pay for.
Full queryability - Inspect, debug, and analyze workflow runs with plain SQL.

If you're already running Postgres (and you probably should be), adding durable workflows is as simple as:

npm install pg-workflows

Requirements

Node.js >= 18.0.0
PostgreSQL >= 10
pg-boss >= 10.0.0

Acknowledgments

Special thanks to the teams behind Temporal, Inngest, Trigger.dev, and DBOS for pioneering durable execution patterns and inspiring this project.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.github		.github
examples		examples
src		src
.biomeignore		.biomeignore
.clinerules		.clinerules
.cursorrules		.cursorrules
.eslintignore		.eslintignore
.gitignore		.gitignore
.windsurfrules		.windsurfrules
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
GEMINI.md		GEMINI.md
LICENSE		LICENSE
README.md		README.md
biome.json		biome.json
bun.lock		bun.lock
bunup.config.ts		bunup.config.ts
package.json		package.json
tsconfig.json		tsconfig.json

Uh oh!

License

SokratisVidros/pg-workflows

Folders and files

Latest commit

History

Repository files navigation

pg-workflows

Why pg-workflows?

When to use pg-workflows

When to consider alternatives

Features

How It Works

Quick Start

1. Install

2. Define a Workflow

3. Start the Engine

4. Run Workflows

What Can You Build?

AI & Agent Workflows

Example: Multi-Step AI Agent

Example: Human-in-the-Loop AI Pipeline

Example: RAG Pipeline with Tool Use

Why Durable Execution Matters for AI

Core Concepts

Workflows

Steps

Event-Driven Workflows

Resource ID

Pause and Resume

Examples

Conditional Steps

Batch Processing with Loops

Error Handling with Retries

Monitoring Workflow Progress

API Reference

WorkflowEngine

Constructor

Methods

workflow()

WorkflowContext

WorkflowStatus

Configuration

Environment Variables

Database Setup

The PostgreSQL-for-Everything Philosophy

Requirements

Acknowledgments

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Sponsor this project

Uh oh!

Packages 0

Contributors 4

Uh oh!

Languages

Packages