teable-perf-lab

Performance regression lab for Teable v2.

The current MVP runs perf cases through the existing teable-ee e2e harness. This keeps setup lightweight: GitHub Actions checks out teable-ee, injects the perf case framework, starts the same e2e Postgres/Redis services, runs the existing seed, and executes the selected cases with @teable/backend-ee in one serial job. V1 and V2 are still measured separately, but they share the same runner checkout, dependency install, database, Redis, and e2e seed setup.

This repository is intended to become the control plane for Teable performance regression validation:

define reproducible performance cases as typed case configs
run API-level end-to-end workloads through the teable-ee e2e entrypoint
persist run history, metrics, artifacts, and trace snapshots
publish manual and scheduled regression reports

The executable entrypoint is perf-lab.e2e-spec.ts. Case definitions live under cases/**/*.case.ts, and each case must have a same-name cases/**/*.md description beside it. Shared runners and artifacts live in framework/. Cases are registered in registry.ts.

Available cases:

smoke/auth-user: authenticated GET /api/auth/user/me smoke timing.
formula/10k-calc: create 10k rows, add a formula field, and verify computed values are ready.
formula/10k-5-concurrent: create 10k rows once, concurrently add 5 formula fields on the same table, and verify computed values are ready.
lookup/conditional-10k: create two 10k-row tables with permuted unique keys, add a conditional lookup on the host table, and verify each sampled row returns a different source value.
record-paste/flat-10k-4fields-copy-paste: create an empty 4-field table, paste 10k deterministic rows through PATCH /selection/paste, and verify the inserted records.
record-paste/flat-10k-20fields-copy-paste: create an empty 20-field table, paste 10k deterministic rows through PATCH /selection/paste, and verify the inserted records.
record-paste/mixed-10k-20fields-complex-copy-paste: create an empty 20-field mixed-type table, paste 10k deterministic rows through PATCH /selection/paste, and verify the typed inserted records.

For operational details, see docs/operations/teable-ee-e2e.md. The broader design remains in docs/plan.md.

Case Registry

Developer-facing case metadata is mirrored into the Teable Perf Cases table:

Base: bselS3I2MeVI6RJhS4g
Table: tbl0pa9PtLeNPCRNCKe

The table stores the case id, title, owner, tags, runner, threshold, local reproduce command, GitHub Actions reproduce command, and a GitHub URL for the case description markdown. The sync source of truth is the repository:

cases/<group>/<case>.case.ts defines executable behavior and thresholds.
cases/<group>/<case>.md explains the data setup, operation, and metric.
registry.ts decides which cases are runnable.

Run pnpm check:cases to validate the registry and markdown descriptions without writing Teable. Run pnpm sync:cases with TEABLE_PERF_LAB_TOKEN to upsert the table locally. GitHub Actions also runs Sync perf cases on pushes to main that touch case definitions, descriptions, registry, or the sync script, so the Teable table stays aligned with the repo.

Seed And Execution Boundaries

The most important design rule in this repository is: every non-trivial case has two explicit stages: seed and execute.

There are three different layers that should not be mixed together:

environment bootstrap: checkout, dependency install, Postgres/Redis startup, migrations, e2e seed, and Nest app startup
case fixture setup: large deterministic source tables and records, such as a future 100k-row table that is expensive to import
measured operation: creating or changing the formula, lookup, rollup, or other computed field and waiting until its result is correct

The first two layers may be reused or restored when the cache key is still valid. The measured operation must stay fresh for every run.

The runner architecture should therefore look like this:

resolve case
compute seed hash from case seed config + seed code
restore seed artifact by hash
if missing: run seed stage, validate fixture, save seed artifact
run execute stage against restored fixture
collect metrics, traces, and verification evidence
cleanup execute-only changes

This lets GitHub Actions, local runs, and future scheduled jobs skip expensive data import whenever the seed hash has not changed. A 100k-row fixture should be created once per hash, then reused until the case's seed config or seed code changes.

Seed Hash Contract

Each runner must define a stable seed hash for the fixture it creates. The hash is the cache key for the seed artifact.

Hash inputs should include:

case id and runner kind
the seed-relevant part of the case config: row count, fields, generator, relationships, batch size, and fixture version
the case file content when it contributes seed behavior
runner seed implementation files and shared seed helpers
the database/schema signature needed to safely restore the fixture

Hash inputs should not include:

threshold values, sample count, owner, tags, or description markdown
execute-only code that creates the measured formula, lookup, rollup, paste, or request workload
trace/reporting code

If a runner cannot separate seed code from execute code yet, use the whole runner file as a conservative hash input. That may rebuild the fixture more often, but it is still correct. Later refactors should move seed builders and execute logic into separate functions or files so the hash can be precise.

Seed artifacts must include enough metadata to prove they are safe to reuse:

caseId
runner
seedHash
fixtureVersion
recordCount
tableIds / baseId or database snapshot location
field ids and field layout
sample record ids or deterministic sample lookup rules
createdAt
schemaSignature

On a cache hit, the runner must still run a fast seedReady validation before execute: check table existence, field layout, record count, and a few sample values. If validation fails, discard the artifact and rebuild the seed.

What The Current CI Reuses

The current GitHub job runs all selected cases serially in one job. It initializes the heavy environment once:

checkout teable-ee
install dependencies
generate prisma clients
start Postgres and Redis
migrate and seed the e2e database
install perf-lab cases into the teable-ee e2e test path
run all selected perf-lab cases in the serial spec

Inside perf-lab.e2e-spec.ts, each engine initializes the Nest e2e app once and then runs all selected cases in that same app context:

engine v1 -> initApp once -> run selected cases serially
engine v2 -> initApp once -> run selected cases serially

That means database services, Redis, the e2e seed user, authentication setup, and Nest app startup are amortized across cases. They are not part of an individual case's primary metric.

When the workflow used parallel matrix jobs, it also prepared an e2e seed snapshot once:

migrate and seed e2e database
pg_dump -Fc e2e_test_teable -> seed-snapshot/e2e_test_teable.dump
upload artifact
matrix job downloads artifact
pg_restore e2e_test_teable.dump

That optimization restored the e2e seed baseline across jobs in the same workflow run. It was removed when the suite moved back to a single serial job, because the serial job no longer repeats migration and e2e seed per case/job. If matrix parallelism is reintroduced, restore this pattern before adding more per-case setup work.

Seed Stage

The existing 10k formula and conditional lookup runners currently create temporary source tables, seed deterministic records, and permanently delete the tables in finally:

perf-formula-10k-<timestamp>
perf-conditional-lookup-source-10k-<timestamp>
perf-conditional-lookup-host-10k-<timestamp>

That is acceptable for the current 10k cases, but it is not the right shape for larger import-heavy cases. For a 100k-row case, the table creation and import phase should be cacheable: the first run for a fixture key pays the cost, and later runs restore or reuse the already-built source fixture.

The seed stage should contain only the deterministic source state needed before the measured operation starts:

base/table ids or a database snapshot for the source fixture
source fields and record count
deterministic generator version and case fixture version
enough sample record ids to validate the restored fixture quickly

For cached fixtures, persist the small sample-id map with the seed artifact metadata or recover it from deterministic rows after restore. The cache should save import time, not push the runner into loading the whole dataset into memory.

Execute Stage

The execute stage is the part the case is actually measuring. It runs after seedReady, even when the seed was restored from cache.

Do not cache the computed formula, lookup, rollup, pasted records, or previously measured result unless the case is explicitly designed to measure reads from a precomputed state. For normal regression cases, remove or create a fresh measured field/workload on top of the restored seed fixture, then measure that operation and readiness.

Execute cleanup should remove only per-run measured changes:

formula, lookup, rollup, or other computed fields created for this run
temporary execute-only tables
records created by an execute workload such as paste, if the seed fixture itself must stay reusable

The cleanup step should not delete reusable seed tables on a cacheable case.

Cache Key Shape

Use a seed key that changes when the fixture shape or seed implementation changes, for example:

<case-id>:<runner>:<fixture-version>:<record-count>:<field-layout>:<generator-version>:<seed-code-hash>:<schema-signature>

The schema-signature should be tied to the database shape that the fixture depends on, not blindly to every source commit. If a looser restore key is used to pick up an older compatible fixture, the case must validate the restored table before measuring anything.

Deterministic Data Construction

Case data must be deterministic. The runner should be able to calculate the expected result locally from the row number and config.

Formula cases use deterministic numeric rows:

Title = Formula row <rowNumber>
A = rowNumber
B = (rowNumber % 97) + 1
C = rowNumber % 13

The expected formula values are computed locally, for example:

({A} * {B}) + {C}

Conditional lookup cases use a deterministic permutation:

sourceRow = ((hostRowOffset * multiplier + offset) % recordCount) + 1

The permutation multiplier must be coprime with recordCount, so each host row maps to a unique source row. This lets the full scan prove that every row gets a different expected lookup value, instead of accidentally testing repeated keys.

Seed In Batches, Keep Only Sample IDs

Large data setup is written in batches, for example 10_000 records with batchSize = 1_000. Batch timings are recorded as setup phases:

seedBatch:1
seedBatch:2
...
seedRecordsMs
maxSeedBatchMs

During seeding, runners only keep the record ids needed for sample validation:

rowOffset -> recordId / rowNumber

Do not keep all 10k records in memory just for later assertions. Full verification should page through the real read path with getRecords, usually 1_000 records at a time.

Split Setup From The Measured Operation

When adding a heavy case, split the flow into these phases:

seedHash: compute the seed cache key from case seed config and seed code.
seedRestore / seedBuild: restore a cached source fixture when available; otherwise create the source tables and insert deterministic data in batches.
seedReady / sourceReady: verify record count, field layout, sample values, and any source-side indexes or links needed by the case.
createFormulaField:* / createLookupField: trigger the operation being measured on top of that source fixture.
fullFormulaScanReady / fullLookupScanReady: page through records until every computed value matches the locally expected value.
cleanup: remove per-run measured fields or derived temporary tables. Preserve reusable source fixtures unless the fixture key is invalid or the case is intentionally testing cold import/setup.

Primary metrics should focus on the operation and readiness verification, not on setup cost:

formulaFullReadyMs = formulasReadyMs + fullFormulaScanReadyMs
conditionalLookupReadyMs = createLookupFieldMs + fullLookupScanReadyMs

Setup metrics such as createTableMs, seedRecordsMs, and maxSeedBatchMs should still be recorded. They help explain noisy runs, but they should not be mixed into the primary computed-field metric unless the case explicitly says it is measuring setup.

For cached seeds, also record seedHash, seedCacheHit, seedRestoreMs, seedBuildMs, and seedReadyMs. A cache miss is still a valid run, but the report should make it obvious that the run paid seed construction cost.

Verification Contract

Use two levels of verification:

sample verification: quick polling on a few known rows to confirm the system has started returning correct values
full scan verification: paged read of every row to prove the computed field is fully ready

If a case fails, throw a diagnostic result that still includes completed phase durations, table ids, field ids, sample records, partial metrics, and error details. A failed run should still leave enough evidence for a developer to understand whether the failure happened during setup, trigger, readiness polling, full scan, or cleanup.

Wrap important phases with withPerfTraceStep() so trace artifacts line up with case phases. The dashboard should be able to guide a developer from a trend point to the exact step that consumed time.

Adding A Case

Use this flow when adding or changing a perf case. The goal is that another developer can understand the data, reproduce the operation, and trigger the same case from Teable without reading the runner internals first.

Pick an existing runner when possible.

Available runner kinds are defined in framework/types.ts:
- http-endpoint: repeated requests against one authenticated endpoint.
- formula-table: create a temporary table, insert deterministic numeric rows, create one or more formula fields, and verify computed values.
- conditional-lookup: create source and host tables, insert deterministic key/value rows, create a conditional lookup, and verify lookup values.
- record-paste: create an empty table, paste deterministic clipboard-style content through the selection paste API, and verify inserted records.
Add a new runner only when the operation cannot be expressed by these configs. A new runner needs type support in framework/types.ts, dispatch in framework/run-perf-case.ts, and a framework/runners/*.runner.ts implementation.

Create the case file.

Put the executable case in:

cases/<group>/<case-name>.case.ts

Use definePerfCase() and keep the id stable:

import { definePerfCase } from "../../framework/types";

export default definePerfCase({
  id: "<group>/<case-name>",
  title: "Human readable title",
  runner: "formula-table",
  timeoutMs: 300_000,
  config: {
    // runner-specific config
    threshold: {
      metric: "formulaFullReadyMs",
      maxMs: 60_000,
    },
  },
});

Case id rules:

Match the path: cases/formula/10k-calc.case.ts uses formula/10k-calc.
Do not rename an existing id unless you intentionally want a new Teable registry row and new historical grouping.
Prefer deterministic data generators and fixed row counts so V1/V2 and repeated runs are comparable.

Add the description markdown beside the case.

Every case must have the same-name markdown file:
```
cases/<group>/<case-name>.md
```
Start it with frontmatter:
```
---
owner: backend-v2
tags:
  - formula
  - computed
  - 10k
  - v1-v2
enabled: true
---
```
The body should include these sections:
- Goal: what regression this case is meant to catch.
- Seed Phase: tables, fields, row counts, generators, relationships, and which config/code is expected to affect the seed hash.
- Execute Phase: ordered steps that run after seed restore/build, including the operation being measured and execute-only cleanup.
- Primary Metric: the metric used for threshold comparison.
- Notes: useful debugging hints, phase names, or known tradeoffs.
If the case is intentionally cold-starting data import instead of reusing a seed artifact, say that explicitly in Seed Phase.
Register the case.

Add an import and include it in the cases array in registry.ts. Optionally add short aliases in caseAliases for manual triggering. The registry is the runnable source of truth; a .case.ts file that is not registered will fail pnpm check:cases.
Validate locally.

From this repository:
```
pnpm check
```
This verifies formatting, workflow YAML, TypeScript syntax, and case registry consistency. The case check confirms:
- every cases/**/*.case.ts file is registered in registry.ts
- every registered case exists on disk
- every case has a same-name markdown description
- required metadata such as id, title, runner, timeout, and threshold can be parsed for Teable sync

Run the case in CI.

Use GitHub Actions when you need the real e2e environment:

gh workflow run "Teable EE e2e perf" \
  --repo teableio/teable-perf-lab \
  --ref main \
  -f teable_ee_ref=<teable-ee-branch-or-sha> \
  -f case_filter=<group>/<case-name> \
  -f engine_filter=v1,v2

The same API call is what Teable buttons or automations should use:

POST https://api.github.com/repos/teableio/teable-perf-lab/actions/workflows/teable-ee-e2e-perf.yml/dispatches

Body:

{
  "ref": "main",
  "inputs": {
    "teable_ee_ref": "<teable-ee-branch-or-sha>",
    "case_filter": "<group>/<case-name>",
    "samples": "10",
    "primary_threshold_ms": "",
    "max_parallel": "0",
    "engine_filter": "v1,v2"
  }
}

samples only controls repeated request samples for http-endpoint cases. Heavy table cases define their own scale through case config such as recordCount, formula count, lookup structure, timeout, and threshold.

Sync the Teable case registry.

Pushes to main automatically run Sync perf cases and update the Perf Cases table. For a local sync, run:
```
TEABLE_PERF_LAB_TOKEN=<token> pnpm sync:cases
```
After sync, the Teable row should show the case id, title, runner, threshold, reproduce commands, and a Description URL pointing to the markdown on the main branch.
Keep result interpretation maintainable.

When setting thresholds, prefer a value that catches real regressions without being noisy on GitHub-hosted runners. For computed-field cases, include phase names in the markdown notes so a developer can quickly tell whether time was spent in setup, field creation, readiness polling, full scan verification, or cleanup.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github/workflows		.github/workflows
cases		cases
docs		docs
framework		framework
scripts		scripts
tasks		tasks
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
package.json		package.json
perf-lab.e2e-spec.ts		perf-lab.e2e-spec.ts
pnpm-lock.yaml		pnpm-lock.yaml
registry.ts		registry.ts
vitest-perf-lab.config.ts		vitest-perf-lab.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

teable-perf-lab

Case Registry

Seed And Execution Boundaries

Seed Hash Contract

What The Current CI Reuses

Seed Stage

Execute Stage

Cache Key Shape

Deterministic Data Construction

Seed In Batches, Keep Only Sample IDs

Split Setup From The Measured Operation

Verification Contract

Adding A Case

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

teable-perf-lab

Case Registry

Seed And Execution Boundaries

Seed Hash Contract

What The Current CI Reuses

Seed Stage

Execute Stage

Cache Key Shape

Deterministic Data Construction

Seed In Batches, Keep Only Sample IDs

Split Setup From The Measured Operation

Verification Contract

Adding A Case

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages