camunda
diff --git a/‎docs/components/best-practices/architecture/sizing-benchmarks.md‎
Lines changed: 91 additions & 0 deletions b/‎docs/components/best-practices/architecture/sizing-benchmarks.md‎
Lines changed: 91 additions & 0 deletions
diff --git a/‎docs/components/best-practices/architecture/sizing-saas.md‎
Lines changed: 71 additions & 0 deletions b/‎docs/components/best-practices/architecture/sizing-saas.md‎
Lines changed: 71 additions & 0 deletions
@@ -0,0 +1,91 @@
+---
+id: sizing-benchmarks
+title: Run benchmarks
+tags:
+  - Performance
+  - Hardware
+  - Sizing
+  - Benchmarks
+description: "Run your own benchmarks to validate Camunda 8 sizing for your specific workload."
+---
+
+Run your own benchmarks to validate [Camunda 8 sizing](./sizing-your-environment.md) for your specific workload.
+
+## Reference benchmark scenario
+
+The sizing recommendations for [SaaS](sizing-saas.md) and [Self-Managed](sizing-self-managed.md) are based on a reference benchmark scenario. Your actual workload may differ significantly, so running your own benchmarks is the most reliable way to validate that your chosen configuration meets your needs.
+
+Camunda uses the following realistic benchmark scenario:
+
+- **Process model:** [bankCustomerComplaintDisputeHandling.bpmn](https://github.com/camunda/camunda/blob/main/load-tests/load-tester/src/main/resources/bpmn/realistic/bankCustomerComplaintDisputeHandling.bpmn) (a credit card fraud dispute handling process from the [Camunda Marketplace blueprint](https://marketplace.camunda.com/en-US/apps/449510/credit-card-fraud-dispute-handling)).
+- **Payload:** [realisticPayload.json](https://github.com/camunda/camunda/blob/main/load-tests/load-tester/src/main/resources/bpmn/realistic/realisticPayload.json) (~11 KB).
+- This setup produces approximately **101 tasks per second at 1 PI/s** due to internal sub-process instantiation (50 sub-process instances per root instance).
+
+:::note
+The official sizing numbers on this page are produced using the [load-tester](https://github.com/camunda/camunda/tree/main/load-tests/load-tester) tool from the Camunda monorepo.
+:::
+
+## Run your own benchmarks
+
+Use the [Camunda 8 Benchmark project (c8b)](https://github.com/camunda-community-hub/camunda-8-benchmark), a Spring Boot application, to run load tests against your cluster.
+
+### Key features
+
+- Starts process instances at a configurable rate and **automatically adjusts based on backpressure**.
+- Completes tasks that appear in the process instances.
+- **Bring your own BPMN process model and payload**, which can be provided as URLs, such as GitHub Gists.
+- **Automatic job type discovery** from BPMN files.
+- Configurable **task completion delay** to simulate real worker behavior.
+- Built-in **Prometheus metrics and Grafana dashboards** for observability.
+
+### Quick start
+
+Run the following command against your cluster:
+
+```bash
+mvn spring-boot:run
+```
+
+With Docker:
+
+```bash
+docker run camundacommunityhub/camunda-8-benchmark:main
+```
+
+Customize it with your own process and payload:
+
+```bash
+benchmark.bpmnResource=url:https://your-gist-url/your-process.bpmn
+benchmark.payloadPath=url:https://your-gist-url/your-payload.json
+benchmark.processInstanceStartRate=25
+benchmark.taskCompletionDelay=200
+```
+
+:::important
+To run meaningful benchmarks, use a **properly sized environment**. SaaS trial clusters and local developer machines have limited resources and will hit bottlenecks too early. Use either a correctly sized Camunda SaaS cluster (with help from your Camunda representative) or a properly provisioned Self-Managed Kubernetes environment.
+:::
+
+## When to benchmark
+
+Running your own benchmarks when:
+
+- Your process models or payload sizes **differ significantly** from the reference scenario.
+- **Latency or cycle time requirements** are critical to your use case.
+- You are running Optimize with **payloads larger than the reference ~11 KB** or retention periods **exceeding 6 months**. Larger payloads and longer retention amplify Elasticsearch disk consumption and Optimize import times.
+- You are **upgrading from a pre-8.8 version** and want to validate resource requirements.
+- You are using **RDBMS (PostgreSQL) as secondary storage** and want to validate throughput differences.
+
+## What to measure
+
+When running benchmarks, focus on these key metrics:
+
+- **Sustained throughput (tasks/second):** The rate your cluster can handle continuously without increasing backpressure.
+- **Backpressure rate:** Should remain below 10% for sustainable operation.
+- **Process instance latency (p99):** End-to-end time from instance creation to completion. Target depends on your SLO.
+- **Elasticsearch disk growth rate:** Helps you forecast disk capacity needs.
+- **Data availability latency:** The time between an event in the engine and its appearance in Operate/Tasklist.
+  - Note: to measure this, you have to compare the time from starting an instance and its availability in query APIs using the Orchestration Cluster REST API
+- **CPU usage and throttling:** High CPU usage or frequent throttling indicates a need for more CPU resources or additional brokers.
+- **Memory usage:** Sustained high memory usage suggests the need for larger memory limits or additional nodes.
+
+<!-- TODO: Define the exact SLO boundary used for "max throughput" in the official benchmark tables (e.g., "max sustainable throughput where backpressure remains below 10% and p99 process duration stays under 1 second"). If the exact boundary is not standardized, document the methodology. -->
@@ -0,0 +1,71 @@
+---
+id: sizing-saas
+title: Size your SaaS cluster
+tags:
+  - Performance
+  - Hardware
+  - Sizing
+  - SaaS
+description: "Select the right Camunda 8 SaaS cluster size based on your needs."
+---
+
+import Tabs from "@theme/Tabs";
+import TabItem from "@theme/TabItem";
+
+Select the right Camunda 8 SaaS cluster size based on your needs. For an overview of the factors that influence sizing, see [Size your environment](./sizing-your-environment.md).
+
+## Determine your cluster size
+
+Camunda 8 defines four [cluster sizes](/components/concepts/clusters.md#cluster-size) (1x, 2x, 3x, and 4x) you can select after choosing your [cluster type](/components/concepts/clusters.md#cluster-type).
+
+To do so, follow these steps:
+
+1. Calculate your throughput and storage requirements using the guidance in [Size your environment](./sizing-your-environment.md).
+2. Use the [sizing tables](#sizing-tables) to find the cluster size that meets your needs.
+
+:::note
+Contact your Customer Success Manager to increase the cluster size beyond 4x. This requires custom sizing and pricing.
+:::
+
+### Sizing tables
+
+| Cluster size                                         |                              1x |                              2x |                              3x |                              4x |
+| :--------------------------------------------------- | ------------------------------: | ------------------------------: | ------------------------------: | ------------------------------: |
+| Max Throughput **Tasks/day** **\***                  |                             9 M |                            18 M |                            27 M |                            36 M |
+| Max Throughput **Tasks/second** **\***               |                             100 |                             200 |                             300 |                             400 |
+| Max Throughput **Process Instances/second** **\*\*** |                               5 |                              10 |                              15 |                              20 |
+| Max Total Number of PI stored (in ES) **\*\*\***     |                           200 k |                           400 k |                           600 k |                           800 k |
+| Approximate resources provisioned **\*\*\*\***       | 11 vCPU, 22 GB mem, 192 GB disk | 22 vCPU, 44 GB mem, 384 GB disk | 33 vCPU, 66 GB mem, 576 GB disk | 44 vCPU, 88 GB mem, 768 GB disk |
+
+<!-- TODO: Validate "with Optimize" numbers against 8.9 benchmarks. The numbers above were measured with Camunda 8.8. Also confirm whether the "max throughput" boundary condition is defined as backpressure < 10% and p99 process duration < 1s, or another SLO. -->
+
+:::note
+The numbers in the tables were measured using Camunda 8 (version 8.8), [the benchmark project](https://github.com/camunda-community-hub/camunda-8-benchmark) running on its own Kubernetes cluster, and using a [realistic process](https://github.com/camunda/camunda/blob/main/load-tests/load-tester/src/main/resources/bpmn/realistic/bankCustomerComplaintDisputeHandling.bpmn) with this [payload](https://github.com/camunda/camunda/blob/main/load-tests/load-tester/src/main/resources/bpmn/realistic/reducedPayload.json) (~1.4 KB). To calculate day-based metrics, an equal distribution over 24 hours is assumed.
+:::
+
+**\*** Tasks (including service, send, and user tasks, among others) completed per day are the primary metric, as this is easy to measure and strongly influences resource consumption. This number assumes a constant load throughout the day. Tasks/day and Tasks/second are scaled linearly.
+
+**\*\*** Because tasks are the primary resource driver, the number of process instances supported by a cluster is calculated assuming an average of 10 tasks per process. As a customers, you can calculate a more accurate process instance estimate using your anticipated number of tasks per process.
+
+**\*\*\*** Maximum total number of historical process instances within the retention period.
+For active process instances, this is limited mostly by Zeebe resources; for historical instances, it is limited mostly by Elasticsearch resources. Calculated assuming a typical set of process variables per process instance. Note that it makes a difference whether you add one or two strings (requiring ~1 KB of space) to your process instances or attach a full JSON document containing 1 MB, as this data must be stored in various places, influencing memory and disk requirements. If this number increases, you can still retain the runtime throughput, but Tasklist, Operate, and/or Optimize may lag behind.
+The provisioned disk size is calculated as the sum of the disk size used by Zeebe and Elasticsearch.
+
+**\*\*\*\*** These are the resource limits configured in the Kubernetes cluster and are subject to change.
+
+## Data retention
+
+The maximum throughput numbers should be considered peak loads, and the data retention configuration considered when defining the amount of data kept for completed instances in your cluster. See [Camunda 8 SaaS data retention](/components/saas/data-retention.md) for the default retention times for Zeebe, Tasklist, Operate, and Optimize.
+
+- If process instances are completed and older than the configured retention time for an application, the data is removed.
+- If a process instance is older than the configured retention time but still active and incomplete, it continues to function at runtime and is _not_ removed.
+
+Camunda can adjust data retention on request (up to certain limits). Consider retention time adjustments and/or storage capacity increases if you plan to run more than \[max PI stored in ES\] / \[configured retention time\].
+
+:::note Why is the total number of process instances stored that low?
+This is related to the limited resources provided to Elasticsearch, which can cause performance problems when too much data is stored there. By increasing the available memory for Elasticsearch, you can also increase that number. At the same time, even with this rather low number, you can always guarantee the throughput of the core workflow engine during peak loads, as this performance is not affected. You can also increase memory for Elasticsearch later if needed.
+:::
+
+## Next steps
+
+Validate your chosen configuration by [running your own benchmarks](sizing-benchmarks.md).