Update execution-backends.md (#1263)

arhamm1 · lbliii · web-flow · commit 0821304cd2ac · 2025-11-25T11:11:04.000-05:00
Signed-off-by: Arham Mehta &lt;141266146+arhamm1@users.noreply.github.com&gt;
Signed-off-by: L.B. &lt;llane@nvidia.com&gt;
Co-authored-by: L.B. &lt;llane@nvidia.com&gt;
diff --git a/docs/reference/infrastructure/execution-backends.md b/docs/reference/infrastructure/execution-backends.md
@@ -12,11 +12,27 @@ modality: "universal"
 
 # Pipeline Execution Backends
 
-Executors run NeMo Curator `Pipeline` workflows across your compute resources. This reference explains the available backends and how to configure them. It applies to all modalities (text, image, video, and audio).
+Configure and optimize execution backends to run NeMo Curator pipelines efficiently across single machines, multi-GPU systems, and distributed clusters.
 
-## How it Works
+## Overview
 
-Build your pipeline by adding stages, then run it with an executor:
+Execution backends (executors) are the engines that run NeMo Curator `Pipeline` workflows across your compute resources. They handle:
+
+- **Task Distribution**: Distribute pipeline stages across available workers and GPUs
+- **Resource Management**: Allocate CPU, GPU, and memory resources to processing tasks
+- **Scaling**: Automatically or manually scale processing based on workload
+- **Data Movement**: Optimize data transfer between pipeline stages
+
+**Choosing the right executor** impacts:
+- Pipeline performance and throughput
+- Resource utilization efficiency
+- Ease of deployment and monitoring
+
+This guide covers all execution backends available in NeMo Curator and applies to all modalities: text, image, video, and audio curation.
+
+## Basic Usage Pattern
+
+All pipelines follow this standard execution pattern:
 
 ```python
 from nemo_curator.pipeline import Pipeline
@@ -28,6 +44,11 @@ pipeline.add_stage(...)
 results = pipeline.run(executor)
 ```
 
+**Key points:**
+- The same pipeline definition works with any executor
+- Executor choice is independent of pipeline stages
+- Switch executors without changing pipeline code
+
 ## Available Backends
 
 ### `XennaExecutor` (recommended)
@@ -87,7 +108,9 @@ results = pipeline.run(executor)
 
 For more details, refer to the official [NVIDIA Cosmos-Xenna project](https://github.com/nvidia-cosmos/cosmos-xenna/tree/main).
 
-### `RayDataExecutor` (experimental)
+### `RayActorPoolExecutor`
+
+Executor using Ray Actor pools for custom distributed processing patterns such as deduplication.
 
 `RayDataExecutor` uses Ray Data, a scalable data processing library built on Ray Core. Ray Data provides a familiar DataFrame-like API for distributed data transformations. This executor is experimental and best suited for large-scale batch processing tasks that benefit from Ray Data's optimized data loading and transformation pipelines.
 
@@ -97,9 +120,9 @@ For more details, refer to the official [NVIDIA Cosmos-Xenna project](https://gi
 - **Experimental status**: API and performance characteristics may change
 
 ```python
-from nemo_curator.backends.experimental.ray_data import RayDataExecutor
+from nemo_curator.backends.experimental.ray_actor_pool import RayActorPoolExecutor
 
-executor = RayDataExecutor()
+executor = RayActorPoolExecutor()
 results = pipeline.run(executor)
 ```
 
@@ -109,9 +132,9 @@ results = pipeline.run(executor)
 ### `RayActorPoolExecutor` (experimental)
 
 ```python
-from nemo_curator.backends.experimental.ray_actor_pool import RayActorPoolExecutor
+from nemo_curator.backends.experimental.ray_data import RayDataExecutor
 
-executor = RayActorPoolExecutor()
+executor = RayDataExecutor()
 results = pipeline.run(executor)
 ```