You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
docs: fix outdated content in documentation (#1385)
* docs: fix outdated content in documentation
- Remove outdated etcd references (etcd backend was removed)
- Update version numbers from old versions to v51.0.0
- Fix executor-slots-policy to task-distribution with correct values
- Remove Sled database references from docker.md
- Update kubernetes.md docker tags and log output format
- Fix Python API: Ballista() -> BallistaBuilder()
- Fix scheduler-policy parameter name and default value
Co-Authored-By: Claude Opus 4.5 <[email protected]>
* chore: add CLAUDE.md to .gitignore
Co-Authored-By: Claude Opus 4.5 <[email protected]>
* style: format markdown with prettier
Co-Authored-By: Claude Opus 4.5 <[email protected]>
* docs: add benchmarking section to contributors guide
Link to benchmarks/README.md for TPC-H and performance testing instructions.
Co-Authored-By: Claude Opus 4.5 <[email protected]>
---------
Co-authored-by: Claude Opus 4.5 <[email protected]>
| scheduler-policy | Utf8 | pull-staged | Sets the task scheduling policy for the scheduler, possible values: pull-staged, push-staged. |
106
-
| event-loop-buffer-size | UInt32 | 10000 | Sets the event loop buffer size. for a system of high throughput, a larger value like 1000000 is recommended. |
107
-
|executor-slots-policy| Utf8 | bias | Sets the executor slots policy for the scheduler, possible values: bias, round-robin, round-robin-local. For a cluster with single scheduler, round-robin-local is recommended.|
108
-
| finished-job-data-clean-up-interval-seconds | UInt64 | 300 | Sets the delayed interval for cleaning up finished job data, mainly the shuffle data, 0 means the cleaning up is disabled. |
109
-
| finished-job-state-clean-up-interval-seconds | UInt64 | 3600 | Sets the delayed interval for cleaning up finished job state stored in the backend, 0 means the cleaning up is disabled. |
| scheduler-policy | Utf8 | pull-staged | Sets the task scheduling policy for the scheduler, possible values: pull-staged, push-staged. |
105
+
| event-loop-buffer-size | UInt32 | 10000 | Sets the event loop buffer size. for a system of high throughput, a larger value like 1000000 is recommended. |
106
+
|task-distribution| Utf8 | bias | Sets the task distribution policy for the scheduler, possible values: bias, round-robin, consistent-hash. |
107
+
| finished-job-data-clean-up-interval-seconds | UInt64 | 300 | Sets the delayed interval for cleaning up finished job data, mainly the shuffle data, 0 means the cleaning up is disabled. |
108
+
| finished-job-state-clean-up-interval-seconds | UInt64 | 3600 | Sets the delayed interval for cleaning up finished job state stored in the backend, 0 means the cleaning up is disabled. |
Copy file name to clipboardExpand all lines: docs/source/user-guide/deployment/docker.md
+9-12Lines changed: 9 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -67,13 +67,10 @@ Run `docker logs CONTAINER_ID` to check the output from the process:
67
67
68
68
```
69
69
$ docker logs a756055576f3
70
-
2024-02-03T14:49:47.904571Z INFO main ThreadId(01) ballista_scheduler::cluster: Initializing Sled database in temp directory
71
-
72
-
2024-02-03T14:49:47.924679Z INFO main ThreadId(01) ballista_scheduler::scheduler_process: Ballista v0.12.0 Scheduler listening on 0.0.0.0:50050
73
-
2024-02-03T14:49:47.924709Z INFO main ThreadId(01) ballista_scheduler::scheduler_process: Starting Scheduler grpc server with task scheduling policy of PullStaged
74
-
2024-02-03T14:49:47.925261Z INFO main ThreadId(01) ballista_scheduler::cluster::kv: Initializing heartbeat listener
75
-
2024-02-03T14:49:47.925476Z INFO main ThreadId(01) ballista_scheduler::scheduler_server::query_stage_scheduler: Starting QueryStageScheduler
76
-
2024-02-03T14:49:47.925587Z INFO tokio-runtime-worker ThreadId(47) ballista_core::event_loop: Starting the event loop query_stage
70
+
INFO ballista_scheduler::scheduler_process: Ballista v51.0.0 Scheduler listening on 0.0.0.0:50050
71
+
INFO ballista_scheduler::scheduler_process: Starting Scheduler grpc server with task scheduling policy of PullStaged
72
+
INFO ballista_scheduler::scheduler_server::query_stage_scheduler: Starting QueryStageScheduler
73
+
INFO ballista_core::event_loop: Starting the event loop query_stage
77
74
```
78
75
79
76
### Start Executors
@@ -99,11 +96,11 @@ Use `docker logs CONTAINER_ID` to check the output from the executor(s):
99
96
100
97
```
101
98
$ docker logs fb8b530cee6d
102
-
2024-02-03T14:50:24.061607Z INFO main ThreadId(01) ballista_executor::executor_process: Running with config:
103
-
2024-02-03T14:50:24.061649Z INFO main ThreadId(01) ballista_executor::executor_process: work_dir: /tmp/.tmpAkP3pZ
104
-
2024-02-03T14:50:24.061655Z INFO main ThreadId(01) ballista_executor::executor_process: concurrent_tasks: 48
105
-
2024-02-03T14:50:24.063256Z INFO tokio-runtime-worker ThreadId(44) ballista_executor::executor_process: Ballista v0.12.0 Rust Executor Flight Server listening on 0.0.0.0:50051
106
-
2024-02-03T14:50:24.063281Z INFO tokio-runtime-worker ThreadId(47) ballista_executor::execution_loop: Starting poll work loop with scheduler
99
+
INFO ballista_executor::executor_process: Running with config:
100
+
INFO ballista_executor::executor_process: work_dir: /tmp/.tmpAkP3pZ
101
+
INFO ballista_executor::executor_process: concurrent_tasks: 48
102
+
INFO ballista_executor::executor_process: Ballista v51.0.0 Rust Executor Flight Server listening on 0.0.0.0:50051
103
+
INFO ballista_executor::execution_loop: Starting poll work loop with scheduler
You can view the scheduler logs with `kubectl logs ballista-scheduler-0`:
211
+
You can view the scheduler logs with `kubectl logs ballista-scheduler-<pod-id>`:
212
212
213
213
```
214
-
$ kubectl logs ballista-scheduler-0
215
-
[2021-02-19T00:24:01Z INFO scheduler] Ballista v0.7.0 Scheduler listening on 0.0.0.0:50050
216
-
[2021-02-19T00:24:16Z INFO ballista::scheduler] Received register_executor request for ExecutorMetadata { id: "b5e81711-1c5c-46ec-8522-d8b359793188", host: "10.1.23.149", port: 50051 }
217
-
[2021-02-19T00:24:17Z INFO ballista::scheduler] Received register_executor request for ExecutorMetadata { id: "816e4502-a876-4ed8-b33f-86d243dcf63f", host: "10.1.23.150", port: 50051 }
214
+
$ kubectl logs ballista-scheduler-<pod-id>
215
+
INFO ballista_scheduler::scheduler_process: Ballista v51.0.0 Scheduler listening on 0.0.0.0:50050
216
+
INFO ballista_scheduler::scheduler_server::grpc: Received register_executor request for ExecutorMetadata { id: "b5e81711-1c5c-46ec-8522-d8b359793188", host: "10.1.23.149", port: 50051 }
217
+
INFO ballista_scheduler::scheduler_server::grpc: Received register_executor request for ExecutorMetadata { id: "816e4502-a876-4ed8-b33f-86d243dcf63f", host: "10.1.23.150", port: 50051 }
0 commit comments