WIP: Service rewrite by 1ntEgr8 · Pull Request #99 · erdos-project/erdos-scheduling-simulator

1ntEgr8 · 2024-11-04T18:23:57Z

Depends on #97, hence the larger than expected diff.

…rvice

…ling-simulator into tpch-loader

…s.path hack

…t public

…it in tpch_loader

…placement

…cheduer)

…n at timestep

Fixes an issue where if start-master.sh or start-worker.sh exits with a nonzero code, or more generally if an exception happens in Service.__enter__(), run_service_experiments.py hangs and doesn't report the exception.

When the last application is deregistered from the spark service, execute all remaining events from the simulator. This allows the final LOG_STATS event to be processed so we can calculate the SLO attainment. Unlike normal runs of the simulator, a SIMULATOR_END event is not inserted as some tasks might not have finished in the simulator and it's unclear when they will finish. The simulator is patched to allow an empty event queue in Simulator.simulate().

On a TASK_FINISH event, set the task completion time to the time of the event rather than the last time the task was stepped. Resolves a bug in the service where tasks that finish later than the simulator's profiled runtime predicts get assigned the wrong completion time.

This reverts commit a22e406.

We found that deadlines for task graphs weren't consistent between the simulator and Spark even with the same RNG seed being used, due to the fact that EventTime keeps a global RNG it uses for all of its fuzzing and both deadlines and runtime variances are fuzzed. Since in simulator runs, task deadlines are all calculated at the start and runtime variances are calculated later, and in Spark, task deadlines and runtime variances are calculated throughout the experiment lifecycle, different deadline variances are obtained between simulator and Spark runs on the same experiment. Our solution is to pass a unique RNG used just for calculating deadline variances to the fuzzer. This RNG is hardcoded with a seed of 42; this is fine for experiments but it should probably be changed to the random_seed command line flag.

Originally, the service named job graphs in the form Q<query>[<spark-app-id>], where Spark sets the app id to app-<timestamp>-<index>, while TPC-H data loader named job graphs in the form Q<query>[<index>]. This commit changes how the service names job graphs by passing the index as an argument to the TpchQuery Spark application, which will then be forwarded into the Servicer through RegisterTaskGraph as a part of the query name. RegisterTaskGraph then uses the index to name the job graph. This ensures that the job graph names are always the same between a Spark run and a simulator run, irrespective of when the task graphs are actually released during a Spark run (which can be nondeterministic). The intent is to use these names to generate deadlines for the task graphs, so that deadlines are always consistent between Spark and simulator runs. This change requires a corresponding change to tpch-spark to forward the index to the Servicer.

To avoid needless reruns, TetriSchedScheduler does not run the scheduler if there are no tasks which are not scheduled, not part of a task graph that has been previously considered, and not part of a task graph that has been cancelled. We remove this second condition to account for situations in which a task graph is considered and its tasks scheduled, but the tasks failed to be placed (for instance, if another task on the same worker finished late, taking up resources). In such cases, the task graph would not be cancelled and might still be able to be completed, so we need to run the scheduler again to try to schedule the tasks that could not be placed before.

1ntEgr8 and others added 30 commits September 23, 2024 08:43

Implement TPC-H data loader

956afcd

Bug fix: convert job graph to task graph

a25fbe8

Make loop_timeout configurable

4d06a95

Make profile path configurable

c756d17

release time handling on workload

9f6aa8b

Wrap up tpch loader implementation

be66704

scale runtime based on max number of tasks

a4d0ded

fix bug in runtime calc

8111ba6

rename optimization_passes flag to opt_passes

e172b56

add cloudlab support, fix runtime rounding bug, make rng gen match se…

90e696c

…rvice

restore tpch_utils to main version

06cf4f7

split tpch_replay config files

fcb0180

remove opt_passes flag

9014090

Merge branch 'opt-passes-flag-fix' of github.com:1ntEgr8/erdos-schedu…

fbce571

…ling-simulator into tpch-loader

update tpch_utils.py

86420f3

setup new service.py

2f09d5e

update rpc proto dir hierarchy to resolve module import issue with sy…

eba9a45

…s.path hack

checkout dhruv's version of service.py

e6a364a

make workload_loader optional, make step and get_time_until_next_even…

69876ef

…t public

implement register/deregister framework

d68772d

refactor sim time calculation

320cb34

implement RegisterWorker

43b8331

factor out __get_worker_pool

eccddc5

refactor tpch loader

e94b1f1

implement register task graph

ed510ab

add testing for service

81c4307

implement register environment ready

e1faa7f

init impl for get placements, readme with spark-erdos setup

0025f3c

WIP: service changes to handle first tpch taskgraph

a7f18e3

Fix the tick() function to dequeue all events upto n, pass runtime_un…

942ead7

…it in tpch_loader

Dhruv Garg and others added 4 commits December 2, 2024 23:49

reorder event queue priority to process scheduler events before task …

e2636f6

…placement

improvements to experiment runner

eeb4854

fix profile path in tpch loader

cb96c3e

add spark-master-ip flag

7036fcf

1ntEgr8 force-pushed the service-rewrite branch from e48ac98 to 7036fcf Compare December 3, 2024 20:31

Dhruv Garg and others added 25 commits December 4, 2024 01:21

reinstate previous eventQueue priority order (task_placement before s…

bd16310

…cheduer)

[simulator] Unschedule subtree rooted at task if task is unable to ru…

b4aceeb

…n at timestep

[service] log line to track tasks that get delayed in execution

ce582e0

sleep for some time before signalling shutdown

3831f44

hack analyze pipeline to work with tpch output

2a74838

[simulator] check task state before invoking unschedule on it

f4bbe6a

add support for tpch query partitioning

b958a8a

run_service_experiments: Log service stdout/stderr

038cc7f

run_service_experiments.py: Fix --dry_run

6f84681

run_service_experiments: Timestamp results folder

968b158

run_service_experiments: Fix hang on exception

51d6f46

Fixes an issue where if start-master.sh or start-worker.sh exits with a nonzero code, or more generally if an exception happens in Service.__enter__(), run_service_experiments.py hangs and doesn't report the exception.

remove extraneous print

7e19fd1

fix non-determinism in deadlines

a22e406

Revert "fix non-determinism in deadlines"

1a43c18

This reverts commit a22e406.

run_service_experiments: print args to service and launcher

337de56

Fix some minor errors with deadline fuzzer

ed9f03b

Deterministic task graph deadlines based on task graph names

6a1528b

tpch partitioning based on space-time analysis

f73f4cd

bucketize space-time for tpch-partitioning, cleaned up code

db90d0c

tpch partitions for 100g, 250g and diff max executors

e400726

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Service rewrite#99

WIP: Service rewrite#99
1ntEgr8 wants to merge 129 commits intoerdos-project:mainfrom
1ntEgr8:service-rewrite

1ntEgr8 commented Nov 4, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

1ntEgr8 commented Nov 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

1ntEgr8 commented Nov 4, 2024 •

edited

Loading