Skip to content

Commit 009001e

Browse files
author
Helin Wang
committed
add: the master need to use a transaction to update its states to ensure lock is held
1 parent 35b5aa1 commit 009001e

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

doc/design/dist/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -116,7 +116,7 @@ When the master is started by the Kubernetes, it executes the following steps at
116116
1. Grabs a unique *master* lock in etcd, which prevents concurrent master instantiations.
117117
1. Recovers the task queues from etcd if they already exist, otherwise, the master will create them.
118118
1. Watches the trainer prefix keys `/trainer/` on etcd to find the live trainers.
119-
1. Starts dispatching the tasks to the trainers.
119+
1. Starts dispatching the tasks to the trainers, and updates task queue using an etcd transaction to ensure lock is held during the update.
120120

121121
The master process will kill itself if its etcd lease expires.
122122

0 commit comments

Comments
 (0)