Skip to content

Commit aff8aed

Browse files
authored
Merge pull request kubernetes#2810 from MikeSpreitzer/update-shuffle-sharding
KEP-1040: Generalize shuffle sharding for req width and extra latency
2 parents 0f32977 + 234398a commit aff8aed

File tree

1 file changed

+6
-1
lines changed
  • keps/sig-api-machinery/1040-priority-and-fairness

1 file changed

+6
-1
lines changed

keps/sig-api-machinery/1040-priority-and-fairness/README.md

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -394,7 +394,12 @@ list of queue indices excluding I[0]. I[2] is the A[2]’th entry in
394394
the list of queue indices excluding I[0] and I[1]. And so on.
395395

396396
The lengths of the queues identified by I[0], I[1], … I[H-1] are
397-
examined, and the request is put in one of the shortest queues.
397+
examined, and the request is put in one of the queues holding the
398+
least amount of work. Originally this was just a matter of examining
399+
queue length. With the generalizations for width and extra latency,
400+
the work in a queue is the sum of the work in its waiting requsts.
401+
The work in a request is the product of its width and its total
402+
estimated execution duration (including extra latency).
398403

399404
For example, if a RequestPriority has numQueues=128 and handSize=6,
400405
the hash value V is converted into 6 unique queue indices plus

0 commit comments

Comments
 (0)