You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
optimizations to global_id_offset/local_size implicit args
Compute workloads add following implicit arguments:
1. payloadHeader - 8 x i32 packing global_id_offset (3 x i32),
local_size (3 x i32) and 2 x i32 reserved.
2. enqueued_local_size - 3 x i32
local_size is never used in favour of enqueued_local_size. In the end,
payloadHeader has unused 20 bytes.
This change introduces following optimizations:
1. Reduces payloadHeader to 3 x i32, packing only global_id_offset.
Controlled with ShortImplicitPayloadHeader, disabled by default.
2. Removes global_id_offset and enqueued_local_size from finalizer
and zeinfo if arguments are unused. Controlled with
RemoveUnusedIdImplicitArguments, disabled by default.
0 commit comments