You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[AIE2P] Create dummy 256-bit accumulator and compose it with 512-bit to mimic vector register hierarchy
Subregister indices corresponds to vector and accumulator does not cover the co-
rresponding sized registers and leads to undefined uses if we use same subregis-
ter indices for both. Having separate subregister indices solves this problem.
However, with this approach, we cannot allocate vector register for accumulator
(or vice versa). The idea is to mimic the register composition hierarchy of vector
for accumulator (only for smaller types, because 2048 does not matter) creating
dummy 256-bit accumulators and reverting the separate subregister indices. Now,
vector and accumulator register has same lane masks.
0 commit comments