You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat(ggml-cpu): Add partial implementation of scale for f16
This is used to zero-out the state in build_rs, so it's required to support
F16 cache states for recurrent models. The bias route does not get hit in
that case, but would need to be implemented if used elsewhere.
Branch: Mamba2SSD
Signed-off-by: Gabe Goodhart <[email protected]>
0 commit comments