Releases: calad0i/HGQ2
Releases · calad0i/HGQ2
v0.1.8
Bug fixes, add SALT attn:
- mask leak in stable softmax impl
- ebops scale factor handling
- allow q/k/v proj fusing for mha (effect is minor - not needed in general)
- beta setter fix: skip layers disabled ebops computation
- DAIS tracer fixes
Full Changelog: v0.1.7...v0.1.8
v0.1.7
v0.1.6
- pf support for pooling layers
- add back dummy pf for einsum dense layer
- allow ebops scaling factor per layer
- test cov of all layers in da4ml
- explicit pf computation in conv for max pf case
- convlut fix
- breaking shuffle is off by default now in datasets.
Full Changelog: v0.1.5...v0.1.6
v0.1.5
- QDenseT/QConvT
- Dataloader shuffle support
- int bits decay fix in WRAP quantizer
Full Changelog: v0.1.4...v0.1.5