You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add support for layer norm and bias folding (#1044)
* Split weights instead of logits for models with joint QKV activation
* Adjust tests accordingly
* Set split_qkv_matrix function inside init
* Remove debugging print statements
* Add support for folding layer norm and folding value biases
* Enable layer norm folding by default in compatibility mode
* Remove old parameters
* Remove hardcoded filepath
* Make sure conversion rules are not none
* Make sure conversion rules are not none
* ran format
* optimized tests a bit
* remvoed extra files
* resolved test
* fixed test
* removed extra block
* removed extra variable
* restored hooks
* cleaned up imports
* Remove conversions out of layer norm folding
* Add configuration dictionary during initialization
* Fix typing error
* Do not use weights and biases if weights are folded
* Add uses_rms_norm configuration parameter
---------
Co-authored-by: Bryce Meyer <[email protected]>
0 commit comments