You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Summary: the move from simple_gpt to gpt-fast altered some things. This
unbreaks eval and GPTQ.
Note GPTQ still is broken due to kv cache issue in model. Needs either
non-public pytorch functionality or a change to GPTQ implementation. see
next PR in stack for a fix.
Test Plan:
python eval.py --checkpoint_path checkpoints/$MODEL_REPO/model.pth
python quantize.py --checkpoint_path checkpoints/$MODEL_REPO/model.pth --mode int4-gptq --calibration_tasks wikitext --calibration_limit 5
Reviewers:
Subscribers:
Tasks:
Tags:
ghstack-source-id: 7e73383
Pull Request resolved: #93
0 commit comments