v0.1.0 Release #9
Praful932
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
This project started off as adding a small adapter over
scikit-learnto do hyperparameter search over LLMs but evolved into a more comprehensive module, encompassing the features below.generation_seed,tfs&top_aas part of HFmodel.generatefunctionThe future will be powered by S(Small)LMs that run on edge devices and sampling techniques (generation parameters) will play a huge role in proving their effectiveness. This belief is reaffirming to see when new sampling techniques keep getting adding to transformers and llama-3 itself comes with a predefined set of generation parameters instead of the default setting.
llmsearch v0.1.0still has a long way to go. There are certain deficiencies of the package that need to be addressed so that users using it get more value out of it. For example the way the package is currently structured where it usesscikit-learnto treat the generation parameters as a hyperparameter search is not optimal as there is no particularfitmethod. Then there is the reproducibility problem, where a particular generation parameter set is selected when it has a better score (which could itself be a artifact of reproducibility).Excited to make this open source and gather feedback from the community.
The
v1.0.0aims to remedy these issues and some more.Beta Was this translation helpful? Give feedback.
All reactions