v0.0.0beta13
·
388 commits
to main
since this release
What's Changed
- Update README.md by @saiatmakuri in #236
- Ianmacleod/fix download artifact gateway by @ian-scale in #237
- add peft config documentation by @saiatmakuri in #238
- Update client completion timeout by @seanshi-scale in #239
- Add nvidia.com/gpu in requests by @yunfeng-scale in #240
- Add new image to image cache by @seanshi-scale in #242
- Remove plugins from endpoint containers by @song-william in #241
- Add vLLM as an inference framework by @yunfeng-scale in #228
- Change max_input_length to half of max_total_tokens to work around potential tokenizer loading issue by @seanshi-scale in #244
- Validate Fine-tuning CSV headers by @saiatmakuri in #243
- Sync scale from zero part 2 by @seanshi-scale in #230
- Completions for vLLM endpoints by @yunfeng-scale in #245
- Download bin files for TGI also by @yunfeng-scale in #247
- update team label for fine-tunes by @saiatmakuri in #246
- Ianmacleod/completion sync error throws 4xx by @ian-scale in #234
- Some fixes by @yunfeng-scale in #248
- Higher concurrency limit for gunicorn by @yunfeng-scale in #249
- Pass labels to job config by @saiatmakuri in #251
- Bump python client version from 0.0.0beta12 to 0.0.0beta13 by @seanshi-scale in #253
Full Changelog: v0.0.0.beta12...v0.0.0beta13