v0.0.0beta13

seanshi-scale released this 02 Sep 01:05

· 388 commits to main since this release

6030f4d

What's Changed

Update README.md by @saiatmakuri in #236
Ianmacleod/fix download artifact gateway by @ian-scale in #237
add peft config documentation by @saiatmakuri in #238
Update client completion timeout by @seanshi-scale in #239
Add nvidia.com/gpu in requests by @yunfeng-scale in #240
Add new image to image cache by @seanshi-scale in #242
Remove plugins from endpoint containers by @song-william in #241
Add vLLM as an inference framework by @yunfeng-scale in #228
Change max_input_length to half of max_total_tokens to work around potential tokenizer loading issue by @seanshi-scale in #244
Validate Fine-tuning CSV headers by @saiatmakuri in #243
Sync scale from zero part 2 by @seanshi-scale in #230
Completions for vLLM endpoints by @yunfeng-scale in #245
Download bin files for TGI also by @yunfeng-scale in #247
update team label for fine-tunes by @saiatmakuri in #246
Ianmacleod/completion sync error throws 4xx by @ian-scale in #234
Some fixes by @yunfeng-scale in #248
Higher concurrency limit for gunicorn by @yunfeng-scale in #249
Pass labels to job config by @saiatmakuri in #251
Bump python client version from 0.0.0beta12 to 0.0.0beta13 by @seanshi-scale in #253

Full Changelog: v0.0.0.beta12...v0.0.0beta13

Contributors

song-william, seanshi-scale, and 3 other contributors

Assets 2