You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -21,7 +21,7 @@ HoML brings the best of both worlds together, offering:
21
21
***One-Line Installation:** A simple, one-line script for easy installation and upgrades across a wide range of machines.
22
22
***Simple CLI:** An intuitive command-line interface for managing and running models.
23
23
***Easy Model Management:** A `pull` command to download models from the Hugging Face Hub.
24
-
***Automatic Model Loading/Unloading:** Models are loaded and unloaded from memory as needed, with an option to disable this feature.
24
+
***Automatic GPU Memory Management:**HoML intelligently manages your GPU memory. Models are automatically loaded when requested via the OpenAI-compatible API and unloaded when another model is requested. To free up resources for other applications, models are also automatically unloaded after a configurable idle period (defaulting to 10 minutes).
25
25
***Interactive Chat:** A `run` command to start an interactive chat session with a model.
26
26
***OpenAI-Compatible API:** A built-in server that exposes an OpenAI-compatible API for seamless integration with existing tools.
27
27
***Curated Model List:** A website and a curated list of tested and verified models, with clear version compatibility.
0 commit comments