@@ -13,10 +13,10 @@ Roughly speaking, you could
1313* Use remote APIs to run models with your own code.
1414
1515 * An OpenAI API
16- * Models hosted locally at Aalto *good for data security *
16+ * Models hosted locally at Aalto: ` Aalto LLM API < https://github.com/AaltoSciComp/llm-examples/tree/main/aalto-llm-api >`__ *good for data security *
1717
1818* Running locally-installable models on your own
19- computer/Triton. * best for data security * .
19+ computer/Triton. Examples can be found ` here < https://github.com/AaltoSciComp/llm-examples >`__ .
2020
2121The first category is good for easy interactive or occasional use
2222(they can only be run manually), the second is good for scripted use
@@ -97,6 +97,10 @@ of text inputs.
9797 for a price. Without an Aalto contract, data security can not be
9898 guaranteed.
9999
100+ * Tools like `vllm <https://docs.vllm.ai/en/v0.4.1/serving/openai_compatible_server.html >`__ can be used to
101+ quickly host an OpenAI compatible API with local models. This is an option if you need a model
102+ that is not available in the Aalto LLM API. `This example <https://github.com/AaltoSciComp/llm-examples/tree/main/server-via-vllm >`__
103+ can help you try it out on triton.
100104
101105
102106Locally installed models
@@ -118,9 +122,7 @@ analysis (especially when combined with HPC clusters).
118122* :doc: `Aalto's Triton </triton/index >` cluster has many models
119123 pre-downloaded and software ready for your use.
120124
121- * See :doc: `/triton/apps/llms ` for instructions (there are more
122- model than listed on that page, and it is easy for us to
123- download more as soon as they are released).
125+ * See :doc: `/triton/apps/llms ` for instructions.
124126 * Installing the required software can sometimes be tricky, but
125127 Aalto SciComp has done this many times: come to our :ref: `garage
126128 help session <garage>` and ask us rather than waste your own time.
0 commit comments