Skip to content

Commit d041b67

Browse files
authored
update GenAI related pages (#763)
1 parent 002c176 commit d041b67

File tree

4 files changed

+11
-9
lines changed

4 files changed

+11
-9
lines changed

aalto/generative-ai-tools.rst

Lines changed: 7 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -13,10 +13,10 @@ Roughly speaking, you could
1313
* Use remote APIs to run models with your own code.
1414

1515
* An OpenAI API
16-
* Models hosted locally at Aalto *good for data security*
16+
* Models hosted locally at Aalto: `Aalto LLM API <https://github.com/AaltoSciComp/llm-examples/tree/main/aalto-llm-api>`__ *good for data security*
1717

1818
* Running locally-installable models on your own
19-
computer/Triton. *best for data security*.
19+
computer/Triton. Examples can be found `here <https://github.com/AaltoSciComp/llm-examples>`__.
2020

2121
The first category is good for easy interactive or occasional use
2222
(they can only be run manually), the second is good for scripted use
@@ -97,6 +97,10 @@ of text inputs.
9797
for a price. Without an Aalto contract, data security can not be
9898
guaranteed.
9999

100+
* Tools like `vllm <https://docs.vllm.ai/en/v0.4.1/serving/openai_compatible_server.html>`__ can be used to
101+
quickly host an OpenAI compatible API with local models. This is an option if you need a model
102+
that is not available in the Aalto LLM API. `This example <https://github.com/AaltoSciComp/llm-examples/tree/main/server-via-vllm>`__
103+
can help you try it out on triton.
100104

101105

102106
Locally installed models
@@ -118,9 +122,7 @@ analysis (especially when combined with HPC clusters).
118122
* :doc:`Aalto's Triton </triton/index>` cluster has many models
119123
pre-downloaded and software ready for your use.
120124

121-
* See :doc:`/triton/apps/llms` for instructions (there are more
122-
model than listed on that page, and it is easy for us to
123-
download more as soon as they are released).
125+
* See :doc:`/triton/apps/llms` for instructions.
124126
* Installing the required software can sometimes be tricky, but
125127
Aalto SciComp has done this many times: come to our :ref:`garage
126128
help session <garage>` and ask us rather than waste your own time.

aalto/index.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -28,3 +28,4 @@ for Research page
2828
python
2929
remotejupyter
3030
zulip
31+
generative-ai-tools

aalto/llm-web-apis.rst

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@ Local LLM web APIs
22
==================
33

44
As a pilot service, :doc:`Aalto RSE </rse/index>` has a service
5-
running some common open-source LLMS (llama2, mistral, etc.) available
5+
running some common open-source LLMS (llama3, deepseek, etc.) available
66
via the web. This can be used for lightweight purposes via
77
programming, but shouldn't replace batch usage (use
88
:doc:`/triton/apps/llms`) or interactive chatting (use Aalto GPT).
@@ -18,8 +18,7 @@ provide development access. Chat with us in the #llms stream on
1818

1919
The API doesn't have it's own detailed documentation (ask us), but the
2020
API should be OpenAI compatible (for chat models) so many existing
21-
libraries work automatically.
22-
21+
libraries work automatically. Here are some simple `examples. <https://github.com/AaltoSciComp/llm-examples/tree/main/aalto-llm-api>`__
2322

2423
Intended use and resource availability
2524
--------------------------------------

triton/apps/llms.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -60,7 +60,7 @@ In the following sbatch script, we request computational resources, load the nec
6060
6161
python your_script.py
6262
63-
The `your_script.py` Python script uses a HuggingFace model `mistralai/Mistral-7B-Instruct-v0.1` for conversations and instructions.
63+
The ``your_script.py`` Python script uses a HuggingFace model ``mistralai/Mistral-7B-Instruct-v0.1`` for conversations and instructions.
6464

6565
.. code-block:: python
6666

0 commit comments

Comments
 (0)