Skip to content

Commit 354ce23

Browse files
committed
Update Blog “deploying-a-hugging-face-llm-in-hpe-private-cloud-ai”
1 parent 5ae6eb9 commit 354ce23

File tree

2 files changed

+4
-4
lines changed

2 files changed

+4
-4
lines changed

content/blog/deploying-a-hugging-face-llm-in-hpe-private-cloud-ai.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: Deploying a Small Language Model in HPE Private Cloud AI Using a Jupyter
2+
title: Deploying a Small Language Model in HPE Private Cloud AI using a Jupyter
33
Notebook
44
date: 2025-02-20T20:03:50.971Z
55
author: Dave Wright and Elias Alagna
@@ -11,7 +11,7 @@ tags:
1111
- vllm
1212
- SLM
1313
---
14-
Deploying new large language models for for users to interact with can be challenging for beginners. HPE has developed Private Cloud AI to help users get to productive AI work quickly and easily.
14+
Deploying new language models for for users to interact with can be challenging for beginners. HPE has developed Private Cloud AI to help users get to productive AI work quickly and easily.
1515

1616
Here we will show how to use HPE Machine Learning Inference Service as a part of HPE Private Cloud AI to add a new packaged model from a Hugging Face repository and create an endpoint to query the model. This is done in a Jupyter Notebook.
1717

@@ -45,12 +45,12 @@ This will create an endpoint for use in the notebook and provide an API token.
4545

4646
![New deployment for AI model](/img/new-deployment.png)
4747

48+
When the process is complete an endpoint will be provided.
4849

50+
![Endpoint provided by MLIS system](/img/endpoint.png)
4951

5052
Next up let's take the now deployed model that's ready for inference and connect to it and interact with it from a Jupyter Notebook.
5153

52-
53-
5454
### Building the Jupyter Notebook
5555

5656
First install `openai` if you do not already have it and import.

static/img/endpoint.png

-38.5 KB
Loading

0 commit comments

Comments
 (0)