How to use Vertex AI Llama 405B API (whilst it is still free) #3835

michaelbarrett-hicksons · 2024-08-29T07:20:22Z

michaelbarrett-hicksons
Aug 29, 2024

Firstly, become a massive fan of LibreChat - to the point I've connected our Azure GPT4o, Google Gemini 1.5 Flash & Pro plus numerous Ollama offline open source models (Phi, Llama3, Gemma2) seamlessly.

However, I'm stuck with how I can connect (or maybe I can't?) the Llama 3.1 405b API (currently free) from Google / Vertex AI.

Note: I have a valid, billable Google Cloud / Vertex AI account already.
I have an existing PROJECT_ID.

The instructions, which work with Postman etc are as follows:

Try Llama 3.1 API Service
To use Llama 3.1 API service with the command line interface (CLI), do the following:

Open Cloud Shell or a local terminal window with the gcloud CLI installed.
Configure environment variables by entering the following. Replace YOUR_PROJECT_ID with the ID of your Google Cloud project.

            ENDPOINT=[us-central1-aiplatform.googleapis.com](http://us-central1-aiplatform.googleapis.com/)
            REGION=us-central1
            PROJECT_ID="YOUR_PROJECT_ID"

Send a prompt request by entering the following curl command:

curl \
  -X POST \
  -H "Authorization: Bearer $(gcloud auth print-access-token)" \
  -H "Content-Type: application/json" https://${ENDPOINT}/v1beta1/projects/${PROJECT_ID}/locations/${REGION}/endpoints/openapi/chat/completions \
  -d '{"model":"meta/llama3-405b-instruct-maas", "stream":true, "messages":[{"role": "user", "content": "Summer travel plan to Paris"}]}'

Is there anyway to integrate this into Libre? (Even if it does mean I have to get a new bearer token every 60 mins)

Many thanks for any advice or guidance.

abhranil26 · 2024-09-07T02:33:28Z

abhranil26
Sep 7, 2024

i was wondering the same

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

How to use Vertex AI Llama 405B API (whilst it is still free) #3835

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

How to use Vertex AI Llama 405B API (whilst it is still free) #3835

Uh oh!

Uh oh!

michaelbarrett-hicksons Aug 29, 2024

Replies: 1 comment

Uh oh!

abhranil26 Sep 7, 2024

michaelbarrett-hicksons
Aug 29, 2024

abhranil26
Sep 7, 2024