Skip to content

Conversation

lionello
Copy link
Member

@lionello lionello commented May 20, 2025

Changing the managed-llm-provider sample to use the "generated" environment variables, <service>_URL and <service>_MODEL.

This needs DefangLabs/defang#1194 to work.

Samples Checklist

✅ All good!

@lionello lionello had a problem deploying to deploy-changed-samples May 20, 2025 23:00 — with GitHub Actions Failure
environment:
- ENDPOINT_URL=http://llm/api/v1/chat/completions # endpoint to the gateway service
- MODEL=anthropic.claude-3-haiku-20240307-v1:0 # LLM model ID used for the gateway
- MODEL=us.amazon.nova-micro-v1:0 # LLM model ID used for the gateway
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's use a cloud-agnostic model.

Copy link
Member Author

@lionello lionello May 20, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

so either a (hidden) default or llama or mistral. (Llama is cheaper than Mistral.)

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah I think ai/llama3.2 is available everywhere

@commit111
Copy link
Collaborator

commit111 commented May 21, 2025

be mindful of changes in #378, where the whole Configuration section in the readme is removed, and the compose.yaml file has an extra comment about model-mapping. but it's mostly just resolving merge conflicts

@commit111 commit111 had a problem deploying to deploy-changed-samples May 21, 2025 00:25 — with GitHub Actions Failure
@commit111 commit111 had a problem deploying to deploy-changed-samples May 21, 2025 00:31 — with GitHub Actions Failure
@jordanstephens jordanstephens self-requested a review May 21, 2025 21:09
@jordanstephens jordanstephens merged commit c4147c8 into main May 21, 2025
5 of 6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants