Skip to content

docs: update & add new AWS DLCs#2571

Open
ehcalabres wants to merge 1 commit into
huggingface:mainfrom
ehcalabres:add-new-aws-dlcs
Open

docs: update & add new AWS DLCs#2571
ehcalabres wants to merge 1 commit into
huggingface:mainfrom
ehcalabres:add-new-aws-dlcs

Conversation

@ehcalabres

@ehcalabres ehcalabres commented Jun 17, 2026

Copy link
Copy Markdown
Contributor

Description

This PR adds docs for 2 new AWS DLCs:

  • Llama.cpp
  • vLLM Omni

Additionaly, it updates the vLLM version to the latest available container


Note

Low Risk
Documentation-only changes to DLC URIs and tables; no runtime or application code affected.

Overview
Updates the SageMaker Available DLCs inference page with newer container references and two new serving options.

The vLLM GPU row is bumped from 0.17.0 to 0.21.0, with an updated ECR image (transformers5.8.1, cu130). New sections document vLLM Omni (0.20.0, multimodal on GPU) and Llama.cpp (b9522, separate CPU and GPU images). SGLang, TEI, and Neuron vLLM entries are unchanged.

Reviewed by Cursor Bugbot for commit 66c2541. Bugbot is set up for automated code reviews on this repo. Configure here.

@HuggingFaceDocBuilderDev

Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@ehcalabres ehcalabres requested a review from pagezyhf June 17, 2026 17:25

@pagezyhf pagezyhf left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!
I think we should also update the FAQ:

@ehcalabres

Copy link
Copy Markdown
Contributor Author

Hey @pagezyhf! Yes, maybe I can update the How to find the URI of my container? section with the changes from aws/sagemaker-python-sdk#5960 and we just keep one section for finding the right container URI

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants