add LFM example by charlesfrye · Pull Request #1504 · modal-labs/modal-examples

charlesfrye · 2026-02-21T02:32:49Z

…example

charlesfrye · 2026-02-21T02:33:02Z

@prbot approve @shababo

modal-pr-review-automation

Approved 👍. @shababo will follow-up review this.

devin-ai-integration

Devin Review found 3 potential issues.

View 2 additional findings in Devin Review.

devin-ai-integration · 2026-02-21T02:37:29Z

06_gpu_and_ml/llm-serving/lfm_snapshot.py

+# With all this in place, we are ready to define our high-performance, low-latency
+# LFM 2 inference server.
+
+app = modal.App("examples-lfm-snapshot")


🚩 App name uses examples- prefix instead of example-

The app is named "examples-lfm-snapshot" at lfm_snapshot.py:268, but every other app in the llm-serving/ directory uses the "example-" prefix (singular): "example-vllm-inference", "example-vllm-low-latency", "example-sglang-snapshot", etc. The CLAUDE.md guidelines also specify example- prefix with kebab-case. The __main__ block at lfm_snapshot.py:507 correctly references the same "examples-lfm-snapshot" string, so this won't cause a runtime mismatch, but it breaks the naming convention.

Was this helpful? React with 👍 or 👎 to provide feedback.

devin-ai-integration · 2026-02-21T02:37:30Z

06_gpu_and_ml/llm-serving/lfm_snapshot.py

+
+MINUTES = 60
+
+MODEL_NAME = os.environ.get("MODEL_NAME", "LiquidAI/LFM2-8B-A1B")


🚩 Model revision is not pinned, unlike other examples

The internal CLAUDE.md guidelines explicitly state: "Always pin model revisions to avoid surprises when upstream repos update". The vllm_low_latency.py example pins MODEL_REVISION and passes --revision to the vLLM CLI (vllm_low_latency.py:69-71, vllm_low_latency.py:276-277). This example does not pin a revision and does not pass --revision in the vLLM command at lines 296-317. If LiquidAI pushes a breaking update to their HuggingFace repo, this example could break silently.

Was this helpful? React with 👍 or 👎 to provide feedback.

devin-ai-integration · 2026-02-21T02:37:31Z

06_gpu_and_ml/llm-serving/lfm_snapshot.py

+            "--max-cudagraph-capture-size",
+            f"{MAX_INPUTS}",


🚩 --max-cudagraph-capture-size CLI flag may not exist in vLLM v0.15.1

The vLLM serve command at lines 314-315 uses --max-cudagraph-capture-size, but this flag name doesn't appear in any other vLLM CLI invocation in the repo — only as a config dictionary key in gpt_oss_inference.py:136. Other vLLM examples pass cudagraph-related settings differently (e.g., through --compilation-config or not at all). Since the base image is vllm/vllm-openai:v0.15.1 (a future version I can't verify), I can't confirm whether this CLI flag is valid. If it isn't recognized, vLLM will fail to start.

Was this helpful? React with 👍 or 👎 to provide feedback.

shababo · 2026-02-25T23:13:05Z

lgtm

charlesfrye added 7 commits February 20, 2026 18:59

copy over script from liquidai docs

e4d29ff

adds draft of lfm snapshot example

61978ca

remove while true from test

f3cc799

Use experimental.http_server and proper GPU snapshot pattern for LFM …

80de2cf

…example

Update lfm_snapshot.py

334ad7d

Add detailed prose and headers to LFM snapshot example

898d6c6

Update lfm_snapshot.py

235f15f

charlesfrye requested a review from shababo February 21, 2026 02:32

modal-pr-review-automation bot approved these changes Feb 21, 2026

View reviewed changes

Merge branch 'main' into charlesfrye/liquidai-lfm

c1fa257

devin-ai-integration bot reviewed Feb 21, 2026

View reviewed changes

charlesfrye merged commit e3e8584 into main Feb 21, 2026
6 checks passed

charlesfrye deleted the charlesfrye/liquidai-lfm branch February 21, 2026 02:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add LFM example#1504

add LFM example#1504
charlesfrye merged 8 commits intomainfrom
charlesfrye/liquidai-lfm

charlesfrye commented Feb 21, 2026 •

edited by devin-ai-integration bot

Loading

Uh oh!

charlesfrye commented Feb 21, 2026

Uh oh!

modal-pr-review-automation bot left a comment

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

devin-ai-integration bot Feb 21, 2026

Uh oh!

devin-ai-integration bot Feb 21, 2026

Uh oh!

devin-ai-integration bot Feb 21, 2026

Uh oh!

Uh oh!

shababo commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		MINUTES = 60

		MODEL_NAME = os.environ.get("MODEL_NAME", "LiquidAI/LFM2-8B-A1B")

Conversation

charlesfrye commented Feb 21, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

charlesfrye commented Feb 21, 2026

Uh oh!

modal-pr-review-automation bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Feb 21, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Feb 21, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Feb 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

shababo commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

charlesfrye commented Feb 21, 2026 •

edited by devin-ai-integration bot

Loading