Sync Swift implementation of Qwen with mlx-lm by ronaldmannak · Pull Request #137 · ml-explore/mlx-swift-lm

ronaldmannak · 2026-03-07T05:41:40Z

Proposed changes

I noticed that Qwen 3.5 can sometimes get stuck in infinite repetition of one or more paragraphs. This is mentioned in Qwen 3.5 readme of the 0.8B and 2B versions, but I've seen it happening with larger Qwen 3.5 models as well. This PR does not fix that issue, but while investigating it I found a few discrepancies with the Python implementation. This change updates the Swift version to match mlx-lm. I'm creating a draft pull request and will continue to investigate the repetition issue (if the issue is the swift implementation)

Checklist

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

Upcast gate and normalized x to float32 before silu+multiply to match upstream Python fix and prevent numerical degradation during generation. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

ronaldmannak and others added 6 commits March 6, 2026 21:34

narrow to just non conforming floats

7b4670e

swift lint

79b692f

Fix RMSNormGated float32 precision in VLM Qwen35 (Python PR #951)

dbeacb7

Upcast gate and normalized x to float32 before silu+multiply to match upstream Python fix and prevent numerical degradation during generation. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Align Qwen 3.5 VLM RMSNormGated with upstream mlx-vlm

75e4798

Optional maxKVSize

a1d9784

swift lint

91470b5

davidkoski mentioned this pull request Mar 9, 2026

[BUG] first token, then repeated exclamation marks with long context #138

Closed

davidkoski added the swift-format Swift format failure in CI label Mar 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sync Swift implementation of Qwen with mlx-lm#137

Sync Swift implementation of Qwen with mlx-lm#137
ronaldmannak wants to merge 6 commits intoml-explore:mainfrom
PicoMLX:qwen

ronaldmannak commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ronaldmannak commented Mar 7, 2026

Proposed changes

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants