batch : fix consistency checks for the input positions #16890

ggerganov · 2025-10-31T09:16:55Z

Update sanity checks in llama-batch to allow repeating positions for mrope embedding inputs.

ngxson

Thanks!

Signed-off-by: JamePeng <[email protected]>

batch : fix consistency checks for the input positions

4b85715

ggerganov mentioned this pull request Oct 31, 2025

Eval bug: QwenVL models "have inconsistent sequence positions" decode failure when n_img_batches > 1 #16876

Closed

ggerganov requested a review from ngxson October 31, 2025 09:22

ngxson approved these changes Oct 31, 2025

View reviewed changes

JamePeng mentioned this pull request Oct 31, 2025

Feature Request: support qwen3-vl series abetlen/llama-cpp-python#2080

Open

JamePeng referenced this pull request in JamePeng/llama-cpp-python Oct 31, 2025

feat: Add Qwen3VLChatHandler into llama_chat_format.py

33b31be

Signed-off-by: JamePeng <[email protected]>

ggerganov merged commit 8da3c0e into master Oct 31, 2025
68 checks passed

ggerganov deleted the gg/batch-fix-pos-check branch October 31, 2025 11:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

batch : fix consistency checks for the input positions #16890

batch : fix consistency checks for the input positions #16890

ggerganov commented Oct 31, 2025

Uh oh!

ngxson left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

batch : fix consistency checks for the input positions #16890

batch : fix consistency checks for the input positions #16890

Conversation

ggerganov commented Oct 31, 2025

Uh oh!

ngxson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants