Skip to content

Eval bug: Qwen 3 VL provides incorrect bounding boxes on sizes that are not 1000x1000px / non-squareΒ #16880

@ayayakirara

Description

@ayayakirara

Name and Version

./llama-b6887-bin-ubuntu-x64/build/bin/llama-cli: /lib64/libcurl.so.4: no version information available (required by ./llama-b6887-bin-ubuntu-x64/build/bin/llama-cli)
load_backend: loaded RPC backend from /var/home/user/LLMs/llama-b6887-bin-ubuntu-x64/build/bin/libggml-rpc.so
load_backend: loaded CPU backend from /var/home/user/LLMs/llama-b6887-bin-ubuntu-x64/build/bin/libggml-cpu-icelake.so
version: 6887 (d261223)
built with cc (Ubuntu 11.4.0-1ubuntu1~22.04.2) 11.4.0 for x86_64-linux-gnu

(Using 6887 from releases)

Operating systems

Linux

GGML backends

CPU

Hardware

Ryzen 5 8640HS

Models

https://huggingface.co/unsloth/Qwen3-VL-30B-A3B-Instruct-GGUF/tree/main

unsloth-Qwen3-30B-A3B-Instruct-2507-UD-Q4_K_XL.gguf
unsloth-Qwen3-VL-30B-A3B-Instruct-mmproj-F16.gguf

Problem description & steps to reproduce

Steps to Reproduce

Run ./{{llama_path}}/llama-server -m unsloth-Qwen3-VL-30B-A3B-Instruct-UD-Q4_K_XL.gguf --mmproj unsloth-Qwen3-VL-30B-A3B-Instruct-mmproj-F16.gguf -c 32768 --jinja --no-context-shift --slot-prompt-similarity 0.0 --keep 0 --cache-ram 0 --no-perf --temp 0 -np 1

Use test image here:
Image
Image

Use test HTML provided here: #16207 (comment)

Expected Results

Expectation is that bounding boxes are accurate without any adjustments of the pixel coordinates. However in reality for the square image we need to resize the image to be 1.25x so that the bounding box coordinates match the bounding box output.

Actual results
Image

After scaling by 1.25x
Image

For the non-square image, bounding boxes have incorrect aspect ratio for bounding boxes.

Image

Related

Probably relates to #16842

First Bad Commit

6887 and 6891 have the same observed behavior

Relevant log output

N/a

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions