Inaccurate architecture description in README: encoder-decoder vs. decoder-only

I noticed an inaccuracy in the model description between the README and the Technical Report.

README: mentions "...unified encoder-decoder architecture..."
Technical Report: states "...adopts a decoder-only vision–language architecture following the design principles of Qwen3-VL."

To maintain technical accuracy and consistency with Qwen3-VL, it would be better to update the README to reflect the decoder-only nature of the model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inaccurate architecture description in README: encoder-decoder vs. decoder-only #5

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Inaccurate architecture description in README: encoder-decoder vs. decoder-only #5

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions