Skip to content

[Tracking] Qwen3.5/Qwen3-Next Optimizations #18590

@hlu1

Description

@hlu1

Hybrid Liear Attention

GDN Kernels

Not arch specific

SM90

SM100

MoE

Full Attention

Communications

NVFP4 kv cache

Runtime

Qwen3-Next

Qwen3.5

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions