Skip to content

ONNX backend does not support BFLOAT16 inputs #8464

@MatthieuToulemont

Description

@MatthieuToulemont

Description:

I am trying to run a model with Bfloat16 inputs in ONNX but I get the following message:

“Invalid argument: load failed for model ‘FluxEasyControlKVCacheApplyTokenDrop’: version 1 is at UNAVAILABLE state: Internal: unsupported datatype TYPE_BF16 for input ‘double_blocks_k_cache’ for model ‘FluxEasyControlKVCacheApplyTokenDrop’;\n”

It's a bit frustrating because I am able to run the model outside of Triton with Bfloat16 inputs.

I am running in the 25.02 triton ngc container

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions