[SUPPORT] Can't load quantised Gpt-oss-20b. (MxFP4) 

Gpt-oss finetuning :
While loading gpt-oss-20b model quantised on H200 GPU, with the below config:

```
quantization_config = Mxfp4Config(dequantize=False)
model_kwargs = dict(
    attn_implementation="eager",
    torch_dtype='auto',
    quantization_config=quantization_config,
    use_cache=False,
    device_map="auto",
)
```

Error:
MXFP4 quantization requires `triton >= 3.4.0` and kernels installed, we will default to dequantizing the model to bf16

Note:
I have also installed triton.

Still why can't i load model with Mxfp4 quantized version.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SUPPORT] Can't load quantised Gpt-oss-20b. (MxFP4) #2146

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[SUPPORT] Can't load quantised Gpt-oss-20b. (MxFP4) #2146

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions