Add traceable mistral and mistral3 classes by anmarques · Pull Request #1343 · vllm-project/llm-compressor

anmarques · 2025-04-09T20:36:38Z

SUMMARY:
This PR adds traceable versions of Mistral and Mistral3.

NOTE:
The code fails quality and style tests, but I think we should ignore them. The failures occur at the model definitions ported from transformers and keeping the changes to a minimal would help to maintain these. I added ignore commands to the headers to automatically skip linting.

TEST PLAN:

llmcompressor.trace \
    --model_id "mistralai/Mistral-Small-3.1-24B-Instruct-2503" \
    --model_class "TraceableMistral3ForConditionalGeneration" \
    --sequential_targets "MistralDecoderLayer" \
    --ignore "language_model.lm_head" "re:vision_tower.*" "re:multi_modal_projector.*" \
    --modality vision

llmcompressor.trace \
    --model_id "mistralai/Mistral-Small-3.1-24B-Instruct-2503" \
    --model_class "TraceableMistral3ForConditionalGeneration" \
    --sequential_targets "MistralDecoderLayer" \
    --ignore "language_model.lm_head" "re:vision_tower.*" "re:multi_modal_projector.*" \
    --modality text

github-actions · 2025-04-09T20:36:47Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

dsikka

I think if we want to ignore linting/style, we should add lines to skip linting:
e.g:

llm-compressor/src/llmcompressor/__init__.py

Line 9 in 352579b

# flake8: noqa

anmarques · 2025-04-10T21:37:39Z

I think if we want to ignore linting/style, we should add lines to skip linting: e.g:

llm-compressor/src/llmcompressor/__init__.py

Line 9 in 352579b

# flake8: noqa

I wasn't aware this is an option. Done. Thanks

… version of transformers

anmarques · 2025-04-11T21:28:11Z

Converted to draft because things are still in flux until we upgrade transformers support

kylesayrs · 2025-05-29T18:47:53Z

A traceable definition for this model is no longer required as of #1411 having been implemented. The accompanying example and test for this model is implemented here: #1490

## Purpose ## * Add support for mistral3 * Related: #1343 ## Prerequisites ## * #1479 ## Changes ## * Added mistral3 example * This model does not automatically change the dtype of pixel_values to match the dtype of the model, so I had to do so manually in the collator and sample generation * This model has a [very verbose chat template by default](https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/blob/main/chat_template.json), which may be less conducive to calibration, so I added a custom shortened version ## Testing ## * Ran example to completion: [nm-testing/Mistral-Small-3.1-24B-Instruct-2503-W4A16-G128](https://huggingface.co/nm-testing/Mistral-Small-3.1-24B-Instruct-2503-W4A16-G128) --------- Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

## Purpose ## * Add support for mistral3 * Related: vllm-project#1343 ## Prerequisites ## * vllm-project#1479 ## Changes ## * Added mistral3 example * This model does not automatically change the dtype of pixel_values to match the dtype of the model, so I had to do so manually in the collator and sample generation * This model has a [very verbose chat template by default](https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/blob/main/chat_template.json), which may be less conducive to calibration, so I added a custom shortened version ## Testing ## * Ran example to completion: [nm-testing/Mistral-Small-3.1-24B-Instruct-2503-W4A16-G128](https://huggingface.co/nm-testing/Mistral-Small-3.1-24B-Instruct-2503-W4A16-G128) --------- Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Add traceable mistral and mistral3 classes

3648861

anmarques requested a review from kylesayrs April 9, 2025 20:36

anmarques requested a review from dsikka April 9, 2025 20:36

anmarques added the ready When a PR is ready for review label Apr 9, 2025

dsikka requested changes Apr 10, 2025

View reviewed changes

Skip quality and style checks

bfd68fa

anmarques requested a review from dsikka April 10, 2025 21:37

anmarques added 6 commits April 11, 2025 00:00

Merge branch 'main' into traceable_mistral3

a862296

Merge latest updates

f51c925

Update import

268495d

revert change

c800890

Add path import from hub_kernels for the moment to support in-between…

5945ea5

… version of transformers

fix import paths

f255ce3

anmarques marked this pull request as draft April 11, 2025 21:27

anmarques removed the ready When a PR is ready for review label Apr 11, 2025

anmarques and others added 9 commits April 11, 2025 21:30

Unpin transformers to enable experimentation

092db7a

Update model definition once more

b87cdcb

fix import

a5cfa9f

Re-introduce link between mistral3 and mistral

10166bf

fix method calling

b638b73

Merge branch 'main' into traceable_mistral3

99794b4

Update to match latest transformers model definition

2af7452

Update with latest changes from transformers

fe84e31

Update model definition to match transformers 4.51.3

abd5ef2

kylesayrs mentioned this pull request May 29, 2025

[Model] Mistral3 example and test #1490

Merged

kylesayrs closed this May 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add traceable mistral and mistral3 classes#1343

Add traceable mistral and mistral3 classes#1343
anmarques wants to merge 17 commits intomainfrom
traceable_mistral3

anmarques commented Apr 9, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Apr 9, 2025

Uh oh!

dsikka left a comment

Uh oh!

anmarques commented Apr 10, 2025

Uh oh!

anmarques commented Apr 11, 2025

Uh oh!

kylesayrs commented May 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

anmarques commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 9, 2025

Uh oh!

dsikka left a comment

Choose a reason for hiding this comment

Uh oh!

anmarques commented Apr 10, 2025

Uh oh!

anmarques commented Apr 11, 2025

Uh oh!

kylesayrs commented May 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

anmarques commented Apr 9, 2025 •

edited

Loading