Skip to content

Conversation

@hthadicherla
Copy link
Contributor

What does this PR do?

Type of change: ? Bug Fix

Overview: Updated torch and transformers to latest versions for normal quantization examples.
For whisper quantization, updated README with steps to install and enable latest torch and torchaudio.

Testing

Tested quantization and MMLU benchmarks with updated torch and transformers version. It was working as expected.

Signed-off-by: Hrishith Thadicherla <[email protected]>
@hthadicherla hthadicherla requested a review from a team as a code owner December 1, 2025 08:07
@codecov
Copy link

codecov bot commented Dec 1, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 74.64%. Comparing base (fa84955) to head (135d93e).

Additional details and impacted files
@@           Coverage Diff           @@
##             main     #622   +/-   ##
=======================================
  Coverage   74.64%   74.64%           
=======================================
  Files         183      183           
  Lines       18542    18542           
=======================================
  Hits        13840    13840           
  Misses       4702     4702           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Comment on lines 11 to 12
torch==2.7.0+cu128
torchaudio==2.7.0+cu128
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we update this torch dependencies to 2.9 as well?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The reason why we kept these torch dependencies to 2.7, is because torchaudio >= 2.8 requires ffmpeg binaries which cannot be installed by pip and needs to be installed manually. Since torchaudio >= 2.8 requires torch >=2.8 we cannot upgrade torch as well

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have written steps to upgrade torch dependencies in the README though.

Comment on lines +7 to 8
onnx==1.18.0
onnxruntime-gpu==1.20.1
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we use onnx 1.19 and ort 1.23?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think there should be an issue upgrading onnx to 1.19 but i need to test the scripts for onnxruntime-gpu. Will update in comment, if it works.

@kevalmorabia97
Copy link
Collaborator

If we no longer use ort-dml for windows, please update requirements in setup.py as well: https://github.com/NVIDIA/TensorRT-Model-Optimizer/blob/main/setup.py#L53

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants