Fix autobatch size issue #1073

clefourrier · 2025-11-20T12:49:36Z

The PR does 2 main things:

adds a del on the created objects to force the memory release of attached resources
constrains the max generation size with the user provided value, as it was skipped and otherwise ignored (we should be careful with this because the generation size management is heavily duplicated across the code base now, so I suspect what I fixed here will need to be ported in other places of the code/put in a better system)

Rest of modifs are nits (duplicated code/legacy functions that I removed), can put in another PR but they were thematically linked

…items, plus updated the logic in generation size to respect what the user asks

clefourrier · 2025-11-20T12:50:04Z

src/lighteval/models/transformers/transformers_model.py

        if config.model_parallel is False and self.config.dtype not in ["4bit", "8bit"]:
            logger.info(f"Using Data Parallelism, putting model on device {self._device}")
            self.model = self.model.to(self._device)
-        if config.compile:


duplicate code, already exists in _create_auto_model

clefourrier · 2025-11-20T12:50:14Z

src/lighteval/models/transformers/transformers_model.py

        )
        # model.to(self.device)
        model.eval()
-        torch.set_grad_enabled(False)


set at the module level

clefourrier · 2025-11-20T12:50:25Z

src/lighteval/models/transformers/transformers_model.py

                continuation = continuation.lstrip()
        return continuation

-    def _model_call(self, inputs: torch.Tensor) -> torch.Tensor:


legacy function

HuggingFaceDocBuilderDev · 2025-11-20T12:52:58Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

clefourrier · 2025-11-20T12:54:11Z

@pcuenca this should fix the issue you had with autobatch size, can you take a look?

I'm not sure it's 100% perfect, I'm still getting some memory not deallocated in the model, but I suspect it should already be helpful for your usecase

removed duplicate code, useless function, added stronger deletion of …

937f09a

…items, plus updated the logic in generation size to respect what the user asks

clefourrier commented Nov 20, 2025

View reviewed changes

clefourrier requested a review from NathanHB November 20, 2025 12:52

Merge branch 'main' into clem_fix_bs2

486ac9d

NathanHB approved these changes Nov 20, 2025

View reviewed changes

clefourrier merged commit 2236e17 into main Nov 20, 2025
5 checks passed

NathanHB added the bug label Nov 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix autobatch size issue #1073

Fix autobatch size issue #1073

Uh oh!

clefourrier commented Nov 20, 2025 •

edited

Loading

Uh oh!

clefourrier Nov 20, 2025

Uh oh!

clefourrier Nov 20, 2025

Uh oh!

clefourrier Nov 20, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Nov 20, 2025

Uh oh!

clefourrier commented Nov 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix autobatch size issue #1073

Fix autobatch size issue #1073

Uh oh!

Conversation

clefourrier commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

clefourrier Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

clefourrier Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

clefourrier Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Nov 20, 2025

Uh oh!

clefourrier commented Nov 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

clefourrier commented Nov 20, 2025 •

edited

Loading