[tx] Compute prompt logprobs on per-micro-batch basis #727

tyler-griggs · 2025-12-01T21:32:07Z

In response to the comment here: #577 (comment)

Prior to this PR, we check whether any request in a batch has requested the prompt logprobs and, if so, compute prompt logprobs for all samples in the batch. In this PR, we update this check to occur for each micro batch to save some unnecessary computations.

Tested with the existing micro batch test: test_sample_prompt_logprobs_with_microbatching

pcmoritz · 2025-12-02T22:19:42Z

/gemini review

gemini-code-assist

Code Review

This pull request refactors the computation of prompt logprobs to be determined on a per-micro-batch basis, rather than for the entire batch. This is a sensible optimization that avoids unnecessary work when only some requests in a batch require logprobs. The changes to the test suite correctly validate this new behavior by including a mix of requests with and without logprobs enabled. The implementation is mostly correct, but I've identified a potential robustness issue where a TypeError could occur.

gemini-code-assist · 2025-12-02T22:21:09Z

skyrl-tx/tx/tinker/engine.py

+                all_prompt_logprobs.extend(
+                    result.prompt_logprobs[i] if request_logprobs else None
+                    for i, request_logprobs in enumerate(all_request_logprobs[batch_start:batch_end])
+                )


The current implementation for extending all_prompt_logprobs could lead to a TypeError if model.generate returns None for prompt_logprobs even when they were requested for the micro-batch (needs_prompt_logprobs is True). The previous code included a check for result.prompt_logprobs being truthy, which was removed, making the access to result.prompt_logprobs[i] potentially unsafe.

To make the code more robust, it's better to reintroduce a check for result.prompt_logprobs being non-None before attempting to index it. The suggested change below does this in a compact way.

Suggested change

all_prompt_logprobs.extend(

result.prompt_logprobs[i] if request_logprobs else None

for i, request_logprobs in enumerate(all_request_logprobs[batch_start:batch_end])

)

all_prompt_logprobs.extend(

result.prompt_logprobs[i] if request_logprobs and result.prompt_logprobs else None

for i, request_logprobs in enumerate(all_request_logprobs[batch_start:batch_end])

)

per mb prompt logprobs

070b09b

tyler-griggs added the tx label Dec 1, 2025

tyler-griggs marked this pull request as ready for review December 1, 2025 21:34

tyler-griggs and others added 5 commits December 1, 2025 22:29

small edit

403479e

update

e8a30ad

remove spurious changes

8f5f344

cleanup

a62ba3c

update

cef4d75

gemini-code-assist bot reviewed Dec 2, 2025

View reviewed changes

pcmoritz approved these changes Dec 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[tx] Compute prompt logprobs on per-micro-batch basis #727

[tx] Compute prompt logprobs on per-micro-batch basis #727

Uh oh!

tyler-griggs commented Dec 1, 2025 •

edited

Loading

Uh oh!

pcmoritz commented Dec 2, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[tx] Compute prompt logprobs on per-micro-batch basis #727

Are you sure you want to change the base?

[tx] Compute prompt logprobs on per-micro-batch basis #727

Uh oh!

Conversation

tyler-griggs commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pcmoritz commented Dec 2, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tyler-griggs commented Dec 1, 2025 •

edited

Loading