[CI] Add mtp_proposer ut #4397

dragondream-chen · 2025-11-24T09:06:15Z

What this PR does / why we need it?

Add mtp_proposer ut

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.11.0
vLLM main: vllm-project/vllm@2918c1b

github-actions · 2025-11-24T09:06:22Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request adds unit tests for the MtpProposer. The tests cover initialization, model loading, and various helper methods for token generation. I've found several critical issues in the new tests that cause them to be broken or incorrect. One test has an incorrect assertion on an exception message. Another test has a syntax error that prevents it from running, as well as a logical flaw in its assertions. These issues need to be addressed to ensure the tests are reliable and correctly validate the code's behavior.

gemini-code-assist · 2025-11-24T09:07:49Z

tests/ut/spec_decode/test_mtp_proposer.py

+        with pytest.raises(AssertionError) as excinfo:
+            proposer.load_model(mock_model)
+
+        assert str(excinfo.value) == ""


The assertion assert str(excinfo.value) == "" is incorrect. When pytest catches an AssertionError, its string representation is the assertion expression that failed (e.g., 'assert 0 == 1'), not an empty string. This causes the test to fail incorrectly. To fix this, you should remove the assertion on the exception value if you only care that an AssertionError is raised.

Suggested change

with pytest.raises(AssertionError) as excinfo:

proposer.load_model(mock_model)

assert str(excinfo.value) == ""

with pytest.raises(AssertionError):

proposer.load_model(mock_model)

gemini-code-assist · 2025-11-24T09:07:49Z

tests/ut/spec_decode/mtp_proposer.py

+
+        mock_backup_output = proposer.backup_next_token_ids
+
+        expected_backup_cpu = np.array([1000, 2000, 3000, 4000, 0, 0, ...])[:10]


This line has a syntax error. The ellipsis ... is not a valid element within a list used to initialize a numpy.array. This will raise a SyntaxError and prevent the test from running.

Suggested change

expected_backup_cpu = np.array([1000, 2000, 3000, 4000, 0, 0, ...])[:10]

expected_backup_cpu = np.array([1000, 2000, 3000, 4000, 0, 0, 0, 0, 0, 0])

gemini-code-assist · 2025-11-24T09:07:49Z

tests/ut/spec_decode/mtp_proposer.py

+        expected_next_tokens = torch.tensor([103, 2, 3, 4], dtype=torch.int32, device="cpu")
+        assert torch.equal(next_token_ids, expected_next_tokens)


The expected value for next_token_ids is incorrect. The backup tokens are calculated from the mocked requests to be [1000, 2000, 3000, 4000]. The method under test correctly calculates the backup tokens and copies them to the GPU buffer. For discarded requests, it should fall back to these values. However, the test asserts against [2, 3, 4], which seems to be based on a stale mock value. The assertion should check against the correctly calculated backup tokens.

Suggested change

expected_next_tokens = torch.tensor([103, 2, 3, 4], dtype=torch.int32, device="cpu")

assert torch.equal(next_token_ids, expected_next_tokens)

expected_next_tokens = torch.tensor([103, 2000, 3000, 4000], dtype=torch.int32, device="cpu")

assert torch.equal(next_token_ids, expected_next_tokens)

Signed-off-by: chenmenglong <[email protected]>

github-actions bot added the module:tests label Nov 24, 2025

gemini-code-assist bot reviewed Nov 24, 2025

View reviewed changes

dragondream-chen force-pushed the ut_mtp_proposer branch from 34c61ee to 560c144 Compare November 24, 2025 09:26

[CI] Add mtp_proposer ut

397fb64

Signed-off-by: chenmenglong <[email protected]>

dragondream-chen force-pushed the ut_mtp_proposer branch from 560c144 to 397fb64 Compare November 25, 2025 08:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CI] Add mtp_proposer ut #4397

[CI] Add mtp_proposer ut #4397

dragondream-chen commented Nov 24, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Nov 24, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 24, 2025

Uh oh!

gemini-code-assist bot Nov 24, 2025

Uh oh!

gemini-code-assist bot Nov 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant


		mock_backup_output = proposer.backup_next_token_ids

		expected_backup_cpu = np.array([1000, 2000, 3000, 4000, 0, 0, ...])[:10]

		expected_next_tokens = torch.tensor([103, 2, 3, 4], dtype=torch.int32, device="cpu")
		assert torch.equal(next_token_ids, expected_next_tokens)

[CI] Add mtp_proposer ut #4397

Are you sure you want to change the base?

[CI] Add mtp_proposer ut #4397

Conversation

dragondream-chen commented Nov 24, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Nov 24, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dragondream-chen commented Nov 24, 2025 •

edited by github-actions bot

Loading