Skip to content

Conversation

@JennyLiu-nv
Copy link

Jenny Liu added 14 commits July 1, 2025 08:50
…put_len:2048,128-reqs:32-con:1 parameters

- Added 12 model variants with different precision configurations
- Includes LLaMA 3.1 8B, LLaMA 3.3 Nemotron Super 49B, LLaMA 3.3 70B, Mixtral 8x7B variants
- Added fp8, fp4, float16, and bfloat16 precision variants
- All configurations use PyTorch backend with specified performance parameters
Signed-off-by: Jenny Liu <[email protected]>
Signed-off-by: Jenny Liu <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant