Skip to content

[AutoDeploy]: Nemotron-6 8K/16K (ISL/OSL) on H100; BF16Β #8435

@nzmora-nvidia

Description

@nzmora-nvidia

πŸš€ The feature, motivation and pitch

capture nsys; compare to vLLM and identify issues.

Alternatives

No response

Additional context

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and checked the documentation and examples for answers to frequently asked questions.

Metadata

Metadata

Assignees

Labels

AutoDeploy<NV> AutoDeploy Backend

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions