Skip to content

[E2E] Add dynamo profile for E2E testing framework #659

@Xunzhuo

Description

@Xunzhuo

Description

Add a new dynamo profile to the E2E testing framework to test Semantic Router integration with Nvidia Dynamo.

Background

The E2E testing framework introduced in #655 provides an extensible profile-based architecture. We need to add a dynamo profile to test Semantic Router deployment and functionality with Nvidia Dynamo optimization platform.

Tasks

  • Create e2e/profiles/dynamo/ directory structure
  • Implement Profile interface for dynamo
    • Setup: Deploy Nvidia Dynamo components
    • Setup: Deploy Semantic Router with Dynamo integration
    • Setup: Configure Dynamo optimization settings
    • Teardown: Clean up Dynamo resources
  • Implement test cases:
    • Basic health check with Dynamo
    • Optimized inference verification
    • Performance comparison (with/without Dynamo)
    • Dynamic batching testing
    • GPU utilization monitoring
  • Add documentation for dynamo profile usage
  • Update CI workflow to run dynamo tests

Implementation Details

Profile Structure

type Profile struct {
    verbose bool
}

func (p *Profile) Setup(ctx context.Context, opts *framework.SetupOptions) error {
    // 1. Deploy Nvidia Dynamo runtime
    // 2. Deploy Semantic Router with Dynamo backend
    // 3. Configure optimization settings (dynamic batching, kernel fusion)
    // 4. Configure GPU resource allocation
}

Test Cases

  1. Dynamo Health Check: Verify Dynamo runtime is healthy
  2. Optimized Inference: Test inference with Dynamo optimizations
  3. Performance Benchmark: Compare performance with/without Dynamo
  4. Dynamic Batching: Verify dynamic batching functionality
  5. GPU Utilization: Monitor GPU usage and efficiency

Acceptance Criteria

  • Dynamo profile can be run with make e2e-test PROFILE=dynamo
  • All test cases pass successfully
  • Documentation is complete and clear
  • CI integration works correctly
  • Performance improvements are documented

References

Related Issues

Part of the E2E testing framework expansion effort.

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions