-
Notifications
You must be signed in to change notification settings - Fork 260
Sv2 long running pipeline #4112
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
This PR adds a new long-running test stage to the SwiftV2 pipeline that repeatedly creates and deletes PodNetwork (pn), PodNetworkInstance (pni), and Pod objects to test the datapath over extended periods.
Key changes:
- New Ginkgo test suite that runs indefinitely, cycling through resource creation and deletion every 35 minutes
- Helper functions for Azure resource queries and Kubernetes operations
- Pipeline configuration updates to run the new test stage with unlimited timeout
- Refactored VNet creation scripts to use loops and dynamic cluster naming
Reviewed Changes
Copilot reviewed 12 out of 13 changed files in this pull request and generated 4 comments.
Show a summary per file
| File | Description |
|---|---|
| test/integration/swiftv2/longRunningCluster/datapath_test.go | New test suite for long-running datapath tests with resource lifecycle management |
| test/integration/swiftv2/longRunningCluster/datapath.go | Helper functions to create Kubernetes resources from templates |
| test/integration/swiftv2/helpers/az_helpers.go | New helper functions for Azure CLI operations and Kubernetes resource management |
| test/integration/manifests/swiftv2/long-running-cluster/*.yaml | Kubernetes resource templates for PodNetwork, PodNetworkInstance, and Pods |
| hack/aks/Makefile | New targets for delegated subnet creation and dummy cluster provisioning |
| go.mod & go.sum | Updated Go version and dependencies, removed version pinning for Ginkgo/Gomega |
| .pipelines/swiftv2-long-running/template/long-running-pipeline-template.yaml | New test stage with unlimited timeout |
| .pipelines/swiftv2-long-running/scripts/create_vnets.sh | Refactored to use loops and create delegation clusters dynamically |
| .pipelines/swiftv2-long-running/scripts/create_aks.sh | Added kubeconfig export for test consumption |
| .pipelines/swiftv2-long-running/pipeline.yaml | Updated default VM size |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
| for { | ||
| iteration++ |
Copilot
AI
Nov 10, 2025
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The infinite loop lacks any graceful termination mechanism or error handling for catastrophic failures. If the test encounters repeated failures, it will continue indefinitely. Consider adding a context with cancellation or a maximum iteration count to allow controlled shutdown.
- Implemented scheduled pipeline running every 1 hour with persistent infrastructure - Split test execution into 2 jobs: Create (with 20min wait) and Delete - Added 8 test scenarios across 2 AKS clusters, 4 VNets, different subnets - Implemented two-phase deletion strategy to prevent PNI ReservationInUse errors - Added context timeouts on kubectl commands with force delete fallbacks - Resource naming uses RG name as BUILD_ID for uniqueness across parallel setups - Added SkipAutoDeleteTill tags to prevent automatic resource cleanup - Conditional setup stages controlled by runSetupStages parameter - Auto-generate RG name from location or allow custom names for parallel setups - Added comprehensive README with setup instructions and troubleshooting - Node selection by agentpool labels with usage tracking to prevent conflicts - Kubernetes naming compliance (RFC 1123) for all resources
e690c0f to
b07b697
Compare
23d306f to
84ef131
Compare
Reason for Change:
Added a new stage to the long running cluster to create and delete pn, pni and pod objects repeatedly.
Long running test cluster pipeline
Requirements:
Notes: