You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
150398: cli(debug.zip): improve tsdump upload speed r=aa-joshi a=aa-joshi
Previously, tsdump upload to Datadog was taking more time compare to stage tsdump in roachprod. This was inadequate because it would increase the MTTD (Mean Time To Detect) an issue. This change introduces `upload-workers` as flag to set the number of Datadog upload workers. The default value is 50. This change includes the changes around the retry configuration to further improve performance.
Epic: None
Part of: CRDB-52094
Release note: None
-----
tsdump size: 11.19GB
roachprod upload time:
<img width="980" height="77" alt="Screenshot 2025-07-17 at 12 04 01 PM" src="https://github.com/user-attachments/assets/718d5f2b-142e-438c-9008-6739295e6930" />
Tsdump upload time (before changes):
<img width="904" height="17" alt="Screenshot 2025-07-17 at 11 52 37 AM" src="https://github.com/user-attachments/assets/9b844a58-7b5b-4e67-bad0-142f6c37673b" />
Tsdump upload time with default(after changes):
<img width="903" height="20" alt="Screenshot 2025-07-17 at 11 51 34 AM" src="https://github.com/user-attachments/assets/e6556e2a-d9bd-411b-acd1-591cc9971784" />
Co-authored-by: Akshay Joshi <[email protected]>
Copy file name to clipboardExpand all lines: pkg/cli/debug.go
+1Lines changed: 1 addition & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -1617,6 +1617,7 @@ func init() {
1617
1617
f.StringVar(&debugTimeSeriesDumpOpts.userName, "user-name", "", "name of the user to perform datadog upload")
1618
1618
f.StringVar(&debugTimeSeriesDumpOpts.storeToNodeMapYAMLFile, "store-to-node-map-file", "", "yaml file path which contains the mapping of store ID to node ID for datadog upload.")
1619
1619
f.BoolVar(&debugTimeSeriesDumpOpts.dryRun, "dry-run", false, "run in dry-run mode without making any actual uploads")
1620
+
f.IntVar(&debugTimeSeriesDumpOpts.noOfUploadWorkers, "upload-workers", 50, "number of workers to upload the time series data in parallel")
0 commit comments