Skip to content

Commit 360a14e

Browse files
author
Himani Anil Deshpande
committed
[Gb200] Add NVIDIA-IMEX verbose and Stats logs to Cloudwatch Group
[Gb200] Adding specific OS that we support for Gb200
1 parent 6c17bff commit 360a14e

File tree

1 file changed

+34
-0
lines changed

1 file changed

+34
-0
lines changed

cookbooks/aws-parallelcluster-environment/files/cloudwatch/cloudwatch_agent_config.json

Lines changed: 34 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -520,6 +520,40 @@
520520
],
521521
"feature_conditions": []
522522
},
523+
{
524+
"timestamp_format_key": "default",
525+
"file_path": "/var/log/nvidia-imex-stats.log",
526+
"log_stream_name": "nvidia-imex-stats",
527+
"schedulers": [
528+
"slurm"
529+
],
530+
"platforms": [
531+
"amazon",
532+
"rocky",
533+
"ubuntu"
534+
],
535+
"node_roles": [
536+
"ComputeFleet"
537+
],
538+
"feature_conditions": []
539+
},
540+
{
541+
"timestamp_format_key": "default",
542+
"file_path": "/var/log/nvidia-imex-verbose.log",
543+
"log_stream_name": "nvidia-imex-verbose",
544+
"schedulers": [
545+
"slurm"
546+
],
547+
"platforms": [
548+
"amazon",
549+
"rocky",
550+
"ubuntu"
551+
],
552+
"node_roles": [
553+
"ComputeFleet"
554+
],
555+
"feature_conditions": []
556+
},
523557
{
524558
"timestamp_format_key": "json",
525559
"file_path": "/var/log/parallelcluster/slurm_health_check.events",

0 commit comments

Comments
 (0)