44
55| Plugin | Collection | Analysis | DataModel | Collector | Analyzer |
66| --- | --- | --- | --- | --- | --- |
7- | AmdSmiPlugin | amd-smi firmware --json<br >amd-smi list --json<br >amd-smi partition --json<br >amd-smi process --json<br >amd-smi static -g all --json<br >amd-smi version --json | ** Analyzer Args:** <br >- ` check_static_data ` : bool<br >- ` expected_gpu_processes ` : Optional[ int] <br >- ` expected_max_power ` : Optional[ int] <br >- ` expected_driver_version ` : Optional[ str] <br >- ` expected_memory_partition_mode ` : Optional[ str] <br >- ` expected_compute_partition_mode ` : Optional[ str] <br >- ` expected_pldm_version ` : Optional[ str] <br >- ` l0_to_recovery_count_error_threshold ` : Optional[ int] <br >- ` l0_to_recovery_count_warning_threshold ` : Optional[ int] <br >- ` vendorid_ep ` : Optional[ str] <br >- ` vendorid_ep_vf ` : Optional[ str] <br >- ` devid_ep ` : Optional[ str] <br >- ` devid_ep_vf ` : Optional[ str] <br >- ` sku_name ` : Optional[ str] | [ AmdSmiDataModel] ( #AmdSmiDataModel-Model ) | [ AmdSmiCollector] ( #Collector-Class-AmdSmiCollector ) | [ AmdSmiAnalyzer] ( #Data-Analyzer-Class-AmdSmiAnalyzer ) |
7+ | AmdSmiPlugin | firmware --json<br>list --json<br>partition --json<br>process --json<br>ras --cper --folder={folder}<br>static -g all --json<br>static -g {gpu_id} --json<br>version --json | **Analyzer Args:**<br>- `check_static_data`: bool<br>- `expected_gpu_processes`: Optional[int]<br>- `expected_max_power`: Optional[int]<br>- `expected_driver_version`: Optional[str]<br>- `expected_memory_partition_mode`: Optional[str]<br>- `expected_compute_partition_mode`: Optional[str]<br>- `expected_pldm_version`: Optional[str]<br>- `l0_to_recovery_count_error_threshold`: Optional[int]<br>- `l0_to_recovery_count_warning_threshold`: Optional[int]<br>- `vendorid_ep`: Optional[str]<br>- `vendorid_ep_vf`: Optional[str]<br>- `devid_ep`: Optional[str]<br>- `devid_ep_vf`: Optional[str]<br>- `sku_name`: Optional[str]<br>- `expected_xgmi_speed`: Optional[list[float]]<br>- `analysis_range_start`: Optional[datetime.datetime]<br>- `analysis_range_end`: Optional[datetime.datetime] | [AmdSmiDataModel](#AmdSmiDataModel-Model) | [AmdSmiCollector](#Collector-Class-AmdSmiCollector) | [AmdSmiAnalyzer](#Data-Analyzer-Class-AmdSmiAnalyzer) |
88| BiosPlugin | sh -c 'cat /sys/devices/virtual/dmi/id/bios_version'<br >wmic bios get SMBIOSBIOSVersion /Value | ** Analyzer Args:** <br >- ` exp_bios_version ` : list[ str] <br >- ` regex_match ` : bool | [ BiosDataModel] ( #BiosDataModel-Model ) | [ BiosCollector] ( #Collector-Class-BiosCollector ) | [ BiosAnalyzer] ( #Data-Analyzer-Class-BiosAnalyzer ) |
99| CmdlinePlugin | cat /proc/cmdline | ** Analyzer Args:** <br >- ` required_cmdline ` : Union[ str, list] <br >- ` banned_cmdline ` : Union[ str, list] | [ CmdlineDataModel] ( #CmdlineDataModel-Model ) | [ CmdlineCollector] ( #Collector-Class-CmdlineCollector ) | [ CmdlineAnalyzer] ( #Data-Analyzer-Class-CmdlineAnalyzer ) |
1010| DeviceEnumerationPlugin | lscpu \| grep Socket \| awk '{ print $2 }'<br >powershell -Command "(Get-WmiObject -Class Win32_Processor \| Measure-Object).Count"<br >lspci -d {vendorid_ep}: \| grep -i 'VGA\\ | Display\\ | 3D' \| wc -l<br >powershell -Command "(wmic path win32_VideoController get name \| findstr AMD \| Measure-Object).Count"<br >lspci -d {vendorid_ep}: \| grep -i 'Virtual Function' \| wc -l<br >powershell -Command "(Get-VMHostPartitionableGpu \| Measure-Object).Count" | ** Analyzer Args:** <br >- ` cpu_count ` : Optional[ list[ int]] <br >- ` gpu_count ` : Optional[ list[ int]] <br >- ` vf_count ` : Optional[ list[ int]] | [ DeviceEnumerationDataModel] ( #DeviceEnumerationDataModel-Model ) | [ DeviceEnumerationCollector] ( #Collector-Class-DeviceEnumerationCollector ) | [ DeviceEnumerationAnalyzer] ( #Data-Analyzer-Class-DeviceEnumerationAnalyzer ) |
1414| JournalPlugin | journalctl --no-pager --system --output=short-iso | - | [ JournalData] ( #JournalData-Model ) | [ JournalCollector] ( #Collector-Class-JournalCollector ) | - |
1515| KernelPlugin | sh -c 'uname -a'<br >wmic os get Version /Value | ** Analyzer Args:** <br >- ` exp_kernel ` : Union[ str, list] <br >- ` regex_match ` : bool | [ KernelDataModel] ( #KernelDataModel-Model ) | [ KernelCollector] ( #Collector-Class-KernelCollector ) | [ KernelAnalyzer] ( #Data-Analyzer-Class-KernelAnalyzer ) |
1616| KernelModulePlugin | cat /proc/modules<br >wmic os get Version /Value | ** Analyzer Args:** <br >- ` kernel_modules ` : dict[ str, dict] <br >- ` regex_filter ` : list[ str] | [ KernelModuleDataModel] ( #KernelModuleDataModel-Model ) | [ KernelModuleCollector] ( #Collector-Class-KernelModuleCollector ) | [ KernelModuleAnalyzer] ( #Data-Analyzer-Class-KernelModuleAnalyzer ) |
17- | MemoryPlugin | free -b<br >wmic OS get FreePhysicalMemory /Value; wmic ComputerSystem get TotalPhysicalMemory /Value | - | [ MemoryDataModel] ( #MemoryDataModel-Model ) | [ MemoryCollector] ( #Collector-Class-MemoryCollector ) | [ MemoryAnalyzer] ( #Data-Analyzer-Class-MemoryAnalyzer ) |
17+ | MemoryPlugin | free -b<br >/usr/bin/lsmem< br > wmic OS get FreePhysicalMemory /Value; wmic ComputerSystem get TotalPhysicalMemory /Value | - | [ MemoryDataModel] ( #MemoryDataModel-Model ) | [ MemoryCollector] ( #Collector-Class-MemoryCollector ) | [ MemoryAnalyzer] ( #Data-Analyzer-Class-MemoryAnalyzer ) |
1818| NvmePlugin | nvme smart-log {dev}<br >nvme error-log {dev} --log-entries=256<br >nvme id-ctrl {dev}<br >nvme id-ns {dev}{ns}<br >nvme fw-log {dev}<br >nvme self-test-log {dev}<br >nvme get-log {dev} --log-id=6 --log-len=512<br >nvme telemetry-log {dev} --output-file={dev}_ {f_name} | - | [ NvmeDataModel] ( #NvmeDataModel-Model ) | [ NvmeCollector] ( #Collector-Class-NvmeCollector ) | - |
1919| OsPlugin | sh -c '( lsb_release -ds \|\| (cat /etc/* release \| grep PRETTY_NAME) \|\| uname -om ) 2>/dev/null \| head -n1'<br >cat /etc/* release \| grep VERSION_ID<br >wmic os get Version /value<br >wmic os get Caption /Value | ** Analyzer Args:** <br >- ` exp_os ` : Union[ str, list] <br >- ` exact_match ` : bool | [ OsDataModel] ( #OsDataModel-Model ) | [ OsCollector] ( #Collector-Class-OsCollector ) | [ OsAnalyzer] ( #Data-Analyzer-Class-OsAnalyzer ) |
2020| PackagePlugin | dnf list --installed<br >dpkg-query -W<br >pacman -Q<br >cat /etc/* release<br >wmic product get name,version | ** Analyzer Args:** <br >- ` exp_package_ver ` : Dict[ str, Optional[ str]] <br >- ` regex_match ` : bool | [ PackageDataModel] ( #PackageDataModel-Model ) | [ PackageCollector] ( #Collector-Class-PackageCollector ) | [ PackageAnalyzer] ( #Data-Analyzer-Class-PackageAnalyzer ) |
@@ -42,25 +42,29 @@ Class for collection of inband tool amd-smi data.
4242
4343- ** AMD_SMI_EXE** : ` amd-smi `
4444- ** SUPPORTED_OS_FAMILY** : ` {<OSFamily.LINUX: 3>} `
45- - ** CMD_VERSION** : ` amd-smi version --json `
46- - ** CMD_LIST** : ` amd-smi list --json `
47- - ** CMD_PROCESS** : ` amd-smi process --json `
48- - ** CMD_PARTITION** : ` amd-smi partition --json `
49- - ** CMD_FIRMWARE** : ` amd-smi firmware --json `
50- - ** CMD_STATIC** : ` amd-smi static -g all --json `
45+ - ** CMD_VERSION** : ` version --json `
46+ - ** CMD_LIST** : ` list --json `
47+ - ** CMD_PROCESS** : ` process --json `
48+ - ** CMD_PARTITION** : ` partition --json `
49+ - ** CMD_FIRMWARE** : ` firmware --json `
50+ - ** CMD_STATIC** : ` static -g all --json `
51+ - ** CMD_STATIC_GPU** : ` static -g {gpu_id} --json `
52+ - ** CMD_RAS** : ` ras --cper --folder={folder} `
5153
5254### Provides Data
5355
5456AmdSmiDataModel
5557
5658### Commands
5759
58- - amd-smi firmware --json
59- - amd-smi list --json
60- - amd-smi partition --json
61- - amd-smi process --json
62- - amd-smi static -g all --json
63- - amd-smi version --json
60+ - firmware --json
61+ - list --json
62+ - partition --json
63+ - process --json
64+ - ras --cper --folder={folder}
65+ - static -g all --json
66+ - static -g {gpu_id} --json
67+ - version --json
6468
6569## Collector Class BiosCollector
6670
@@ -300,6 +304,7 @@ Collect memory usage details
300304
301305- ** CMD_WINDOWS** : ` wmic OS get FreePhysicalMemory /Value; wmic ComputerSystem get TotalPhysicalMemory /Value `
302306- ** CMD** : ` free -b `
307+ - ** CMD_LSMEM** : ` /usr/bin/lsmem `
303308
304309### Provides Data
305310
@@ -308,6 +313,7 @@ MemoryDataModel
308313### Commands
309314
310315- free -b
316+ - /usr/bin/lsmem
311317- wmic OS get FreePhysicalMemory /Value; wmic ComputerSystem get TotalPhysicalMemory /Value
312318
313319## Collector Class NvmeCollector
@@ -646,10 +652,15 @@ Data model for amd-smi data.
646652- ** gpu_list** : ` Optional[list[nodescraper.plugins.inband.amdsmi.amdsmidata.AmdSmiListItem]] `
647653- ** partition** : ` Optional[nodescraper.plugins.inband.amdsmi.amdsmidata.Partition] `
648654- ** process** : ` Optional[list[nodescraper.plugins.inband.amdsmi.amdsmidata.Processes]] `
655+ - ** topology** : ` Optional[list[nodescraper.plugins.inband.amdsmi.amdsmidata.Topo]] `
649656- ** firmware** : ` Optional[list[nodescraper.plugins.inband.amdsmi.amdsmidata.Fw]] `
650657- ** bad_pages** : ` Optional[list[nodescraper.plugins.inband.amdsmi.amdsmidata.BadPages]] `
651658- ** static** : ` Optional[list[nodescraper.plugins.inband.amdsmi.amdsmidata.AmdSmiStatic]] `
652659- ** metric** : ` Optional[list[nodescraper.plugins.inband.amdsmi.amdsmidata.AmdSmiMetric]] `
660+ - ** xgmi_metric** : ` Optional[list[nodescraper.plugins.inband.amdsmi.amdsmidata.XgmiMetrics]] `
661+ - ** xgmi_link** : ` Optional[list[nodescraper.plugins.inband.amdsmi.amdsmidata.XgmiLinks]] `
662+ - ** cper_data** : ` Optional[list[nodescraper.models.datamodel.FileModel]] `
663+ - ** amdsmitst_data** : ` nodescraper.plugins.inband.amdsmi.amdsmidata.AmdSmiTstData `
653664
654665## BiosDataModel Model
655666
@@ -763,6 +774,7 @@ Data model for journal logs
763774
764775- ** mem_free** : ` str `
765776- ** mem_total** : ` str `
777+ - ** lsmem_output** : ` Optional[dict] `
766778
767779## NvmeDataModel Model
768780
@@ -915,7 +927,11 @@ Data model for in band syslog logs
915927
916928## Data Analyzer Class AmdSmiAnalyzer
917929
918- ** Bases** : [ 'DataAnalyzer']
930+ ### Description
931+
932+ Check AMD SMI Application data for PCIe, ECC errors, CPER data, and analyze amdsmitst metrics
933+
934+ ** Bases** : [ 'CperAnalysisTaskMixin', 'DataAnalyzer']
919935
920936** Link to code** : [ amdsmi_analyzer.py] ( https://github.com/amd/node-scraper/blob/HEAD/nodescraper/plugins/inband/amdsmi/amdsmi_analyzer.py )
921937
@@ -1213,6 +1229,9 @@ Check sysctl matches expected sysctl details
12131229- ** devid_ep** : ` Optional[str] `
12141230- ** devid_ep_vf** : ` Optional[str] `
12151231- ** sku_name** : ` Optional[str] `
1232+ - ** expected_xgmi_speed** : ` Optional[list[float]] `
1233+ - ** analysis_range_start** : ` Optional[datetime.datetime] `
1234+ - ** analysis_range_end** : ` Optional[datetime.datetime] `
12161235
12171236## Analyzer Args Class BiosAnalyzerArgs
12181237
0 commit comments