Metrics Details
Detailed list and descriptions of all metrics collected by AIBooster. The panel library provides 101 panels organized into the following categories.
Unit "-" represents dimensionless values.
GPU Metrics (DCGM)
Basic GPU Information (11 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
DCGM_FI_DEV_GPU_UTIL | GPU Utilization | GPU utilization percentage | % |
DCGM_FI_DEV_GPU_TEMP | GPU Temperature | GPU temperature | ℃ |
DCGM_FI_DEV_POWER_USAGE | GPU Power Usage | GPU power consumption | Watt |
DCGM_FI_DEV_FB_USED | GPU Memory Used | Framebuffer memory used | Bytes |
DCGM_FI_DEV_FB_FREE | GPU Memory Free | Framebuffer memory free | Bytes |
DCGM_FI_DEV_MEM_CLOCK | GPU Memory Clock | Memory clock frequency | MHz |
DCGM_FI_DEV_SM_CLOCK | GPU SM Clock | Streaming multiprocessor clock frequency | MHz |
DCGM_FI_DEV_MEMORY_TEMP | GPU Memory Temperature | GPU memory temperature | ℃ |
DCGM_FI_DEV_MEM_COPY_UTIL | GPU Memory Copy Utilization | GPU memory copy utilization | % |
DCGM_FI_DEV_ROW_REMAP_FAILURE | GPU Row Remap Failure | GPU memory row remap failure count | - |
DCGM_FI_DEV_VGPU_LICENSE_STATUS | vGPU License Status | vGPU license status | - |
Profiling Information (6 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
DCGM_FI_PROF_DRAM_ACTIVE | DRAM Active | DRAM utilization | % |
DCGM_FI_PROF_PCIE_RX_BYTES | PCIe RX Bytes | PCIe receive bytes | Bytes/sec |
DCGM_FI_PROF_PCIE_TX_BYTES | PCIe TX Bytes | PCIe transmit bytes | Bytes/sec |
DCGM_FI_PROF_SM_ACTIVE | SM Active | Streaming multiprocessor utilization | % |
DCGM_FI_PROF_SM_OCCUPANCY | SM Occupancy | Streaming multiprocessor occupancy | % |
DCGM_FI_PROF_PIPE_TENSOR_ACTIVE | Tensor Core Active | Tensor core utilization | % |
System Metrics (Node Exporter)
CPU & Load (7 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
node_load1 | Load Average 1m | 1-minute system load average | - |
node_load5 | Load Average 5m | 5-minute system load average | - |
node_load15 | Load Average 15m | 15-minute system load average | - |
node_cpu_frequency_max_hertz | CPU Frequency Max | Maximum CPU frequency | Hz |
node_cpu_frequency_min_hertz | CPU Frequency Min | Minimum CPU frequency | Hz |
node_cpu_scaling_frequency_hertz | CPU Scaling Frequency | Current CPU operating frequency | Hz |
node_cpu_scaling_governor | CPU Governor | CPU governor setting status | - |
Memory (9 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
node_memory_MemTotal_bytes | Node Memory Total | Total memory capacity | Bytes |
node_memory_MemAvailable_bytes | Node Memory Available | Available memory capacity | Bytes |
node_memory_MemFree_bytes | Node Memory Free | Free memory capacity | Bytes |
node_memory_Active_bytes | Node Memory Active | Active memory usage | Bytes |
node_memory_Inactive_bytes | Node Memory Inactive | Inactive memory usage | Bytes |
node_memory_Cached_bytes | Node Memory Cached | Cache memory usage | Bytes |
node_memory_Buffers_bytes | Node Memory Buffers | Buffer memory usage | Bytes |
node_memory_SwapTotal_bytes | Node Swap Total | Total swap capacity | Bytes |
node_memory_SwapFree_bytes | Node Swap Free | Free swap capacity | Bytes |
Filesystem (5 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
node_filesystem_size_bytes | Filesystem Size | Total filesystem capacity | Bytes |
node_filesystem_avail_bytes | Filesystem Available | Available filesystem capacity | Bytes |
node_filesystem_free_bytes | Filesystem Free | Free filesystem capacity | Bytes |
node_filesystem_files | Filesystem Files Total | Total inode count | - |
node_filesystem_files_free | Filesystem Files Free | Free inode count | - |
Network (4 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
node_network_info | Network Info | Network interface information | - |
node_network_up | Network Up | Network interface status | - |
node_network_speed_bytes | Network Speed | Network speed | Bytes/sec |
node_network_mtu_bytes | Network MTU | Maximum Transmission Unit | Bytes |
Processes (2 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
node_procs_running | Processes Running | Running process count | - |
node_procs_blocked | Processes Blocked | Blocked process count | - |
File Descriptors (3 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
node_filefd_allocated | File Descriptors Allocated | Allocated file descriptor count | - |
node_filefd_maximum | File Descriptors Maximum | Maximum file descriptor count | - |
node_arp_entries | ARP Entries | ARP table entry count | - |
System Boot Time (1 panel)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
node_boot_time_seconds | Boot Time | System boot time | Seconds |
ZFS Related (16 panels)
ARC Cache (10 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
node_zfs_arc_size | ZFS ARC Size | Current ARC cache size | Bytes |
node_zfs_arc_c | ZFS ARC C | ARC target size | Bytes |
node_zfs_arc_c_max | ZFS ARC C Max | ARC maximum size | Bytes |
node_zfs_arc_c_min | ZFS ARC C Min | ARC minimum size | Bytes |
node_zfs_arc_hits | ZFS ARC Hits | ARC cache hit count | - |
node_zfs_arc_misses | ZFS ARC Misses | ARC cache miss count | - |
node_zfs_arc_mfu_hits | ZFS ARC MFU Hits | Most Frequently Used hit count | - |
node_zfs_arc_mru_hits | ZFS ARC MRU Hits | Most Recently Used hit count | - |
node_zfs_arc_demand_data_hits | ZFS ARC Demand Data Hits | Demand data hit count | - |
node_zfs_arc_demand_data_misses | ZFS ARC Demand Data Misses | Demand data miss count | - |
ZFS Pool (6 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
node_zfs_zpool_state | ZFS Pool State | ZFS pool status | - |
node_zfs_zpool_dataset_nread | ZFS Dataset Reads | Dataset read count | - |
node_zfs_zpool_dataset_nwritten | ZFS Dataset Writes | Dataset write count | - |
node_zfs_zpool_dataset_reads | ZFS Dataset Read Bytes | Dataset read bytes | Bytes |
node_zfs_zpool_dataset_writes | ZFS Dataset Write Bytes | Dataset write bytes | Bytes |
node_zfs_zpool_dataset_nunlinks | ZFS Dataset Unlinks | Dataset unlink count | - |
Process & Application Metrics
Go Applications (18 panels)
Basic Information (4 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
go_info | Go Info | Go language version information | - |
go_goroutines | Go Goroutines | Running goroutine count | - |
go_threads | Go Threads | Thread count | - |
go_sched_gomaxprocs_threads | Go MAXPROCS | GOMAXPROCS setting value | - |
Garbage Collection (2 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
go_gc_gogc_percent | Go GC Percent | GOGC setting value | % |
go_gc_gomemlimit_bytes | Go Memory Limit | GOMEMLIMIT setting value | Bytes |
Memory Statistics (12 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
go_memstats_alloc_bytes | Go Memory Allocated | Allocated memory | Bytes |
go_memstats_sys_bytes | Go Memory System | System allocated memory | Bytes |
go_memstats_heap_alloc_bytes | Go Heap Allocated | Heap allocated memory | Bytes |
go_memstats_heap_sys_bytes | Go Heap System | Heap system memory | Bytes |
go_memstats_heap_idle_bytes | Go Heap Idle | Heap idle memory | Bytes |
go_memstats_heap_inuse_bytes | Go Heap In Use | Heap in-use memory | Bytes |
go_memstats_heap_released_bytes | Go Heap Released | Heap released memory | Bytes |
go_memstats_heap_objects | Go Heap Objects | Heap object count | - |
go_memstats_stack_inuse_bytes | Go Stack In Use | Stack in-use memory | Bytes |
go_memstats_stack_sys_bytes | Go Stack System | Stack system memory | Bytes |
go_memstats_mspan_inuse_bytes | Go MSpan In Use | MSpan in-use memory | Bytes |
go_memstats_mspan_sys_bytes | Go MSpan System | MSpan system memory | Bytes |
Process Information (6 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
process_resident_memory_bytes | Process Resident Memory | Process physical memory usage | Bytes |
process_virtual_memory_bytes | Process Virtual Memory | Process virtual memory usage | Bytes |
process_virtual_memory_max_bytes | Process Virtual Memory Max | Process maximum virtual memory | Bytes |
process_open_fds | Process Open FDs | Process open file descriptor count | - |
process_max_fds | Process Max FDs | Process maximum file descriptor count | - |
process_start_time_seconds | Process Start Time | Process start time | Seconds |
Scraping Statistics (5 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
scrape_duration_seconds | Scrape Duration | Metric collection time | Seconds |
scrape_samples_scraped | Scrape Samples Scraped | Collected sample count | - |
scrape_samples_post_metric_relabeling | Scrape Samples Post Relabeling | Post-relabeling sample count | - |
scrape_series_added | Scrape Series Added | Added time series count | - |
up | Scrape Up | Scrape success status | - |
Other System Metrics
Memory Cache (6 panels)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
go_memstats_mcache_inuse_bytes | Go MCache In Use | MCache in-use memory | Bytes |
go_memstats_mcache_sys_bytes | Go MCache System | MCache system memory | Bytes |
go_memstats_gc_sys_bytes | Go GC System | GC system memory | Bytes |
go_memstats_other_sys_bytes | Go Other System | Other system memory | Bytes |
go_memstats_buck_hash_sys_bytes | Go Bucket Hash System | Bucket hash system memory | Bytes |
go_memstats_next_gc_bytes | Go Next GC | Next GC threshold | Bytes |
GC Statistics (1 panel)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
go_memstats_last_gc_time_seconds | Go Last GC Time | Last GC execution time | Seconds |
HTTP Statistics (1 panel)
Metric Name | Panel Name | Description | Unit |
---|---|---|---|
promhttp_metric_handler_requests_in_flight | HTTP Requests In Flight | In-flight HTTP request count | - |