accelerator-nvidia-bad-envs: Tracks any bad environment variables that are globally set for the NVIDIA GPUs.accelerator-nvidia-clock-speed: Tracks the per-GPU clock speed.accelerator-nvidia-ecc: Tracks the NVIDIA per-GPU ECC errors and other ECC related information.accelerator-nvidia-error-sxid: Tracks the NVIDIA GPU SXid errors scanning the kmsg -- see fabric manager documentation.accelerator-nvidia-error-xid: Tracks the NVIDIA GPU Xid errors scanning the kmsg and using the NVIDIA Management Library (NVML) -- see Xid messages.accelerator-nvidia-fabric-manager: Tracks the fabric manager version and its activeness.accelerator-nvidia-gpm: Monitors the NVIDIA per-GPU GPM metrics.accelerator-nvidia-gsp-firmware: Tracks the GSP firmware mode.accelerator-nvidia-hw-slowdown: Monitors NVIDIA GPU hardware slowdown clock events of all GPUs.accelerator-nvidia-infiniband: Monitors the infiniband status of the system and Mellanox kernel events. Optional, enabled if the host has NVIDIA GPUs.accelerator-nvidia-memory: Monitors the NVIDIA per-GPU memory usage.accelerator-nvidia-nccl: Monitors the NCCL (NVIDIA Collective Communications Library) status. Optional, enabled if the host has NVIDIA GPUs.accelerator-nvidia-nvlink: Monitors the NVIDIA per-GPU nvlink devices.accelerator-nvidia-peermem: Monitors the peermem module status. Optional, enabled if the host has NVIDIA GPUs.accelerator-nvidia-persistence-mode: Tracks the NVIDIA persistence mode.accelerator-nvidia-power: Tracks the NVIDIA per-GPU power usage.accelerator-nvidia-processes: Tracks the NVIDIA per-GPU processes.accelerator-nvidia-remapped-rows: Tracks the NVIDIA per-GPU remapped rows (which indicates whether to reset the GPU or not).accelerator-nvidia-temperature: Tracks the NVIDIA per-GPU temperatures.accelerator-nvidia-utilization: Tracks the NVIDIA per-GPU utilization.containerd: Tracks the current containerd status.cpu: Tracks the combined usage of all CPUs (not per-CPU).disk: Tracks the disk usage of all the mount points specified in the configuration.docker: Tracks the current containers from the docker runtime.fuse: Tracks the FUSE connections.kernel-module: Monitors the FUSE (Filesystem in Userspace).kubelet: Tracks the kubelet status.library: Checks system libraries such as "libnvidia-ml.so" and "libcuda.so", if applicable.memory: Tracks the memory usage of the host.network-latency: Tracks global network connectivity statistics.nfs: Tracks the NFS volume healthiness.os: Queries the host OS information (e.g., kernel version, file descriptor usage).pci: Tracks the PCI devices and their Access Control Services (ACS) status.tailscale: Tracks the tailscale state (e.g., version) if available.