Showing 3 of 3 projects
Exposes NVIDIA GPU metrics for Prometheus monitoring using the NVIDIA Data Center GPU Manager (DCGM).
Prometheus exporter for Lustre parallel filesystem metrics, enabling monitoring of OST, MDT, MGS, MDS, client, generic, LNET, and health data.
A Prometheus exporter that collects metrics from Slurm workload manager via its REST API.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.