Showing 3 of 3 projects
An open-source machine learning platform for distributed training, hyperparameter tuning, experiment tracking, and resource management.
A Python module for programmatically retrieving NVIDIA GPU status and selecting available GPUs based on memory and load.
A Terraform plugin for managing machine learning compute resources across AWS, GCP, Azure, and Kubernetes with spot instance recovery and auto-termination.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.