A fast GPU-accelerated library for training Gradient Boosting Decision Trees (GBDT) and Random Forests.
ThunderGBM is a high-performance machine learning library that trains Gradient Boosting Decision Trees (GBDT) and Random Forests using GPU acceleration. It solves the computational bottleneck of these popular tree-based ensemble methods, delivering speedups often 10x faster than other libraries for large-scale data. The library provides a scikit-learn compatible Python interface and supports classification, regression, and ranking tasks.
Data scientists and machine learning practitioners working with large datasets who need to train GBDT or Random Forest models quickly, particularly those in competitive data science (e.g., Kaggle) or real-world applications requiring fast experimentation and deployment. It is also suitable for researchers and developers seeking GPU-accelerated implementations of these algorithms.
Developers choose ThunderGBM for its significant GPU-accelerated performance gains, often 10x faster than CPU-based alternatives, making it ideal for large-scale model training. Its seamless scikit-learn compatibility allows easy integration into existing Python workflows, while its optimized parallel computing implementations efficiently handle classification, regression, and ranking problems.
ThunderGBM: Fast GBDTs and Random Forests on GPUs
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Leverages CUDA-capable GPUs to achieve training speeds often 10x faster than CPU-based libraries, as demonstrated in benchmarks and the project's documentation.
Provides a Python interface fully compatible with scikit-learn for classifiers, regressors, and rankers, enabling seamless integration into existing machine learning pipelines.
Runs on both Linux and Windows with CUDA support, catering to a wide range of users in research and production environments.
Handles classification, regression, and ranking tasks with optimized implementations for GBDT and Random Forests, making it versatile for various predictive modeling needs.
Does not support MacOS and requires specific CUDA versions and build tools, which can exclude users on unsupported systems or those without GPU access.
For Windows, installation involves manually downloading wheel files, and building from source requires cmake and CUDA setup, making it less user-friendly compared to one-command pip installs for other libraries.
Only supports GBDT and Random Forests, lacking other machine learning algorithms, which may necessitate additional libraries for broader modeling tasks.