A Python library for time series forecasting using scikit-learn compatible machine learning models.
Skforecast is a Python library for time series forecasting using machine learning models. It provides a comprehensive toolkit that works with any estimator compatible with the scikit-learn API, such as LightGBM, XGBoost, and CatBoost. The library solves the problem of building and deploying reliable forecasting models by offering flexible workflows, feature engineering tools, and production-ready validation methods.
Data scientists, machine learning engineers, and developers who need to build and deploy time series forecasting models using Python and scikit-learn compatible algorithms. It's particularly useful for those working on projects requiring scalable and interpretable predictions.
Developers choose Skforecast for its seamless integration with the scikit-learn ecosystem, which allows them to use familiar machine learning models for forecasting without learning a new API. Its comprehensive tooling for model validation, backtesting, and production deployment provides a significant advantage over building custom forecasting pipelines from scratch.
Time series forecasting with machine learning models
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Works seamlessly with any scikit-learn compatible estimator, such as LightGBM, XGBoost, and CatBoost, allowing data scientists to use familiar ML models for forecasting without a steep learning curve.
Supports both recursive and direct forecasting methods for single and multi-series data, as shown in the detailed forecaster comparison table, catering to diverse project needs.
Includes comprehensive tools for backtesting, hyperparameter tuning, and model validation, enabling robust performance evaluation and deployment in real-world scenarios.
Offers specialized forecasters for various approaches, including deep learning RNNs and statistical SARIMAX models, providing versatility beyond standard ML estimators.
While it includes statistical models, the library is optimized for machine learning, so users relying heavily on pure statistical methods may find integration with libraries like statsmodels less seamless or efficient.
Full functionality requires managing additional dependencies for specific estimators (e.g., deep learning frameworks), which can complicate installation and increase environment setup time.
The library is geared towards batch processing and may not handle ultra-high-frequency or irregularly spaced time series out-of-the-box without custom preprocessing.