An AutoML implementation and tutorial for automating machine learning pipelines on both static datasets and dynamic data streams, with a focus on IoT anomaly detection.
AutoML-Implementation-for-Static-and-Dynamic-Data-Analytics is an open-source project that provides an Automated Machine Learning (AutoML) implementation for both batch (static) and online (dynamic) data analytics. It automates the entire machine learning pipeline, including data pre-processing, feature engineering, model selection, and hyperparameter optimization, with a focus on IoT anomaly detection. The project also serves as a tutorial to help researchers and practitioners automatically obtain optimized models for specific tasks.
Machine learning researchers, data analysts, and industrial users working on IoT data analytics, anomaly detection, or those seeking to automate their ML workflows. It is particularly useful for those dealing with both static datasets and dynamic data streams.
Developers choose this project because it offers a comprehensive, hands-on AutoML implementation with a strong focus on IoT applications and support for both batch and online learning. It combines practical code with educational resources, including a detailed case study and links to a highly cited hyperparameter optimization tutorial.
Implementation/Tutorial of using Automated Machine Learning (AutoML) methods for static/batch and online/continual learning
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Automates the entire ML pipeline from data pre-processing to model updating, as outlined in the AutoML procedures sections, reducing manual effort in model development.
Implements both batch learning for static datasets and online learning for dynamic data streams, with specific notebooks and algorithms like Adaptive Random Forest for concept drift handling.
Links to a highly cited hyperparameter optimization tutorial with 1,200+ GitHub stars, enhancing its value as a learning tool for ML researchers and practitioners.
Includes practical case studies using datasets like CICIDS2017 and IoTID20 for intrusion detection, providing concrete examples of AutoML application in IoT contexts.
Code is primarily in Jupyter notebooks, which can hinder version control, collaboration, and production deployment compared to modular, package-based solutions.
Case studies and optimizations are tailored for IoT anomaly detection, requiring significant adaptation for use in other domains without built-in generalizability.
Requires installation of multiple libraries like Keras, hyperopt, and River, leading to potential setup issues and maintenance overhead in diverse environments.