Course materials for GWU's Data Mining and Machine Learning classes covering preprocessing, modeling, and practical Kaggle applications.
GWU_data_mining is a collection of educational materials for George Washington University's Data Mining and Machine Learning courses. It provides structured learning resources covering data preprocessing, statistical modeling, machine learning algorithms, and practical applications through Kaggle competitions. The repository serves as a complete curriculum for students and professionals looking to build data science skills.
Graduate students in data science or analytics programs, particularly those at GWU, and self-learners seeking structured, university-level materials on data mining and machine learning.
It offers a comprehensive, applied curriculum with real-world project integration (Kaggle), support for multiple software tools, and a pre-configured Docker environment for easy setup, making it a practical resource for hands-on learning.
Materials for GWU DNSC 6279 and DNSC 6290.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Covers 11 sections from basic data prep to model interpretability, providing a full university-level education in data mining and machine learning.
Includes hands-on workshops for ongoing Kaggle competitions like Advanced Regression and Digit Recognizer, emphasizing real-world application.
Designed to work with Python, R, SAS, H2O, PySpark, TensorFlow, Keras, and XGBoost, offering flexibility across different tools.
Provides a pre-configured Docker image with Anaconda Python 3.5, H2O, XGBoost, and GraphViz for easy and consistent environment replication.
Relies on Anaconda Python 3.5 and may not be updated for newer library releases, potentially causing compatibility issues.
Some materials are restricted to Blackboard for GWU students, limiting full availability for external learners.
Includes grading schemes, exams, and university-specific policies that are irrelevant for self-learners focused solely on skills.