A machine learning project comparing topological and statistical feature extraction for classifying human activities from smartphone and smartwatch sensor data.
Human Activity Recognition is a machine learning research project that classifies human activities using accelerometer and gyroscope data from smartphones and smartwatches. It compares topological data analysis against statistical feature extraction methods to preprocess sensor data, then evaluates multiple classifiers including Decision Trees, Random Forest, SVM, and CNN. The project aims to determine which preprocessing approach yields better activity recognition accuracy.
Machine learning researchers and data scientists working on human activity recognition, time-series classification, or sensor data analysis. Particularly relevant for those exploring topological data analysis applications or comparing preprocessing techniques for HAR systems.
Provides a comprehensive comparison of topological versus statistical preprocessing methods for HAR using real-world sensor data. The project offers reproducible experiments with multiple classifiers and sensor types, helping researchers understand which approaches work best for different activity recognition scenarios.
This project aims to classify human activities using data obtained from accelerometer and gyroscope sensors from phone and watch.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Directly evaluates topological data analysis against traditional statistical feature extraction, providing empirical evidence on preprocessing efficacy for HAR, as detailed in the results tables.
Tests five classifiers—Decision Tree, k-NN, Random Forest, SVM, and CNN—across different preprocessing methods, offering broad insights into model performance for sensor data.
Analyzes data from both smartphone (pocket) and smartwatch (wrist) sensors, highlighting how sensor type and placement impact accuracy, especially for hand-oriented vs. non-hand-oriented activities.
Trains both user-specific and generalized models, revealing significant performance gaps and the challenges of creating universal HAR systems, as shown in the accuracy tables.
Solely uses the WISDM dataset without extension to other datasets, limiting generalizability and requiring substantial modification for real-world applications beyond the studied 18 activities.
Relies on niche libraries like scikit-tda and giotto-tda for topological analysis, which are harder to install and maintain compared to standard Python ML stacks, increasing setup complexity.
Admits that CNN models performed poorly on impersonal (generalized) models, indicating a limitation in deep learning approaches for cross-user HAR without personalization, as noted in the results discussion.