Showing 14 of 14 projects
A drop-in replacement for pandas that scales data analysis workflows to use all CPU cores and handle out-of-memory datasets.
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
A GPU-accelerated DataFrame library for tabular data processing, part of the RAPIDS data science suite.
A suite of GPU-accelerated machine learning algorithms with scikit-learn compatible APIs for 10-50x faster performance on large datasets.
A Python package for working with labeled multi-dimensional arrays, inspired by pandas and tailored for scientific data.
A Python package that automatically accelerates pandas and Modin DataFrame apply operations by choosing the fastest available method.
An open-source Python library for low-code data preparation, offering fast EDA, data cleaning, and collection from APIs and databases.
A Python library for agile data preparation workflows that works with Pandas, Dask, cuDF, Dask-cuDF, Vaex, and PySpark.
A Python framework for scalable time series forecasting using machine learning models, designed for production environments.
An engine for ML/data tracking, visualization, explainability, drift detection, and dashboards, integrated with Polyaxon.
A distributed web interface for collaborative memory forensics analysis using Volatility 3.
A Python library for reading, writing, and converting microscopy image formats with support for OME-TIFF, CZI, ND2, and more.
A Python histogram library offering updateable, semantic histogram objects with multiple visualization backends and data source support.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.