Showing 36 of 243 projects
A comprehensive Python library for creating static, animated, and interactive visualizations and publication-quality figures.
A workflow orchestration framework for building resilient data pipelines in Python.
A repository of examples, utilities, and best practices for building and deploying production-ready recommendation systems.
A free, self-taught curriculum following undergraduate Data Science guidelines using MOOCs from top universities.
A free, self-taught curriculum following undergraduate Data Science guidelines using MOOCs from top universities.
A reactive Python notebook that's reproducible, git-friendly, and deployable as scripts or apps.
An automatic forecasting procedure for time series data with multiple seasonality and linear or non-linear growth.
An open-source forecasting tool for time series data with multiple seasonality and linear or non-linear growth.
Comprehensive cheatsheets and refreshers covering all key concepts from Stanford's CS 229 Machine Learning course.
An elegant and simple Python library for fetching financial data from various sources, designed for quantitative research.
A curated list of resources dedicated to Natural Language Processing (NLP), including libraries, datasets, tutorials, and research.
A topic-wise curated list of machine learning and deep learning tutorials, articles, and resources for developers and data scientists.
A curated collection of tutorials, articles, and resources for learning machine learning and deep learning topics.
A curated index of the latest and best machine learning and AI courses available on YouTube, organized by topic.
A community-maintained list of entry-level software engineering, product management, quant, and tech jobs for new graduates.
A Python library for topic modeling, document indexing, and similarity retrieval with large corpora.
A Python library for topic modeling, document indexing, and similarity retrieval with large text corpora.
An orchestration platform for developing, deploying, and monitoring data pipelines and assets.
A metapackage for installing and documenting the Jupyter ecosystem of interactive computing tools.
A metapackage for installing and documenting the Jupyter ecosystem of interactive computing tools.
An extensible, next-generation web-based interface for interactive computing and data science, based on the Jupyter Notebook architecture.
A Python distribution for the browser and Node.js based on WebAssembly, enabling Python to run in web environments.
A curated list of awesome big data frameworks, resources, and tools across various categories.
A curated list of awesome big data frameworks, resources, and tools across various categories.
A Python visualization library based on matplotlib for creating attractive statistical graphics with a high-level interface.
Generate comprehensive data quality profiling and exploratory data analysis reports for Pandas and Spark DataFrames with a single line of code.
Generate comprehensive data quality profiles and exploratory data analysis reports for Pandas and Spark DataFrames with a single line of code.
A comprehensive study plan and resource collection for preparing for machine learning engineering interviews at top tech companies.
A Python library that explains predictions of any machine learning classifier using local interpretable model-agnostic explanations.
A Python library for data quality testing and validation using expressive, extensible Expectations.
A curated guide to learning machine learning with Python and Jupyter Notebook, featuring courses, notebooks, and practical resources.
A curated guide to learning machine learning with Python and Jupyter Notebook, featuring hands-on tutorials, courses, and ethical considerations.
A Python framework for creating reproducible, maintainable, and modular data engineering and data science pipelines.
A drop-in replacement for pandas that scales data analysis workflows to use all CPU cores and handle out-of-memory datasets.
A practical booklet covering the four main steps of designing machine learning systems with 27 interview questions.
A declarative statistical visualization library for Python built on Vega-Lite.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.