Showing 36 of 218 projects
A comprehensive Python library for natural language processing, providing modules, datasets, and tutorials for NLP research and development.
A full-featured computer algebra system (CAS) written in pure Python for symbolic mathematics.
A Python library for video editing, processing, and custom effects creation through code.
A simple Python framework for state-of-the-art natural language processing (NLP) tasks like named entity recognition and sentiment analysis.
A Python package for deep learning on graphs, framework-agnostic and optimized for performance and scalability.
Generate comprehensive data quality profiling and exploratory data analysis reports for Pandas and Spark DataFrames with a single line of code.
Generate comprehensive data quality profiles and exploratory data analysis reports for Pandas and Spark DataFrames with a single line of code.
An open-source library for rapid development of software dealing with 3D data, with support for C++ and Python.
A Python library that provides reliable, validated JSON outputs from any LLM using Pydantic models.
A Python-based interactive packet manipulation program and library for network analysis, scanning, and security testing.
A Python library that explains predictions of any machine learning classifier using local interpretable model-agnostic explanations.
An open-source data-centric AI library for automatically detecting and fixing data quality issues in machine learning datasets.
A Python library for data quality testing and validation using expressive, extensible Expectations.
A CLI utility that pipes video streams from services like Twitch and YouTube into video players or files.
A Python library for language-vision intelligence research, providing unified access to state-of-the-art models, datasets, and tasks.
A Python library for building custom machine learning models for tasks like image classification, object detection, and recommendations.
An automated machine learning library that trains and deploys high-accuracy models for tabular, text, image, and time series data with minimal code.
A pure-Python PDF library for splitting, merging, cropping, transforming, and extracting data from PDF files.
A Python library for performing data science and machine learning on data without direct access, using remote datasites.
An open-source Python toolkit for speaker diarization with state-of-the-art pretrained models and pipelines.
A Python library for flexible and readable tensor operations across numpy, PyTorch, JAX, TensorFlow, and other frameworks.
A Python library for user-friendly forecasting and anomaly detection on time series, from ARIMA to deep neural networks.
An AutoML library for deep learning that automates model selection and hyperparameter tuning using Keras and TensorFlow.
Deep Lake is a multimodal data lake and vector store optimized for AI, enabling scalable data management, retrieval, and training for LLM and deep learning applications.
A flexible, scalable deep probabilistic programming library built on PyTorch for universal probabilistic modeling.
A flexible, scalable deep probabilistic programming library built on PyTorch for universal representation of computable probability distributions.
A high-performance gradient boosting library with best-in-class handling of categorical features and support for CPU/GPU training.
A fast online machine learning system with advanced techniques like hashing, reductions, and contextual bandits.
A Python library for building production-ready model inference APIs, job queues, and multi-model serving systems for AI applications.
A CLI tool and Python library that converts command output, files, and strings to JSON, YAML, or dictionaries for easier parsing.
A Python library for music and audio analysis, providing tools for feature extraction, visualization, and transformation.
A fast, correct JSON library for Python with native support for dataclasses, datetimes, and numpy.
A semantic cache library for LLM queries that reduces API costs by 10x and boosts response speed by 100x.
A Python NLP library from Stanford for tokenization, sentence segmentation, NER, and dependency parsing across 60+ languages.
An open-source Python library for automated feature engineering using Deep Feature Synthesis.
Automatically differentiate native Python and NumPy code for gradient-based optimization and machine learning.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.