Showing 22 of 22 projects
A Python library for topic modeling, document indexing, and similarity retrieval with large text corpora.
A Python library for topic modeling, document indexing, and similarity retrieval with large corpora.
A comprehensive collection of machine learning algorithms implemented exclusively in NumPy for educational purposes and prototyping.
A toolkit for distributed machine learning featuring parameter server framework, topic modeling, gradient boosting, and word embedding.
A Python NLP library built on spaCy for text preprocessing, feature extraction, and analysis tasks.
A dedicated OCaml system for scientific and engineering computing, providing n-dimensional arrays, linear algebra, algorithmic differentiation, and neural networks.
An efficient R package for text analysis and NLP with fast vectorization, topic modeling, and word embeddings.
A modern C++ toolkit for text retrieval and analysis, featuring indexing, ranking, topic modeling, classification, and language models.
A fast, open-source platform for topic modeling using Additive Regularization of Topic Models (ARTM).
A deep learning system that classifies food images into 230 categories and retrieves matching recipes using convolutional neural networks.
An R package for creating interactive web-based visualizations of Latent Dirichlet Allocation (LDA) topic models.
A Scala toolkit for deployable probabilistic modeling using imperatively-defined factor graphs.
A Go library implementing selected machine learning algorithms for natural language processing and semantic analysis.
A Julia package providing standard tools and models for text analysis and natural language processing.
Python implementations of various topic modeling algorithms including LDA, collaborative topic models, and hierarchical Dirichlet processes.
A curated collection of learning resources, R packages, and practical examples for understanding and applying topic modeling techniques.
A generic numerical library for D providing sparse tensors, linear algebra, and machine learning components.
Interactive topic model visualization and interpretation library for Python, compatible with sklearn, Gensim, BERTopic, and Turftopic.
A Python toolbox using deep belief networks for topic modeling on document data, producing latent representations for content-based recommendation.
A Ruby wrapper for Latent Dirichlet Allocation (LDA) that clusters documents into topics with native, Rust, and pure Ruby backends.
A high-level Python toolbox for topic modeling with easy-to-use functions and command-line interface.
A Python pipeline for multilingual text clustering using Latent Dirichlet Allocation with stop words removal, n-gram features, and inverse stemming.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.