Showing 36 of 268 projects
An open-source study on neural question generation using transformers, providing simplified training and inference pipelines.
A Rust library for natural language detection using trigram models, focusing on simplicity and performance.
A curated list of awesome resources, libraries, and tools for natural language processing (NLP) in Ruby.
A curated list of awesome resources, libraries, and tools for natural language processing (NLP) in Ruby.
A GPT-2 variant that generates plausible fake words, definitions, and usage examples from scratch.
A TensorFlow implementation of QANet for machine reading comprehension on the SQuAD dataset.
A pretrained modeling library for Keras 3 offering simple, flexible, and fast access to models for text, image, and audio tasks.
A collaboratively maintained, reverse-chronological list of datasets and corpora for natural language processing tasks.
A Python natural language processing library for pre-modern languages like Latin, Ancient Greek, and Sanskrit.
TensorFlow implementation of an attention-based neural image caption generator that focuses on relevant image parts while generating words.
An R package for the quantitative analysis of textual data, providing comprehensive tools for natural language processing and text management.
An efficient R package for text analysis and NLP with fast vectorization, topic modeling, and word embeddings.
Swift SDK for integrating IBM Watson AI services like speech, language, and assistant into iOS and Linux applications.
An AI-powered terminal assistant that uses OpenAI ChatGPT to generate and run commands from natural language descriptions.
An open-source suite featuring financial large language models (FinMA), instruction datasets (FIT), and evaluation benchmarks (FinBen) for financial AI.
An open-source Python framework for building chat-ops bots that connect chat services, natural language APIs, and third-party services.
Catalyst is a high-performance C# NLP library inspired by spaCy, offering pre-trained models, entity recognition, and embedding training.
A JupyterLab extension that integrates GPT-4 as a code interpreter, translating natural language to Python and executing it automatically.
A Go library for naive Bayesian classification and TF-IDF calculations on string sets.
A Ruby gem for calculating text similarity using tf*idf and BM25 vector space models.
A curated list of resources for Question Answering (QA), covering machine learning, deep learning, datasets, and research.
Pre-trained BERT models fine-tuned on clinical text from MIMIC for medical natural language processing tasks.
A tool for automatically annotating mentions of DBpedia resources in text, linking entities to their global identifiers.
A pre-trained BERT model designed for DNA sequence analysis, enabling genome understanding tasks like classification and motif discovery.
A Ruby library for text classification with Bayesian, LSI, logistic regression, k-NN, and TF-IDF algorithms.
A modern C++ toolkit for text retrieval and analysis, featuring indexing, ranking, topic modeling, classification, and language models.
Pre-trained biomedical language representation model for biomedical text mining tasks like named entity recognition and relation extraction.
A framework for building AI agents that run in the browser, with support for Angular and React.
A natural language detection library for Go that identifies 84 languages and scripts with no external dependencies.
A fast, open-source platform for topic modeling using Additive Regularization of Topic Models (ARTM).
A curated list of awesome Torch tutorials, projects, libraries, and communities for deep learning.
A curated list of deep learning resources for video-text retrieval, including papers, implementations, and datasets.
A cookiecutter template for deploying spaCy NLP models as FastAPI services compatible with Azure Search Custom Skills.
A .NET wrapper for Stanford CoreNLP providing natural language processing capabilities including tokenization, parsing, and named entity recognition.
A Python wrapper and JSON-RPC server for Stanford CoreNLP, providing NLP tools like parsing, tagging, and coreference resolution.
A rule-based sentence boundary detection gem for Ruby that works out-of-the-box across many languages.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.