Showing 32 of 32 projects
TensorFlow implementation and pre-trained models for BERT, a bidirectional Transformer for language understanding.
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications in Python.
An open-source NLP framework for building and deploying deep learning dialog systems and chatbots with PyTorch and transformers.
A domain-specific generative language model pre-trained on biomedical literature for text generation and mining tasks.
A PyTorch system for open-domain question answering by retrieving and reading documents, originally applied to Wikipedia.
Give ChatGPT long-term memory by uploading custom knowledge base files (PDF, txt, epub) and asking questions via a React frontend.
A Rust-native port of Hugging Face Transformers providing ready-to-use NLP pipelines and transformer models like BERT, GPT2, and T5.
Implementations of memory-augmented neural networks for language modeling, dialogue systems, and question answering tasks.
A deep learning model for machine comprehension that uses bi-directional attention flow to answer questions about text passages.
A curated list of 'Ask Me Anything' (AMA) repositories from open-source developers and organizations.
Script to generate question/answer pairs from CNN and Daily Mail articles for machine reading comprehension research.
A TensorFlow implementation of QANet for machine reading comprehension on the SQuAD dataset.
A collaboratively maintained, reverse-chronological list of datasets and corpora for natural language processing tasks.
An open-source suite featuring financial large language models (FinMA), instruction datasets (FIT), and evaluation benchmarks (FinBen) for financial AI.
A curated list of resources for Question Answering (QA), covering machine learning, deep learning, datasets, and research.
A Python tool that uses GPT-3.5 to read, summarize, and answer questions about academic PDF papers locally.
Pre-trained biomedical language representation model for biomedical text mining tasks like named entity recognition and relation extraction.
TensorFlow implementation of R-Net for machine reading comprehension on the SQuAD dataset.
A reading comprehension dataset with Wikipedia summaries, full stories, and question-answer pairs for narrative understanding.
A tool-augmented LLM that uses NCBI Web APIs to answer biomedical questions with high accuracy and reduced hallucinations.
A PyTorch implementation of the DrQA model for reading comprehension and open-domain question answering.
A TensorFlow implementation of End-To-End Memory Networks with a scikit-learn-like interface for bAbI tasks.
A pure Go package for running inference with pre-trained Transformer models from Hugging Face, enabling NLP tasks without external languages.
Scripts and tools to recreate the ELI5 dataset for long-form question answering research.
Extract and index knowledge from websites, PDFs, docs, and YouTube to power Q&A sessions using GPT and other language models.
Tools for compiling and using the Maluuba NewsQA dataset, a machine reading comprehension dataset based on CNN articles.
A Python library that simplifies using, finetuning, and deploying state-of-the-art machine learning models for various AI tasks.
A Keras implementation of Microsoft's R-NET neural network for question answering on the SQuAD dataset.
A public Q&A repository where Sindre Sorhus answers personal, technical, and life questions.
A public Q&A repository where Sindre Sorhus answers questions about code, work, life, and anything else.
A characteristic-rich dataset for factoid question answering with explicit question specifications to enable fine-grained QA system evaluation.
A collection of tools, datasets, and approaches for building natural language interfaces to query the Web of Data.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.