Showing 36 of 268 projects
A BERT model pre-trained on PubMed abstracts and clinical notes for biomedical natural language processing tasks.
A curated collection of open-source machine learning models compatible with Apple's Core ML framework.
TensorFlow implementation of R-Net for machine reading comprehension on the SQuAD dataset.
An R package for creating interactive web-based visualizations of Latent Dirichlet Allocation (LDA) topic models.
A Unity SDK for integrating IBM Watson AI services like speech, language, and vision into games and applications.
A Scala toolkit for deployable probabilistic modeling using imperatively-defined factor graphs.
A curated list of open-access resources and tools for Natural Language Processing (NLP) focused on the German language.
A reading comprehension dataset with Wikipedia summaries, full stories, and question-answer pairs for narrative understanding.
A Go library implementing word embedding models (Word2Vec, GloVe, LexVec) from scratch with CLI and SDK.
An open-source benchmark toolkit for Natural Language Generation in spoken dialogue systems, featuring multiple RNN-based models and datasets.
A Torch implementation of a VIS+LSTM model for answering questions about images using deep learning.
A command-line tool that translates plain English requests into terminal commands using AI.
A comprehensive suite of Java NLP libraries and tools for text annotation, feature extraction, and language processing tasks.
A Ruby wrapper for Ginger Proofreader that corrects spelling and grammar mistakes using contextual sentence analysis.
A Go library implementing selected machine learning algorithms for natural language processing and semantic analysis.
A multilingual command-line sentence tokenizer written in Go, ported from NLTK's Punkt system.
A Ruby gem for simple sentiment analysis that classifies text as positive, negative, or neutral based on configurable thresholds.
A curated list of resources, tools, datasets, and communities for linguistics and natural language processing.
A curated list of resources for Biomedical Information Extraction (BioIE), including datasets, tools, libraries, and research.
An iOS app that uses ChatGPT to generate ARKit code from spoken prompts, placing and manipulating 3D objects in augmented reality.
A medical text mining and information extraction framework built on spaCy for rapid prototyping and training of predictive NLP models.
Ruby bindings for the Stanford CoreNLP natural language processing toolkit, supporting English, French, and German.
A Neo4j extension for document and text classification using graph-based hierarchical pattern recognition.
A dataset of millions of news articles labeled by credibility type for training fake news detection algorithms.
A PyTorch implementation of the DrQA model for reading comprehension and open-domain question answering.
A Naive Bayes machine learning implementation in Elixir with multiple models and storage options.
A Julia package providing standard tools and models for text analysis and natural language processing.
A functional programming library for JavaScript/Node.js focused on string processing, regular expressions, and linear algebra.
Python implementations of various topic modeling algorithms including LDA, collaborative topic models, and hierarchical Dirichlet processes.
A Ruby natural language parser for elapsed time that converts human-readable durations to seconds and vice versa.
An open-source toolkit for building end-to-end trainable task-oriented dialogue models with neural networks.
A curated collection of linguistic resources, tools, and datasets for Natural Language Processing and Computational Linguistics on Spanish.
A curated collection of linguistic resources, datasets, and tools for Natural Language Processing and Computational Linguistics on Spanish.
A Python library for interpretable text classification using the SS3 model, with built-in visualization tools for explainable AI.
A pure Go package for running inference with pre-trained Transformer models from Hugging Face, enabling NLP tasks without external languages.
Scripts and tools to recreate the ELI5 dataset for long-form question answering research.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.