Showing 36 of 268 projects
A Java library implementing various string similarity and distance algorithms like Levenshtein, Jaro-Winkler, and n-gram methods.
A curated collection of hands-on data science project ideas and resources for learning machine learning and AI concepts.
A high-performance Golang port of the Jieba Chinese text segmentation library.
A friendly English-like interface for your command line that translates natural language phrases into executable commands.
A JAX/Flax-based framework for easy and scalable pre-training, fine-tuning, evaluation, and serving of large language models.
A natural language processor powered by plugins that transforms and analyzes text using syntax trees.
A modular toolkit for machine learning, natural language processing, and text generation with TensorFlow and PyTorch versions.
A Python NLP library built on spaCy for text preprocessing, feature extraction, and analysis tasks.
A curated collection of R tutorials, packages, and resources for Data Science, NLP, and Machine Learning.
A spaCy pipeline and models specifically designed for processing scientific and biomedical documents.
A self-contained machine learning and natural language processing library written in pure Go with a dynamic computational graph.
A Python package for fine-tuning and generating text with GPT-2 and GPT Neo models using PyTorch and Hugging Face Transformers.
A library for creating TensorFlow models that handle structured data with dynamic computation graphs using dynamic batching.
A Go library and CLI tool for converting Chinese characters to Hanyu Pinyin with tone support.
Implementations of memory-augmented neural networks for language modeling, dialogue systems, and question answering tasks.
A collection of models, callbacks, and datasets to extend PyTorch Lightning for applied AI/ML research and production.
An easy-to-use, state-of-the-art named-entity recognition (NER) tool based on neural networks.
A BERT language model pre-trained on a large corpus of scientific papers for natural language processing tasks in scientific domains.
A deep learning model for machine comprehension that uses bi-directional attention flow to answer questions about text passages.
A curated list of resources for Document Understanding (DU), covering research, datasets, tools, and applications in Intelligent Document Processing.
Node.js client library for accessing IBM Watson AI services like Assistant, Speech-to-Text, and Natural Language Understanding.
A natural language date/time parser with pluggable rules and merge strategies for Go applications.
A Python client library for interacting with IBM Watson AI services, available via pip as ibm-watson.
A Python library for evaluating natural language generation models using multiple unsupervised automated metrics.
A PHP Chinese text segmentation module offering precise, full, and search engine modes with support for Traditional Chinese and CJK languages.
A comprehensive natural language processing framework for Ruby with support for text extraction, parsing, and machine learning.
The most accurate natural language detection library for Go, excelling with short text and mixed-language content.
A Clojure library that parses natural language text into structured data like dates, times, and durations.
A curated collection of resources for deep learning applications in natural language processing.
Script to generate question/answer pairs from CNN and Daily Mail articles for machine reading comprehension research.
A curated collection of Ruby libraries, tools, and resources for Natural Language Processing (NLP).
A curated repository of famous Vision-Language Models (VLMs) detailing their architectures, training procedures, and datasets.
A command-line tool that performs semantic searches on text using word embeddings to find words with similar meaning to the query.
A curated list of awesome resources for information retrieval and web search, including books, courses, datasets, and software.
A curated list of recent research papers and resources on Vision and Language Pre-trained Models (VL-PTMs).
A method to steer topic and attributes of GPT-2 language models without fine-tuning, enabling controlled text generation.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.