Showing 36 of 158 projects
A modular natural language processing library for Node.js and React Native, designed for building multilingual chatbots and language utilities.
A fast and comprehensive machine learning framework for Java, Scala, and Kotlin with state-of-the-art algorithms and data visualization.
An open-source RPA tool that automates repetitive tasks on websites, desktop apps, and the command line using a simple language.
A high-performance neural network training interface for TensorFlow, optimized for speed and research flexibility.
A high-performance neural network training interface for TensorFlow focused on speed, flexibility, and reproducible research.
An open-source offline translation library and toolkit written in Python, supporting over 30 languages via downloadable language models.
A Python library and CLI tool for web crawling, scraping, and extracting main text, metadata, and comments from web pages.
A curated collection of Python tutorials and resources for data science, machine learning, and natural language processing.
A TensorFlow implementation of a convolutional neural network for sentence classification based on Yoon Kim's paper.
Detect the language of text with support for up to 419 languages, more than any other library.
A state-of-the-art Natural Language Processing library built on Apache Spark, offering 100,000+ pretrained models and pipelines in 200+ languages.
An open-source solution for continuous validation of machine learning models and data, from research to production.
A Python library and CLI tool for automatic text summarization using extractive methods like LexRank, LSA, Luhn, and Edmundson.
A visual roadmap and keyword mind map for students learning Natural Language Processing, from basics to SOTA models.
A Rust-native port of Hugging Face Transformers providing ready-to-use NLP pipelines and transformer models like BERT, GPT2, and T5.
A lightweight deep learning library with a functional API for composing models, compatible with PyTorch, TensorFlow, and MXNet.
A Go library for efficient multilingual text segmentation and NLP, supporting English, Chinese, Japanese, and more.
A natural language processor powered by plugins that transforms and analyzes text using syntax trees.
A Python NLP library built on spaCy for text preprocessing, feature extraction, and analysis tasks.
A powerful client-side JavaScript library for interacting with the ChatGPT DOM.
A spaCy pipeline and models specifically designed for processing scientific and biomedical documents.
An easy-to-use, state-of-the-art named-entity recognition (NER) tool based on neural networks.
A BERT language model pre-trained on a large corpus of scientific papers for natural language processing tasks in scientific domains.
A robust yet lenient forced aligner built on Kaldi for aligning speech audio with text transcripts.
A Python library that automatically extracts schema, statistics, and sensitive entities (PII/NPI) from datasets.
A deep learning model for machine comprehension that uses bi-directional attention flow to answer questions about text passages.
A persistent, network resilient, full-text search library for both browser and Node.js environments.
A Python library for evaluating natural language generation models using multiple unsupervised automated metrics.
A PHP Chinese text segmentation module offering precise, full, and search engine modes with support for Traditional Chinese and CJK languages.
The most accurate natural language detection library for Go, excelling with short text and mixed-language content.
A Clojure library that parses natural language text into structured data like dates, times, and durations.
A minimalist neural network library optimized for sparse data and single-machine environments.
A method to steer topic and attributes of GPT-2 language models without fine-tuning, enabling controlled text generation.
An open-source study on neural question generation using transformers, providing simplified training and inference pipelines.
An open-source Java framework for rapid development of machine learning and statistical applications with large dataset support.
A Rust library for natural language detection using trigram models, focusing on simplicity and performance.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.