Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Nlp

Nlp

165 projects

Showing 36 of 165 projects

compare-mt
compare-mtPython

A command-line tool for holistic comparison and error analysis of language generation systems like machine translation and summarization.

#evaluation-metrics#machine-translation#summarization
Stars471
Forks58
Last commit8 months ago
Sentimental
SentimentalRuby

A Ruby gem for simple sentiment analysis that classifies text as positive, negative, or neutral based on configurable thresholds.

#text-classification#text-analysis#ruby-gem
Stars464
Forks72
Last commit7 years ago
tone-analyzer-nodejs
tone-analyzer-nodejsCSS

A Node.js sample application demonstrating the IBM Watson Tone Analyzer service for detecting emotional and language tones in text.

#sample-app#text-analysis#kubernetes
Stars452
Forks270
Last commit4 years ago
Biomedical Information Extraction
Biomedical Information Extraction

A curated list of resources for Biomedical Information Extraction (BioIE), including datasets, tools, libraries, and research.

#biomedical-language#biomedical-nlp#biomedical-data
Stars445
Forks39
Last commit13 days ago
Graphify
GraphifyJava

A Neo4j extension for document and text classification using graph-based hierarchical pattern recognition.

#semantic-analysis#text-classification#neo4j-extension
Stars433
Forks100
Last commit6 years ago
FakeNewsCorpus
FakeNewsCorpus

A dataset of millions of news articles labeled by credibility type for training fake news detection algorithms.

#database#data-scraping#text-corpus
Stars413
Forks98
Last commit6 years ago
tiktoken-rs
tiktoken-rsRust

A Rust implementation of OpenAI's tiktoken tokenizer for working with GPT models and token counting.

#tiktoken#openai#text-processing
Stars394
Forks69
Last commit6 days ago
Neural Machine Translation Implementations
Neural Machine Translation Implementations

A curated list of open-source neural machine translation implementations across various deep learning frameworks.

#sequence-to-sequence#transformer-models#framework-comparison
Stars364
Forks66
Last commit3 years ago
PySS3
PySS3Python

A Python library for interpretable text classification using the SS3 model, with built-in visualization tools for explainable AI.

#hyperparameter-optimization#explainable-artificial-intelligence#python-library
Stars348
Forks44
Last commit
End-To-End Memory Networks
End-To-End Memory NetworksPython

A TensorFlow implementation of End-To-End Memory Networks with a scikit-learn-like interface for bAbI tasks.

#babi-dataset#neural-networks#question-answering
Stars340
Forks131
Last commit9 years ago
awesome-nlp-polish
awesome-nlp-polish

A curated list of resources for Natural Language Processing (NLP) in Polish, including datasets, models, and tools.

#nlp-tools#nlp-datasets#natural-language-processing
Stars307
Forks34
Last commit4 years ago
knowledge-gpt
knowledge-gptPython

Extract and index knowledge from websites, PDFs, docs, and YouTube to power Q&A sessions using GPT and other language models.

#youtube-transcription#semantic-search#knowledge-extraction
Stars290
Forks53
Last commit
Flare
FlareClojure

A Clojure library for dynamic neural network graphs with pluggable tensor backends, inspired by PyTorch.

#dynamic-graph#intel-mkl#neural-networks
Stars287
Forks18
Last commit7 years ago
VeritasGraph
VeritasGraphPython

An enterprise-grade Graph RAG framework combining hierarchical tree navigation with knowledge graph reasoning for verifiable, on-premise AI.

#multi-hop-reasoning#information-retrieval#document-navigation
Stars282
Forks33
Last commit
awesome-hungarian-nlp
awesome-hungarian-nlp

A curated list of free tools, datasets, models, and resources for Hungarian Natural Language Processing.

#computational-linguistics#hungarian#information-retrieval
Stars278
Forks19
Last commit1 month ago
flaxmodels
flaxmodelsPython

A collection of pretrained deep learning models (StyleGAN2, GPT2, VGG, ResNet) for the Jax/Flax ecosystem.

#jax#resnet#vgg
Stars265
Forks28
Last commit1 year ago
NewsQA
NewsQAPython

Tools for compiling and using the Maluuba NewsQA dataset, a machine reading comprehension dataset based on CNN articles.

#question-answering#python#reading-comprehension
Stars257
Forks56
Last commit3 years ago
Backprop
BackpropPython

A Python library that simplifies using, finetuning, and deploying state-of-the-art machine learning models for various AI tasks.

#transfer-learning#api#python-library
Stars241
Forks11
Last commit5 years ago
scicloj.ml
scicloj.mlClojure

An idiomatic Clojure machine learning library providing a unified interface for classification, regression, and unsupervised models.

#metamorph#tech-ml-dataset#hyperparameter-optimization
Stars238
Forks16
Last commit7 months ago
Cadmium
CadmiumJust

A comprehensive Natural Language Processing (NLP) library for the Crystal programming language.

#readability#nlp-library#modular-architecture
Stars211
Forks14
Last commit5 months ago
Stik
StikTypeScript

A macOS app for instantly capturing thoughts with a global shortcut, saving notes as plain markdown files.

#ai#productivity#keyboard-first
Stars209
Forks14
Last commit1 month ago
Awesome Community-Curated NLP List
Awesome Community-Curated NLP List

A community-curated list of NLP tools, libraries, datasets, and resources across speech processing, text analysis, and machine translation.

#community-driven#text-analysis#nlp-tools
Stars202
Forks33
Last commit3 years ago
shainet
shainetCrystal

A pure Crystal machine learning library for building and training neural networks with CPU/GPU support and PyTorch compatibility.

#transformer#cuda#neural-network
Stars195
Forks19
Last commit5 months ago
go-porterstemmer
go-porterstemmerGo

A native Go implementation of the Porter Stemming algorithm for NLP and machine learning tasks.

#stemming#natural-language-processing#golang-library
Stars193
Forks45
Last commit5 years ago
ChatGPT-Python-Applications
ChatGPT-Python-ApplicationsJupyter Notebook

A collection of Python applications demonstrating various use cases of ChatGPT, including chatbots, automation, and voice assistants.

#chatgpt-api#python-applications#fine-tuning
Stars188
Forks37
Last commit
(Node.js)
(Node.js)JavaScript

A sample Node.js app demonstrating Dialogflow features like custom entities, contexts, and deep links for Google Assistant actions.

#dialogflow#firebase-functions#serverless
Stars187
Forks125
Last commit
Norwegian NLP resources
Norwegian NLP resources

A curated collection of open-source libraries, models, datasets, and tools for Natural Language Processing (NLP) in Norwegian.

#spacy#nlp-resources#bokmal
Stars182
Forks15
Last commit5 years ago
R-Net-in-Keras
R-Net-in-KerasPython

A Keras implementation of Microsoft's R-NET neural network for question answering on the SQuAD dataset.

#squad#machine-reading-comprehension#deep-learning
Stars177
Forks88
Last commit8 years ago
getlang
getlangGo

A pure Go library for fast, offline natural language detection supporting 29 languages.

#iso-639#text-analysis#natural-language
Stars175
Forks23
Last commit5 years ago
wit-go
wit-goGo

A Go client library for interacting with the Wit.ai natural language processing HTTP API.

#intent-recognition#go-client#entity-extraction
Stars170
Forks36
Last commit9 months ago
EDS_NLP
EDS_NLPPython

A modular NLP framework for extracting information from French clinical notes, compatible with spaCy and PyTorch.

#medical-text#spacy#fast
Stars165
Forks42
Last commit3 days ago
words_counted
words_countedRuby

A Ruby natural language processor for tokenizing and analyzing text with flexible filtering and custom regex support.

#nlp-library#word-counter#text-analysis
Stars164
Forks28
Last commit4 years ago
open-solution-toxic-comments
open-solution-toxic-commentsPython

An open-source starter solution for the Kaggle Toxic Comment Classification Challenge, providing ready-to-use machine learning pipelines for detecting online harassment.

#ensemble-learning#text-classification#data-science
Stars155
Forks55
Last commit
topicwizard
topicwizardPython

Interactive topic model visualization and interpretation library for Python, compatible with sklearn, Gensim, BERTopic, and Turftopic.

#bertopic#mantine#python-library
Stars148
Forks17
Last commit1 year ago
Paasaa
PaasaaElixir

An Elixir library for natural language and script detection using statistical analysis without AI.

#statistical-analysis#elixir#language-identification
Stars143
Forks14
Last commit5 months ago
wordnet
wordnetRuby

A Ruby interface to the WordNet lexical database, enabling natural language processing and linguistic analysis.

#semantic-analysis#lexical-database#ruby-gem
Stars140
Forks25
Last commit3 years ago
PreviousPage 4 of 5Next

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
7 months ago
3 years ago
10 days ago
1 month ago
6 years ago
4 years ago
#Machine Learning90
#Natural Language Processing89
#Deep Learning46
#Python40
#Text Analysis33
#Python Library25
#Neural Networks25
#Named Entity Recognition21
#Tensorflow20
#Data Science20
#Computer Vision18
#Ai16