Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Natural Language Processing

Natural Language Processing

268 projects

Showing 36 of 268 projects

WordGPT
WordGPTTypeScript

A Microsoft Word add-in that integrates OpenAI's ChatGPT to enhance writing with AI-powered text generation.

#ai-writing#productivity#office
Stars322
Forks60
Last commit3 years ago
stringi <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">
stringi <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">C++

Fast and portable character string processing in R using the Unicode ICU library.

#unicode#regex#stringi
Stars317
Forks48
Last commit
awesome-nlp-polish
awesome-nlp-polish

A curated list of resources for Natural Language Processing (NLP) in Polish, including datasets, models, and tools.

#nlp-tools#nlp-datasets#natural-language-processing
Stars307
Forks34
Last commit4 years ago
Tensorflow FastText
Tensorflow FastTextPython

A TensorFlow implementation of fastText for embedding-based text classification with support for character ngrams and distributed training.

#distributed-training#language-identification#text-classification
Stars303
Forks90
Last commit
Conversational AI
Conversational AI

A curated collection of resources for building conversational AI applications like chatbots and voice assistants.

#ai#nlp-resources#chatbot
Stars296
Forks17
Last commit4 years ago
Indonesian NLP
Indonesian NLP

A curated collection of datasets, corpora, and resources for Indonesian natural language processing tasks.

#nlp-resources#indonesian-language#text-classification
Stars290
Forks46
Last commit4 years ago
wit-ruby
wit-rubyRuby

Official Ruby SDK for Wit.ai, providing natural language processing and conversational AI capabilities.

#intent-recognition#chatbots#entity-extraction
Stars283
Forks67
Last commit4 years ago
awesome-hungarian-nlp
awesome-hungarian-nlp

A curated list of free tools, datasets, models, and resources for Hungarian Natural Language Processing.

#computational-linguistics#hungarian#information-retrieval
Stars278
Forks19
Last commit1 month ago
Chalk
ChalkScala

A Scala library for natural language processing with functional and actor-based pipelines.

#nlp-library#functional-programming#pipeline-architecture
Stars260
Forks48
Last commit9 years ago
Neural machine translation between the writings of Shakespeare and modern English using TensorFlow
Neural machine translation between the writings of Shakespeare and modern English using TensorFlowPython

Neural machine translation between Shakespearean and modern English using TensorFlow.

#sequence-to-sequence#nlp-research#deep-learning
Stars248
Forks57
Last commit
Anafora
AnaforaJavaScript

A lightweight, web-based raw text annotation tool for collaborative annotation projects with support for complex schemas.

#open-source-annotation#data-labeling#natural-language-processing
Stars242
Forks56
Last commit3 years ago
Backprop
BackpropPython

A Python library that simplifies using, finetuning, and deploying state-of-the-art machine learning models for various AI tasks.

#transfer-learning#api#python-library
Stars241
Forks11
Last commit5 years ago
cl-nlp
cl-nlpCommon Lisp

A comprehensive and extensible natural language processing toolkit for Common Lisp, supporting custom pipelines and experimentation.

#pos-tagging#natural-language-processing#text-processing
Stars236
Forks28
Last commit6 years ago
genius
geniusPython

An open-source Chinese text segmentation library using CRF (Conditional Random Field) algorithm with support for pinyin segmentation and part-of-speech tagging.

#part-of-speech-tagging#search-indexing#python-library
Stars234
Forks63
Last commit7 years ago
Topic Models Resources
Topic Models ResourcesR

A curated collection of learning resources, R packages, and practical examples for understanding and applying topic modeling techniques.

#document-analysis#text-analysis#data-science
Stars232
Forks54
Last commit
TextRank
TextRankGo

A Go implementation of the TextRank algorithm for automatic text summarization, phrase extraction, and keyword ranking with multithreading support.

#automatic-summarization#textrank#graph-algorithms
Stars224
Forks23
Last commit11 months ago
OpenCCG
OpenCCGJava

A Java library for parsing and generating text using combinatory categorial grammar and hybrid logic dependency semantics.

#computational-linguistics#java-library#grammar-parsing
Stars219
Forks45
Last commit5 years ago
Cadmium
CadmiumJust

A comprehensive Natural Language Processing (NLP) library for the Crystal programming language.

#readability#nlp-library#modular-architecture
Stars211
Forks14
Last commit5 months ago
react-native-dialogflow
react-native-dialogflowJavaScript

A React Native bridge for integrating Google Dialogflow (API.AI) SDK to build conversational interfaces in mobile apps.

#dialogflow#speech-to-function#speak
Stars205
Forks61
Last commit3 years ago
Awesome Community-Curated NLP List
Awesome Community-Curated NLP List

A community-curated list of NLP tools, libraries, datasets, and resources across speech processing, text analysis, and machine translation.

#community-driven#text-analysis#nlp-tools
Stars202
Forks33
Last commit3 years ago
awesome-danish
awesome-danish

A curated list of awesome resources for Danish language technology, including datasets, models, and tools.

#corpora#nlp-tools#natural-language-processing
Stars195
Forks20
Last commit1 year ago
MolT5
MolT5Python

A T5-based model for bidirectional translation between molecular structures (SMILES) and natural language descriptions.

#transformer#cheminformatics#natural-language-processing
Stars194
Forks22
Last commit2 years ago
go-porterstemmer
go-porterstemmerGo

A native Go implementation of the Porter Stemming algorithm for NLP and machine learning tasks.

#stemming#natural-language-processing#golang-library
Stars193
Forks45
Last commit5 years ago
CORD-19
CORD-19

A corpus of academic papers about COVID-19 and related coronavirus research for text mining and NLP.

#document-embeddings#semantic-scholar#natural-language-processing
Stars186
Forks23
Last commit1 year ago
Norwegian NLP resources
Norwegian NLP resources

A curated collection of open-source libraries, models, datasets, and tools for Natural Language Processing (NLP) in Norwegian.

#spacy#nlp-resources#bokmal
Stars182
Forks15
Last commit5 years ago
getlang
getlangGo

A pure Go library for fast, offline natural language detection supporting 29 languages.

#iso-639#text-analysis#natural-language
Stars175
Forks23
Last commit5 years ago
wit-go
wit-goGo

A Go client library for interacting with the Wit.ai natural language processing HTTP API.

#intent-recognition#go-client#entity-extraction
Stars170
Forks36
Last commit9 months ago
EDS_NLP
EDS_NLPPython

A modular NLP framework for extracting information from French clinical notes, compatible with spaCy and PyTorch.

#medical-text#spacy#fast
Stars165
Forks42
Last commit3 days ago
words_counted
words_countedRuby

A Ruby natural language processor for tokenizing and analyzing text with flexible filtering and custom regex support.

#nlp-library#word-counter#text-analysis
Stars164
Forks28
Last commit4 years ago
postagga
postaggaClojure

A Clojure/ClojureScript library for building self-contained natural language parsers using part-of-speech tagging and semantic rules.

#part-of-speech-tagging#bots#language-understanding
Stars162
Forks16
Last commit5 years ago
Qtypes
QtypesJavaScript

A rule-based question classification system for Node.js that categorizes questions by type and answer format.

#qa-systems#nlp-library#text-analysis
Stars161
Forks27
Last commit9 years ago
shield
shieldGo

Bayesian text classifier for Go with flexible tokenizers and storage backends.

#redis#text-classification#bayesian-classifier
Stars160
Forks31
Last commit6 years ago
dbmdz BERT models
dbmdz BERT models

A collection of pre-trained BERT, DistilBERT, ELECTRA, GPT-2, and ConvBERT models for multiple languages, including German, Italian, Turkish, and historic texts.

#italian-nlp#transformer-models#natural-language-processing
Stars158
Forks12
Last commit3 years ago
natural-language-classifier-nodejs
natural-language-classifier-nodejsJavaScript

A deprecated Node.js sample application demonstrating IBM Watson Natural Language Classifier service features.

#text-classification#natural-language-processing#cloud-foundry
Stars157
Forks202
Last commit
open-solution-toxic-comments
open-solution-toxic-commentsPython

An open-source starter solution for the Kaggle Toxic Comment Classification Challenge, providing ready-to-use machine learning pipelines for detecting online harassment.

#ensemble-learning#text-classification#data-science
Stars155
Forks55
Last commit
NLP4J
NLP4JJava

A natural language processing framework for JVM languages with comprehensive linguistic analysis tools.

#coreference-resolution#java-nlp#semantic-role-labeling
Stars155
Forks32
Last commit5 years ago
PreviousPage 6 of 8

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
5 days ago
8 years ago
3 years ago
10 years ago
4 years ago
4 years ago
Next
#Machine Learning128
#Nlp89
#Text Analysis63
#Deep Learning61
#Python43
#Computer Vision33
#Text Processing32
#Named Entity Recognition31
#Python Library29
#Text Classification24
#Tensorflow23
#Ruby Gem22