Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Natural Language Processing

Natural Language Processing

268 projects

Showing 36 of 268 projects

BlueBERT
BlueBERTPython

A BERT model pre-trained on PubMed abstracts and clinical notes for biomedical natural language processing tasks.

#relation-extraction#biomedical-nlp#transfer-learning
Stars593
Forks81
Last commit3 years ago
Awesome CoreML
Awesome CoreML

A curated collection of open-source machine learning models compatible with Apple's Core ML framework.

#ai#coremltools#ios
Stars585
Forks63
Last commit6 years ago
R-Net
R-NetPython

TensorFlow implementation of R-Net for machine reading comprehension on the SQuAD dataset.

#squad#deep-learning#neural-networks
Stars577
Forks209
Last commit7 years ago
LDAvis
LDAvisJavaScript

An R package for creating interactive web-based visualizations of Latent Dirichlet Allocation (LDA) topic models.

#statistical-visualization#r-package#text-analysis
Stars570
Forks130
Last commit2 years ago
unity-sdk
unity-sdkC#

A Unity SDK for integrating IBM Watson AI services like speech, language, and vision into games and applications.

#unity3d#hacktoberfest#csharp
Stars565
Forks205
Last commit1 year ago
FACTORIE
FACTORIEScala

A Scala toolkit for deployable probabilistic modeling using imperatively-defined factor graphs.

#probabilistic-modeling#factor-graphs#scala-library
Stars552
Forks143
Last commit8 years ago
German NLP resources
German NLP resources

A curated list of open-access resources and tools for Natural Language Processing (NLP) focused on the German language.

#german-language#computational-linguistics#language-resources
Stars524
Forks66
Last commit1 year ago
NarrativeQA
NarrativeQAShell

A reading comprehension dataset with Wikipedia summaries, full stories, and question-answer pairs for narrative understanding.

#narrative-understanding#text-analysis#deep-learning
Stars514
Forks69
Last commit6 years ago
word-embedding
word-embeddingGo

A Go library implementing word embedding models (Word2Vec, GloVe, LexVec) from scratch with CLI and SDK.

#word2vec#go-library#natural-language-processing
Stars506
Forks45
Last commit3 years ago
RNNLG
RNNLGPython

An open-source benchmark toolkit for Natural Language Generation in spoken dialogue systems, featuring multiple RNN-based models and datasets.

#nltk#nlp-research#deep-learning
Stars491
Forks126
Last commit7 years ago
Torch code for Visual Question Answering using a CNN+LSTM model
Torch code for Visual Question Answering using a CNN+LSTM modelLua

A Torch implementation of a VIS+LSTM model for answering questions about images using deep learning.

#deep-learning#natural-language-processing#research-implementation
Stars485
Forks90
Last commit
Shell Genie
Shell GeniePython

A command-line tool that translates plain English requests into terminal commands using AI.

#productivity#ai-assistant#shell-scripting
Stars482
Forks40
Last commit2 years ago
CogCompNLP
CogCompNLPJava

A comprehensive suite of Java NLP libraries and tools for text annotation, feature extraction, and language processing tasks.

#part-of-speech-tagging#cogcomp#java-library
Stars479
Forks143
Last commit2 years ago
gingerice
gingericeRuby

A Ruby wrapper for Ginger Proofreader that corrects spelling and grammar mistakes using contextual sentence analysis.

#proofreading#grammar-checker#ruby-gem
Stars477
Forks21
Last commit6 years ago
nlp
nlpGo

A Go library implementing selected machine learning algorithms for natural language processing and semantic analysis.

#semantic-analysis#tf-idf#text-analysis
Stars474
Forks46
Last commit5 years ago
sentences
sentencesGo

A multilingual command-line sentence tokenizer written in Go, ported from NLTK's Punkt system.

#nlp-library#sentences#command-line-tool
Stars470
Forks42
Last commit2 years ago
Sentimental
SentimentalRuby

A Ruby gem for simple sentiment analysis that classifies text as positive, negative, or neutral based on configurable thresholds.

#text-classification#text-analysis#ruby-gem
Stars464
Forks72
Last commit7 years ago
Linguistics
Linguistics

A curated list of resources, tools, datasets, and communities for linguistics and natural language processing.

#computational-linguistics#nlp-resources#natural-language-processing
Stars450
Forks34
Last commit3 months ago
Biomedical Information Extraction
Biomedical Information Extraction

A curated list of resources for Biomedical Information Extraction (BioIE), including datasets, tools, libraries, and research.

#biomedical-language#biomedical-nlp#biomedical-data
Stars445
Forks39
Last commit13 days ago
ChatARKit
ChatARKitC

An iOS app that uses ChatGPT to generate ARKit code from spoken prompts, placing and manipulating 3D objects in augmented reality.

#ios#arkit#natural-language-processing
Stars441
Forks35
Last commit3 years ago
medaCy
medaCyPython

A medical text mining and information extraction framework built on spaCy for rapid prototyping and training of predictive NLP models.

#spacy#clinical-text#metamap
Stars438
Forks92
Last commit3 years ago
stanford-core-nlp
stanford-core-nlpRuby

Ruby bindings for the Stanford CoreNLP natural language processing toolkit, supporting English, French, and German.

#ruby-bindings#text-analysis#language-processing
Stars436
Forks69
Last commit1 year ago
Graphify
GraphifyJava

A Neo4j extension for document and text classification using graph-based hierarchical pattern recognition.

#semantic-analysis#text-classification#neo4j-extension
Stars433
Forks100
Last commit6 years ago
FakeNewsCorpus
FakeNewsCorpus

A dataset of millions of news articles labeled by credibility type for training fake news detection algorithms.

#database#data-scraping#text-corpus
Stars413
Forks98
Last commit6 years ago
DrQA
DrQAPython

A PyTorch implementation of the DrQA model for reading comprehension and open-domain question answering.

#research-tool#squad#spacy
Stars401
Forks109
Last commit4 years ago
simple_bayes
simple_bayesElixir

A Naive Bayes machine learning implementation in Elixir with multiple models and storage options.

#probabilistic-models#naive-bayes#text-classification
Stars396
Forks24
Last commit8 years ago
Text Analysis
Text AnalysisJulia

A Julia package providing standard tools and models for text analysis and natural language processing.

#nlp-library#julia#text-classification
Stars383
Forks92
Last commit2 months ago
scriptum
scriptumJavaScript

A functional programming library for JavaScript/Node.js focused on string processing, regular expressions, and linear algebra.

#functional-programming#transducers#folding
Stars381
Forks20
Last commit10 months ago
Implementation of various topic models in Python
Implementation of various topic models in PythonJupyter Notebook

Python implementations of various topic modeling algorithms including LDA, collaborative topic models, and hierarchical Dirichlet processes.

#research-tool#probabilistic-modeling#bayesian-statistics
Stars374
Forks169
Last commit
chronic_duration
chronic_durationRuby

A Ruby natural language parser for elapsed time that converts human-readable durations to seconds and vice versa.

#time-conversion#datetime-utilities#time-parsing
Stars355
Forks69
Last commit1 year ago
NNDIAL
NNDIALPython

An open-source toolkit for building end-to-end trainable task-oriented dialogue models with neural networks.

#academic-toolkit#dialogue-agents#dialogues
Stars353
Forks102
Last commit9 years ago
Spanish
Spanish

A curated collection of linguistic resources, tools, and datasets for Natural Language Processing and Computational Linguistics on Spanish.

#computational-linguistics#pos-tagging#machine-translation
Stars350
Forks42
Last commit2 years ago
awesome-spanish-nlp
awesome-spanish-nlp

A curated collection of linguistic resources, datasets, and tools for Natural Language Processing and Computational Linguistics on Spanish.

#computational-linguistics#text-analysis#nlp-datasets
Stars350
Forks42
Last commit2 years ago
PySS3
PySS3Python

A Python library for interpretable text classification using the SS3 model, with built-in visualization tools for explainable AI.

#hyperparameter-optimization#explainable-artificial-intelligence#python-library
Stars348
Forks44
Last commit
Cybertron
CybertronGo

A pure Go package for running inference with pre-trained Transformer models from Hugging Face, enabling NLP tasks without external languages.

#text-classification#transformer-models#machine-translation
Stars329
Forks28
Last commit2 years ago
ELI5
ELI5Python

Scripts and tools to recreate the ELI5 dataset for long-form question answering research.

#dataset-creation#transformer-models#research-tools
Stars324
Forks42
Last commit4 years ago
PreviousPage 5 of 8

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
10 years ago
5 years ago
7 months ago
Next
#Machine Learning128
#Nlp89
#Text Analysis63
#Deep Learning61
#Python43
#Computer Vision33
#Text Processing32
#Named Entity Recognition31
#Python Library29
#Text Classification24
#Tensorflow23
#Ruby Gem22