Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Natural Language Processing

Natural Language Processing

92 projects

Showing 20 of 92 projects

java-string-similarity
java-string-similarityJava

A Java library implementing various string similarity and distance algorithms like Levenshtein, Jaro-Winkler, and n-gram methods.

#algorithm-implementation#string-similarity#distance-measure
Stars2.7k
Forks415
Last commit
gojieba
gojiebaGo

A high-performance Golang port of the Jieba Chinese text segmentation library.

#part-of-speech-tagging#search-engine-tokenization#cgo
Stars2.6k
Forks303
Last commit1 month ago
Data Science Projects
Data Science ProjectsJupyter Notebook

A curated collection of hands-on data science project ideas and resources for learning machine learning and AI concepts.

#data-science#kaggle#deep-learning
Stars2.6k
Forks623
Last commit2 years ago
Betty
BettyRuby

A friendly English-like interface for your command line that translates natural language phrases into executable commands.

#developer-tools#productivity#shell-scripting
Stars2.6k
Forks210
Last commit4 years ago
EasyLM
EasyLMPython

A JAX/Flax-based framework for easy and scalable pre-training, fine-tuning, evaluation, and serving of large language models.

#transformer#distributed-training#jax
Stars2.5k
Forks261
Last commit1 year ago
retext
retextJavaScript

A natural language processor powered by plugins that transforms and analyzes text using syntax trees.

#open-source#retext#text-analysis
Stars2.4k
Forks92
Last commit1 year ago
Texar
TexarPython

A modular toolkit for machine learning, natural language processing, and text generation with TensorFlow and PyTorch versions.

#model-training#research-toolkit#transformer-models
Stars2.4k
Forks368
Last commit4 years ago
textacy
textacyPython

A Python NLP library built on spaCy for text preprocessing, feature extraction, and analysis tasks.

#nlp-library#computational-linguistics#spacy
Stars2.2k
Forks249
Last commit2 years ago
Curated list of R tutorials for Data Science, NLP and Machine Learning
Curated list of R tutorials for Data Science, NLP and Machine LearningR

A curated collection of R tutorials, packages, and resources for Data Science, NLP, and Machine Learning.

#data-science#statistics#r-programming
Stars2.1k
Forks876
Last commit
ScispaCy
ScispaCyPython

A spaCy pipeline and models specifically designed for processing scientific and biomedical documents.

#biomedical-nlp#scientific-text#spacy
Stars1.9k
Forks253
Last commit4 months ago
spaGO
spaGOGo

A self-contained machine learning and natural language processing library written in pure Go with a dynamic computational graph.

#neural-network#deep-learning#nlp-framework
Stars1.8k
Forks89
Last commit1 year ago
aitextgen
aitextgenPython

A Python package for fine-tuning and generating text with GPT-2 and GPT Neo models using PyTorch and Hugging Face Transformers.

#text-generation#fine-tuning#natural-language-processing
Stars1.8k
Forks215
Last commit2 years ago
TensorFlow Fold
TensorFlow FoldPython

A library for creating TensorFlow models that handle structured data with dynamic computation graphs using dynamic batching.

#deep-learning#neural-networks#natural-language-processing
Stars1.8k
Forks263
Last commit
go-pinyin
go-pinyinGo

A Go library and CLI tool for converting Chinese characters to Hanyu Pinyin with tone support.

#pinyin#internationalization#go-library
Stars1.8k
Forks205
Last commit1 month ago
Memory Networks Implementations - Facebook
Memory Networks Implementations - FacebookLua

Implementations of memory-augmented neural networks for language modeling, dialogue systems, and question answering tasks.

#babi-dataset#neural-networks#question-answering
Stars1.8k
Forks370
Last commit
PyTorch Lightning Bolts
PyTorch Lightning BoltsPython

A collection of models, callbacks, and datasets to extend PyTorch Lightning for applied AI/ML research and production.

#ai#callbacks#model-training
Stars1.8k
Forks316
Last commit3 months ago
NeuroNER
NeuroNERPython

An easy-to-use, state-of-the-art named-entity recognition (NER) tool based on neural networks.

#text-analysis#deep-learning#neural-networks
Stars1.7k
Forks472
Last commit3 years ago
SciBERT
SciBERTPython

A BERT language model pre-trained on a large corpus of scientific papers for natural language processing tasks in scientific domains.

#relation-extraction#scientific-text#text-classification
Stars1.7k
Forks232
Last commit4 years ago
BiDAF
BiDAFPython

A deep learning model for machine comprehension that uses bi-directional attention flow to answer questions about text passages.

#squad#neural-network#deep-learning
Stars1.5k
Forks670
Last commit2 years ago
Awesome Document Understanding
Awesome Document Understanding

A curated list of resources for Document Understanding (DU), covering research, datasets, tools, and applications in Intelligent Document Processing.

#key-information-extraction#document-understanding#document-analysis
Stars1.5k
Forks170
Last commit2 years ago
PreviousPage 3 of 3

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
3 years ago
3 years ago
4 years ago
5 years ago
#Machine Learning56
#Deep Learning39
#Nlp32
#Python25
#Computer Vision21
#Python Library19
#Pytorch15
#Neural Networks14
#Text Analysis13
#Tensorflow13
#Named Entity Recognition12
#Data Science11