Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Natural Language Processing

Natural Language Processing

268 projects

Showing 36 of 268 projects

java-string-similarity
java-string-similarityJava

A Java library implementing various string similarity and distance algorithms like Levenshtein, Jaro-Winkler, and n-gram methods.

#algorithm-implementation#string-similarity#distance-measure
Stars2.7k
Forks416
Last commit
Data Science Projects
Data Science ProjectsJupyter Notebook

A curated collection of hands-on data science project ideas and resources for learning machine learning and AI concepts.

#data-science#kaggle#deep-learning
Stars2.7k
Forks630
Last commit2 years ago
gojieba
gojiebaGo

A high-performance Golang port of the Jieba Chinese text segmentation library.

#part-of-speech-tagging#search-engine-tokenization#cgo
Stars2.6k
Forks304
Last commit1 month ago
Betty
BettyRuby

A friendly English-like interface for your command line that translates natural language phrases into executable commands.

#developer-tools#productivity#shell-scripting
Stars2.6k
Forks210
Last commit5 years ago
EasyLM
EasyLMPython

A JAX/Flax-based framework for easy and scalable pre-training, fine-tuning, evaluation, and serving of large language models.

#transformer#distributed-training#jax
Stars2.5k
Forks260
Last commit1 year ago
retext
retextJavaScript

A natural language processor powered by plugins that transforms and analyzes text using syntax trees.

#open-source#retext#text-analysis
Stars2.4k
Forks92
Last commit1 year ago
Texar
TexarPython

A modular toolkit for machine learning, natural language processing, and text generation with TensorFlow and PyTorch versions.

#model-training#research-toolkit#transformer-models
Stars2.4k
Forks368
Last commit4 years ago
textacy
textacyPython

A Python NLP library built on spaCy for text preprocessing, feature extraction, and analysis tasks.

#nlp-library#computational-linguistics#spacy
Stars2.2k
Forks247
Last commit2 years ago
Curated list of R tutorials for Data Science, NLP and Machine Learning
Curated list of R tutorials for Data Science, NLP and Machine LearningR

A curated collection of R tutorials, packages, and resources for Data Science, NLP, and Machine Learning.

#data-science#statistics#r-programming
Stars2.1k
Forks879
Last commit
ScispaCy
ScispaCyPython

A spaCy pipeline and models specifically designed for processing scientific and biomedical documents.

#biomedical-nlp#scientific-text#spacy
Stars2.0k
Forks254
Last commit6 months ago
spaGO
spaGOGo

A self-contained machine learning and natural language processing library written in pure Go with a dynamic computational graph.

#neural-network#deep-learning#nlp-framework
Stars1.9k
Forks89
Last commit1 year ago
aitextgen
aitextgenPython

A Python package for fine-tuning and generating text with GPT-2 and GPT Neo models using PyTorch and Hugging Face Transformers.

#text-generation#fine-tuning#natural-language-processing
Stars1.8k
Forks215
Last commit2 years ago
TensorFlow Fold
TensorFlow FoldPython

A library for creating TensorFlow models that handle structured data with dynamic computation graphs using dynamic batching.

#deep-learning#neural-networks#natural-language-processing
Stars1.8k
Forks263
Last commit
go-pinyin
go-pinyinGo

A Go library and CLI tool for converting Chinese characters to Hanyu Pinyin with tone support.

#pinyin#internationalization#go-library
Stars1.8k
Forks206
Last commit3 months ago
Memory Networks Implementations - Facebook
Memory Networks Implementations - FacebookLua

Implementations of memory-augmented neural networks for language modeling, dialogue systems, and question answering tasks.

#babi-dataset#neural-networks#question-answering
Stars1.8k
Forks370
Last commit
PyTorch Lightning Bolts
PyTorch Lightning BoltsPython

A collection of models, callbacks, and datasets to extend PyTorch Lightning for applied AI/ML research and production.

#ai#callbacks#model-training
Stars1.8k
Forks315
Last commit4 months ago
NeuroNER
NeuroNERPython

An easy-to-use, state-of-the-art named-entity recognition (NER) tool based on neural networks.

#text-analysis#deep-learning#neural-networks
Stars1.7k
Forks472
Last commit3 years ago
SciBERT
SciBERTPython

A BERT language model pre-trained on a large corpus of scientific papers for natural language processing tasks in scientific domains.

#relation-extraction#scientific-text#text-classification
Stars1.7k
Forks231
Last commit4 years ago
BiDAF
BiDAFPython

A deep learning model for machine comprehension that uses bi-directional attention flow to answer questions about text passages.

#squad#neural-network#deep-learning
Stars1.5k
Forks669
Last commit3 years ago
Awesome Document Understanding
Awesome Document Understanding

A curated list of resources for Document Understanding (DU), covering research, datasets, tools, and applications in Intelligent Document Processing.

#key-information-extraction#document-understanding#document-analysis
Stars1.5k
Forks170
Last commit3 years ago
node-sdk
node-sdkTypeScript

Node.js client library for accessing IBM Watson AI services like Assistant, Speech-to-Text, and Natural Language Understanding.

#hacktoberfest#ai-api#language-translation
Stars1.5k
Forks650
Last commit3 days ago
when
whenGo

A natural language date/time parser with pluggable rules and merge strategies for Go applications.

#datetime#natural-language#time-parsing
Stars1.5k
Forks94
Last commit1 year ago
python-sdk
python-sdkPython

A Python client library for interacting with IBM Watson AI services, available via pip as ibm-watson.

#hacktoberfest#cloud-services#authentication
Stars1.4k
Forks821
Last commit4 months ago
NLG-eval
NLG-evalPython

A Python library for evaluating natural language generation models using multiple unsupervised automated metrics.

#cider#nlp-evaluation#unsupervised-metrics
Stars1.4k
Forks226
Last commit1 year ago
Jieba-PHP
Jieba-PHPPHP

A PHP Chinese text segmentation module offering precise, full, and search engine modes with support for Traditional Chinese and CJK languages.

#chinese-text-segmentation#open-source#pos-tagging
Stars1.4k
Forks258
Last commit5 months ago
treat
treatRuby

A comprehensive natural language processing framework for Ruby with support for text extraction, parsing, and machine learning.

#text-extraction#computational-linguistics#text-analysis
Stars1.4k
Forks125
Last commit1 year ago
lingua-go
lingua-goGo

The most accurate natural language detection library for Go, excelling with short text and mixed-language content.

#statistical-analysis#language-identification#language-recognition
Stars1.3k
Forks79
Last commit1 year ago
duckling
ducklingClojure

A Clojure library that parses natural language text into structured data like dates, times, and durations.

#time-parsing#entity-extraction#clojure-library
Stars1.3k
Forks209
Last commit7 years ago
Awesome Deep Learning for Natural Language Processing
Awesome Deep Learning for Natural Language Processing

A curated collection of resources for deep learning applications in natural language processing.

#ai-frameworks#research-papers#deep-learning
Stars1.3k
Forks256
Last commit4 months ago
DeepMind QA Corpus
DeepMind QA CorpusPython

Script to generate question/answer pairs from CNN and Daily Mail articles for machine reading comprehension research.

#deep-learning#question-answering#natural-language-processing
Stars1.3k
Forks240
Last commit9 years ago
Ruby NLP
Ruby NLP

A curated collection of Ruby libraries, tools, and resources for Natural Language Processing (NLP).

#ruby-ecosystem#nlp-resources#ruby-gems
Stars1.3k
Forks105
Last commit3 years ago
VLM Architectures
VLM ArchitecturesMarkdown

A curated repository of famous Vision-Language Models (VLMs) detailing their architectures, training procedures, and datasets.

#llava#open-source#internlm
Stars1.3k
Forks54
Last commit4 months ago
semantic-grep
semantic-grepGo

A command-line tool that performs semantic searches on text using word embeddings to find words with similar meaning to the query.

#multi-language#semantic-search#text-analysis
Stars1.2k
Forks27
Last commit1 year ago
awesome Information Retrieval
awesome Information Retrieval

A curated list of awesome resources for information retrieval and web search, including books, courses, datasets, and software.

#research-datasets#information-retrieval#natural-language-processing
Stars1.2k
Forks142
Last commit3 years ago
Awesome Vision + Language
Awesome Vision + Language

A curated list of recent research papers and resources on Vision and Language Pre-trained Models (VL-PTMs).

#vision-language-pretraining#vl-ptms#vision-and-language
Stars1.2k
Forks104
Last commit3 years ago
PPLM
PPLMPython

A method to steer topic and attributes of GPT-2 language models without fine-tuning, enabling controlled text generation.

#text-steering#research-tool#deep-learning
Stars1.2k
Forks204
Last commit2 years ago
PreviousPage 3 of 8

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
4 years ago
3 years ago
5 years ago
5 years ago
Next
#Machine Learning128
#Nlp89
#Text Analysis63
#Deep Learning61
#Python43
#Computer Vision33
#Text Processing32
#Named Entity Recognition31
#Python Library29
#Text Classification24
#Tensorflow23
#Ruby Gem22