Natural Language Processing

#hacktoberfest#cloud-ai#natural-language-processing

dotnet-standard-sdkC#

A .NET Standard library for accessing IBM Watson cognitive services like Assistant, Discovery, and Speech-to-Text.

Stars148

Forks114

Image Caption GeneratorJupyter Notebook

A TensorFlow-based neural network model for generating descriptive captions from images using Flickr30K and MSCOCO datasets.

#neural-network#deep-learning#captioning-images

Stars145

Forks55

Deep Belief Nets for Topic ModelingPython

A Python toolbox using deep belief networks for topic modeling on document data, producing latent representations for content-based recommendation.

#deep-belief-networks#research-tool#document-analysis

A Ruby interface to the WordNet lexical database, enabling natural language processing and linguistic analysis.

#semantic-analysis#lexical-database#ruby-gem

Stars140

Forks25

natural-language-understanding-nodejsJavaScript

A Node.js sample application demonstrating IBM Watson Natural Language Understanding service features.

#sample-app#natural-language-understanding#text-analysis

A fast implementation of the Porter stemming algorithm for English word normalization in natural language processing.

#stemmer#stemming#text-analysis

Stars138

Forks8

#text-analysis#ruby-wrapper#ruby-gem

lda-rubyRuby

A Ruby wrapper for Latent Dirichlet Allocation (LDA) that clusters documents into topics with native, Rust, and pure Ruby backends.

Stars134

Forks30

Last commit2 months ago

ClearTKJava

A Java framework for developing statistical natural language processing (NLP) components on Apache UIMA.

#statistical-nlp#text-analysis#language-processing

Stars133

Forks58

#c-plus-plus-library#computational-linguistics#pattern-modeling

colibri-coreC++

A C++ and Python library for efficient extraction and analysis of n-grams, skipgrams, and flexgrams from large corpora.

Stars131

Forks20

Last commit5 months ago

RAKE.goGo

A Go implementation of the Rapid Automatic Keyword Extraction (RAKE) algorithm for extracting keywords from text.

#rake-algorithm#information-retrieval#text-analysis

Stars124

Forks19

#research-tool#academic-software#diagram-generator

rsyntaxtreeRuby

A graphical syntax tree generator for linguistic research that creates publication-quality tree diagrams from bracket notation.

Stars122

Forks18

Last commit6 days ago

nickelRuby

A Ruby gem that extracts structured date, time, and message information from naturally worded text.

#datetime#reminders#time-parsing

Stars118

Forks17

Last commit8 years ago

lemmatizerRuby

A Ruby gem for lemmatizing English text, converting inflected words to their base dictionary forms.

#text-analysis#nlp-tools#lemmatization

Stars112

Forks15

Last commit4 years ago

textblob-dePython

A Python library providing German language support for TextBlob, enabling NLP tasks like tokenization, POS tagging, and sentiment analysis.

#german-language#textblob-extension#python-library

Stars103

Forks12

#commonjs#information-retrieval#natural-language-processing

porter-stemmerJavaScript

A Node.js implementation of Martin Porter's stemming algorithm for removing morphological endings from English words.

Stars102

Forks12

Last commit5 years ago

UralicNLPPython

A natural language processing library for Uralic and other languages, offering morphological analysis, generation, lemmatization, and lexical information.

#sami#nlp-library#computational-linguistics

Stars100

Forks8

Last commit4 months ago

Word TokenizersJulia

A Julia package providing high-performance, configurable tokenizers and sentence splitters for natural language processing.

#julia#computational-linguistics#sentence-splitting

Stars99

Forks25

Last commit4 years ago

ai-cmdShell

A Zsh plugin that converts natural language descriptions into shell commands using AI, with ghost text preview.

#developer-tools#productivity#ai-assistant

Stars98

Forks19

Last commit15 days ago

GraphQuestionsReScript

A characteristic-rich dataset for factoid question answering with explicit question specifications to enable fine-grained QA system evaluation.

#nlp-research#question-answering#natural-language-processing

Stars94

Forks14

#nlp-library#text-analysis#multilingual

pragmatic_tokenizerRuby

A multilingual Ruby gem for splitting strings into tokens with extensive language support and configurable options.

Stars93

Forks11

#text-analysis#data-science#natural-language-processing

topikPython

A high-level Python toolbox for topic modeling with easy-to-use functions and command-line interface.

Stars93

Forks23

Last commit10 years ago

ruby-nlpRuby

Ruby bindings for Stanford NLP tools providing part-of-speech tagging and named entity recognition capabilities.

#part-of-speech-tagging#nlp-tools#natural-language-processing

Stars92

Forks14

Last commit12 years ago

MonkeyLearnR

Archived R package for accessing the Monkeylearn API for text classification and extraction.

#text-extraction#peer reviewed#text-classification

Stars92

Forks16

Last commit4 years ago

TensorFlow Lite Examples - AndroidKotlin

A collection of refactored, high-quality Android examples demonstrating TensorFlow Lite for on-device machine learning tasks.

#android#model-deployment#minst

Ruby bindings to the OpenNLP Java toolkit for natural language processing tasks like tokenization, POS tagging, and named entity recognition.

#java bindings#jruby#pos-tagging

Stars91

Forks11

#nlp-library#sentence-boundaries#nltk

punkt-segmenterRuby

A Ruby port of the NLTK Punkt algorithm for unsupervised, language-independent sentence boundary detection.

Stars91

Forks9

Last commit8 years ago

segmentGo

A Go library for Unicode text segmentation at word boundaries as defined by Unicode Standard Annex #29.

#unicode#word-boundaries#ragel

Stars89

Forks15

Language Understanding (LUIS) SamplesC#

A collection of code samples demonstrating how to use Azure's Language Understanding (LUIS) service for natural language processing.

#language-understanding#chatbots#azure

Stars87

Forks135

Hierarchical Attention NetworksPython

TensorFlow implementation of hierarchical attention networks for document classification using GRU cells and attention mechanisms.

#hierarchical-networks#text-classification#text-analysis

Stars87

Forks25