Nlp

225 projects

Showing 36 of 225 projects

getlangGo

A pure Go library for fast, offline natural language detection supporting 29 languages.

#iso-639#text-analysis#natural-language

Stars175

Forks23

Last commit5 years ago

wit-goGo

A Go client library for interacting with the Wit.ai natural language processing HTTP API.

#intent-recognition#go-client#entity-extraction

Stars171

Forks38

Last commit10 months ago

EDS_NLPPython

A modular NLP framework for extracting information from French clinical notes, compatible with spaCy and PyTorch.

#medical-text#spacy#fast

Stars164

Forks44

Last commit14 days ago

words_countedRuby

A Ruby natural language processor for tokenizing and analyzing text with flexible filtering and custom regex support.

#nlp-library#word-counter#text-analysis

Stars164

Forks28

Last commit4 years ago

open-solution-toxic-commentsPython

An open-source starter solution for the Kaggle Toxic Comment Classification Challenge, providing ready-to-use machine learning pipelines for detecting online harassment.

#ensemble-learning#text-classification#data-science

Interactive topic model visualization and interpretation library for Python, compatible with sklearn, Gensim, BERTopic, and Turftopic.

#bertopic#mantine#python-library

Stars148

Forks17

Last commit1 year ago

jProcessingOpenEdge ABL

Japanese Natural Langauge Processing Libraries

#word-sense-disambiguation#wsd#japanese

Stars147

Forks30

Last commit5 years ago

PaasaaElixir

An Elixir library for natural language and script detection using statistical analysis without AI.

#statistical-analysis#elixir#language-identification

Stars143

Forks14

Last commit22 days ago

Jupyter Notebooks for Digital Humanities

A curated collection of Jupyter notebooks for digital humanities research and teaching, covering text analysis, data visualization, and more.

#text-analysis#educational-resources#multilingual

Stars141

Forks19

Last commit3 years ago

wordnetRuby

A Ruby interface to the WordNet lexical database, enabling natural language processing and linguistic analysis.

#semantic-analysis#lexical-database#ruby-gem

Stars140

Forks25

Last commit3 years ago

steppyPython

A lightweight Python library for building reproducible machine learning pipelines with minimal interface constraints.

#experimentation#python-library#data-science

Stars136

Forks32

Last commit7 years ago

colibri-coreC++

A C++ and Python library for efficient extraction and analysis of n-grams, skipgrams, and flexgrams from large corpora.

#c-plus-plus-library#computational-linguistics#pattern-modeling

Stars131

Forks20

Last commit5 months ago

Introduction to Deep Learning Using Python (GitHub)Python

A hands-on workshop introducing deep learning concepts with practical examples using neural networks, CNNs, RNNs, and autoencoders.

#autoencoders#educational#deep-learning

A Go implementation of the Rapid Automatic Keyword Extraction (RAKE) algorithm for extracting keywords from text.

#rake-algorithm#information-retrieval#text-analysis

Stars124

Forks19

Last commit1 year ago

lemmatizerRuby

A Ruby gem for lemmatizing English text, converting inflected words to their base dictionary forms.

#text-analysis#nlp-tools#lemmatization

Stars112

Forks15

Last commit4 years ago

triple_accelRust

Rust edit distance library accelerated with SIMD for fast Hamming, Levenshtein, and Damerau-Levenshtein calculations.

#string-similarity#simd#string-matching

Stars110

Forks15

Last commit3 years ago

openwhisk-darkvisionappJavaScript

An application that uses IBM Watson AI services and Cloud Functions to analyze videos, extracting visual and audio insights for search and categorization.

#ibm-cloud#watson-ai#watson-visual-recognition

A Python library providing German language support for TextBlob, enabling NLP tasks like tokenization, POS tagging, and sentiment analysis.

#german-language#textblob-extension#python-library

Stars103

Forks12

Last commit1 year ago

Word TokenizersJulia

A Julia package providing high-performance, configurable tokenizers and sentence splitters for natural language processing.

#julia#computational-linguistics#sentence-splitting

Stars99

Forks25

Last commit4 years ago

NLIWOD's Question answering datasetsJava

A collection of tools, datasets, and approaches for building natural language interfaces to query the Web of Data.

#web-of-data#question-answering#natural-language-interfaces

Archived R package for accessing the Monkeylearn API for text classification and extraction.

#text-extraction#peer reviewed#text-classification

Stars92

Forks16

Last commit4 years ago

open-nlpRuby

Ruby bindings to the OpenNLP Java toolkit for natural language processing tasks like tokenization, POS tagging, and named entity recognition.

#java bindings#jruby#pos-tagging

Stars91

Forks11

Last commit1 year ago

Language Understanding (LUIS) SamplesC#

A collection of code samples demonstrating how to use Azure's Language Understanding (LUIS) service for natural language processing.

#language-understanding#chatbots#azure

A curated collection of books covering Artificial Intelligence, Machine Learning, Deep Learning, and Transformers for students and professionals.

#ai#python-ml#ai-agent

Stars83

Forks8

Last commit11 months ago

frogC++

A tagger, lemmatizer, morphological analyzer, and dependency parser for Dutch using memory-based NLP modules.

#c-plus-plus-library#computational-linguistics#memory-based-learning

Stars82

Forks12

Last commit1 month ago

tactful_tokenizerRuby

Accurate Bayesian sentence tokenizer in Ruby.

#rubynlp#ruby#nlp

Stars80

Forks13

Last commit12 years ago

uctoC++

A rule-based Unicode tokenizer that separates words from punctuation and splits sentences for NLP preprocessing.

#nlp-library#computational-linguistics#rule-based

Stars72

Forks14

Last commit1 month ago

chronicityCommon Lisp

A natural language date and time parser for Common Lisp, inspired by Ruby's Chronic.

#datetime#natural-language-processing#date-parsing

Stars69

Forks14

Last commit7 years ago

InsNet - A neural network library for building instance-dependent NLP models with padding-free dynamic batchingC++

InsNet Runs Instance-dependent Neural Networks with Padding-free Dynamic Batching.

#dynamic-batching#nlp#deep-learning-library

A Ruby wrapper for the spaCy NLP library via PyCall, enabling tokenization, POS tagging, NER, and OpenAI integration.

#parsing#nlp-library#spacy

Stars68

Forks6

Last commit4 days ago

gibranElixir

An Elixir natural language processor for tokenization, counting, and string similarity analysis.

Stars65

Forks3

Last commit9 years ago

SaulScala

Saul is a declarative domain-specific language in Scala for designing flexible machine learning models with relational feature extraction.

#declarative-programming#learning-models#ai-systems

Stars65

Forks18

Last commit6 years ago

HarmonyPython

A Python library using NLP and AI to help psychologists and social scientists harmonize questionnaire items across different languages and formats.

#depression#social-sciences#harmony

Stars64

Forks59

Last commit1 month ago

tensorlmPython

A TensorFlow wrapper library for character-level and word-level text generation using recurrent neural networks.

#char-rnn#python-library#deep-learning

Stars60

Forks28

Last commit4 years ago

Tensorflow-lite-kotlin-samplesKotlin

Kotlin implementations of TensorFlow Lite example Android apps for on-device machine learning.

#hacktoberfest#tensorflow-examples#on-device-ai

Stars59

Forks11

Last commit

max-text-sentiment-classifierPython

A pre-trained BERT-based model for detecting positive or negative sentiment in short text fragments.

#ibm#natural-language-understanding#api

Stars57

Forks32

Last commit10 months ago

PreviousPage 5 of 7Next

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub