Text Analysis

116 projects

Showing 36 of 116 projects

German NLP resources

A curated list of open-access resources and tools for Natural Language Processing (NLP) focused on the German language.

#german-language#computational-linguistics#language-resources

Stars528

Forks67

Last commit1 year ago

NarrativeQAShell

A reading comprehension dataset with Wikipedia summaries, full stories, and question-answer pairs for narrative understanding.

#narrative-understanding#text-analysis#deep-learning

Stars518

Forks70

Last commit6 years ago

wokeGo

A text file analysis tool that detects non-inclusive language in source code and suggests inclusive alternatives.

#developer-tools#inclusive-language#pre-commit

Stars516

Forks62

Last commit2 years ago

nlpGo

A Go library implementing selected machine learning algorithms for natural language processing and semantic analysis.

#semantic-analysis#tf-idf#text-analysis

Stars475

Forks46

Last commit5 years ago

SentimentalRuby

A Ruby gem for simple sentiment analysis that classifies text as positive, negative, or neutral based on configurable thresholds.

#text-classification#text-analysis#ruby-gem

Stars465

Forks72

Last commit7 years ago

tone-analyzer-nodejsCSS

A Node.js sample application demonstrating the IBM Watson Tone Analyzer service for detecting emotional and language tones in text.

#sample-app#text-analysis#kubernetes

Stars452

Forks269

Last commit4 years ago

stanford-core-nlpRuby

Ruby bindings for the Stanford CoreNLP natural language processing toolkit, supporting English, French, and German.

#ruby-bindings#text-analysis#language-processing

Stars436

Forks69

Last commit1 year ago

wordcloud2JavaScript

An R package for creating interactive and customizable word cloud visualizations using wordcloud2.js.

#javascript-integration#r-package#text-analysis

Stars413

Forks107

Last commit4 years ago

Digital HumanitiesSCSS

A curated list of tools, resources, and services for humanities scholars using quantitative or computational methods.

#open-source#academic-software#dh-tools

A Julia package providing standard tools and models for text analysis and natural language processing.

#nlp-library#julia#text-classification

Stars384

Forks92

Last commit3 months ago

Implementation of various topic models in PythonJupyter Notebook

Python implementations of various topic modeling algorithms including LDA, collaborative topic models, and hierarchical Dirichlet processes.

#research-tool#probabilistic-modeling#bayesian-statistics

A curated collection of linguistic resources, datasets, and tools for Natural Language Processing and Computational Linguistics on Spanish.

#computational-linguistics#text-analysis#nlp-datasets

Stars351

Forks42

Last commit2 years ago

stringi <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">C++

Fast and portable character string processing in R using the Unicode ICU library.

#unicode#regex#stringi

Stars318

Forks48

Last commit

Topic Models ResourcesR

A curated collection of learning resources, R packages, and practical examples for understanding and applying topic modeling techniques.

#document-analysis#text-analysis#data-science

A Go implementation of the TextRank algorithm for automatic text summarization, phrase extraction, and keyword ranking with multithreading support.

#automatic-summarization#textrank#graph-algorithms

Stars225

Forks24

Last commit1 year ago

StyloR

An R package with GUI for computational stylistics and authorship attribution through statistical text analysis.

#statistical-analysis#stylometry#r-package

Stars222

Forks51

Last commit1 month ago

CadmiumJust

A comprehensive Natural Language Processing (NLP) library for the Crystal programming language.

#readability#nlp-library#modular-architecture

Stars211

Forks14

Last commit6 months ago

Awesome Community-Curated NLP List

A community-curated list of NLP tools, libraries, datasets, and resources across speech processing, text analysis, and machine translation.

#community-driven#text-analysis#nlp-tools

Stars202

Forks32

Last commit4 years ago

lmgrepClojure

A grep-like CLI utility that searches text files using Lucene query syntax, compiled to a native binary for fast startup.

#text-analysis#lucene#graalvm-native-image

Stars200

Forks5

Last commit1 year ago

RousseauJavaScript

A lightweight JavaScript proofreader that checks writing style, readability, and common errors in text.

#readability#proofreading#open-source

Stars188

Forks21

Last commit4 years ago

getlangGo

A pure Go library for fast, offline natural language detection supporting 29 languages.

#iso-639#text-analysis#natural-language

Stars175

Forks23

Last commit5 years ago

words_countedRuby

A Ruby natural language processor for tokenizing and analyzing text with flexible filtering and custom regex support.

#nlp-library#word-counter#text-analysis

Stars164

Forks28

Last commit4 years ago

QtypesJavaScript

A rule-based question classification system for Node.js that categorizes questions by type and answer format.

#qa-systems#nlp-library#text-analysis

Stars160

Forks27

Last commit9 years ago

topicwizardPython

Interactive topic model visualization and interpretation library for Python, compatible with sklearn, Gensim, BERTopic, and Turftopic.

#bertopic#mantine#python-library

Stars148

Forks17

Last commit1 year ago

Jupyter Notebooks for Digital Humanities

A curated collection of Jupyter notebooks for digital humanities research and teaching, covering text analysis, data visualization, and more.

#text-analysis#educational-resources#multilingual

Stars141

Forks19

Last commit3 years ago

natural-language-understanding-nodejsJavaScript

A Node.js sample application demonstrating IBM Watson Natural Language Understanding service features.

#sample-app#natural-language-understanding#text-analysis

A fast implementation of the Porter stemming algorithm for English word normalization in natural language processing.

#stemmer#stemming#text-analysis

Stars138

Forks8

Last commit3 years ago

lda-rubyRuby

A Ruby wrapper for Latent Dirichlet Allocation (LDA) that clusters documents into topics with native, Rust, and pure Ruby backends.

#text-analysis#ruby-wrapper#ruby-gem

Stars134

Forks30

Last commit2 months ago

ClearTKJava

A Java framework for developing statistical natural language processing (NLP) components on Apache UIMA.

#statistical-nlp#text-analysis#language-processing

Stars133

Forks58

Last commit3 years ago

colibri-coreC++

A C++ and Python library for efficient extraction and analysis of n-grams, skipgrams, and flexgrams from large corpora.

#c-plus-plus-library#computational-linguistics#pattern-modeling

Stars131

Forks20

Last commit5 months ago

RAKE.goGo

A Go implementation of the Rapid Automatic Keyword Extraction (RAKE) algorithm for extracting keywords from text.

#rake-algorithm#information-retrieval#text-analysis

Stars124

Forks19

Last commit1 year ago

lemmatizerRuby

A Ruby gem for lemmatizing English text, converting inflected words to their base dictionary forms.

#text-analysis#nlp-tools#lemmatization

Stars112

Forks15

Last commit4 years ago

topikPython

A high-level Python toolbox for topic modeling with easy-to-use functions and command-line interface.

#text-analysis#data-science#natural-language-processing

Stars93

Forks23

Last commit10 years ago

pragmatic_tokenizerRuby

A multilingual Ruby gem for splitting strings into tokens with extensive language support and configurable options.

#nlp-library#text-analysis#multilingual

Stars93

Forks11

Last commit1 year ago

MonkeyLearnR

Archived R package for accessing the Monkeylearn API for text classification and extraction.

#text-extraction#peer reviewed#text-classification

Stars92

Forks16

Last commit4 years ago

Hierarchical Attention NetworksPython

TensorFlow implementation of hierarchical attention networks for document classification using GRU cells and attention mechanisms.

#hierarchical-networks#text-classification#text-analysis

Stars87

Forks25

Last commit

PreviousPage 2 of 4

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub