Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Text Processing

Text Processing

267 projects

Showing 15 of 267 projects

cl-interpol
cl-interpolCommon Lisp

A Common Lisp library that modifies the reader to enable Perl/Shell-style string interpolation and character insertion.

#string interpolation#dsl#reader-macros
Stars71
Forks12
Last commit3 years ago
bytelines
bytelinesRust

A Rust library for reading input lines as byte slices to avoid allocations and improve performance.

#line-processing#memory-efficiency#async
Stars71
Forks8
Last commit2 years ago
backscanner
backscannerGo

A Go scanner that reads lines in reverse order from a given position, similar to bufio.Scanner but backward.

#efficient-search#reverse#log-processing
Stars69
Forks11
Last commit1 year ago
goreadability
goreadabilityGo

A Go library for extracting readable content from webpages using Open Graph metadata and arc90's readability algorithm.

#readability#content-extraction#go-library
Stars69
Forks8
Last commit7 years ago
php-speller
php-spellerPHP

A PHP library for spell checking that supports multiple backends like Aspell, Hunspell, and Ispell.

#spell-checking#aspell#php-library
Stars68
Forks24
Last commit1 year ago
grappa
grappaJava

A Java library for writing parsers at runtime without preprocessing or a separate DSL.

#no-preprocessing#dsl-free#java
Stars67
Forks11
Last commit10 years ago
QtVerbalExpressions
QtVerbalExpressionsC++

A Qt library for building regular expressions using a fluent, human-readable API.

#qt#developer-tools#pattern-matching
Stars67
Forks12
Last commit11 years ago
SmarkDown
SmarkDownSwift

A pure Swift implementation of the Markdown markup language, consistent with Gruber's 1.0.1 specification.

#library#markdown-parser#swift-package-manager
Stars67
Forks6
Last commit10 years ago
Pandex
PandexElixir

Lightweight Elixir wrapper for Pandoc to convert between Markdown, HTML, LaTeX, and dozens of other document formats.

#elixir#wrapper-library#latex
Stars65
Forks11
Last commit7 years ago
flag d
flag dHTML

A Stage 4 ECMAScript proposal to add start and end indices for regex capture groups via the 'd' flag.

#tc39#regex#syntax-highlighting
Stars65
Forks9
Last commit4 years ago
emoji-data-js
emoji-data-jsCoffeeScript

Node.js library providing low-level operations for handling Unicode emoji encoding, variants, and parsing.

#unicode#v8-optimized#text-processing
Stars64
Forks25
Last commit10 years ago
alexa
alexaCommon Lisp

A Common Lisp lexical analyzer generator that defines a domain-specific language within your Lisp program, eliminating the need for external tools.

#language-engineering#lexical-analysis#regex
Stars64
Forks6
Last commit1 year ago
DeepL
DeepLPython

A Python library and CLI application for translating text using the DeepL neural machine translation service.

#application#library#language-processing
Stars63
Forks7
Last commit7 years ago
simetric
simetricElixir

An Elixir library providing fast, comprehensive string similarity metrics for approximate matching.

#data-cleaning#elixir#distance
Stars63
Forks7
Last commit5 years ago
caps
capsGo

A Unicode-aware case conversion library for Go supporting camel, snake, kebab, and other naming conventions.

#unicode#unicode-case-folding#naming-conventions
Stars61
Forks0
Last commit2 years ago
PreviousPage 8 of 8

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
#Unicode58
#Markdown33
#Natural Language Processing32
#Go Library31
#Regex30
#Cli Tool30
#Go28
#Markdown Parser27
#Developer Tools27
#Golang26
#String Manipulation25
#Nodejs23