Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Unicode

Unicode

118 projects

Showing 36 of 118 projects

Big List of Naughty Strings
Big List of Naughty StringsPython

A curated list of strings likely to cause issues when used as user-input data, for automated and manual QA testing.

#software-testing#unicode#input-validation
Stars47.7k
Forks2.2k
Last commit2 years ago
{fmt}
{fmt}C++

A fast, safe, and modern C++ formatting library providing a type-safe alternative to printf and iostreams.

#formatting#ranges#unicode
Stars23.6k
Forks2.9k
Last commit3 days ago
xterm.js
xterm.jsTypeScript

A frontend component that enables fully-featured terminal emulation in web browsers.

#terminal-emulator#developer-tools#unicode
Stars20.7k
Forks1.9k
Last commit2 days ago
twemoji
twemojiHTML

A library providing standard Unicode emoji support across all platforms with consistent vector and raster graphics.

#web-assets#emoji#svg-assets
Stars17.7k
Forks1.9k
Last commit5 months ago
RapidJSON
RapidJSONC++

A fast, header-only JSON parser and generator for C++ with SAX/DOM APIs, optimized for performance and memory efficiency.

#unicode#high-performance#json-generator
Stars15.1k
Forks3.6k
Last commit1 year ago
RapidJSON
RapidJSONC++

A fast, header-only JSON parser and generator for C++ with SAX/DOM APIs, emphasizing performance and minimal dependencies.

#unicode#high-performance#json-generator
Stars15.1k
Forks3.6k
Last commit1 year ago
RapidJSON
RapidJSONC++

A fast, header-only JSON parser and generator for C++ with SAX and DOM APIs.

#unicode#high-performance#json-generator
Stars15.1k
Forks3.6k
Last commit1 year ago
Syntax
SyntaxC++

A fast, safe, thread-friendly regular expression library with linear-time matching guarantees, designed for production use.

#linear-time-matching#unicode#c-plus-plus
Stars9.7k
Forks1.2k
Last commit4 months ago
grex
grexRust

A command-line tool and library that generates regular expressions from user-provided test cases.

#developer-tools#webassembly#unicode
Stars8.1k
Forks193
Last commit3 months ago
HarfBuzz
HarfBuzzC++

A robust, high-performance text shaping engine and font platform supporting OpenType and Apple Advanced Typography.

#c-library#text-shaping#graphics
Stars5.7k
Forks750
Last commit1 day ago
gemoji
gemojiRuby

A Ruby library providing emoji character information, names, and Unicode mappings for emoji processing.

#rails-helper#emoji#unicode
Stars4.5k
Forks801
Last commit6 months ago
ftfy
ftfyPython

A Python library that fixes mojibake and other Unicode text glitches by detecting and correcting encoding mix-ups.

#data-cleaning#encoding-fixes#unicode
Stars4.0k
Forks126
Last commit1 year ago
python-ftfy
python-ftfyPython

A Python library that fixes mojibake and other Unicode text glitches by detecting and correcting encoding mix-ups.

#data-cleaning#unicode#python-library
Stars4.0k
Forks126
Last commit1 year ago
regex
regexRust

A Rust regular expression library with guaranteed linear time matching using finite automata.

#parsing#dfa#automata
Stars4.0k
Forks509
Last commit1 day ago
CTRE
CTREC++

A C++ library for compile-time regular expression matching, searching, and capturing with PCRE-like syntax.

#parsing#unicode#constexpr
Stars3.8k
Forks204
Last commit1 month ago
mimic
mimicPython

A command-line tool that replaces ASCII characters with visually similar Unicode homoglyphs to create confusing and frustrating code.

#programming-humor#unicode#code-obfuscation
Stars3.8k
Forks103
Last commit2 years ago
he
heJavaScript

A robust JavaScript library for encoding and decoding HTML entities with full spec compliance.

#encoder#text-encoding#unicode
Stars3.6k
Forks261
Last commit4 years ago
ICU
ICUC++

A mature, widely-used set of C/C++ and Java libraries providing Unicode and globalization support for software applications.

#c-library#java-library#unicode
Stars3.5k
Forks892
Last commit3 days ago
StringZilla
StringZillaC

A high-performance string library leveraging SIMD and SWAR to accelerate search, hashing, sorting, and edit distances across C, C++, Python, Rust, and more.

#memory-mapping#substring#information-retrieval
Stars3.5k
Forks125
Last commit1 day ago
iconv-lite
iconv-liteJavaScript

A pure JavaScript library for converting character encodings with no native dependencies.

#unicode#text-processing#character-encoding
Stars3.2k
Forks295
Last commit1 month ago
Twitter Text Obj
Twitter Text ObjHTML

A collection of libraries for parsing Tweet text, including character counting and entity linking.

#multi-language#emoji#social-media
Stars3.1k
Forks530
Last commit2 years ago
Twitter Text Java
Twitter Text JavaHTML

A collection of libraries and conformance tests for parsing Tweet text, including character counting and entity linking.

#multi-language#emoji#social-media
Stars3.1k
Forks530
Last commit2 years ago
chardet
chardetPython

A Python library that automatically detects the character encoding of text files and byte streams with high accuracy and speed.

#encoding-detection#unicode#python-library
Stars2.6k
Forks300
Last commit1 month ago
plog
plogC++

A portable, simple, and extensible header-only C++ logging library in about 1000 lines of code.

#embedded-systems#library#unicode
Stars2.5k
Forks408
Last commit4 months ago
Oniguruma
OnigurumaC

A modern, flexible regular expression library supporting multiple character encodings and syntaxes.

#c-library#gnu-regex#multi-encoding
Stars2.5k
Forks350
Last commit1 year ago
RE
REC

A modern, flexible regular expression library supporting multiple character encodings and syntaxes.

#c-library#gnu-regex#multi-encoding
Stars2.5k
Forks350
Last commit1 year ago
base64Captcha
base64CaptchaGo

A flexible Go package for generating and verifying captchas as base64-encoded image or audio strings.

#bot-protection#web-security#unicode
Stars2.4k
Forks309
Last commit8 months ago
lettre
lettreRust

A Rust library for sending emails with multiple transport methods, SMTP security, and async support.

#hacktoberfest#unicode#authentication
Stars2.2k
Forks224
Last commit11 days ago
utf8.h
utf8.hC++

A single-header library providing UTF-8 string functions for C and C++, mirroring the standard string.h API.

#library#unicode#c
Stars2.0k
Forks139
Last commit7 days ago
ansiweather
ansiweatherShell

A shell script that displays current weather conditions in the terminal with ANSI colors and Unicode symbols.

#unicode#openweathermap#unix
Stars1.9k
Forks147
Last commit5 months ago
emoji-regex
emoji-regexJavaScript

A regular expression to match all emoji symbols and sequences as per the Unicode Standard.

#emoji#unicode#regex
Stars1.9k
Forks174
Last commit7 months ago
simdutf
simdutfC++

A high-performance C++ library for Unicode validation and transcoding (UTF-8/16/32, Latin1, Base64) using SIMD instructions.

#utf16#transcoding#sse2
Stars1.8k
Forks136
Last commit7 days ago
emojilib
emojilibJavaScript

A JavaScript library providing a keyword-to-emoji mapping for making emoji searchable.

#search#emoji#unicode
Stars1.8k
Forks302
Last commit1 month ago
ada
adaC++

A fast, spec-compliant URL parser written in modern C++, used by Node.js, Cloudflare Workers, and other major systems.

#sse2#cloudflare-workers#unicode
Stars1.7k
Forks126
Last commit1 day ago
JPHP
JPHPJava

JPHP is a PHP compiler that runs on the Java Virtual Machine, enabling use of Java libraries and building desktop/Android apps.

#java-interop#gradle#compiler
Stars1.7k
Forks165
Last commit4 years ago
python-slugify
python-slugifyPython

A Python library that converts Unicode strings into URL-friendly slugs with extensive customization options.

#unicode#slug-generation#url-slugs
Stars1.6k
Forks122
Last commit1 month ago
Page 1 of 4Next

Related Tags

#Text Processing56#Emoji22#Character Encoding17#Internationalization16#Cross Platform15#String Manipulation14#Performance14#Nodejs13#Utf 813#Developer Tools13#C Plus Plus13#Typography10
Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub