Showing 4 of 4 projects
An unsupervised text tokenizer and detokenizer for neural network-based text generation systems with subword units.
A minimalistic, single-header JSON tokenizer/parser in C for resource-limited and embedded systems.
A blazing fast and feature-rich parser building toolkit for JavaScript, supporting LL(K) and LL(*) grammars.
A high-performance, browser-grade HTML5 parser written in Rust, developed as part of the Servo project.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.