Showing 6 of 6 projects
An open-source OCR engine that converts images to text, supporting over 100 languages and multiple output formats.
A command-line tool that adds an OCR text layer to scanned PDF files, making them searchable and copy-pasteable.
A polyglot document intelligence framework with a Rust core for extracting text, metadata, and structured data from 91+ file formats.
An open-source RPA tool that automates repetitive tasks on websites, desktop apps, and the command line using a simple language.
A Go package for Optical Character Recognition (OCR) using the Tesseract C++ library.
A Java JNA wrapper for Tesseract OCR API, enabling OCR functionality in Java applications.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.