Showing 9 of 9 projects
An open-source OCR engine that converts images to text, supporting over 100 languages and multiple output formats.
A command-line tool that adds an OCR text layer to scanned PDF files, making them searchable and copy-pasteable.
A polyglot document intelligence framework with a Rust core for extracting text, metadata, and structured data from 91+ file formats.
An open-source RPA tool that automates repetitive tasks on websites, desktop apps, and the command line using a simple language.
A Go package for Optical Character Recognition (OCR) using the Tesseract C++ library.
A Java JNA wrapper for Tesseract OCR API, enabling OCR functionality in Java applications.
A lightweight Linux desktop application that extracts text from images using OCR with drag-and-drop simplicity.
A simple OCR API server that's easy to deploy with Docker or on Heroku.
A Ruby wrapper library that provides Ruby bindings and a Ruby-esque interface to the Tesseract OCR API.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.