Showing 27 of 27 projects
An open-source PDF platform for editing, converting, signing, and automating PDFs with a self-hostable API and desktop app.
An open-source OCR engine that converts images to text, supporting over 100 languages and multiple output formats.
A Python library for parsing diverse document formats into structured data, optimized for integration with generative AI applications.
A command-line tool that adds an OCR text layer to scanned PDF files, making them searchable and copy-pasteable.
A ready-to-use OCR Python library supporting 80+ languages and popular writing scripts like Latin, Chinese, Arabic, and Cyrillic.
A concise and elegant macOS dictionary and translation app with OCR, supporting 20+ services including Apple Dictionary, OpenAI, and DeepL.
A self-hosted document management system that scans, indexes, and archives paper documents with OCR and encryption.
A polyglot document intelligence framework with a Rust core for extracting text, metadata, and structured data from 91+ file formats.
A ready-to-go translation and OCR tool developed with WPF.
An open-source RPA tool that automates repetitive tasks on websites, desktop apps, and the command line using a simple language.
A high-performance deep learning library written entirely in Swift, optimized for Apple hardware.
Fast and simple OCR library for iOS/macOS using neural networks, optimized for short alphanumeric codes.
A fast and simple OCR library written in Swift, optimized for recognizing short alphanumeric codes like gift cards.
A local-first, ML-powered desktop application for translating manga, built in Rust with automated text detection, OCR, inpainting, and LLM translation.
A self-learning vector database with graph intelligence, local AI, and PostgreSQL integration, built for real-time adaptation.
A deep learning framework for training image classification models to solve complex captcha and OCR tasks.
A Go package for Optical Character Recognition (OCR) using the Tesseract C++ library.
Open source robotic process automation software for automating repetitive tasks across desktop and web applications.
A TensorFlow-based CNN solution for recognizing character-based CAPTCHAs, providing training, validation, and API modules.
A deep learning project using Keras to build convolutional and recurrent neural networks for high-accuracy captcha recognition.
Deep neural network to extract structured information from invoice documents with a customizable UI and training tools.
A CLI swiss army knife for image processing, offering wallpaper recoloring, OCR, AI upscaling, compression, and more.
A C library for efficient image processing and analysis, widely used in OCR and computer vision applications.
A lightweight desktop translator that translates and speaks text using multiple online translation APIs.
A macOS menu bar app that uses OCR to copy any text visible on your screen directly to your clipboard.
A Java JNA wrapper for Tesseract OCR API, enabling OCR functionality in Java applications.
A curated list of resources for Document Understanding (DU), covering research, datasets, tools, and applications in Intelligent Document Processing.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.