Ocr

52 projects

Showing 36 of 52 projects

An open-source PDF platform for editing, converting, signing, and automating PDFs with a self-hostable API and desktop app.

#pdf-editing#pdf-ocr#pdf-conversion

Stars87.5k

Forks7.7k

Last commit2 days ago

tesseractC++

An open-source OCR engine that converts images to text, supporting over 100 languages and multiple output formats.

#c-plus-plus-library#hacktoberfest#open-source

Stars75.4k

Forks10.7k

Last commit2 days ago

doclingPython

A Python library for parsing diverse document formats into structured data, optimized for integration with generative AI applications.

#ai#tables#documents

Stars63.5k

Forks4.5k

Last commit4 days ago

SiYuanTypeScript

A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

#obsidian#notes-app#note-taking

A community-supported supercharged document management system: scan, index and archive all your documents

#dms#archiving#document management system

A command-line tool that adds an OCR text layer to scanned PDF files, making them searchable and copy-pasteable.

#text-extraction#pdf-ocr#pdf-a

Stars34.2k

Forks2.4k

Last commit4 days ago

EasyOCRPython

A ready-to-use OCR Python library supporting 80+ languages and popular writing scripts like Latin, Chinese, Arabic, and Cyrillic.

#multi-language#python-library#deep-learning

Stars29.8k

Forks3.6k

Last commit7 months ago

EasydictSwift

A concise and elegant macOS dictionary and translation app with OCR, supporting 20+ services including Apple Dictionary, OpenAI, and DeepL.

#youdao#productivity#baidu

Stars13.9k

Forks701

Last commit3 days ago

KreuzbergRust

A polyglot document intelligence framework with a Rust core for extracting text, metadata, and structured data from 91+ file formats.

#text-extraction#document-intelligence#batch-processing

A self-hosted document management system that scans, indexes, and archives paper documents with OCR and encryption.

#scanning#search#documents

Stars7.9k

Forks499

Last commit5 years ago

STranslateC#

A ready-to-go translation and OCR tool developed with WPF.

#desktop-application#multilingual#plugin

Stars7.5k

Forks385

Last commit1 day ago

TagUI (.2k)JavaScript

An open-source RPA tool that automates repetitive tasks on websites, desktop apps, and the command line using a simple language.

#ai#opencv#workflow-automation

Stars6.3k

Forks644

Last commit23 days ago

Swift-AISwift

A high-performance deep learning library written entirely in Swift, optimized for Apple hardware.

#scientific-computing#apple-platforms#ios

Stars6.0k

Forks553

Last commit9 years ago

koharuRust

A local-first, ML-powered desktop application for translating manga, built in Rust with automated text detection, OCR, inpainting, and LLM translation.

#manga#desktop-app#inpainting

Stars4.9k

Forks304

Last commit2 days ago

SwiftOCRSwift

Fast and simple OCR library for iOS/macOS using neural networks, optimized for short alphanumeric codes.

#ocr-library#ios#neural-network

Stars4.6k

Forks477

Last commit5 years ago

SwiftOCRSwift

A fast and simple OCR library written in Swift, optimized for recognizing short alphanumeric codes like gift cards.

#ocr-library#ios#neural-network

Stars4.6k

Forks477

Last commit5 years ago

ruvectorRust

A self-learning vector database with graph intelligence, local AI, and PostgreSQL integration, built for real-time adaptation.

#ai#agentic-ai#self-learning

Stars4.4k

Forks577

Last commit2 days ago

captcha_trainerPython

A deep learning framework for training image classification models to solve complex captcha and OCR tasks.

#deep-learning#captcha-recognition#cnn

Stars3.2k

Forks823

Last commit8 months ago

gosseractGo

A Go package for Optical Character Recognition (OCR) using the Tesseract C++ library.

#text-extraction#tesseract-ocr#go-library

Stars3.1k

Forks307

Last commit6 months ago

OpenRPA (.9k)C#

Open source robotic process automation software for automating repetitive tasks across desktop and web applications.

#workflow#windows-automation#workflow-automation

Stars3.0k

Forks653

Last commit3 months ago

cnn_captchaPython

A TensorFlow-based CNN solution for recognizing character-based CAPTCHAs, providing training, validation, and API modules.

#flask#captcha-recognition#python

Stars2.9k

Forks785

Last commit3 years ago

captcha_breakJupyter Notebook

A deep learning project using Keras to build convolutional and recurrent neural networks for high-accuracy captcha recognition.

#deep-learning#pytorch-tutorial#keras

Stars2.8k

Forks668

Last commit4 years ago

InvoiceNetPython

Deep neural network to extract structured information from invoice documents with a customizable UI and training tools.

#document-intelligence#invoice-processing#deep-learning

Stars2.7k

Forks413

Last commit2 years ago

macshotSwift

A native macOS screenshot and screen recording tool with annotation, auto-redact PII, scroll capture, GIF recording, OCR, and uploads.

#annotation#productivity#appkit

Stars2.6k

Forks146

Last commit10 days ago

gowallGo

A CLI swiss army knife for image processing, offering wallpaper recoloring, OCR, AI upscaling, compression, and more.

#theme-converter#pixel-art#theme-switcher

Stars2.3k

Forks37

Last commit1 month ago

DocspellElm

Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources with miminal effort.

#dms#personal document system#stanford-corenlp

Stars2.3k

Forks179

Last commit2 months ago

LeptonicaC

A C library for efficient image processing and analysis, widely used in OCR and computer vision applications.

#c-library#image-analysis#document-analysis

Stars2.1k

Forks433

Last commit9 days ago

Crow TranslateC++

A lightweight desktop translator that translates and speaks text using multiple online translation APIs.

#desktop-application#qt#open-source

Stars2.0k

Forks176

Last commit2 years ago

TRexSwift

A macOS menu bar app that uses OCR to copy any text visible on your screen directly to your clipboard.

#text-extraction#textrecognition#qr-code-reader

Stars1.9k

Forks62

Last commit9 days ago

Tess4JJava

A Java JNA wrapper for Tesseract OCR API, enabling OCR functionality in Java applications.

#text-extraction#pdf-ocr#java

Stars1.8k

Forks381

Last commit1 month ago

Awesome Document Understanding

A curated list of resources for Document Understanding (DU), covering research, datasets, tools, and applications in Intelligent Document Processing.

#key-information-extraction#document-understanding#document-analysis

Stars1.5k

Forks178

Last commit3 years ago

TextSnatcherVala

A lightweight Linux desktop application that extracts text from images using OCR with drag-and-drop simplicity.

#text-extraction#libhandy#tesseract-ocr

Stars1.4k

Forks54

Last commit2 years ago

CTPNJupyter Notebook

Scene text detection using Connectionist Text Proposal Network (CTPN) for detecting text lines in natural images.

#ocr-preprocessing#deep-learning#scene-text-detection

Stars1.3k

Forks529

Last commit4 years ago

sist2C

A fast, multi-threaded file system indexer and search tool with a web interface, supporting text/metadata extraction, thumbnails, OCR, and incremental scanning.

#file-indexing#metadata-extraction#self-hosted-search

Stars1.3k

Forks78

Last commit1 year ago

ocrserverGo

A simple OCR API server that's easy to deploy with Docker or on Heroku.

#text-extraction#api#api-server

Stars767

Forks147

Last commit5 years ago