Showing 14 of 14 projects
A collection of example skills for Claude that demonstrate how to create reusable instruction sets for specialized AI tasks.
An opinionated RAG framework for integrating generative AI into applications, supporting any LLM, vector store, and file type.
A command-line tool that adds an OCR text layer to scanned PDF files, making them searchable and copy-pasteable.
A pure-Python PDF library for splitting, merging, cropping, transforming, and extracting data from PDF files.
A pure-C HTML5 parsing library implementing the HTML5 parsing algorithm.
A Slack bot that reads and summarizes webpages, documents, and videos using ChatGPT, with voice chat capabilities.
A Python library and CLI tool for automatic text summarization using extractive methods like LexRank, LSA, Luhn, and Edmundson.
Give ChatGPT long-term memory by uploading custom knowledge base files (PDF, txt, epub) and asking questions via a React frontend.
A curated list of awesome open-source OCR software, libraries, datasets, and literature.
A Go package for Optical Character Recognition (OCR) using the Tesseract C++ library.
A Python library for reading, writing, repairing, and transforming PDFs, powered by the qpdf C++ library.
A Rust library for creating, merging, modifying, and decrypting PDF documents with support for modern object streams.
A high-performance .NET library for creating, manipulating, inspecting, and maintaining PDF documents.
A Java JNA wrapper for Tesseract OCR API, enabling OCR functionality in Java applications.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.