Showing 4 of 4 projects
A Python utility for converting PDFs, Office documents, images, audio, and more into structured Markdown for LLM consumption.
A pure Go library for creating, editing, and processing Microsoft Office Open XML documents (DOCX, XLSX, PPTX).
A Ruby library for extracting text and metadata from various document formats using Apache Tika.
A .NET framework for extracting and exporting text and data from a wide variety of document formats.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.