Showing 5 of 5 projects
A Python library for parsing diverse document formats into structured data, optimized for integration with generative AI applications.
An open-source framework for building LLM-powered applications with data ingestion, indexing, and retrieval capabilities.
A cross-platform XML/HTML parser for Swift with XPath and CSS selector support.
A Ruby library for extracting text and metadata from various document formats using Apache Tika.
A .NET framework for extracting and exporting text and data from a wide variety of document formats.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.