Showing 6 of 6 projects
A Python library for parsing diverse document formats into structured data, optimized for integration with generative AI applications.
A customizable AI chatbot agent that ingests PDF documents, stores embeddings in a vector database, and answers user queries using LangChain and LangGraph.
A comprehensive PDF processing library and CLI written in Go, supporting encryption, validation, and batch operations.
A Python library for extracting and analyzing text, images, and metadata from PDF documents.
A high-performance GraphRAG framework in Rust that transforms documents into knowledge graphs for superior retrieval and generation.
A curated list of resources for Document Understanding (DU), covering research, datasets, tools, and applications in Intelligent Document Processing.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.