Showing 18 of 18 projects
A line-oriented search tool that extends ripgrep to search inside PDFs, Office documents, archives, and many other file types.
A Go library for writing shell-like scripts with a pipeline API for file reading, subprocess execution, string matching, and more.
A pure Go library for creating, editing, and processing Microsoft Office Open XML documents (DOCX, XLSX, PPTX).
A Ruby library for reading and parsing spreadsheet files (Excel, OpenOffice, CSV) with a unified interface.
A suite of extremely fast and reliable parsers for Java with a consistent interface for multiple file formats.
A deprecated collection of utility functions for building gulp plugins and tasks.
A simple, lightweight batch processing framework for Java designed for ETL jobs.
A Ruby library for extracting text and metadata from various document formats using Apache Tika.
A Gulp plugin for performing string and regex replacements in build pipelines.
A serverless reference architecture for real-time parallel file processing using AWS Lambda, S3, SQS, and Amazon Comprehend.
A pure streaming tar parser and generator for Node.js that operates without hitting the file system.
Generate minimal-boilerplate web UIs from Python functions using type hints.
A simple Node.js transform stream for reading text streams line-by-line with Unicode support.
A Java library for sorting very large files using external-memory algorithms and multiple cores.
Merge multiple Node.js streams into one interleaved stream for parallel processing in gulp and other stream-based workflows.
A GitHub Action that creates or updates issues using content from a file, designed for automated workflows.
A Node.js library that determines if a filename or buffer is text or binary using smarter detection than other solutions.
A Rust library for efficiently reading and navigating forward, backward, or randomly through lines in huge files (tested with 300+ GB).
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.