Showing 9 of 81 projects
A suite of command-line tools for manipulating SAM, BAM, and CRAM files in next-generation sequencing data analysis.
A high-performance Python package for fast, multi-threaded manipulation of large tabular datasets, inspired by R's data.table.
A curated list of awesome Apache Spark packages, libraries, and resources for data engineers and scientists.
A Ruby framework for writing reliable, concise, and maintainable ETL (Extract-Transform-Load) data processing jobs.
A C++ library for reading, writing, creating, and modifying Microsoft Excel .xlsx files.
A fast, lightweight JSON Query Language CLI tool built with Rust for querying and transforming JSON data.
A computational parallel flow library for Elixir built on top of GenStage for parallel processing of collections.
A collection of small, chainable command-line utilities for advanced password cracking operations.
A Python library for agile data preparation workflows that works with Pandas, Dask, cuDF, Dask-cuDF, Vaex, and PySpark.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.