Showing 17 of 17 projects
An organized reading list of patterns, case studies, and articles on building scalable, reliable, and performant large-scale systems.
A flow control component for microservices enabling reliability, resilience, and real-time monitoring.
A durable execution platform for building scalable, reliable applications with automatic failure handling.
A curated list of awesome Site Reliability Engineering (SRE) and Production Engineering resources.
A curated list of Chaos Engineering resources, including tools, books, papers, and educational materials.
A fast, friendly, functional programming language designed for building reliable software.
Static code analysis tool for Kubernetes YAML and Helm charts that provides recommendations to improve reliability and security.
A Ruby job queue that uses PostgreSQL's advisory locks for high performance and ACID reliability.
A CLI-driven toolkit for writing and running Chaos Engineering experiments across any platform via extensions.
A Docker container that automatically monitors and restarts unhealthy containers using Docker's HEALTHCHECK feature.
A Go HTTP middleware library for injecting faults like latency, errors, and request rejection into services for resilience testing.
A Ruby gem providing a simple and reliable beanstalkd-powered job queue for background processing.
A Ruby-based process monitoring tool that automatically restarts failed processes and manages daemons.
A Sidekiq extension that makes long-running jobs interruptible and resumable with checkpointing.
A background job processing library for Elixir using RabbitMQ for reliable messaging and job queuing.
A JVMTI agent that forcibly terminates the JVM when it cannot allocate memory or create threads.
A lightweight Go library that handles RabbitMQ auto-reconnect, publishing retry, and channel pooling to simplify robust messaging.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.