Open-source data pipelines for cloud asset inventory, CSPM, FinOps, and vulnerability management across AWS, Azure, GCP, and 70+ sources.
CloudQuery is an open-source data pipeline platform that extracts, normalizes, and syncs cloud infrastructure metadata from AWS, Azure, GCP, and 70+ SaaS sources into data warehouses. It solves the problem of fragmented cloud asset data by providing a unified, queryable inventory for security, compliance, and cost management.
Platform teams, cloud engineers, and DevOps professionals responsible for cloud asset management, security posture, cost optimization, and automation across multi-cloud environments.
Developers choose CloudQuery for its extensive plugin coverage, self-hosted privacy, SQL-based querying, and high-performance syncs, enabling them to build custom cloud asset inventories, CSPM, and FinOps solutions without vendor lock-in.
Data pipelines for cloud config and security data. Build cloud asset inventory, CSPM, FinOps, and vulnerability management solutions. Extract from AWS, Azure, GCP, and 70+ cloud and SaaS sources.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Supports over 70 cloud and SaaS sources including AWS, Azure, GCP, Wiz, and GitHub, enabling unified asset management across diverse environments as highlighted in the README.
Eliminates the need for custom API scripts by making normalized cloud asset data accessible via SQL, allowing direct analysis and automation without vendor-specific code.
Handles large data volumes efficiently with fine-grained control, powered by Apache Arrow for fast data movement, as noted in the key features.
Runs entirely on user infrastructure, ensuring full data privacy and compliance without touching CloudQuery's servers, ideal for regulated environments.
Requires configuration of plugins, destinations, and orchestrators, which can be time-consuming compared to managed services, despite the flexible plugin system.
Primarily designed for periodic data extraction and synchronization, lacking native support for real-time, event-driven updates, which may limit use cases needing instant alerts.
Does not include an internal database; users must provision and manage their own data warehouse or storage solution, adding to infrastructure overhead.