A command-line tool for running SQL queries against JSON, CSV, Excel, Parquet, and other structured data files.
dsq is a command-line tool that allows users to run SQL queries directly against structured data files like JSON, CSV, Excel, and Parquet. It solves the problem of needing to import data into a database before querying, enabling quick data exploration and transformation from the terminal.
Data analysts, developers, and system administrators who work with multiple data formats and need to quickly query or transform files without setting up a full database.
Developers choose dsq for its simplicity, broad file format support, and the ability to use familiar SQL syntax for ad-hoc data analysis, making it a versatile alternative to format-specific tools or heavier database solutions.
Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Supports querying JSON, CSV, Excel, Parquet, Avro, YAML, and log files directly, as listed in the Supported Data Types table, eliminating the need for format-specific tools.
Enables SQL joins across multiple files of different formats in a single command, demonstrated in the 'Multiple files and joins' section with examples joining CSV and JSON data.
Optional disk caching speeds up repeated queries on unchanged datasets by storing imported data with SHA1-based invalidation, detailed in the Caching section for large file exploration.
Built-in SQL REPL allows iterative querying without reloading data, shown in the Interactive REPL section for ad-hoc data exploration.
The README explicitly states the project is not under active development and recommends alternatives like DuckDB for stronger analytics foundations, risking long-term support.
Relies on SQLite, which lacks column-oriented storage and vectorized execution, making it slower for large-scale analytics compared to DuckDB or ClickHouse, as noted in benchmarks.
Cannot specify paths through arrays (only objects) or retrieve whole objects, restricting flexibility with complex JSON structures, admitted in the 'Limitation: nested arrays' and 'Limitation: whole object retrieval' sections.