A fast duplicate file finder that works as both a Rust library and a CLI tool, using file hashing to identify duplicates.
DDH (Directory Differential hTool) is a fast duplicate file finder that identifies duplicate files by hashing their contents. It traverses specified directories and subdirectories, comparing file hashes to report duplicates, helping users reclaim disk space and organize files.
Developers and system administrators who need to identify and manage duplicate files across directories, especially those working in Rust environments or preferring command-line tools.
DDH offers a simple, dual-usage design as both a Rust library and CLI tool, with flexible output formats and configurable filtering, making it efficient for both manual use and integration into automated workflows.
A fast duplicate file finder
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Functions as both a Rust library for programmatic integration and a standalone CLI tool, making it versatile for developers building custom workflows.
Supports human-readable standard output and machine-parsable JSON output, enabling easy integration with tools like ddh-move for automated processing.
Allows setting a minimum file size and ignoring specific directories, providing fine-grained control over which files are considered for duplicate detection.
Uses efficient file hashing and traversal algorithms, with optimizations discussed in the project's linked article for handling large directories.
Performance is heavily tied to disk speeds, as hashing large files can be slow on mechanical drives, a limitation admitted in the README.
Requires Rust and Cargo for installation, which may be cumbersome for users not already in the Rust ecosystem or on systems without Rust set up.
Focuses on exact hash matching without support for fuzzy hashing, symbolic link handling, or file type filtering, which might be needed for complex duplicate scenarios.