Content Disarm and Reconstruction software that sanitizes Office files by removing malicious dynamic content.
DocBleach is a Content Disarm and Reconstruction (CDR) software that sanitizes Office documents by removing malicious dynamic content. It addresses the security gap where traditional antivirus solutions may fail to detect threats in files from untrusted sources. The software works by stripping potentially dangerous elements from documents while preserving their usable content.
Organizations and individuals who regularly handle Office files from external or untrusted sources, such as HR departments processing resumes, legal firms receiving documents from clients, or security-conscious businesses.
DocBleach provides proactive protection by sanitizing documents before they're opened, offering an additional security layer beyond detection-based antivirus solutions. Its open-source nature allows for transparency and community-driven improvement of threat detection capabilities.
:shower: Sanitising your documents, one threat at a time. — Content Disarm & Reconstruction Software
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
DocBleach removes malicious dynamic content from Office files before they're opened, offering protection beyond detection-based antivirus solutions, as highlighted in its philosophy of proactive sanitization.
Provides both a web interface and command-line interface, enabling flexible usage for manual uploads or batch processing, as demonstrated in the README with CLI examples.
Accepts files from local paths, HTTP URLs, or stdin, and outputs to local paths or stdout, making it adaptable to various workflows, such as piping data in scripts.
The project is open-source with Maven build system and JUnit tests, allowing community inspection and contribution, which is encouraged in the README for improvements.
Primarily focused on Office files like Word and Excel, with limited coverage for other formats such as PDFs or images, reducing its utility in diverse document environments.
The README states 'Don't expect the code base to change everyday,' indicating slower adaptation to new threats, which is a significant drawback for security software.
Requires Java to run unless using the Windows executable, adding setup complexity and potential compatibility issues for non-Windows or Java-free environments.