Showing 4 of 4 projects
WarcDB is an SQLite-based file format that makes web crawl data easier to share and query.
A Rails engine for discovering web archives in WARC and ARC formats with faceted search and advanced discovery options.
A Python script that converts offline web resources into a single WARC file for archiving.
A Go library for reading and parsing WARC and ARC web archive formats with specialized utilities for web archiving workflows.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.