Showing 3 of 3 projects
A high-fidelity, user-scriptable archival web crawler using Chrome/Chromium to preserve JavaScript-rendered content.
An open-source toolkit for analyzing web archives at scale using Apache Spark.
A dockerized, queued web archiver using Chrome headless to create high-fidelity WARC files from URLs.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.