Showing 32 of 32 projects
Original Apollo 11 Guidance Computer (AGC) source code for the command and lunar modules, digitized from MIT Museum scans.
Open-source self-hosted web archiving tool that saves websites in multiple durable formats like HTML, PDF, and WARC.
An Adobe Flash Player emulator written in Rust that runs on desktop and web via WebAssembly.
Offline full-text search and archiving tool for Chromium-based browsers that saves and indexes every page you visit.
An open-source, extensible, web-scale, archival-quality web crawler from the Internet Archive.
An open-source, extensible, web-scale, archival-quality web crawler from the Internet Archive.
A virtual machine that lets you play classic point-and-click adventure games and RPGs on modern devices using original game data files.
A curated list of resources, tools, and services for web archiving, from acquisition and replay to analysis and community.
A curated list of resources, tools, and services for web archiving, from acquisition and replay to analysis and community.
A privacy-focused web archiving tool with an IM-style interface that captures pages to multiple archival services.
An open-source web application for hosting archives of fanworks including fanfic, fanart, and fan vids.
A preconfigured web crawler for backing up websites, producing WARC files with a live dashboard and dynamic ignore patterns.
A browser extension and desktop app for interactive, high-fidelity web archiving directly in the browser.
A standalone Docker container for high-fidelity, browser-based web archiving crawls using Puppeteer and Brave.
A set of Python tools for downloading and preserving wikis, including MediaWiki wikis and Wikimedia projects.
A distributed and persistent web archive replay system that uses IPFS to store and serve WARC files.
Legacy web archive replay engine for accessing historical web content from WARC files.
A graphical desktop application that simplifies web archiving by providing a one-click interface to preserve and replay web pages using Heritrix and OpenWayback.
Open-source quality control tool for analyzing digitized video files through audiovisual analytics and filtering.
A web-compatible Shockwave Player emulator written in Rust for playing old browser games on modern browsers.
A Latin-English dictionary with inflectional morphology support, originally created by William Whitaker and maintained for digital preservation.
A high-fidelity, browser-based web archiving library and CLI for capturing single web pages with provenance.
A wrapper for ffmpeg that checksums internal media streams, enabling stable verification across metadata and container changes.
A high-fidelity, user-scriptable archival web crawler using Chrome/Chromium to preserve JavaScript-rendered content.
Python command-line tools and libraries for handling, validating, and converting WARC and ARC web archive files.
A web application for searching, browsing, and analyzing archived web content (ARC/WARC files) with a Solr backend.
A toolkit for indexing and exploring web archive content from ARC and WARC files using OpenSearch/Elasticsearch.
A toolkit for indexing and exploring web archive content from ARC and WARC files using OpenSearch/Elasticsearch.
A curated list of software, literature, and resources for the Memento protocol (RFC7089) enabling time-based access to archived web content.
A Node.js library for parsing and creating Web ARChive (WARC) files with support for Chrome, Puppeteer, and Electron.
A portable concurrent Memento aggregator CLI and server for retrieving archived web pages from multiple sources.
A command-line tool and Python library for archiving Facebook data via the Graph API, supporting recursive retrieval of nodes and edges.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.