A set of Python tools for downloading and preserving wikis, including MediaWiki wikis and Wikimedia projects.
WikiTeam is a collection of Python tools for archiving wikis, primarily those running on MediaWiki. It solves the problem of wikis disappearing from the internet by enabling users to download complete backups, including page histories and images, ensuring long-term preservation of free-licensed content.
Archivists, librarians, researchers, and wiki administrators who need to preserve wiki content for historical, research, or backup purposes.
Developers choose WikiTeam because it offers specialized, open-source tools for wiki preservation that are more focused and comprehensive than generic web archiving solutions, with a proven track record of archiving hundreds of thousands of wikis.
Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2026, WikiTeam has preserved more than 600,000 wikis.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Downloads complete XML page histories, images, and metadata, as shown in the command examples with --xml and --images options for complete backups.
Supports resuming interrupted downloads with the --resume flag, essential for handling large wikis without restarting from scratch.
Offers tutorials, a mailing list for support, and detailed wiki pages, evidenced by the quick guide and community acknowledgments in the README.
Has archived over 600,000 wikis and 34 TB of Wikimedia Commons images as of 2026, demonstrating reliability in large-scale preservation efforts.
Requires Python 2.7, which is end-of-life and no longer receiving security updates, as stated in the requirements, limiting modern compatibility.
Primarily supports MediaWiki wikis, with expansion to other engines still a goal rather than a current feature, restricting use for non-MediaWiki sites.
Lacks a graphical interface, requiring users to navigate command-line options and configurations, which can be daunting for non-technical archivists.
Hosted on GitHub but criticizes it for proprietary software, reflecting political stances that may complicate project maintenance and adoption.