ArchiveWeb.page
ActiveOverview
ArchiveWeb.page is a browser extension and standalone desktop application for creating high-fidelity web archives by capturing network traffic, HTML, images, videos, stylesheets, scripts, and other page resources directly in the browser or app. It organizes captured data into pages and collections for local storage via IndexedDB, with exports to standard WARC and WACZ formats for viewing in ReplayWeb.page or other tools. Designed for users needing control over dynamic web content and social media archiving, it stands out for interactive capture during browsing without relying on remote services like the Wayback Machine.1234
Key Features
- Interactive Capture - Records network traffic and page interactions as users click links, assets, and navigate sites.
- High-Fidelity Archiving - Captures HTML, images, videos, stylesheets, scripts, and data files for complete page preservation.
- Local Storage - Stores archives in browser IndexedDB for viewing, management, and deletion without external servers.
- WARC/WACZ Export - Exports archives in standard WARC and WACZ formats compatible with ReplayWeb.page viewer.
- Autopilot Mode - Automatically scrolls pages to capture content without manual intervention.
- Behaviors for Social Media - Automates scrolling feeds, expanding comments, and loading posts on platforms like Instagram.
- Video/Image Auto-Load - Detects and loads videos, images, and styles in background for complete capture.
- Collection Management - Organizes pages into collections with options to view, delete, and share individual captures.
Pricing
| Plan | Price | Includes |
|---|---|---|
| Free | $0 | Unlimited archiving, local storage, WARC/WACZ export, all features. |
Platforms & Requirements
Functions as a Chrome/Chromium extension capturing tabs via debugging protocol and as a standalone Electron app for Windows (.exe), macOS (.dmg), and Linux (.zip). No specific minimum requirements listed beyond standard browser or Electron support; extension may conflict with other extensions or fail on certain content like YouTube videos or PDFs, where desktop app is recommended.268
Integrations & Ecosystem
- WARC export
- WACZ export
- ReplayWeb.page viewer
- Chrome debugging protocol
- wabac.js service worker
- IndexedDB storage
Alternatives
| App | Difference |
|---|---|
| Wayback Machine | Remote centralized archiving service with less control over captures compared to local interactive tool. |
| SingleFile | Saves pages as single HTML files but lacks high-fidelity network capture and video handling. |
| HTTrack | Desktop website copier focused on mirroring sites, less suited for interactive dynamic content. |
| WebCite | Online archiving service requiring submission, no local storage or browser-based capture. |
Reputation
ArchiveWeb.page is regarded as a user-friendly, effective tool for preserving dynamic web content and social media locally, praised for high-fidelity captures and control absent in services like Wayback Machine. Users and guides highlight its utility for cultural heritage and research archiving. Criticisms include extension conflicts with other browser plugins, issues with specific content like YouTube or PDFs, and requests for enhanced automation like extended autoscroll.7810
Sources (10)
- https://chromewebstore.google.com/detail/webrecorder-archivewebpag/fpeoodllldobpkbkabpblcfaogecpndd
- https://github.com/webrecorder/archiveweb.page
- https://archiveweb.page
- https://archiveweb.page/en/usage/
- https://archiveweb.page/guide
- https://www.sucho.org/archivewebpage-app-instructions
- https://libguides.lakeheadu.ca/c.php?g=734257&p=5282156
- https://archiveweb.page/en/troubleshooting/errors/
- https://archiveweb.page/en/features/behaviors/
- https://forum.webrecorder.net/t/archivewebpage-app-and-extension-functions-request/826