Archiveteam Project Collects Lost Web 2.0 Content
Many users keep their emails with webmail services, wedding pictures in photo communities and reading habits with social bookmarking services. What happens, though, when data is lost or websites fold? Archiveteam wants to help in those circumstances.
The Archiveteam wiki provides various assistance so that your personal photo album and other files don't end up in the ether. Assistance includes instructions and documentation about file formats and storage media, much of which are in early phases of development. In a more progressed state is the team's Deathwatch page with a continually updated list of websites that have gone kaput or are about to go that way. Among them, Yahoo's Geocities site and the already closed Furl and Tripod.

Under the rubric Software, the project collects tools, tips and tricks. Included is the GNU wget command that, with some appropriate parameters, secures a complete Wordpress blog on a local hard drive. Some site-specific pages relate to Google, Livejournal and Twitter.
One of the Archiveteam founders is Jason Scott, whose textfiles.com site has been archiving text data off the network from the 1980s and 90s. The young Archiveteam is looking for fellow archivers to write articles and manuals, set up mirror servers and bittorents and form a download task force.
Debian developer Joey Hess has already had thoughts (in a blog) about a GUI program for rescuing Web 2.0 data. Ideally the user would simply enter a list of URLs or a bookmark file and the program would take care of the rest: plugins appropriate to the service or website would handle the work, including a generic one for sites with RSS feeds. Hess is collecting "thoughts, comments, prior art [and] cute program idea names." Some have come his way already.
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters
Support Our Work
Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News
-
Go-Based Botnet Attacking IoT Devices
Using an SSH credential brute-force attack, the Go-based PumaBot is exploiting IoT devices everywhere.
-
Plasma 6.5 Promises Better Memory Optimization
With the stable Plasma 6.4 on the horizon, KDE has a few new tricks up its sleeve for Plasma 6.5.
-
KaOS 2025.05 Officially Qt5 Free
If you're a fan of independent Linux distributions, the team behind KaOS is proud to announce the latest iteration that includes kernel 6.14 and KDE's Plasma 6.3.5.
-
Linux Kernel 6.15 Now Available
The latest Linux kernel is now available with several new features/improvements and the usual bug fixes.
-
Microsoft Makes Surprising WSL Announcement
In a move that might surprise some users, Microsoft has made Windows Subsystem for Linux open source.
-
Red Hat Releases RHEL 10 Early
Red Hat quietly rolled out the official release of RHEL 10.0 a bit early.
-
openSUSE Joins End of 10
openSUSE has decided to not only join the End of 10 movement but it also will no longer support the Deepin Desktop Environment.
-
New Version of Flatpak Released
Flatpak 1.16.1 is now available as the latest, stable version with various improvements.
-
IBM Announces Powerhouse Linux Server
IBM has unleashed a seriously powerful Linux server with the LinuxONE Emperor 5.
-
Plasma Ends LTS Releases
The KDE Plasma development team is doing away with the LTS releases for a good reason.