Archiveteam Project Collects Lost Web 2.0 Content
Many users keep their emails with webmail services, wedding pictures in photo communities and reading habits with social bookmarking services. What happens, though, when data is lost or websites fold? Archiveteam wants to help in those circumstances.
The Archiveteam wiki provides various assistance so that your personal photo album and other files don't end up in the ether. Assistance includes instructions and documentation about file formats and storage media, much of which are in early phases of development. In a more progressed state is the team's Deathwatch page with a continually updated list of websites that have gone kaput or are about to go that way. Among them, Yahoo's Geocities site and the already closed Furl and Tripod.

Under the rubric Software, the project collects tools, tips and tricks. Included is the GNU wget command that, with some appropriate parameters, secures a complete Wordpress blog on a local hard drive. Some site-specific pages relate to Google, Livejournal and Twitter.
One of the Archiveteam founders is Jason Scott, whose textfiles.com site has been archiving text data off the network from the 1980s and 90s. The young Archiveteam is looking for fellow archivers to write articles and manuals, set up mirror servers and bittorents and form a download task force.
Debian developer Joey Hess has already had thoughts (in a blog) about a GUI program for rescuing Web 2.0 data. Ideally the user would simply enter a list of URLs or a bookmark file and the program would take care of the rest: plugins appropriate to the service or website would handle the work, including a generic one for sites with RSS feeds. Hess is collecting "thoughts, comments, prior art [and] cute program idea names." Some have come his way already.
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters
Support Our Work
Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News
-
AUR Repository Still Under DDoS Attack
Arch User Repository continues to be under a DDoS attack that has been going on for two weeks.
-
RingReaper Malware Poses Danger to Linux Systems
A new kind of malware exploits modern Linux kernels for I/O operations.
-
Happy Birthday, Linux
On August 25, Linux officially turns 34.
-
VirtualBox 7.2 Has Arrived
With early support for Linux kernel 6.17 and other new additions, VirtualBox 7.2 is a must-update for users.
-
Linux Mint 22.2 Beta Available for Testing
Some interesting new additions and improvements are coming to Linux Mint. Check out the Linux Mint 22.2 Beta to give it a test run.
-
Debian 13.0 Officially Released
After two years of development, the latest iteration of Debian is now available with plenty of under-the-hood improvements.
-
Upcoming Changes for MXLinux
MXLinux 25 has plenty in store to please all types of users.
-
A New Linux AI Assistant in Town
Newelle, a Linux AI assistant, works with different LLMs and includes document parsing and profiles.
-
Linux Kernel 6.16 Released with Minor Fixes
The latest Linux kernel doesn't really include any big-ticket features, just a lot of lines of code.
-
EU Sovereign Tech Fund Gains Traction
OpenForum Europe recently released a report regarding a sovereign tech fund with backing from several significant entities.