Collecting Data from Web Pages with OutWit

Productivity Sauce
Web scraping is a clever idea, but extracting data from a Web page manually can be a real chore. The new OutWit extension provides a solution to this problem. Better yet, it allows you to save and export the scraped data, which makes it a great research tool. Although the extension is still at a very early stage of development, it has the potential to turn your favorite browser into a powerful tool for extracting and organizing data. The current version already boasts an impressive list of features, including data structure recognition, page and image link extraction, e-mail extraction, table and list extraction, and more.
Although OutWit is a rather advanced tool, using it for simple Web scraping is not particularly difficult. Let's say you want to extract data from the Population of the 5 largest cities in the EU table and export the data for use in a Calc spreadsheet. Press the OutWit button in the Firefox toolbar to open the OutWit Hub window. The left pane contains a tree of data types supported by the OutWit Hub. Navigate to page -> data -> tables, and you should see the data from the tables on the Wikipedia page. Locate and select the rows containing the city data (see screenshot below) and drag them onto the Catch pane at the bottom.
To save the selected data, choose the File -> Save Catch as command. To export the data for use in a spreadsheet, select all the rows in the Catch pane, right-click on the selection, and choose the Export Selection as command. OutWit can export the data in the Excel format only, but since OpenOffice.org Calc can read .xls files, that's not a big issue. In a similar manner, you can collect other types of data, including lists, email addresses, RSS feeds, images, and much more.
OutWit is actually more than just a mere Firefox extension. It is a platform that allows you to create your own Web data collection solutions called outfits. In fact, the OutWit Hub is an outfit built upon the OutWit kernel. Besides catching all sorts of data from a Web page, you can use OutWit Hub to create your own scrapers, and the following post on the OutWit blog shows you how to do that.
Comments
comments powered by DisqusSubscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters
Support Our Work
Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News
-
Linux Kernel 6.17 is Available
Linus Torvalds has announced that the latest kernel has been released with plenty of core improvements and even more hardware support.
-
Kali Linux 2025.3 Released with New Hacking Tools
If you're a Kali Linux fan, you'll be glad to know that the third release of this famous pen-testing distribution is now available with updates for key components.
-
Zorin OS 18 Beta Available for Testing
The latest release from the team behind Zorin OS is ready for public testing, and it includes plenty of improvements to make it more powerful, user-friendly, and productive.
-
Fedora Linux 43 Beta Now Available for Testing
Fedora Linux 43 Beta ships with Gnome 49 and KDE Plasma 6.4 (and other goodies).
-
USB4 Maintainer Leaves Intel
Michael Jamet, one of the primary maintainers of USB4 and Thunderbolt drivers, has left Intel, leaving a gaping hole for the Linux community to deal with.
-
Budgie 10.9.3 Now Available
The latest version of this elegant and configurable Linux desktop aligns with changes in Gnome 49.
-
KDE Linux Alpha Available for Daring Users
It's official, KDE Linux has arrived, but it's not quite ready for prime time.
-
AMD Initiates Graphics Driver Updates for Linux Kernel 6.18
This new AMD update focuses on power management, display handling, and hardware support for Radeon GPUs.
-
AerynOS Alpha Release Available
With a choice of several desktop environments, AerynOS 2025.08 is almost ready to be your next operating system.
-
AUR Repository Still Under DDoS Attack
Arch User Repository continues to be under a DDoS attack that has been going on for more than two weeks.
World coins
INCREDIBLE STUFF!
Very cool FF3 version
Great Add-On! Thanks for the tip.
Outwit
I'll stick to good old scrapbook for now...