Screen scraping with Colly in Go

Programming Snapshot – Colly

Lead Image © Hannu Viitanen, 123RF.com

Article from Issue 223/2019

Author(s): Mike Schilli

The Colly scraper helps developers who work with the Go programming language to collect data off the web. Mike Schilli illustrates the capabilities of this powerful tool with a few practical examples.

As long as there are websites to view for the masses of browser customers on the web, there will also be individuals on the consumer side who want the data in a different format and write scraper scripts to automatically extract the data to fit their needs.

Many sites do not like the idea of users scraping their data. Check the website's terms of service for more information, and be aware of the copyright laws for your jurisdiction. In general, as long as the scrapers do not republish or commercially exploit the data, or bombard the website too overtly with their requests, nobody is likely to get too upset about it.

Different languages offer different tools for this. Perl aficionados will probably appreciate the qualities of WWW::Mechanize as a scraping tool, while Python fans might prefer the selenium package [1]. In Go, there are several projects dedicated to scraping that attempt to woo developers.

[...]

Use Express-Checkout link below to read the full article (PDF).

Buy this article as PDF

Express-Checkout as PDF

Price $2.95
(incl. VAT)

Buy Linux Magazine

SINGLE ISSUES

Print Issues

Digital Issues

SUBSCRIPTIONS

Print Subs

Digisubs

TABLET & SMARTPHONE APPS

US / Canada

UK / Australia

Support Our Work

Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News

KDE Unleashes Plasma 6.5

Flatpak , KDE , Plasma

The Plasma 6.5 desktop environment is now available with new features, improvements, and the usual bug fixes.
Xubuntu Site Possibly Hacked

Linux , Security , Xubuntu

It appears that the Xubuntu site was hacked and briefly served up a malicious ZIP file from its download page.
LMDE 7 Now Available

Cinnamon , DEBIAN , Linux mint

Linux Mint Debian Edition, version 7, has been officially released and is based on upstream Debian.
Linux Kernel 6.16 Reaches EOL

Kernel , Linux

Linux kernel 6.16 has reached its end of life, which means you'll need to upgrade to the next stable release, Linux kernel 6.17.
Amazon Ditches Android for a Linux-Based OS

Linux , Operating Systems , Tools

Amazon has migrated from Android to the Linux-based Vega OS for its Fire TV.
Cairo Dock 3.6 Now Available for More Compositors

Desktop , graphics , Linux

If you're a fan of third-party desktop docks, then the latest release of Cairo Dock with Wayland support is for you.
System76 Unleashes Pop!_OS 24.04 Beta

COSMIC , Operating Systems , Pop!_OS

System76's first beta of Pop!_OS 24.04 is an impressive feat.
Linux Kernel 6.17 is Available

Games , Kernel , Linux

Linus Torvalds has announced that the latest kernel has been released with plenty of core improvements and even more hardware support.
Kali Linux 2025.3 Released with New Hacking Tools

Kali Linux , Linux , Operating Systems

If you're a Kali Linux fan, you'll be glad to know that the third release of this famous pen-testing distribution is now available with updates for key components.
Zorin OS 18 Beta Available for Testing

Linux , Operating Systems , Zorin OS

The latest release from the team behind Zorin OS is ready for public testing, and it includes plenty of improvements to make it more powerful, user-friendly, and productive.

Screen scraping with Colly in Go

Programming Snapshot – Colly

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters

Support Our Work

News

KDE Unleashes Plasma 6.5

Xubuntu Site Possibly Hacked

LMDE 7 Now Available

Linux Kernel 6.16 Reaches EOL

Amazon Ditches Android for a Linux-Based OS

Cairo Dock 3.6 Now Available for More Compositors

System76 Unleashes Pop!_OS 24.04 Beta

Linux Kernel 6.17 is Available

Kali Linux 2025.3 Released with New Hacking Tools

Zorin OS 18 Beta Available for Testing

Screen scraping with Colly in Go

Programming Snapshot – Colly

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters Find Linux and Open Source Jobs Subscribe to our ADMIN Newsletters

Support Our Work

News

Tag Cloud

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters