Data processor
Open Source Gem

© Lead Image © Mikhail Avlasenko, 123RF.com
A little-known, very powerful data processor for your scripts, datamash makes long, complex calculations simple.
GNU datamash [1] is a command-line program capable of analyzing, summarizing, or transforming in various ways tables of numbers, with or without text, stored inside plaintext files. For these kinds of tasks, datamash is often a faster, more productive alternative to tools like AWK, sed, or any scripting language.
Just like those other tools, datamash is a good team player, in the traditional Unix and Linux sense: You can use datamash interactively at the prompt, automatically in shell scripts, and even directly attach it to other programs (including itself!) via Unix pipes.
Besides, in almost all the cases I have seen or can imagine, datamash does what you need with less typing, possibly a lot less. Last but not least, datamash lets you easily perform basic quality checks on raw data. I'll show you how to do all this from scratch, starting with the basic options and ways of working with datamash and then moving to more complicated examples.
[...]
Buy this article as PDF
(incl. VAT)
Buy Linux Magazine
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters
Support Our Work
Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News
-
AerynOS Alpha Release Available
With a choice of several desktop environments, AerynOS 2025.08 is almost ready to be your next operating system.
-
AUR Repository Still Under DDoS Attack
Arch User Repository continues to be under a DDoS attack that has been going on for more than two weeks.
-
RingReaper Malware Poses Danger to Linux Systems
A new kind of malware exploits modern Linux kernels for I/O operations.
-
Happy Birthday, Linux
On August 25, Linux officially turns 34.
-
VirtualBox 7.2 Has Arrived
With early support for Linux kernel 6.17 and other new additions, VirtualBox 7.2 is a must-update for users.
-
Linux Mint 22.2 Beta Available for Testing
Some interesting new additions and improvements are coming to Linux Mint. Check out the Linux Mint 22.2 Beta to give it a test run.
-
Debian 13.0 Officially Released
After two years of development, the latest iteration of Debian is now available with plenty of under-the-hood improvements.
-
Upcoming Changes for MXLinux
MXLinux 25 has plenty in store to please all types of users.
-
A New Linux AI Assistant in Town
Newelle, a Linux AI assistant, works with different LLMs and includes document parsing and profiles.
-
Linux Kernel 6.16 Released with Minor Fixes
The latest Linux kernel doesn't really include any big-ticket features, just a lot of lines of code.