Setting up a data analytics environment in Linux with Python

Data analytics is a major force in the current zeitgeist. Analytics are the eyes and ears on a very wide variety of domains (society, climate, health, etc.) to perform an even wider variety of tasks (such as understanding commercial trends, the spread of COVID-19, and finding exoplanets). In this article, I will discuss some fundamentals of data analytics and show how to get started with analytics in Python. Finally, I will show the whole process at work on a simple data analytics problem.

A Primer on Data Analytics

Data analytics uses tools from statistics and computer science (CS), such as artificial intelligence (AI) and machine learning (ML), to extract information from collected data. The collected data is usually very complex and voluminous, and it cannot be interpreted easily (or at all) by humans. Therefore, the data on its own is useless. Information lies hidden within the data, and it takes many forms: repeating patterns, trends, classifications, or even predictive models. You can use this data to uncover insights and build knowledge of the problem you are studying. For example, suppose you wish to measure the traffic in a parking lot that is monitored by a network of IoT sensors covering the whole city. Reading a single occupancy sensor doesn't say anything about the traffic on its own. Neither do the readings of all the parking sensors of the city without any more context. But the timestamped percentage of occupied places within the monitored parking lot does tell us something, and we use this information to derive insights, such as the times of day with maximum traffic.

Learning the mathematical background and analytics tools is only half the journey. Field expertise (experience on the problem that is being studied) is equally important. Some data scientists come from a statistics background, others are computer scientists who pick up the statistics as they go, and many are people starting from a field of expertise who need to learn both the statistics and the computing tools.

[...]

Use Express-Checkout link below to read the full article (PDF).

Buy this article as PDF

Express-Checkout as PDF

Price $2.95
(incl. VAT)

Buy Linux Magazine

SINGLE ISSUES

Print Issues

Digital Issues

SUBSCRIPTIONS

Print Subs

Digisubs

TABLET & SMARTPHONE APPS

US / Canada

UK / Australia

Support Our Work

Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News

IBM Announces Powerhouse Linux Server

Linux , Security , Server

IBM has unleashed a seriously powerful Linux server with the LinuxONE Emperor 5.
Plasma Ends LTS Releases

Linux , open source , Plasma

The KDE Plasma development team is doing away with the LTS releases for a good reason.
Arch Linux Available for Windows Subsystem for Linux

Arch Linux , Linux , Windows

If you've ever wanted to use a rolling release distribution with WSL, now's your chance.
System76 Releases COSMIC Alpha 7

COSMIC , Linux , open source

With scores of bug fixes and a really cool workspaces feature, COSMIC is looking to soon migrate from alpha to beta.
OpenMandriva Lx 6.0 Available for Installation

Linux , OpenMandriva , Plasma

The latest release of OpenMandriva has arrived with a new kernel, an updated Plasma desktop, and a server edition.
TrueNAS 25.04 Arrives with Thousands of Changes

Linux , Storage , TrueNAS

One of the most popular Linux-based NAS solutions has rolled out the latest edition, based on Ubuntu 25.04.
Fedora 42 Available with Two New Spins

Fedora , Gnome , Plasma

The latest release from the Fedora Project includes the usual updates, a new kernel, an official KDE Plasma spin, and a new System76 spin.
So Long, ArcoLinux

Linux , open source , Operating Systems

The ArcoLinux distribution is the latest Linux distribution to shut down.
What Open Source Pros Look for in a Job Role

FOSS , open source

Learn what professionals in technical and non-technical roles say is most important when seeking a new position.
Asahi Linux Runs into Issues with M4 Support

Linux , open source

Due to Apple Silicon changes, the Asahi Linux project is at odds with adding support for the M4 chips.

Setting up a data analytics environment in Linux with Python

A Primer on Data Analytics

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters

Support Our Work

News

IBM Announces Powerhouse Linux Server

Plasma Ends LTS Releases

Arch Linux Available for Windows Subsystem for Linux

System76 Releases COSMIC Alpha 7

OpenMandriva Lx 6.0 Available for Installation

TrueNAS 25.04 Arrives with Thousands of Changes

Fedora 42 Available with Two New Spins

So Long, ArcoLinux

What Open Source Pros Look for in a Job Role

Asahi Linux Runs into Issues with M4 Support

Setting up a data analytics environment in Linux with Python

A Primer on Data Analytics

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters Find Linux and Open Source Jobs Subscribe to our ADMIN Newsletters

Support Our Work

News

Tag Cloud

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters