Setting up a data analytics environment in Linux with Python

Down in the Mine

Article from Issue 259/2022

Author(s): Emil J. Khatib

The Knowledge Discovery in Data Mining (KDD) method breaks the business of data analytics into easy-to-understand steps. We'll show you how to get started with KDD and Python.

Data analytics is a major force in the current zeitgeist. Analytics are the eyes and ears on a very wide variety of domains (society, climate, health, etc.) to perform an even wider variety of tasks (such as understanding commercial trends, the spread of COVID-19, and finding exoplanets). In this article, I will discuss some fundamentals of data analytics and show how to get started with analytics in Python. Finally, I will show the whole process at work on a simple data analytics problem.

A Primer on Data Analytics

Data analytics uses tools from statistics and computer science (CS), such as artificial intelligence (AI) and machine learning (ML), to extract information from collected data. The collected data is usually very complex and voluminous, and it cannot be interpreted easily (or at all) by humans. Therefore, the data on its own is useless. Information lies hidden within the data, and it takes many forms: repeating patterns, trends, classifications, or even predictive models. You can use this data to uncover insights and build knowledge of the problem you are studying. For example, suppose you wish to measure the traffic in a parking lot that is monitored by a network of IoT sensors covering the whole city. Reading a single occupancy sensor doesn't say anything about the traffic on its own. Neither do the readings of all the parking sensors of the city without any more context. But the timestamped percentage of occupied places within the monitored parking lot does tell us something, and we use this information to derive insights, such as the times of day with maximum traffic.

Learning the mathematical background and analytics tools is only half the journey. Field expertise (experience on the problem that is being studied) is equally important. Some data scientists come from a statistics background, others are computer scientists who pick up the statistics as they go, and many are people starting from a field of expertise who need to learn both the statistics and the computing tools.

[...]

Use Express-Checkout link below to read the full article (PDF).

Buy this article as PDF

Express-Checkout as PDF

Price $2.95
(incl. VAT)

Buy Linux Magazine

SINGLE ISSUES

Print Issues

Digital Issues

SUBSCRIPTIONS

Print Subs

Digisubs

TABLET & SMARTPHONE APPS

US / Canada

UK / Australia

Support Our Work

Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News

EU Sovereign Tech Fund Gains Traction

funding , open source , Security

OpenForum Europe recently released a report regarding a sovereign tech fund with backing from several significant entities.
FreeBSD Promises a Full Desktop Installer

Desktop , FreeBSD , open source

FreeBSD has lacked an option to include a full desktop environment during installation.
Linux Hits an Important Milestone

Linux , open source

If you pay attention to the news in the Linux-sphere, you've probably heard that the open source operating system recently crashed through a ceiling no one thought possible.
Plasma Bigscreen Returns

KDE , open source , Plasma

A developer discovered that the Plasma Bigscreen feature had been sitting untouched, so he decided to do something about it.
CachyOS Now Lets Users Choose Their Shell

CachyOS , shell , Wayland

Imagine getting the opportunity to select which shell you want during the installation of your favorite Linux distribution. That's now a thing.
Wayland 1.24 Released with Fixes and New Features

communication , Linux , Wayland

Wayland continues to move forward, while X11 slowly vanishes into the shadows, and the latest release includes plenty of improvements.
Bugs Found in sudo

Linux , Security

Two critical flaws allow users to gain access to root privileges.
Fedora Continues 32-Bit Support

Fedora , Games , Linux

In a move that should come as a relief to some portions of the Linux community, Fedora will continue supporting 32-bit architecture.
Linux Kernel 6.17 Drops bcachefs

Filesystem , Kernel , Linux

After a clash over some late fixes and disagreements between bcachefs's lead developer and Linus Torvalds, bachefs is out.
ONLYOFFICE v9 Embraces AI

Artificial Inte... , open source , OpenOffice

Like nearly all office suites on the market (except LibreOffice), ONLYOFFICE has decided to go the AI route.

Setting up a data analytics environment in Linux with Python

Down in the Mine

A Primer on Data Analytics

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters

Support Our Work

News

EU Sovereign Tech Fund Gains Traction

FreeBSD Promises a Full Desktop Installer

Linux Hits an Important Milestone

Plasma Bigscreen Returns

CachyOS Now Lets Users Choose Their Shell

Wayland 1.24 Released with Fixes and New Features

Bugs Found in sudo

Fedora Continues 32-Bit Support

Linux Kernel 6.17 Drops bcachefs

ONLYOFFICE v9 Embraces AI

Setting up a data analytics environment in Linux with Python

Down in the Mine

A Primer on Data Analytics

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters Find Linux and Open Source Jobs Subscribe to our ADMIN Newsletters

Support Our Work

News

Tag Cloud

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters