Disaster tolerance with Apache Cassandra

Highly Available

© Lead Image © Igor Zakharevich, 123RF.com

Article from Issue 233/2020

Author(s): Aleksandr Volochnev

The size and scope of today's Internet companies require more than your average SQL. Apache Cassandra is one of the NoSQL systems filling the need for high availability at scale.

Apache Cassandra is an open source NoSQL distributed database that stores and manages large volumes of data on standard servers. Cloud providers use Cassandra for configurations with many data centers spread across global networks.

The story of Apache Cassandra began in 2007 when Facebook engineers Prashant Malik and Avinash Lakshman developed a very early version for Facebook's inbox search. The challenge was to store the data for huge datasets residing on hundreds of servers. A year later, Facebook released Cassandra on Google Code, making it an open source project. In 2009, it joined the Apache incubator, paving the way to it becoming a top-level Apache Foundation project. Since then, many well-known companies have implemented Cassandra or a commercial version (DataStax Enterprise), including Apple, Netflix, Twitter, Sony, eBay, Walmart, and FedEx. Cassandra and other NoSQL alternatives are part of a new generation of data tools designed to fulfill the massive storage needs of the Internet era. A conventional relational database, such as an SQL database, is difficult to cluster, subdivide, or scale horizontally. Companies can either keep their data at a single location and let their customers contend with long wait times to access it remotely, or they can operate two instances of the database. Neither of these scenarios is viable for a modern international company that needs both global data availability and the ability to grow without incurring additional costs. NoSQL systems are built to be extremely scalable. To increase performance, you can simply add additional nodes to the cluster on the fly. To double the performance of the database, you just need to add the same number of nodes as the cluster already has. Apache Cassandra is based on Java and has symmetrical nodes organized in clusters, rather than the master and named nodes used with SQL implementations. Cassandra is useful for real-time data storage for online applications with multiple transactions. You can also use Cassandra as a read-intensive database for business intelligence systems. If you're accustomed to SQL, you'll find that the Cassandra Query Language (CQL) is strongly reminiscent of SQL in terms of syntax and keywords. Cassandra is designed for a distributed environment. To fully implement Cassandra's disaster tolerance capabilities on a massive scale, companies need to distribute the data across different regions or even different cloud providers. If one instance fails, some latency may occur, but the data remains available.

CAP Theorem

The CAP theorem is a principle of computer science that helps to explain why NoSQL systems like Cassandra differ from conventional data tools. The CAP theorem (or Brewer's theorem), which describes the relationship between consistency (C), availability (A), and partition tolerance (P), was first articulated by Eric Allen Brewer, Professor Emeritus of Computer Science at University of California, Berkeley and Vice President of Infrastructure at Google. CAP forms the basis for planning a distributed architecture. The basic parts of the CAP decision framework are:

[...]

Use Express-Checkout link below to read the full article (PDF).

Buy this article as PDF

Download Article PDF now with Express Checkout

Price $2.95
(incl. VAT)

Buy Linux Magazine

SINGLE ISSUES

Print Issues

Digital Issues

SUBSCRIPTIONS

Print Subscriptions

Digital Subscriptions

Support Our Work

Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News

Nitrux 6.0 Now Ready to Rock Your World

DEBIAN , Desktop , Nitrux

The latest iteration of the Debian-based distribution includes all kinds of newness.
Linux Foundation Reports that Open Source Delivers Better ROI

Community , open source , Software

In a report that may surprise no one in the Linux community, the Linux Foundation found that businesses are finding a 5X return on investment with open source software.
Keep Android Open

Android , apps , open source

Google has announced that, soon, anyone looking to develop Android apps will have to first register centrally with Google.
Kernel 7.0 Now in Testing

Kernel , Linux

Linus Torvalds has announced the first Release Candidate (RC) for the 7.x kernel is available for those who want to test it.
Introducing matrixOS, an Immutable Gentoo-Based Linux Distro

Gentoo Linux , matrixOS , Operating Systems

It was only a matter of time before a developer decided one of the most challenging Linux distributions needed to be immutable.
Chaos Comes to KDE in KaOS

KDE , Plasma

KaOS devs are making a major change to the distribution, and it all comes down to one system.
New Linux Botnet Discovered

botnet , Security

The SSHStalker botnet uses IRC C2 to control systems via legacy Linux kernel exploits.
The Next Linux Kernel Turns 7.0

Encryption , Kernel

Linus Torvalds has announced that after Linux kernel 6.19, we'll finally reach the 7.0 iteration stage.
Linux From Scratch Drops SysVinit Support

Linux From Scratch , Systemd

LFS will no longer support SysVinit.
LibreOffice 26.2 Now Available

libreoffice , office suite , open source

With new features, improvements, and bug fixes, LibreOffice 26.2 delivers a modern, polished office suite without compromise.

Disaster tolerance with Apache Cassandra

Highly Available

CAP Theorem

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters

Support Our Work

News

Nitrux 6.0 Now Ready to Rock Your World

Linux Foundation Reports that Open Source Delivers Better ROI

Keep Android Open

Kernel 7.0 Now in Testing

Introducing matrixOS, an Immutable Gentoo-Based Linux Distro

Chaos Comes to KDE in KaOS

New Linux Botnet Discovered

The Next Linux Kernel Turns 7.0

Linux From Scratch Drops SysVinit Support

LibreOffice 26.2 Now Available

Disaster tolerance with Apache Cassandra

Highly Available

CAP Theorem

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters Find Linux and Open Source Jobs Subscribe to our ADMIN Newsletters

Support Our Work

News

Tag Cloud

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters