Network monitoring from the cloud

Dashboard Tour

Now that you've installed the agents on multiple computers and can access them from the Cloud dashboard, it's time to familiarize yourself with the dashboard.

Netdata collects monitoring data from dozens of hardware and software components, such as CPU, memory, disks, networking, filesystems, and more. What makes it even more useful though is that Netdata can also collect metrics from hundreds of popular services and applications.

Netdata deploys collectors to gather the metrics. It includes collectors for collating performance data from some of the popular services and apps, such as Apache, NGINX, Tomcat, MySQL, Postgres, MongoDB, Ceph, OpenLDAP, Tor, Docker, and more [5].

All the collected metrics are exposed via the Netdata dashboard as interactive charts. Netdata shows all its charts on a single scrollable page. You can also navigate between the various elements using the menu placed on the dashboard's right-hand side. Note, however, that if you run Netdata on multiple computers that run different operating systems or different versions, the menus might look a little different for each one.

Using the mouse, you can drag the charts to the left or right to move forward and backward through the different time intervals. Similarly, you can change the time markers by holding down the Shift key as you scroll within a chart. To reset a chart to its default view, simply double click inside it.

The good thing about Netdata's visualization is that when you change the view on one chart, it automatically replicates the same view on the other charts as well. Thanks to this feature, you'll always get a synchronized view of the metrics.

The charts themselves are self-explanatory. At the top, you get an overview of the computer's resources. This is followed by a summary of the computer's CPUs, including their utilization and information about the interrupts handled by each, in addition to other aspects. Similarly, you get real-time information about the system's memory utilization, and so on (Figure 4).

Figure 4: In addition to the built-in collectors, you can pull in additional ones via plugins.

Most of the charts have a brief description to explain the feature they display along with its importance. Unless you're well-versed with monitoring Linux/BSD systems, you should spend some time exploring the individual metrics and how they can be used to monitor your systems' health.

Get Alerts

In addition to the active performance monitoring, the Netdata Agent can also help you ensure your systems and applications are healthy by alerting you about possible issues. The Netdata Agent includes dozens of preconfigured alarms that trigger alerts when a monitoring component requires your attention.

As mentioned earlier, these alarms are preconfigured with sensible defaults. Just like Netdata itself, these alarms have been designed by the tool's system administrator community, which means the alarms will be activated automatically upon the agent's installation. That said, while you don't need to edit them, the alarms can be customized to meet your needs.

You can access Netdata's alarm notifications system by clicking the alarms button (the bell icon) at the top of the dashboard. This will bring up a screen that shows the currently raised alarms, along with tabs to view all running alarms, as well as the alarms log (Figure 5).

Figure 5: You can view details about the raised alarms, as well as go to the chart where the alarms were raised for further analysis.

To tune a default alarm, switch to the All tab. This page will list the various alarms along with their preconfigured settings. The source row in the tab points to the configuration file that controls the settings for a particular alarm (Figure 6). You'll need to edit the file and adjust the settings as per your requirements.

Figure 6: Below every alarm's name is a badge that updates automatically to show the chart's current value.

For instance, the /usr/lib/netdata/conf.d/health.d/ram.conf file controls the alarms related to a computer's physical RAM. By default, Netdata will warn you when the amount of used RAM crosses the 80 percent threshhold. You can change this behavior by editing the value in the warn line.

After you've saved the file, you can reload the health monitoring settings with:

$ sudo netdatacli reload-health

See the project's documentation on the health monitoring system [6] to understand the other lines in an alarm's configuration file.

Going Further

Once you've become accustomed to Netdata, it's time to explore the various settings and configure it to meet your requirements. The project has excellent documentation (as referenced throughout the article).

While I've covered most of Netadata's basic features, there's a lot more you can do with it. You can, for instance, export and import snapshots [7] of your dashboard's contents, which helps diagnose major errors and anomalies. You can also create custom dashboards that do a better job of visualizing the metrics in which you are interested.

Despite server monitoring being an already crowded space, Netdata has managed to create a wide berth for itself thanks to its ease of use and customizability. No wonder then that it is one of the most starred projects in the Cloud Native Computing Foundation landscape.

The Author

Mayank Sharma has been writing and reporting on open source software from all over the globe for almost two decades.

Buy this article as PDF

Express-Checkout as PDF
Price $2.95
(incl. VAT)

Buy Linux Magazine

Get it on Google Play

US / Canada

Get it on Google Play

UK / Australia

Related content

  • Netdata

    What cannot be measured cannot be improved. Netdata lets you measure almost anything – at least as long as it's about the performance and health of a Linux computer.

comments powered by Disqus