I am not sure if i got your issue right... How ever, network outages on one location can be avoided by redundancy of the network.
If you are monitoring multiple datacenters on different locations, it could become tricky to keep everything redundancy. We use openITCOCKPIT to resolve this issue. This provieds a Master/Slave system. The Master System keeps all monitoring configuration and provide the web interface. The Slave (Satellite Systems) is located at the remote datacenter and executing the checks. If the Master server lost connection to the slave system, the slave will cache all check results. As soon as the connection is back, the master system will post process the cached data from the slave. Master and slave are always connected using an SSH tunnel.
The master system will also generate all configuration files for Nagios for the master and all connected satellite systems. https://openitcockpit.io/beginners/8-setup-distributed-monitoring/
The best: You don't need to care about anything. Just install a Master and a Slave, exchange the SSH keys and thats it. SSH tunneling, caching, config file generation, web interface - all handled by the system itself.
Maybe I'm a bit late on this but you should give openITCOCKPIT a shot.
It is compatible with Nagios and Checkmk but also has an own monitoring agent and you can basicly get started in seconds:
https://openitcockpit.io/beginners/11-monitoring-through-openitcockpit-agent/
​
It's also available for the Raspberry Pi (arm64).
I'm part of the development Team of openITCOCKPIT so take everything you read with a grain of salt 😄
Basically openITCOCKPIT is Nagios on steroids. It's compatible to all Nagios/Naemon/Icinga plugins, has Checkmk build in and also Grafana. Our own monitoring agent provides simple and secure monitoring for Linux, Windows and macOS devices. All of this is free and open source.
Feel free to ask
If you are interested in free, open source and easy to manage monitoring tool i would recommend you to take a look at openITCOCKPIT. I'm part of the development team of openITCOCKPIT which is basicaly Nagios on steroids. Its compatible to all Nagios plugins, has Checkmk build in and also Grafana. Our own monitoring agent provides simple and secure monitoring for Linux, Windows and macOS devices.
You should take a look at openITCOCKPIT. It's maybe not as lightweight as you like it, but it cames with Naemon (fork of Nagios), Grafana and Checkmk by default. It also runs on a Raspberry Pi (which I use to monitor my systems at home)
To setup your monitoring in a few minutes go with the openITCOCKPIT Monitoring Agent: https://openitcockpit.io/beginners/11-monitoring-through-openitcockpit-agent/
Feel free to ask if you have any questions
I work with Nagios (Naemon) every day. I'm part of the development Team of openITCOCKPIT which is Nagios on steroids. Its compatible to all Nagios plugins, has Checkmk build in and also Grafana. Our own monitoring agent provides simple and secure monitoring for Linux, Windows and macOS devices. All of this is free and open source.
In my professional life I'm part of the open source project openITCOCKPIT. It will handle all the Nagios configuration (and much more) for you.
But there are also other solutions out there that can manage the configs for you such as Thruk or Adagios.
I’m one of the developers of openITCOCKIT. You should give it a shot. Monitoring of Windows, Linux and macOS can be easily done:
https://openitcockpit.io/beginners/11-monitoring-through-openitcockpit-agent/
Basically it’s Nagios on steroids
openITCOCKPIT makes it really easy to monitor Windows, Linux and macOS Systems through the own monitoring agent: https://openitcockpit.io/beginners/11-monitoring-through-openitcockpit-agent/
openITCOCKPIT (with Nagios/Naemon/Checkmk and Grafana with Graphite for data visualization) - very easy to configure and to use :)
Written in PHP, but open source and very easy to extend :)
https://docs.it-novum.com/display/ODE/Creating+a+new+openITCOCKPIT+Module
I'm not sure but openITCOCKPIT could be a solution for you. It is based on Naemon (fork of Nagios) for "classic" status checks and also has a Prometheus integration. So you get the classical status information(ok, warning, critical) for Prometheus monitored metrics/services. Both combined in one web interface.
See this blog for more information:
https://openitcockpit.io/2020/2020/10/20/openitcockpit-4-1-with-prometheus-integration/
>Reporting objects are contacts?
My english was bad on this.
"Are 'Reporting objects' Nagios Contacts in your context?" would be a better sentence i guess^^ In the Nagios universe Contacts are used to send notifications but i never had heard of 'Reporting objects' So i was a bit confused
Anyway
>But this answer is disappointing, since it's absolutely the sort of thing one should audit. "Do all of my machines have the right alerting?"
I work with Nagios Core for more than 10 years now but i never wrote the config files by hand. I always used this configuration tool (and I'm also part of the development team)
I never understood why admins would run Nagios without an config tool or an attached database
openITCOCKPIT is an open source based monitoring solution that combines different tools like Nagios/Naemon, Grafana, Checkmk, Prometheus and many more into one modern web interface (with own agent or agentless too)
Technical enabling Nagios to send mails isn't a big thing. Are you trying to setup nagios from scratch or is this a legacy system you need to live with?
Nagios it self is not sending the mail. Nagios will only call a defined command (a bash script or so) which will send the mail using linux commands. However, a lot of things can go wrong and it can be tricky to find the issue - especially if you are not so familiar with linux at all.
​
So if you are starting with Nagios right now I would recommend you to use a monitoring suite which comes with an easy to use GUI and documentation. There are a few out there, also open source like: https://openitcockpit.io/beginners/7-setup-email-notifications/
If you have to go with your current setup you need to go through all the required steps like checking the mailserver (maybe postfix or exim), executing the mail script manually, checking your notification command in nagios, checking your contact definition and timeperiod definition, make sure enable_notifications is set to 1 in nagios.cfg etc... etc...
You can give https://openitcockpit.io/ a shot. It is open source and based on Nagios or Naemon. You are able to create the entire monitoring configuration through the web interface or the JSON API. Also it comes with modules for reporting, distributed monitoring and much more
You can give https://openitcockpit.io/ a try. It's an open source web interface to create Nagios/Naemon configuration files. Providing an JSON API, Debian packages, and lots of other stuff.
If you are a little bit lucky, you can also import your current nagios configs: https://github.com/it-novum/openITCOCKPIT-configuration-import