Scheduled Downtime · Nagios Core Documentation

Need Help Configuring Nagios?

Our tech support team is happy to help you with any questions you might have. Contact us on our online support forum at https://support.nagios.com/forum/

Nagios XI Makes Monitoring Easier:

Nagios XI is the easy-to-use, enterprise version of Nagios that features:

Web-Based Configuration provides advanced configuration features
Monitoring Wizards make it easy to monitor new devices, applications, and services
Customizable Dashboards allow for per-user customization
Integrated Performance Graphs provide trending and capacity planning information
Advanced Reports provide data insight and exporting capabilities
Data Visualizations enable powerful analysis of patterns and problems
Nagios Core Import functionality makes it easy to migrate from Nagios Core
... and many other features

Download a free 30-day trial to give Nagios XI a spin.

Inquire today and let our Quickstart team help you get started with Nagios XI

Up To Up To: Contents
See Also: Notifications

Introduction

Nagios Core allows you to schedule periods of planned downtime for hosts and service that you're monitoring. This is useful in the event that you actually know you're going to be taking a server down for an upgrade, etc.

Scheduling Downtime

You can schedule downtime for hosts and service through the extinfo CGI (either when viewing host or service information). Click in the "Schedule downtime for this host/service" link to actually schedule the downtime.

Once you schedule downtime for a host or service, Nagios Core will add a comment to that host/service indicating that it is scheduled for downtime during the period of time you indicated. When that period of downtime passes, Nagios Core will automatically delete the comment that it added.

Fixed vs. Flexible Downtime

When you schedule downtime for a host or service through the web interface you'll be asked if the downtime is fixed or flexible. Here's an explanation of how "fixed" and "flexible" downtime differs:

Fixed downtime starts and stops at the exact start and end times that you specify when you schedule it.

Flexible downtime is intended for times when you know that a host or service is going to be down for X minutes (or hours), but you don't know exactly when that'll start. When you schedule flexible downtime, Nagios Core will start the scheduled downtime sometime between the start and end times you specified. The downtime will last for as long as the duration you specified when you scheduled the downtime. This assumes that the host or service for which you scheduled flexible downtime either goes down (or becomes unreachable) or goes into a non-OK state sometime between the start and end times you specified. The time at which a host or service transitions to a problem state determines the time at which Nagios Core actually starts the downtime. The downtime will then last for the duration you specified, even if the host or service recovers before the downtime expires. This is done for a very good reason. As we all know, you might think you've got a problem fixed, but then have to restart a server ten times before it actually works right.

Triggered Downtime

When scheduling host or service downtime you have the option of making it "triggered" downtime. What is triggered downtime? With triggered downtime the start of the downtime is triggered by the start of some other scheduled host or service downtime. This is extremely useful if you're scheduling downtime for a large number or hosts or services and the start time of the downtime period depends on the start time of another downtime entry. For instance, if you schedule flexible downtime for a particular host (because its going down for maintenance), you might want to schedule triggered downtime for all of that hosts's "children".

How Scheduled Downtime Affects Notifications

When a host or service is in a period of scheduled downtime, Nagios Core will not allow normal notifications to be sent out for the host or service. However, a "DOWNTIMESTART" notification will get sent out for the host or service, which will serve to put any admins on notice that they won't receive upcoming notifications.

When the scheduled downtime is over, Nagios Core will allow normal notifications to be sent out for the host or service again. A "DOWNTIMEEND" notification will get sent out notifying admins that the scheduled downtime is over, and they will start receiving notifications again.

If the scheduled downtime is canceled prematurely (before it expires), a "DOWNTIMECANCELLED" notification will get sent out to the appropriate admins.

Overlapping Scheduled Downtime

I like to refer to this as the "Oh crap, its not working" syndrome. You know what I'm talking about. You take a server down to perform a "routine" hardware upgrade, only to later realize that the OS drivers aren't working, the RAID array blew up, or the drive imaging failed and left your original disks useless to the world. Moral of the story is that any routine work on a server is quite likely to take three or four times as long as you had originally planned.

Let's take the following scenario:

You schedule downtime for host A from 7:30pm-9:30pm on a Monday
You bring the server down about 7:45pm Monday evening to start a hard drive upgrade
After wasting an hour and a half battling with SCSI errors and driver incompatabilities, you finally get the machine to boot up
At 9:15 you realize that one of your partitions is either hosed or doesn't seem to exist anywhere on the drive
Knowing you're in for a long night, you go back and schedule additional downtime for host A from 9:20pm Monday evening to 1:30am Tuesday Morning.

If you schedule overlapping periods of downtime for a host or service (in this case the periods were 7:40pm-9:30pm and 9:20pm-1:30am), Nagios will wait until the last period of scheduled downtime is over before it allows notifications to be sent out for that host or service. In this example notifications would be suppressed for host A until 1:30am Tuesday morning.