[Nagios-users] Scheduled checks falling far behind

Litwin, Matthew mlitwin at stubhub.com
Sat Oct 23 22:31:36 UTC 2010


On Oct 22, 2010, at 6:53 PM, Frost, Mark {PBC} wrote:

> Matthew,
> 
> You don't say, but my guess would be that you have high latencies.  That is for one of several reasons, Nagios is not able to run checks when it thinks it should.  You can see this information and other stats by looking at the Performance item near the bottom of the Nav pane in the Nagios web interface.

Hi Mark,

I have set up MRTG to track nagios performace and it is reporting that latency for host and service checks are next to nothing and service execution time is just under 400 ms, however, host checks are coming back at around 4 seconds. Based on the samples that are in the MRTG section of the tuning guide, my stats would indicate that service execution times are fine, but that host execution checks are high.

nagiostat says this, and if I am reading it right, it contradicts what MRTG is report:


Nagios Stats 3.2.1
Copyright (c) 2003-2008 Ethan Galstad (www.nagios.org)
Last Modified: 03-09-2010
License: GPL

CURRENT STATUS DATA
------------------------------------------------------
Status File:                            /usr/local/nagios/var/status.dat
Status File Age:                        0d 0h 0m 21s
Status File Version:                    3.2.1

Program Running Time:                   0d 5h 40m 15s
Nagios PID:                             5828
Used/High/Total Command Buffers:        0 / 0 / 4096

Total Services:                         4987
Services Checked:                       4987
Services Scheduled:                     4970
Services Actively Checked:              4987
Services Passively Checked:             0
Total Service State Change:             0.000 / 5.860 / 0.001 %
Active Service Latency:                 0.236 / 477.406 / 458.101 sec
Active Service Execution Time:          0.013 / 12.393 / 0.377 sec
Active Service State Change:            0.000 / 5.860 / 0.001 %
Active Services Last 1/5/15/60 min:     195 / 1799 / 4970 / 4970
Passive Service Latency:                0.000 / 0.000 / 0.000 sec
Passive Service State Change:           0.000 / 0.000 / 0.000 %
Passive Services Last 1/5/15/60 min:    0 / 0 / 0 / 0
Services Ok/Warn/Unk/Crit:              4971 / 10 / 1 / 5
Services Flapping:                      0
Services In Downtime:                   0

Total Hosts:                            241
Hosts Checked:                          241
Hosts Scheduled:                        241
Hosts Actively Checked:                 241
Host Passively Checked:                 0
Total Host State Change:                0.000 / 0.000 / 0.000 %
Active Host Latency:                    444.598 / 469.511 / 455.625 sec
Active Host Execution Time:             0.152 / 4.045 / 3.779 sec
Active Host State Change:               0.000 / 0.000 / 0.000 %
Active Hosts Last 1/5/15/60 min:        8 / 19 / 241 / 241
Passive Host Latency:                   0.000 / 0.000 / 0.000 sec
Passive Host State Change:              0.000 / 0.000 / 0.000 %
Passive Hosts Last 1/5/15/60 min:       0 / 0 / 0 / 0
Hosts Up/Down/Unreach:                  241 / 0 / 0
Hosts Flapping:                         0
Hosts In Downtime:                      0

Active Host Checks Last 1/5/15 min:     8 / 38 / 267
   Scheduled:                           8 / 33 / 249
   On-demand:                           0 / 5 / 18
   Parallel:                            8 / 33 / 249
   Serial:                              0 / 0 / 0
   Cached:                              0 / 5 / 18
Passive Host Checks Last 1/5/15 min:    0 / 0 / 0
Active Service Checks Last 1/5/15 min:  332 / 1949 / 5883
   Scheduled:                           332 / 1949 / 5883
   On-demand:                           0 / 0 / 0
   Cached:                              0 / 0 / 0
Passive Service Checks Last 1/5/15 min: 0 / 0 / 0

External Commands Last 1/5/15 min:      0 / 0 / 0






More information about the Nagios-users mailing list