sslug-teknik team mailing list archive
-
sslug-teknik team
-
Mailing list archive
-
Message #78540
Crash problem.
Hej
Jeg har en server som kører som en Nagios slave. Dvs at den står stand-by
og bare modtager service/host-check resultater fra en master server. Jeg
har dog det problem at på et eller andet tidspunkt så låser den helt og er
komplet umulig at komme i kontakt med.
I loggen har jeg følgende, som kan ses herunder
Jeg kan gætte mig at load'et stiger explosivt pga. maskinen skal til at
swappe helt extremt og så til sidst løber tør for swap / ram.
Jeg kan dog tænke mig til at det evt. har med at gøre at Nagios modtager
flere checks fra masteren end den når at få kørt igennem den nagios.cmd.
Jeg har nu ændret det så nagios checker dens cmd fil hvert sekund for nye
externe kommandoer; men jeg vil meget gerne undgå at dette sker igen.
Nogen der evt har oplevet noget lign. eller evt. har en løsning ?
Det skal sige at Master serveren send check-resultater fra Master til
Slave via send_ncsa kommandoen.
Mvh
Brian Møller
Dec 6 15:30:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 12
Dec 6 15:30:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 12
Dec 6 15:31:01 w04lx002 /usr/sbin/cron[27254]: (root) CMD
(/usr/nagios/libexec/check_master_nagios.sh)
Dec 6 15:31:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 19
Dec 6 15:31:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 19
Dec 6 15:31:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 29
Dec 6 15:31:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 29
Dec 6 15:31:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 35
Dec 6 15:31:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 35
Dec 6 15:31:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 43
Dec 6 15:31:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 43
Dec 6 15:32:01 w04lx002 /usr/sbin/cron[27370]: (root) CMD
(/usr/nagios/libexec/check_master_nagios.sh)
Dec 6 15:32:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 52
Dec 6 15:32:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 52
Dec 6 15:32:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 61
Dec 6 15:32:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 61
Dec 6 15:32:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 71
Dec 6 15:32:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 71
Dec 6 15:32:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 79
Dec 6 15:32:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 79
Dec 6 15:33:01 w04lx002 /usr/sbin/cron[27414]: (root) CMD
(/usr/nagios/libexec/check_master_nagios.sh)
Dec 6 15:33:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 85
Dec 6 15:33:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 85
Dec 6 15:33:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 90
Dec 6 15:33:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 90
Dec 6 15:33:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 93
Dec 6 15:33:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 93
Dec 6 15:33:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 95
Dec 6 15:33:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 95
Dec 6 15:34:01 w04lx002 /usr/sbin/cron[27425]: (root) CMD
(/usr/nagios/libexec/check_master_nagios.sh)
Dec 6 15:34:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 96
Dec 6 15:34:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 96
Dec 6 15:34:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 98
Dec 6 15:34:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 98
Dec 6 15:34:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 98
Dec 6 15:34:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 98
Dec 6 15:34:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 98
Dec 6 15:34:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 98
Dec 6 15:35:01 w04lx002 /usr/sbin/cron[27434]: (root) CMD
(/usr/nagios/libexec/check_master_nagios.sh)
Dec 6 15:35:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 99
Dec 6 15:35:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 99
Dec 6 15:35:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 99
Dec 6 15:35:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 99
Dec 6 15:35:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 99
Dec 6 15:35:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 99
Dec 6 15:35:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 99
Dec 6 15:35:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 99
Dec 6 15:36:01 w04lx002 /usr/sbin/cron[27443]: (root) CMD
(/usr/nagios/libexec/check_master_nagios.sh)
Dec 6 15:36:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 99
Dec 6 15:36:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 99
Dec 6 15:36:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 99
Dec 6 15:36:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 99
Dec 6 15:36:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 99
Dec 6 15:36:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 99
Dec 6 15:36:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 99
Dec 6 15:36:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 99
Dec 6 15:37:01 w04lx002 /usr/sbin/cron[27454]: (root) CMD
(/usr/nagios/libexec/check_master_nagios.sh)
Dec 6 15:37:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 99
Dec 6 15:37:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 99
Dec 6 15:37:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 99
Dec 6 15:37:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 99
Dec 6 15:37:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 99
Dec 6 15:37:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 99
Dec 6 15:37:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 99
Dec 6 15:37:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 99
Dec 6 15:38:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 98
Dec 6 15:38:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 98
Dec 6 15:38:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 99
Dec 6 15:38:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 99
Dec 6 15:38:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 98
Dec 6 15:38:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 98
Dec 6 15:38:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 98
Dec 6 15:38:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 98
Dec 6 15:39:01 w04lx002 /usr/sbin/cron[27503]: (root) CMD
(/usr/nagios/libexec/check_master_nagios.sh)
Dec 6 15:39:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 98
Dec 6 15:39:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 98
Dec 6 15:39:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 98
Dec 6 15:39:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 98
Dec 6 15:39:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 98
Dec 6 15:39:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 98
Dec 6 15:39:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 98
Dec 6 15:39:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 98
Dec 6 15:40:01 w04lx002 /usr/sbin/cron[27509]: (root) CMD (test -x
/usr/sbin/run-crons && /usr/sbin/run-crons )
Dec 6 15:40:01 w04lx002 /usr/sbin/cron[27510]: (root) CMD
(/usr/nagios/libexec/check_master_nagios.sh)
Dec 6 15:40:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 98
Dec 6 15:40:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 98
Dec 6 15:40:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 98
Dec 6 15:40:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 98
Dec 6 15:40:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 99
Dec 6 15:40:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 99
Dec 6 15:40:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 100
Dec 6 15:40:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 100
Dec 6 15:41:01 w04lx002 /usr/sbin/cron[27519]: (root) CMD
(/usr/nagios/libexec/check_master_nagios.sh)
Dec 6 15:41:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 100
Dec 6 15:41:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 100
Dec 6 16:47:14 w04lx002 oom-killer: gfp_mask=0xd2
Dec 6 16:47:14 w04lx002 DMA per-cpu:
Dec 6 16:47:14 w04lx002 cpu 0 hot: low 2, high 6, batch 1
Dec 6 16:47:14 w04lx002 cpu 0 cold: low 0, high 2, batch 1
Dec 6 16:47:14 w04lx002 Normal per-cpu:
Dec 6 16:47:14 w04lx002 cpu 0 hot: low 32, high 96, batch 16
Dec 6 16:47:14 w04lx002 cpu 0 cold: low 0, high 32, batch 16
Dec 6 16:47:14 w04lx002 HighMem per-cpu:
Dec 6 16:47:14 w04lx002 cpu 0 hot: low 32, high 96, batch 16
Dec 6 16:47:14 w04lx002 cpu 0 cold: low 0, high 32, batch 16
Dec 6 16:47:14 w04lx002
Dec 6 16:47:14 w04lx002 Free pages: 6080kB (1024kB HighMem)
Dec 6 16:47:14 w04lx002 Active:4085 inactive:358873 dirty:0 writeback:0
unstable:0 free:1520 slab:13909 mapped:366023 pagetables:5542
Dec 6 16:47:14 w04lx002 DMA free:2928kB min:16kB low:32kB high:48kB
active:8284kB inactive:1148kB present:16384kB
Dec 6 16:47:14 w04lx002 protections[]: 8 476 732
Dec 6 16:47:14 w04lx002 Normal free:2128kB min:936kB low:1872kB
high:2808kB active:4116kB inactive:786148kB present:901120kB
Dec 6 16:47:14 w04lx002 protections[]: 0 468 724
Dec 6 16:47:14 w04lx002 HighMem free:1024kB min:512kB low:1024kB
high:1536kB active:3940kB inactive:648196kB present:655232kB
Dec 6 16:47:14 w04lx002 protections[]: 0 0 256
Dec 6 16:47:14 w04lx002 DMA: 0*4kB 0*8kB 1*16kB 3*32kB 2*64kB 1*128kB
0*256kB 1*512kB 0*1024kB 1*2048kB 0*4096kB = 2928kB
Dec 6 16:47:14 w04lx002 Normal: 0*4kB 0*8kB 9*16kB 2*32kB 2*64kB 2*128kB
2*256kB 0*512kB 1*1024kB 0*2048kB 0*4096kB = 2128kB
Dec 6 16:47:14 w04lx002 HighMem: 0*4kB 0*8kB 0*16kB 0*32kB 0*64kB 0*128kB
0*256kB 0*512kB 1*1024kB 0*2048kB 0*4096kB = 1024kB
Dec 6 16:47:14 w04lx002 Swap cache: add 289859, delete 289859, find
58503/59217, race 0+0
Dec 6 16:47:14 w04lx002 Out of Memory: Killed process 29477 (nagios).
Dec 6 15:42:01 w04lx002 /usr/sbin/cron[27524]: (root) CMD
(/usr/nagios/libexec/check_master_nagios.sh)
Dec 6 15:41:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 101
Dec 6 15:41:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 101
Dec 6 15:41:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 101
Dec 6 15:41:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 101
Dec 6 15:41:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 102
Dec 6 15:41:47 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 102
Dec 6 15:42:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 102
Dec 6 15:42:02 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 102
Dec 6 15:42:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 103
Dec 6 15:42:17 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 103
Dec 6 15:42:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MTA: load average: 103
Dec 6 15:42:32 w04lx002 sm-mta[5348]: rejecting connections on daemon
MSA: load average: 103
Dec 6 15:43:01 w04lx002 /usr/sbin/cron[27529]: (root) CMD
(/usr/nagios/libexec/check_master_nagios.sh)
Dec 6 15:44:01 w04lx002 /usr/sbin/cron[27531]: (root) CMD
(/usr/nagios/libexec/check_master_nagios.sh)
Dec 6 15:45:01 w04lx002 /usr/sbin/cron[27533]: (root) CMD
(/usr/nagios/libexec/check_master_nagios.sh)
Dec 7 08:38:52 w04lx002 syslog-ng[4941]: syslog-ng version 1.6.5 starting