Servery nms
(→Dostupnost sítovycg zarizeni) |
|||
Řádka 68: | Řádka 68: | ||
* High | * High | ||
* Disaster | * Disaster | ||
+ | |||
+ | = Default triggers = | ||
+ | == Linux servers == | ||
+ | | width="322" height="14" align="LEFT" | '''<font size="1">Name</font>''' | ||
+ | | width="267" align="LEFT" | '''<font size="1">Expression</font>''' | ||
+ | | width="67" align="LEFT" | '''<font size="1">Severity</font>''' | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">/etc/inetd.conf has been changed on server $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.file.cksum[/etc/inetd.conf].diff(0)}>0</font> | ||
+ | | align="LEFT" | <font size="1">Warning</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">/etc/passwd has been changed on server $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.file.cksum[/etc/passwd].diff(0)}>0</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">/etc/services has been changed on server $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.file.cksum[/etc/services].diff(0)}>0</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">/usr/bin/ssh has been changed on server $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.file.cksum[/usr/bin/ssh].diff(0)}>0</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">/usr/sbin/sshd has been changed on server $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.file.cksum[/usr/sbin/sshd].diff(0)}>0</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">/vmlinuz has been changed on server $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.file.cksum[/vmlinuz].diff(0)}>0</font> | ||
+ | | align="LEFT" | <font size="1">Warning</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Apache is not running on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:proc.num[httpd].last(0)}<1</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Configured max number of opened files is too low on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:kernel.maxfiles.last(0)}<512</font> | ||
+ | | align="LEFT" | <font size="1">Information</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Configured max number of processes is too low on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:kernel.maxproc.last(0)}<256</font> | ||
+ | | align="LEFT" | <font size="1">Information</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Email (SMTP) server is down on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:net.tcp.service[smtp].last(0)}=0</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">FTP server is down on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:net.tcp.service[ftp].last(0)}=0</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Host information was changed on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:system.uname.diff(0)}>0</font> | ||
+ | | align="LEFT" | <font size="1">Information</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Hostname was changed on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:system.hostname.diff(0)}>0</font> | ||
+ | | align="LEFT" | <font size="1">Information</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">IMAP server is down on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:net.tcp.service[imap].last(0)}=0</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Inetd is not running on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:proc.num[inetd].last(0)}<1</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Lack of free memory on server $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vm.memory.size[free].last(0)}<10000</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Lack of free swap space on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:system.swap.size[,free].last(0)}<100000</font> | ||
+ | | align="LEFT" | <font size="1">High</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Low free disk space on $server volume /</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.fs.size[/,pfree].last(0)}<10</font> | ||
+ | | align="LEFT" | <font size="1">High</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Low free disk space on $server volume /home</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.fs.size[/home,pfree].last(0)}<10</font> | ||
+ | | align="LEFT" | <font size="1">High</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Low free disk space on $server volume /opt</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.fs.size[/opt,pfree].last(0)}<10</font> | ||
+ | | align="LEFT" | <font size="1">High</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Low free disk space on $server volume /tmp</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.fs.size[/tmp,pfree].last(0)}<10</font> | ||
+ | | align="LEFT" | <font size="1">High</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Low free disk space on $server volume /usr</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.fs.size[/usr,pfree].last(0)}<10</font> | ||
+ | | align="LEFT" | <font size="1">High</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Low free disk space on $server volume /var</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.fs.size[/var,pfree].last(0)}<10</font> | ||
+ | | align="LEFT" | <font size="1">High</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Low number of free inodes on $server volume /</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.fs.inode[/,pfree].last(0)}<10</font> | ||
+ | | align="LEFT" | <font size="1">High</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Low number of free inodes on $server volume /home</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.fs.inode[/home,pfree].last(0)}<10</font> | ||
+ | | align="LEFT" | <font size="1">High</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Low number of free inodes on $server volume /opt</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.fs.inode[/opt,pfree].last(0)}<10</font> | ||
+ | | align="LEFT" | <font size="1">High</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Low number of free inodes on $server volume /tmp</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.fs.inode[/tmp,pfree].last(0)}<10</font> | ||
+ | | align="LEFT" | <font size="1">High</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Low number of free inodes on $server volume /usr</font> | ||
+ | | align="LEFT" | <font size="1">{$server:vfs.fs.inode[/usr,pfree].last(0)}<10</font> | ||
+ | | align="LEFT" | <font size="1">High</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Mysql is not running on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:proc.num[mysqld].last(0)}<1</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">News (NNTP) server is down on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:net.tcp.service[nntp].last(0)}=0</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">POP3 server is down on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:net.tcp.service[pop].last(0)}=0</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Processor load is too high on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:system.cpu.load[,avg1].last(0)}>5</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Server $server is unreachable</font> | ||
+ | | align="LEFT" | <font size="1">{$server:status.last(0)}=2</font> | ||
+ | | align="LEFT" | <font size="1">High</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">SSH server is down on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:net.tcp.service[ssh].last(0)}=0</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Sshd is not running on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:proc.num[sshd].last(0)}<1</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Syslogd is not running on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:proc.num[syslogd].last(0)}<1</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Too many processes on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:proc.num[].last(0)}>300</font> | ||
+ | | align="LEFT" | <font size="1">High</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Too many processes running on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:proc.num[,,run].last(0)}>10</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Too may users connected on server $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:system.users.num.last(0)}>50</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Version of zabbix_agent(d) was changed on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:agent.version.diff(0)}>0</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">WEB (HTTP) server is down on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:net.tcp.service[http].last(0)}=0</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Zabbix_agentd is not running on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:proc.num[zabbix_agentd].last(0)}<1</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">Zabbix_server is not running on $server</font> | ||
+ | | align="LEFT" | <font size="1">{$server:proc.num[zabbix_server].last(0)}<1</font> | ||
+ | | align="LEFT" | <font size="1">Average</font> | ||
+ | |- | ||
+ | | height="14" align="LEFT" | <font size="1">$server has just been restarted</font> | ||
+ | | align="LEFT" | <font size="1">{$server:system.uptime.last(0)}<600</font> | ||
+ | | align="LEFT" | <font size="1">Information</font> | ||
+ | | | ||
= Notifikace = | = Notifikace = |
Verze z 1. 3. 2009, 01:55
Obsah |
Obecné informace
NMS je Network monitoring system. Hlavním cílem tohoto serveru je, monitorování dostupnosti a performance veškerého provozu na Strahově.
Cíle
- performance a alarm monitoring serveru a sitovych prvku
Cílem je prozkoumat, zda je dany spravovaný server aktivní, pokud aktivní je, je vše v pořádku, v opačném případě zašle systém e-mail správci, zástupci a technickému manažerovi. Systém dále zašle SMS správci a jeho zástupci o nekativitě PC.
Monitorované platformy
Switche a routre
- Cisco Catalyst 6509
- Cisco 3750
- Cisco 2950
- Cisco 2960
- Cisco Air Ap 1242AG-E-K9
Servery
- HP Proliant 145 G2
- HP Proliant DL380 G5
- Různe x86a x64 servery
- Dell ???
Jina zarizeni
- KVM
- APC Symmetra LX 16kVA
Operacni systemy
- Debian GNU Linux Lenny
- Debian GNU Linux 4.0 Etch
- Debian GNU Linux 3.1 Sarge
- FreeBSD 6.3
- Microsoft Windows Server 2003 Standart x64
- Microsoft Windows Server 2008
Komunikační protokoly vyuzivany k monitoringu
- ICMP (ping)
- SNMP (161/UDP)
- SYSLOG (514/UDP)
- ZABBIX AGENT (10050/TCP)
- ZABBIX TRAP (10051/TCP)
Alarm monitoring
Dostupnost sítovych zarizeni
- Monitorovaní stavu portů u jednotlivých propojů na páteři a blocích pomocí syslog správ nebo SNMP trapů
- Monitorování dostupnosti síťových prvkú pomocí ICMP pingu
- Seznam síťových prvků a monitorovaných interfaců:[xls]
Dostupnost serverů
- Dostupnost serverů se ověřuje pomoci ICMP pingu.
- Seznam monitorovaných serverů: [xls]
Dostupnost služeb
- Monitorování dostupnosti TCP/UDP služeb je zabezpečen pomocí požadavků na daný TCP/UDP port a kontrolou bežíciho procesu
- Seznam ověřovaných služeb: [xls]
Performance monitoring
- Zátež CPU (load)
- Vyuziti pameti
- Mereni teplot
- Chybovost sitovych interfacu
- Stav baterie APC
Severity
- Information
- Warning
- Average
- High
- Disaster
Default triggers
Linux servers
| width="322" height="14" align="LEFT" | Name | width="267" align="LEFT" | Expression | width="67" align="LEFT" | Severity |- | height="14" align="LEFT" | /etc/inetd.conf has been changed on server $server | align="LEFT" | {$server:vfs.file.cksum[/etc/inetd.conf].diff(0)}>0 | align="LEFT" | Warning |- | height="14" align="LEFT" | /etc/passwd has been changed on server $server | align="LEFT" | {$server:vfs.file.cksum[/etc/passwd].diff(0)}>0 | align="LEFT" | Average |- | height="14" align="LEFT" | /etc/services has been changed on server $server | align="LEFT" | {$server:vfs.file.cksum[/etc/services].diff(0)}>0 | align="LEFT" | Average |- | height="14" align="LEFT" | /usr/bin/ssh has been changed on server $server | align="LEFT" | {$server:vfs.file.cksum[/usr/bin/ssh].diff(0)}>0 | align="LEFT" | Average |- | height="14" align="LEFT" | /usr/sbin/sshd has been changed on server $server | align="LEFT" | {$server:vfs.file.cksum[/usr/sbin/sshd].diff(0)}>0 | align="LEFT" | Average |- | height="14" align="LEFT" | /vmlinuz has been changed on server $server | align="LEFT" | {$server:vfs.file.cksum[/vmlinuz].diff(0)}>0 | align="LEFT" | Warning |- | height="14" align="LEFT" | Apache is not running on $server | align="LEFT" | {$server:proc.num[httpd].last(0)}<1 | align="LEFT" | Average |- | height="14" align="LEFT" | Configured max number of opened files is too low on $server | align="LEFT" | {$server:kernel.maxfiles.last(0)}<512 | align="LEFT" | Information |- | height="14" align="LEFT" | Configured max number of processes is too low on $server | align="LEFT" | {$server:kernel.maxproc.last(0)}<256 | align="LEFT" | Information |- | height="14" align="LEFT" | Email (SMTP) server is down on $server | align="LEFT" | {$server:net.tcp.service[smtp].last(0)}=0 | align="LEFT" | Average |- | height="14" align="LEFT" | FTP server is down on $server | align="LEFT" | {$server:net.tcp.service[ftp].last(0)}=0 | align="LEFT" | Average |- | height="14" align="LEFT" | Host information was changed on $server | align="LEFT" | {$server:system.uname.diff(0)}>0 | align="LEFT" | Information |- | height="14" align="LEFT" | Hostname was changed on $server | align="LEFT" | {$server:system.hostname.diff(0)}>0 | align="LEFT" | Information |- | height="14" align="LEFT" | IMAP server is down on $server | align="LEFT" | {$server:net.tcp.service[imap].last(0)}=0 | align="LEFT" | Average |- | height="14" align="LEFT" | Inetd is not running on $server | align="LEFT" | {$server:proc.num[inetd].last(0)}<1 | align="LEFT" | Average |- | height="14" align="LEFT" | Lack of free memory on server $server | align="LEFT" | {$server:vm.memory.size[free].last(0)}<10000 | align="LEFT" | Average |- | height="14" align="LEFT" | Lack of free swap space on $server | align="LEFT" | {$server:system.swap.size[,free].last(0)}<100000 | align="LEFT" | High |- | height="14" align="LEFT" | Low free disk space on $server volume / | align="LEFT" | {$server:vfs.fs.size[/,pfree].last(0)}<10 | align="LEFT" | High |- | height="14" align="LEFT" | Low free disk space on $server volume /home | align="LEFT" | {$server:vfs.fs.size[/home,pfree].last(0)}<10 | align="LEFT" | High |- | height="14" align="LEFT" | Low free disk space on $server volume /opt | align="LEFT" | {$server:vfs.fs.size[/opt,pfree].last(0)}<10 | align="LEFT" | High |- | height="14" align="LEFT" | Low free disk space on $server volume /tmp | align="LEFT" | {$server:vfs.fs.size[/tmp,pfree].last(0)}<10 | align="LEFT" | High |- | height="14" align="LEFT" | Low free disk space on $server volume /usr | align="LEFT" | {$server:vfs.fs.size[/usr,pfree].last(0)}<10 | align="LEFT" | High |- | height="14" align="LEFT" | Low free disk space on $server volume /var | align="LEFT" | {$server:vfs.fs.size[/var,pfree].last(0)}<10 | align="LEFT" | High |- | height="14" align="LEFT" | Low number of free inodes on $server volume / | align="LEFT" | {$server:vfs.fs.inode[/,pfree].last(0)}<10 | align="LEFT" | High |- | height="14" align="LEFT" | Low number of free inodes on $server volume /home | align="LEFT" | {$server:vfs.fs.inode[/home,pfree].last(0)}<10 | align="LEFT" | High |- | height="14" align="LEFT" | Low number of free inodes on $server volume /opt | align="LEFT" | {$server:vfs.fs.inode[/opt,pfree].last(0)}<10 | align="LEFT" | High |- | height="14" align="LEFT" | Low number of free inodes on $server volume /tmp | align="LEFT" | {$server:vfs.fs.inode[/tmp,pfree].last(0)}<10 | align="LEFT" | High |- | height="14" align="LEFT" | Low number of free inodes on $server volume /usr | align="LEFT" | {$server:vfs.fs.inode[/usr,pfree].last(0)}<10 | align="LEFT" | High |- | height="14" align="LEFT" | Mysql is not running on $server | align="LEFT" | {$server:proc.num[mysqld].last(0)}<1 | align="LEFT" | Average |- | height="14" align="LEFT" | News (NNTP) server is down on $server | align="LEFT" | {$server:net.tcp.service[nntp].last(0)}=0 | align="LEFT" | Average |- | height="14" align="LEFT" | POP3 server is down on $server | align="LEFT" | {$server:net.tcp.service[pop].last(0)}=0 | align="LEFT" | Average |- | height="14" align="LEFT" | Processor load is too high on $server | align="LEFT" | {$server:system.cpu.load[,avg1].last(0)}>5 | align="LEFT" | Average |- | height="14" align="LEFT" | Server $server is unreachable | align="LEFT" | {$server:status.last(0)}=2 | align="LEFT" | High |- | height="14" align="LEFT" | SSH server is down on $server | align="LEFT" | {$server:net.tcp.service[ssh].last(0)}=0 | align="LEFT" | Average |- | height="14" align="LEFT" | Sshd is not running on $server | align="LEFT" | {$server:proc.num[sshd].last(0)}<1 | align="LEFT" | Average |- | height="14" align="LEFT" | Syslogd is not running on $server | align="LEFT" | {$server:proc.num[syslogd].last(0)}<1 | align="LEFT" | Average |- | height="14" align="LEFT" | Too many processes on $server | align="LEFT" | {$server:proc.num[].last(0)}>300 | align="LEFT" | High |- | height="14" align="LEFT" | Too many processes running on $server | align="LEFT" | {$server:proc.num[,,run].last(0)}>10 | align="LEFT" | Average |- | height="14" align="LEFT" | Too may users connected on server $server | align="LEFT" | {$server:system.users.num.last(0)}>50 | align="LEFT" | Average |- | height="14" align="LEFT" | Version of zabbix_agent(d) was changed on $server | align="LEFT" | {$server:agent.version.diff(0)}>0 | align="LEFT" | Average |- | height="14" align="LEFT" | WEB (HTTP) server is down on $server | align="LEFT" | {$server:net.tcp.service[http].last(0)}=0 | align="LEFT" | Average |- | height="14" align="LEFT" | Zabbix_agentd is not running on $server | align="LEFT" | {$server:proc.num[zabbix_agentd].last(0)}<1 | align="LEFT" | Average |- | height="14" align="LEFT" | Zabbix_server is not running on $server | align="LEFT" | {$server:proc.num[zabbix_server].last(0)}<1 | align="LEFT" | Average |- | height="14" align="LEFT" | $server has just been restarted | align="LEFT" | {$server:system.uptime.last(0)}<600 | align="LEFT" | Information |
Notifikace
Notifikace sou rozesílane emailem pres smtp server service.sh.cvut.cz na adresu $server@sh.cvut.cz a přez GSM modul formou SMS (zatim neimplementovano).
Format email notifikace
Head>To: $HOSTALIAS@sh.cvut.cz Head>From: nms@sh.cvut.cz Head>Subject: $HOSTALIAS - $ALARMTYPE Body>Notification Type: $NOTIFICATIONTYPE$ Body>Host: $HOSTALIAS$ Body>State: $HOSTSTATE$ for $HOSTDURATION$ Body>Alarmtype: $ALARMTYPE Body>Address: $HOSTADDRESS$ Body>Info: $HOSTOUTPUT$ Body>Date/Time: $LONGDATETIME$ Body>ACK by: $HOSTACKAUTHOR$ Body>Comment: $HOSTACKCOMMENT$ Body>Severity: $SEVERITY
Format SMS notifikace
- Maximálně 160 znaků.
Head>To: $ADMIN_NUMBER Head>From: $NMS_NUMBER Body>Host: $HOSTALIAS Body>State: $HOSTSTATE$ for $HOSTDURATION$ Body>Date/Time: $LONGDATETIME$ Body>Alarmtype: $ALARMTYPE Body>Severity: $SEVERITY
Informace o serveru nms.sh.cvut.cz
- Operační systém : Debian Linux Lenny
- Hardware : Dell Power Edge
- Správce : James
- Zabbixadmin : Netopier
Software
- zabbix 1.4.2 - activity and performance monitoring
- cacti 0.7.8c - monitoring sitoveho provozu na portech cisco routru
Hostované služby
- monitoring serverů
Administrátorský (root) přístup na server
Římek Jakub - správce serveru Leonov Alexander - zabbix admin
Nastaveni zabbix agenta
/etc/zabbix/zabbix_agent.conf
Server=147.32.127.234 Timeout=3
/etc/zabbix/zabbix_agentd.conf
Server=147.32.127.234 ServerPort=10051 Hostname=$hostname_serveru.sh.cvut.cz ListenPort=10050 ListenIP=$ipadresa_serveru StartAgents=5 Timeout=3
Nastaveni cisco routru
Global settings:
logging 147.32.127.234 snmp-server enable traps snmp-server enable authentication snmp-server enable linkdown linkup snmp-server enable coldstart warmstart snmp-server enable traps chassis snmp-server enable traps module snmp-server enable traps ospf state-change snmp-server enable traps ospf errors snmp-server enable traps config-copy snmp-server enable traps config snmp-server enable traps cpu threshold
Interface settings:
logging event link-status snmp trap link-status