Previous Article Next Article 監控系統 icinga (nagios 的分支) Q&A 篇
Posted in Linux

監控系統 icinga (nagios 的分支) Q&A 篇

監控系統 icinga (nagios 的分支) Q&A 篇 Posted on 2014 年 07 月 28 日Leave a comment

Q

[1397706401] Error: Could not load module '/usr/local/icinga/lib/idomod.so' -> /tmp/icinganebmodWoV8IA: failed to map segment from shared object: Operation not permitted
[1397706401] Error: Failed to load module '/usr/local/icinga/lib/idomod.so'.

A

	
因為我的 /tmp 有設定 noexec
所以我就把 tmp 放到別的地方去,編譯時就加上如下
--with-temp-dir=/usr/local/icinga/var
--with-temp-file=/usr/local/icinga/var/icinga.tmp

Q

	
如果在 icinga-core 裡 ./configure 加上 --enable-embedded-perl --with-perlcache 後,再 make 時
../include/epn_icinga.h:30:20: error: EXTERN.h: No such file or directory

A

目前還沒研究解法

Q

在 icinga 介面上遇到 "Warning: Status data OUTDATED! Last status data update"

A

調整 my.cnf 效能使用
比方說參考 /usr/share/mysql 裡的 my-huge.cnf

Q

[1398763667] Warning: Breaking out of check result reaper: max reaper time (30) exceeded. Reaped 32 results, but more checkresults to process. Perhaps check core performance tuning tips?

A

因為使用 nsca + command 後,大量 checks 讓效能差,所以可以用 nsca + ocp daemon 改善

Q

icinga classic UI 介面上看到的三個數值,Unacknowledged / acknowledged / Handled 有什麼不同

A

https://www.mail-archive.com/icinga-users@lists.sourceforge.net/msg01576.html

Unacknowledged == THE SKY IS FALLING!
Acknowledged == It broke, but it's being attended to
Handled == We planned on breaking this host/service on purpose; nothing to
see here, move along. (in addition for services, the host was Acknowledged)

--> "Unacknowledged" means a check you were expecting to return UP/OK
returned for that particular state (Down, Critical, Warning etc) and it
hasn't been acknowledged... in other words these are "real" problems that
haven't gotten any attention yet, at least as far as Icinga knows.

--> "Acknowledged" means a check you were expecting to return
UP/OK returned for that particular state (Down, Critical, Warning etc), but
that someone has acknowledged it. Which means someone has told Icinga that
they know about it and it can (hopefully) be assumed that it is being dealt
with in some fashion.

--> "Handled" means a check you were expecting to probably *NOT* return
UP/OK returned for that particular state (Down, Critical, Warning etc).
Handled is determined when a host or service is in scheduled downtime or in
the case of services, also when its host has been acknowledged. In a way,
scheduled downtimes are kind of like a pre-acknowledgement of a purposely
made issue/outage which makes them a distinctly different type of event
than something that is a surprise and needs to be Acknowledged after the
fact.

發佈留言

發佈留言必須填寫的電子郵件地址不會公開。 必填欄位標示為 *