[op5-users] merlin troubleshooting
Andreas Ericsson
ae at op5.se
Wed Oct 14 16:33:56 CEST 2009
On 10/14/2009 02:36 PM, Russell Jennings wrote:
> well, by "not working" i mean, the noc isn't updating nagios correctly
> that a host is up, which it should know from the poller. in my nagios,
> it still shows my given test host as "down" even though it should
> update it as UP. When running latest on the poller, this is what
> happens. rolled back to 6.2b4 and it works.
>
> I AM running two different versions. But this is out of need, as the
> latest on both NOC and Poller doesn't work, and is where i get that
> error message. It seems downgrading the poller is the only way i can
> get data to flow again.
>
> as is, everything seems to be in harmony, at least as far as merlin
> goes. Neither the daemon logs nor NEB (on either poller or noc) are
> spitting out any heavy messages. But, this is only when the poller
> version is old. When i run a later version, doesn't even need to be
> THAT much later, things just stop working, and both logs on NOC and
> Poller seem to have more errors. not sure what's a big deal in the
> logs and whats not, i imagine level 6& 7 messages are all normal, and
> things like 4&3's are concerning errors?
>
That's correct.
> So what should I make of this? I have a poller who is fine on 6.2b4,
> but anything later (haven't pinpointed the exact point it breaks at
> with versions) does not work. Could this be an actual problem (in
> merlin), or is it more likely that though one way or the other, the
> fault is with that particular server's config?
>
It's a problem with Merlin. The problem is rather straightforward
actually. We disabled registering the module for host and service
check results and rely solely on host/service status update events
instead, but the receiving end of the module has no knowledge of
such events and therefore cannot handle the results it receives.
> aside from just compiling different versions and seeing what works and
> what doesn't, is there anything else i can do? or should i just try
> running latest and post relevant errors from logs?
>
If what you're using now works for you, use that and I'll let you know
in this thread when it should be ok to upgrade merlin versions again.
> I am just trying to get a grasp of what i can/should do here. I know
> being stuck on an older version is bad, but i am not sure what i can
> do to help correct this.
>
Actually, viewing the logs from back then shows me it isn't all that
bad. The major changes between those versions has to do with merlin's
database support, so if the NOC is running the latest version you
should be just fine.
--
Andreas Ericsson andreas.ericsson at op5.se
OP5 AB www.op5.se
Tel: +46 8-230225 Fax: +46 8-230231
Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.
More information about the op5-users
mailing list