[op5-users] merlin troubleshooting
Russell Jennings
russ at geekwhiz.com
Wed Oct 14 16:55:33 CEST 2009
Andreas,
Thank you very much for your response. It's comforting knowing that
i'm not crazy (or a complete idiot). Aye, everything looks ok now
otherwise, so i'm in no rush to upgrade the poller. though if(when?)
you guys get merlin to the point of centralized configs that get auto-
synced to nodes... I will be scrambling to update everything at that
point ;)
Thanks,
Russell
On Oct 14, 2009, at 10:33 AM, Andreas Ericsson wrote:
> On 10/14/2009 02:36 PM, Russell Jennings wrote:
>> well, by "not working" i mean, the noc isn't updating nagios
>> correctly
>> that a host is up, which it should know from the poller. in my
>> nagios,
>> it still shows my given test host as "down" even though it should
>> update it as UP. When running latest on the poller, this is what
>> happens. rolled back to 6.2b4 and it works.
>>
>> I AM running two different versions. But this is out of need, as the
>> latest on both NOC and Poller doesn't work, and is where i get that
>> error message. It seems downgrading the poller is the only way i can
>> get data to flow again.
>>
>> as is, everything seems to be in harmony, at least as far as merlin
>> goes. Neither the daemon logs nor NEB (on either poller or noc) are
>> spitting out any heavy messages. But, this is only when the poller
>> version is old. When i run a later version, doesn't even need to be
>> THAT much later, things just stop working, and both logs on NOC and
>> Poller seem to have more errors. not sure what's a big deal in the
>> logs and whats not, i imagine level 6& 7 messages are all normal,
>> and
>> things like 4&3's are concerning errors?
>>
>
> That's correct.
>
>> So what should I make of this? I have a poller who is fine on 6.2b4,
>> but anything later (haven't pinpointed the exact point it breaks at
>> with versions) does not work. Could this be an actual problem (in
>> merlin), or is it more likely that though one way or the other, the
>> fault is with that particular server's config?
>>
>
> It's a problem with Merlin. The problem is rather straightforward
> actually. We disabled registering the module for host and service
> check results and rely solely on host/service status update events
> instead, but the receiving end of the module has no knowledge of
> such events and therefore cannot handle the results it receives.
>
>> aside from just compiling different versions and seeing what works
>> and
>> what doesn't, is there anything else i can do? or should i just try
>> running latest and post relevant errors from logs?
>>
>
> If what you're using now works for you, use that and I'll let you know
> in this thread when it should be ok to upgrade merlin versions again.
>
>> I am just trying to get a grasp of what i can/should do here. I know
>> being stuck on an older version is bad, but i am not sure what i can
>> do to help correct this.
>>
>
> Actually, viewing the logs from back then shows me it isn't all that
> bad. The major changes between those versions has to do with merlin's
> database support, so if the NOC is running the latest version you
> should be just fine.
>
> --
> Andreas Ericsson andreas.ericsson at op5.se
> OP5 AB www.op5.se
> Tel: +46 8-230225 Fax: +46 8-230231
>
> Considering the successes of the wars on alcohol, poverty, drugs and
> terror, I think we should give some serious thought to declaring war
> on peace.
> _______________________________________________
> op5-users mailing list
> op5-users at lists.op5.com
> http://lists.op5.com/mailman/listinfo/op5-users
More information about the op5-users
mailing list