[op5-users] troubleshooting merlin
Russell Jennings
russ at geekwhiz.com
Mon Oct 5 14:54:10 CEST 2009
well, to clarify, only one VM is on at a time. it is that they are
similar in all other regards, just different OS's, and one works and
one doesn't. I shut down one and power up the other and everything
starts working. Which leads me to believe it is something with the
centos node in some way...
On Oct 5, 2009, at 8:46 AM, Andreas Ericsson wrote:
> On 10/05/2009 02:36 PM, Russell Jennings wrote:
>> So,
>>
>> I created a new VM (centos) and installed nagios then merlin in it.
>> the problem i'm having is the NOC isn't getting data from the poller.
>> or at least it's not ending up in nagios.
>>
>> i have watch tail'd both logs on the NOC and poller.
>>
>> sometimes this guy pops up and i gotta restart the one complaining:
>>
>> in read_event: incomplete header read(). expected 64, got 0.
>> read() from NOC node noc1 failed: success (also, this line is
>> confusing with the failed : success)
>>
>
> "success" here means read(2) didn't return with an error code, but
> that it did get a zero read from the socket, which we consider an
> error since it usually means that the other end has silently gone
> away.
>
>> other lines...
>> new connection inbound when one already exists. dropping old.
>>
>> and in the poller:
>> recv(7, (buf + total), 391, MSG_DONTWAIT | MSG_NOSIGNAL) returned -1
>> (resource temporarily unavailable)
>> Bogus read in proto_read_event() got -1 expected 524
>> read() from NOC noc1 failed: Resource temporarily unavailable
>>
>> now, the frustrating thing, is i have another VM running debian (as
>> the node) with the same config, and it works fine. both have the same
>> IP, and same config, so when i bring one down and the other up, it
>> works.
>>
>> So at this point i have no idea how to proceed. Would like to sort
>> out
>> WHY this doesn't work because the new VM i created is going to (or
>> was
>> going to be) the deploy node. However i am out of ideas as to what to
>> do or try.
>>
>
> Well, if you run the two nodes simultaneously you will ofcourse get an
> IP conflict in your network, so the behaviour of any network transport
> protocol can only be described as "undefined" when that happens.
>
> --
> Andreas Ericsson andreas.ericsson at op5.se
> OP5 AB www.op5.se
> Tel: +46 8-230225 Fax: +46 8-230231
>
> Considering the successes of the wars on alcohol, poverty, drugs and
> terror, I think we should give some serious thought to declaring war
> on peace.
> _______________________________________________
> op5-users mailing list
> op5-users at lists.op5.com
> http://lists.op5.com/mailman/listinfo/op5-users
More information about the op5-users
mailing list