[op5-users] Réf. : Re: Réf. : Re: Réf. : Re: merlind crash after loosing mysql connection

nicolas.raspail at bnpparibas.com nicolas.raspail at bnpparibas.com
Tue Sep 1 14:32:27 CEST 2009


op5-users-bounces at lists.op5.com wrote on 01/09/2009 13:46:07:

> nicolas.raspail at bnpparibas.com wrote:
> > op5-users-bounces at lists.op5.com wrote on 31/08/2009 17:04:24:
> > 
> >> nicolas.raspail at bnpparibas.com wrote:
> >>> p5-users-bounces at lists.op5.com wrote on 31/08/2009 10:38:27:
> >>>
> > 
> > <snip> 
> > 
> >> Ok. In that case it's not a configuration error. Can you try using 
the
> >> latest git snapshot (download it directly from git for simpler 
updates)
> >> and see if that solves this particular problem?
> >>
> >> The latest core code changes can be found in v0.6.2-beta11.
> >>
> >> Thanks for your reports. I really appreciate them :-)
> >>
> > 
> > Hi,
> > 
> > I have just compiled and installed the tarball from the git commit 
> > b0703a40b91d39b57d84d52ddb81a8e34933c362. I have modified the 
> > gen-version.sh script to add 
> > DEF_VER=v0.6.2-b0703a40b91d39b57d84d52ddb81a8e34933c362 
> > 
> > * When I start merlind, I get the following messages
> > 
> > [1251803549] 6: Initializing IPC socket 
> > '/bnp/apps/nagios/merlin/ipc.sock' for daemon [1251803549] 6: 
> > dbi_conn_query_null(): Failed to run [SELECT host_name, current_state, 

> > state_type FROM merlindb.host ORDER BY host_name]: no database 
> > connection. Error-code is 7 [1251803549] 3: Attempting to reconnect to 

> > database [1251803549] 6: Successfully ran the previously failed query 
> > [1251803550] 6: Primed object states for 0 hosts and 14639 services 
> > [1251803550] 6: Merlin daemon 
> > v0.6.2-b0703a40b91d39b57d84d52ddb81a8e34933c362 successfully 
initialized 
> > [1251803550] 6: Accepting inbound connection on ipc socket 
> > 
> > * After that, a large number (41) of php importer process are launched
> > 
> > [1251803550] 6: Executing import command 'php 
> > /bnp/apps/nagios/merlin/import.php 
> > --nagios-cfg=/bnp/apps/nagios/etc/nagios.cfg 
> > --cache=/bnp/apps/nagios/var/objects.cache --db-name=merlindb 
> > --db-user=merlin --db-pass=xxx --db-host=eqd-nagios-sql' 
> > <40 times>
> > [1251803554] 6: Handled 86 ipc events in 4.326 seconds
> > [1251803566] 6: Executing import command 'php 
> > /bnp/apps/nagios/merlin/import.php 
> > --nagios-cfg=/bnp/apps/nagios/etc/nagios.cfg 
> > --cache=/bnp/apps/nagios/var/objects.cache --db-name=merlindb 
> > --db-user=merlin --db-pass=xxx --db-host=eqd-nagios-sql' 
> > [1251803566] 6: Handled 6 ipc events in 0.006 seconds
> > [1251803566] 6: Handled 2 ipc events in 0.003 seconds
> > [1251803566] 6: Handled 128 ipc events in 0.083 seconds
> > [1251803566] 6: Handled 1 ipc events in 0.000 seconds
> > 
> > I have stopped merlind because the load on the server was very high.
> > Several minutes after I have stopped the merlind process, I can see 
php
> > running, but they finally disappeared.
> > 
> > I did not can test the reconnection after a failover of my mysql 
server
> > because of this new merlind behaviour :)
> > 
> 
> Can you please send me the log from your eventbroker module as well
> (the one call neb.log in the example config file)?
> 
> What I think is happening is that your config is simply so large that
> the import takes far too much time. I'll add a check to make sure we
> aren't running one import while another one's working.
> 

Hi

unfortunately, a wrong permission on the logs directory prevent nagios
to write the neb.log. Maybe a warning in nagios.log from the module would
be a nice feature :)

But I have corrected the permission, enabled again merlind for some
minutes and stopped it. And I have attached the file in this email.

Regards

Nicolas





This message and any attachments (the "message") is
intended solely for the addressees and is confidential. 
If you receive this message in error, please delete it and 
immediately notify the sender. Any use not in accord with 
its purpose, any dissemination or disclosure, either whole 
or partial, is prohibited except formal approval. The internet
can not guarantee the integrity of this message. 
BNP PARIBAS (and its subsidiaries) shall (will) not 
therefore be liable for the message if modified. 
Do not print this message unless it is necessary,
consider the environment.

                ---------------------------------------------

Ce message et toutes les pieces jointes (ci-apres le 
"message") sont etablis a l'intention exclusive de ses 
destinataires et sont confidentiels. Si vous recevez ce 
message par erreur, merci de le detruire et d'en avertir 
immediatement l'expediteur. Toute utilisation de ce 
message non conforme a sa destination, toute diffusion 
ou toute publication, totale ou partielle, est interdite, sauf 
autorisation expresse. L'internet ne permettant pas 
d'assurer l'integrite de ce message, BNP PARIBAS (et ses
filiales) decline(nt) toute responsabilite au titre de ce 
message, dans l'hypothese ou il aurait ete modifie.
N'imprimez ce message que si necessaire,
pensez a l'environnement.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.op5.com/pipermail/op5-users/attachments/20090901/a977b407/attachment-0001.html 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: neb.zip
Type: application/zip
Size: 3564 bytes
Desc: not available
Url : http://lists.op5.com/pipermail/op5-users/attachments/20090901/a977b407/attachment-0001.zip 


More information about the op5-users mailing list