<html><head><style type='text/css'>body { font-family: 'Times New Roman'; font-size: 12pt; color: #000000}</style></head><body>Alright, here's what I have. <br><br>--<br>Justin Brehm<br><br>----- Original Message -----<br>From: "Frieder Kundel" <frieder.kundel@gmail.com><br>To: mogilefs@lists.danga.com<br>Sent: Tuesday, April 29, 2008 7:38:12 AM (GMT-0500) America/New_York<br>Subject: Re: Nagios plugin?<br><br>the third check seems to be what I have in mind - currently I monitor the databases with <br> check_mysql_query -H ... -u mog -p .. -d mogilefs -q "select 1"<br><br>Are you willing to share your checkscript?<br>
<br>/fk<br><br><div class="gmail_quote">2008/4/28 Justin H. Brehm <<a href="mailto:jbrehm@icontact.com" target="_blank">jbrehm@icontact.com</a>>:<br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
<div>I've had three different checks that I've used and all seem to have flaws. <br><br>First one was a simple TCP port check on the ports that MogileFS has open. This is cool if you want to make sure the daemons are still running, but I noticed that there were cases when a DB could go down and the port remains open. <br>
<br>Next I wrote something that used 'mogtool' to test injections and extractions, however 'mogtool' does way more than I needed it to do and it would also tend to keep retrying in areas if mogile went down making the nagios plugin NRPE timeout.<br>
<br>The last thing that I wrote was script that uses the MogileFS::Client perl modules and does an injection, extraction and I then compares the in/out files size to simply check if we have the same file. This is what we've been using so far, however, I have seen an instance where the database was down and MogileFS::Backend would have a return code of '82' or something in that range and my nagios check was giving me the UNKNOWN status. That was a long night of moving some development databases, so I wasn't up to debugging it that night and haven't revisited yet.<br>
<br>What I'm planning on doing, because most of the problems that I've seen tend to revolve around the database side, will be modifying my last nagios plugin to do a 'select 1' query on the Mogile DB first and if that fails then to alert. At least I'll elimnate that first and then move on to testing whether the trackers are functioning, etc.<br>
<font color="#888888"><br>--<br>Justin Brehm<br>Systems Engineer<br>iContact.com</font><div><div></div><div class="Wj3C7c"><br><br>----- Original Message -----<br>From: "Frieder Kundel" <<a href="mailto:frieder.kundel@gmail.com" target="_blank">frieder.kundel@gmail.com</a>><br>
To: <a href="mailto:mogilefs@lists.danga.com" target="_blank">mogilefs@lists.danga.com</a><br>Sent: Monday, April 28, 2008 10:18:42 AM (GMT-0500) America/New_York<br>Subject: Nagios plugin?<br><br>Hi folks,<br><br>how do you monitor your mogile? Has anyone written a nagios plugin?<br>
<br>Best regards,<br clear="all"><br>Frieder Kundel<br>
</div></div></div></blockquote></div><br><br clear="all"><br>-- <br>Frieder Kundel<br>
</body></html>