<div dir="ltr">I've been thinking is something like this applied on SGE, a tool to monitor when jobs are launched, how much time did it runned, mem and other relevant data. When we've got a lot of scripts and many of them have a simple log system, detecting that something is working wrong or not working at all is quite painful. A basic output like xml or similar should be simple enough to tool authors get some script and create a small-tiny-micro-nano monitor tool adapted for they're needs.<br>
<div class="gmail_extra"><br><br><div class="gmail_quote">2014-05-30 18:39 GMT+01:00 Ryan Lane <span dir="ltr"><<a href="mailto:rlane32@gmail.com" target="_blank">rlane32@gmail.com</a>></span>:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div dir="ltr">Yeah, that seems like a good idea. In general projects always have the option on setting up more specific tools inside of their own project if the general tool can't easily meet their needs.<br></div><div class="HOEnZb">
<div class="h5"><div class="gmail_extra">
<br><br><div class="gmail_quote">On Fri, May 30, 2014 at 7:30 AM, Yuvi Panda <span dir="ltr"><<a href="mailto:yuvipanda@gmail.com" target="_blank">yuvipanda@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Hello!<br>
<br>
I know that we have some form of icinga for Labs in general, but it is<br>
currently pretty dysfunctional and not very useful. Wondering if we<br>
should setup a separate icinga just for toollabs that not just<br>
provides general monitoring for admins, but also monitoring for<br>
individual tools (in a way that's easily customizable by he tool<br>
authors themselves). I wrote up a proposal for this a while ago<br>
(<a href="https://wikitech.wikimedia.org/wiki/User:Yuvipanda/Icinga_for_tools" target="_blank">https://wikitech.wikimedia.org/wiki/User:Yuvipanda/Icinga_for_tools</a>).<br>
I think this will help improve general reliability of all our tools<br>
and infrastructure.<br>
<br>
Thoughts?<br>
<span><font color="#888888"><br>
--<br>
Yuvi Panda T<br>
<a href="http://yuvi.in/blog" target="_blank">http://yuvi.in/blog</a><br>
<br>
_______________________________________________<br>
Labs-l mailing list<br>
<a href="mailto:Labs-l@lists.wikimedia.org" target="_blank">Labs-l@lists.wikimedia.org</a><br>
<a href="https://lists.wikimedia.org/mailman/listinfo/labs-l" target="_blank">https://lists.wikimedia.org/mailman/listinfo/labs-l</a><br>
</font></span></blockquote></div><br></div>
</div></div><br>_______________________________________________<br>
Labs-l mailing list<br>
<a href="mailto:Labs-l@lists.wikimedia.org">Labs-l@lists.wikimedia.org</a><br>
<a href="https://lists.wikimedia.org/mailman/listinfo/labs-l" target="_blank">https://lists.wikimedia.org/mailman/listinfo/labs-l</a><br>
<br></blockquote></div><br><br clear="all"><br>-- <br>Alchimista<br><a href="http://pt.wikipedia.org/wiki/Utilizador:Alchimista" target="_blank">http://pt.wikipedia.org/wiki/Utilizador:Alchimista</a><br>
</div></div>