Boston Linux & Unix (BLU) Home | Calendar | Mail Lists | List Archives | Desktop SIG | Hardware Hacking SIG
Wiki | Flickr | PicasaWeb | Video | Maps & Directions | Installfests | Keysignings
Linux Cafe | Meeting Notes | Blog | Linux Links | Bling | About BLU

BLU Discuss list archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

System monitoring

On Thu, Apr 22, 2004 at 03:39:28PM -0400, Cole Tuininga wrote:
> On Thu, 2004-04-22 at 15:21, dsr at wrote:
> > Have you evaluated mon?
> Nope - do you use it?  If so, what do you like about it?

I've used two dozen systems; mon is the one I keep returning to. It may
be masochism.

The good news: mon works pretty well as a primarily SNMP-based requestor
of values, which then evaluates certain conditions and takes certain
actions. In practice, this comes down to:

mon: ping productionhost
 (OK, it's up and on the network)
mon: run ssh.mon productionhost
 (OK, I can ssh to it)
mon: snmp get diskspace / productionhost
 (OK, it hasn't run out of space on /)
mon: snmp get process named oracle productionhost
 (OK, oracle is running)
mon: run simple-oracle-query productionhost
 (blast, that failed)
 (if it fails again within 1 minute, send pages to...)

If that's the level of functionality you want, this is a good bet. You
can add in web integration of various sorts, but basically, when mon
isn't sending alerts, everything should be good.


BLU is a member of BostonUserGroups
BLU is a member of BostonUserGroups
We also thank MIT for the use of their facilities.

Valid HTML 4.01! Valid CSS!

Boston Linux & Unix /