Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
×
Sun Microsystems

Worldwide Performance/Usage Monitoring Software? 9

Wee asks: "I've got a need to monitor a bunch of Sun Solaris boxes worldwide for various load statistics. Things I need to see are disk I/O, CPU usage, RAM usage, IP traffic, etc. I need reports by the hour, day, week, month and year (as near real-time as possible). The reports have to customizable. I don't have a GUI running, so these stats need to be compiled from the console, and remotely. I can't figure out how to log system/network/load/usage statistics without loading the server in doing so. All the packages I've found so far are either limited or not appropriate in some way. I'd really like to hear how to monitor something without affecting the monitored thing. Since I need pretty precise numbers this is kinda important. I'm sure there's something somewhere that does what I need, but I can't seem to find it. If I have to use multiple tools and collate data then I will, but this wouldn't be ideal. "
This discussion has been archived. No new comments can be posted.

Worldwide Performance/Usage Monitoring Software?

Comments Filter:
  • I recommend Cricket:

    http://www.munitions.com/~jra/cricket/

    which is built around

    http://ee-staff.ethz.ch/~oetiker/webtools/rrdtoo l/

    plus whatever other tools and bits and bobs you find/create/get pointed to by everyone else.
  • If you want to spend bux, hit up HP and get OpenView. It'll monitor anything, anywhere, and HP will set it up for you. (for a premium, of course) You should be able to get a box and the software as a package deal.

    If you want free, I've heard good things about MRTG, bu
    t I don't know how customizable it is.
  • If you want to spend bux, hit up HP and get OpenView. It'll monitor anything, anywhere, and HP will set it up for you. (for a premium, of course) You should be able to get a box and the software as a package deal.

    If you want free, I've heard good things about MRTG, but I don't know how customizable it is.
  • Until recently, I worked on one of the major tools vendor's monitoring
    product. I'll avoid product plugs as I'm biased. There are a number
    of commercial products. HP, Tivoli/IBM, Platinum/CA, Compuware, and
    BMC all have products. There are also some open source packages,
    though I'm less familiar with them. All address much of your problem,
    but none of them will be an out of the box solution. Like as not the
    long term summarizing will remain your problem.

    However, I want to address to some issues that I see in your question,
    so you avoid some of the mistakes I've seen people fall into.

    First, wanting to be "real-time" raises a red flag with me. Be
    careful of wanting to collect data on a very fine granularity. In
    many cases (cpu utilization, run queue length) the numbers are really
    averages over time. Collecting them too often degrades their meaning.
    There's also a trade off between how often you collect data and the
    overhead of collecting it. Give serious thought to how much you
    "care" about short lived perturbations. Would you really do something
    about them? Also think about what the numbers you are collecting
    really mean over the time frames you collect them.

    Second, there is absolutely no way to collect data without impacting
    the system. You can minimize the impact a number of ways. Don't
    collect extraneous data. Use efficient means of collection. Offload
    data analysis and summation to a different machine. But, you can't
    eliminate the overhead altogether. The data is on the machine it's
    on, and that's where you need to get it.

    Third, don't worry too much about precision until you are sure what it
    is you are being precise about. By and large all any product can do
    is collect what the kernel has to offer and maybe add some value in
    terms of summarization and correlation. Give serious thought to what
    you really need to track. The more you understand what the OS and
    machine are up to the better off you are. There are a number of good
    books on tuning and internals.

    Most of all, remember that the point of the OS is to *use* the
    machine. Sure, it's to use it efficiently and fairly. You want to
    detect inefficiency and unfairness as well as any major anomalies, but
    to be fair about the stats, you have to take time to understand what
    the OS is up to and why the folks who wrote it collected the stat in
    the first place. I can't emphasize that point enough.
  • by martin ( 1336 )
    Hi

    I'd recommend Orca (http://www.geocities.com/ResearchTriangle/Thinkta nk/4996/) it does exactly what you want. Its based on rrd (already mentioned) and SE (the performance analysis programming language from SUN). Check out the Performance tuning COlumn of Sunworld (http://www.sunworld.com) as a good resource too.

    martin
  • The ucd-snmp [ucdavis.edu] package can produce the information you're looking for. Someone else has already mentioned mrtg, etc, which can be used to create graphs of the data that you want to collect from the ucd-snmp snmp agent...
  • is the way to go, was at least for me, and if you burrow into its pages, you get to see some real nice monitoring going on, including of water levels, IRC users, squid objects.. you name it.

    It groks SNMP, but, has a real simple way of running a program to return the critical values for a given thing that doesnt happen to run snmp agents.

    It simply handles year/month/week/day graphs without keeping boatloads of data.. the data is simply combined as it gets older, so log files do NOT grow.

    It takes very little cpu. A halfway decent box can monitor a hundred routers and create html graph pages for them.

    A lot of people have written patches & addons etc for it.

    It only costs a CD from cdnow.com to the author. and thats optional.

    Check mine out.. i put this up in 30 minutes. It monitors my eth0 interface, and i dont even run snmpd. My sdsl line [dslreports.com]

"Don't try to outweird me, three-eyes. I get stranger things than you free with my breakfast cereal." - Zaphod Beeblebrox in "Hithiker's Guide to the Galaxy"

Working...