SAM At A Glance Production Environment EXAMPLE PAGE captured on 21 Mar 2007 00:35:50 |
Monitor Level: Critical Host:Port Version Up Since station1-prd dummy-node-1.fnal.gov:12345 v6_0_4_2 05 May 2007 12:30:00 station2-prd dummy-node-2.fnal.gov:67890 v6_0_4_2 04 Nov 2005 10:00:41 Monitor Level: High Host:Port Version Up Since cranky-station station3-prd dummy-node-3.imperial.co.uk:00001 v6_0_3_9p1 25 Dec 2006 12:15:00 station4-prd germany.uni-karlsruhe.de:99999 v6_0_3_9p1 15 Apr 2007 00:00:00 Monitor Level: Normal Host:Port Version Up Since stubborn-station pouting-station average-prd dummy-node-4.anytown.edu:98765 v6_0_2_5 01 Dec 2006 15:31:54 Monitor Level: unknown Host:Port Version Up Since
Server Host:Port Version Up Since SAMDbServer.cluster_prd:SAMDbServer (Using: samdbs@fnalprd) server1.fnal.gov:22222 v8_0_7 14 Apr 2007 14:44:04 SAMDbServer.blob_prd:SAMDbServer (Using: samdbs@fnalprd) server2.fnal.gov:33333 v8_0_7 11 Nov 2007 11:11:11
What it means.... server has never registered or has died and cannot be reached; OR, a station has only stagers (no station master, no fss) and is therefore unusable server is behind a firewall or some other network filter; OR, server is temporarily busy processing another request and could not respond to a ping before timing out; OR, server is hopelessly hung and about to crash server is alive and healthy, and can respond to a ping
Station Life Cycle Monitor Level woeful active ignore ugly active ignore mutt active ignore