As the monitoring team, we need to see things as they happen. This view shows us Nodes and interfaces with problems. Also a full hardware health overview and the graph on the right is a graph of the pollers pulling # of users on our VPN (the break in the graph was poller server repair earlier today) Normally a break or "Dip" in that graph means VPN issues and then i really start working. Critical Applications (DHCP Server Service & Wireless Radius Services begin monitored). Then High Errors and discards for today (Customized to filter out FW's and other appliances that discard packets) ; and Discards this hour for more granularity. Also we have the last 1- config changes, so each morning we can come in and see who did what to what and where. Polling Engine status has been inportant to us in the past, but that will be replaced after we confirm our pollers to be up to speed and an installation of another.
Now, if you were sitting here and i scrolled down.... which does happen from time to time on our big screens.
you would see also
1 :Interfaces with High % usage
2: Custom Property - Tracking Nodes with open tickets/work orders ; & ;
3: a UnDP Table showing Cisco devices with a VTP Mode value other than Transparent (3)
4: A Long list of all dependencies that are setup (but that one is way down there)
Image may be NSFW.
Clik here to view.
The idea here, being anything that breaks will show at the top, real issue or problem area's will also show within our problem fields and the errors/discards for the day. And quick access to config changes and (below that) current open tickets allows us to see if it is yet to be resolved or if a rogue engineer made a change last night and did not tell anyone.
CHEERS!