
|
|||
Monitor The Monitor MTM provides high confidence to NOC teams, letting them know that they are receiving the correct information at all times, proactively identifying issues and knowing how to resolve those issues as quickly as possible, protecting revenue, customer and data streams. The challenges these customers have is knowing that the systems they are reliant upon are running correctly, and in turn providing the information and alerts as required. The Abilisoft solution, via small footprint agents, continuously monitors the existing monitoring solution and provides key information via a web based console on the performance, availability and configuration of critical objects within the enterprise monitoring environment, alerting when appropriate. Abilisoft MTM & Netcool Health Analysis Views are supplied pre-configured for the landscape of Netcool/OMNIbus components - the ObjectServer, Probes, Gateways, WebTop and Process Control. The time taken for events to be delivered to your Netcool users is measured and recorded, enabling you to highlight architectural issues and plan ahead for capacity increases. This information also lets you ensure that you are meeting your event delivery service level agreements. SNMP Event Storms Typically, either a specific device will send the same event repeatedly, many times a second, or a group of devices will start to send a significantly higher number of events than your are expecting. Whatever the exact scenario in your environment, an event storm can be simply defined as the sudden receipt of a higher than normal number of events that may negatively impact upon service performance. The Abilisoft MTM solution will detect such an event storm, and notify you when the storm starts. An event storm is detected either by the number of events in the fault management database from a specific source increasing and breaching the specified threshold, or by the detection of a sudden increase in the Rate Of Change measurement of events from a specific source. The image below shows the number of events received via SNMP increasing and breaking the upper threshold. A pre-determined corrective action can take place once the event storm is detected by Abilisoft MTM, to help reduce the negative performance impact on the rest of the fault management system. An example of this would be to dynamically modify the SNMP collector configuration so that it discards non-critical events. The type of corrective action to take is dependant on your environment, and is customisable to meet your requirement. Measuring E2E Netcool Performance Taking this measurement allows you to assure your entire Netcool® environment in one easy step. It allows you to demonstrate that your Netcool service is functional, and delivering event data to your operators. You will also understand how long it takes for critical event data to reach your operators, and ensure that you stay within event delivery service level agreements that you have with your user community. The latency measurement is taken by injecting synthetic event data into your Netcool Probes, and tracking the movement of these specific events throughout the Netcool® architecture. The event is configured so that it will move through the various Netcool components that you have deployed, and end up in a WebTop display. Abilisoft MTM finds the event at the display layer, and establishes the how long it has taken the event to get there. Once the event has been tracked by Abilisoft MTM, it will automatically clear and delete itself. Abilisoft Rich Internet Application Interface The interface features a dynamic Explorer panel that is hidden by default to reduce the amount of screen real estate that is used. The Explorer panels are activated by moving your mouse near the edges of the screen. The image below shows the Abilisoft interface with multiple Explorer panels activated.
Information can be viewed in multiple ways, allowing each user to use the interface in the way that works best for them. The available views allow you to look at your data organised either by the type of component, the host platform or in groups that you have created. You can group by physical and logical locations, and service and component types, meaning that you can effectively organise things how you want them. Alerting & Escalating The Alerts dashboard view presents you with a view of all of the current Alerts, and any associated Escalations. Summary data showing you how many alerts have been acknowledged, and the distribution of Alert severity is also included. |
|||