SAN Management and Performance Monitoring with Predictive Intelligence
The goal of IntelliMagic Vision is to identify potential issues before they impact applications. This is achieved by an intelligent rating system where workload measurements are combined and compared with the capabilities of the systems. The resulting ratings are shown in dashboards that represent the health of your SAN Storage and Switches.
There are separate dashboards for the storage systems, for virtualization engines such as IBM Spectrum Virtualize (SVC), and for the Fibre switches.
The dashboard indicates infrastructure health by showing the key ratings for that part of the environment. Dashboard ratings are based on the analysis of thousands of underlying data points, making the dashboards a very dense summary of hundreds of charts. The result is that issues and risks are flagged proactively, before application performance degrades.
Watch the video below for an in-depth look at our Dashboards and Ratings.
Benefits and Capabilities of IT Infrastructure Monitoring
Overall Health Status
The dashboards show the summary; drill-downs allow you to explore details. The highest level dashboards are color-coded bubble dashboards, as shown below. The color and size of these bubbles show the ratings for the underlying metrics:
- green bubble – indicates a healthy situation
- yellow bubble – indicates that a problem is developing
- red bubble – indicates more severe risks or issues.
With these visual cues, it is extremely easy to see the health status of the entire environment at a glance. IntelliMagic Vision can be set up to send emails automatically when a dashboard contains one or more yellow or red warnings.
The bubble dashboard shown above provides the most compact view, but the rating is actually based on a very detailed level of analysis. To get more information about an issue, you can click on the bubble to get to the next level of detail, where the bubbles are replaced by mini charts that show the metrics over time, as well as the rating and threshold values.
Root Cause Analysis
When the dashboard shows a problem, you can click on one of the mini charts to get a full version of the chart, which also contains an explanation of the metric and thresholds, as well as recommendations on what could be done to address the issue. The border of the chart is colored in the same way as the bubble dashboard:
- green border means a healthy metric
- yellow is for early warning
- red indicates a larger issue
Each individual chart contains multiple drill-down options to go to the deepest level of detail in any direction. For example, to find the cause of the large red circle for back-end write response time in the highest level dashboard.
There are thousands of pre-defined charts and reports available in IntelliMagic Vision, grouped into logical sets. If you are interested to show a combination of metrics or filters that is not available out of the box, you can customize the charts that are there, or define your own from scratch and add it to your favorite chart set. The thresholds that are used in the rating system are also customizable to fit your situation.
All charts and reports can be exported to CSV, HTML, PDF, Powerpoint and Splunk.
Many metrics are best shown as line or area charts, but some values are better looked at in a different fashion. The balance chart, for instance, is a great way to show (im)balance. In the example below the fibre throughput for each port on an HDS VSP G-Series array is shown.
- the green rectangle shows the standard deviation
- the yellow area shows the minimum and maximum value over the entire period
This chart shows immediately that there is an imbalance between the storage ports: the first port, CL3-A, carries the majority of the workload.
Example: Front-end or Host Adapter
Often the aggregate throughput of a SAN storage system’s front-end (host) adapter is less than the sum of the individual ports. Unfortunately, most tools do not report the cumulative throughput or I/O rates for an entire front-end adapter. IntelliMagic automatically computes the sum of the key metrics for all Fibre ports on a given front-end adapter to provide the utilization of the adapters.
Mainframe Health Checks 101 – When was your last checkup?
Comprehensive health checks for your z/OS mainframe are essential for avoiding potential performance or availability issues.
Don’t Keep Your CPU Waiting: Speed Reading for Machines | IntelliMagic zAcademy
This webinar discusses the many tiers of storage in IT systems and offers ideas about how to optimize access to those areas.
Are My Remote Clusters Receiving Replication?
Review these key reports when troubleshooting remote cluster issues or trying to determine if your remote clusters are receiving replication.