Management, please read and reward your true heroes
Often when you do the right things in performance and capacity management, your work goes unnoticed and unappreciated. In fact, if you are consistently pro-active with your storage performance and capacity management processes, you will have few opportunities to be in the spotlight. This is because pro-active management reduces the number of crises and, consequently, the need for heroic action!
For those who want to live dangerously, consider avoiding the following activities:
- Monitor storage for performance risk. More than just determining whether your storage has exceeded response time goals, also regularly analyze the actual storage system throughput against storage controller capability to understand how close you are to the performance cliff.
- Define Storage Performance Service Level Objectives (SLOs) for response time for different tiers with your business units and closely monitor the SLOs for each tier. Use pro-active analysis to determine whether the SLOs for a tier are endangered and make the appropriate adjustments to preserve SLO compliance (e.g. – adding spindles to the endangered tier).
- Monitor storage for imbalances in the front-end adapters and ports due to improper zoning, insufficient host-ports or improperly configured speeds on the fabric path. Provide recommendations to engineering to resolve imbalances, improve throughput, and reduce the probable impact to SLOs (e.g. – redistributing workload across existing ports to spread traffic more evenly).
- Monitor capacity trends to determine which tiers and pools are going to run out of space and provide regular and timely feedback to the storage management team about needed adjustments.
- Analyze the access density of the hosts to determine if they are utilizing the appropriate storage tier needed to meet SLOs at the lowest possible costs, e.g. move hosts with lots of capacity and very little I/O from higher to lower performance tiers.
While the title of this blog is a bit tongue-in-cheek, the tension between the ego affirming opportunities provided by reactive heroics and the ego depriving reality of proactive performance and capacity management is real. So why not reward storage management for meeting SLOs? If there is a cost of missing SLOs, then certainly there is a real reward for meeting them. Too often SLOs are only used in the negative sense. Rewarding storage management for identifying and reducing performance risk in the environment is a more positive approach. Providing the right tools and personnel to complete the job properly could make every day heroics routine.
For more information about pro-actively managing your storage performance and capacity please visit https://www.intellimagic.com
RMF/SMF Analysis - Get Ahead of the Curve with Statistical Analysis
This webinar demonstrates how applying statistical algorithms to meaningful metrics will automatically identify significant changes to the z/OS workloads and applications.
Simplify Mainframe Performance Management
IntelliMagic Vision simplifies mainframe performance management and allows analysts to examine their entire z/OS environment in a glance and quickly prevent or resolve application service disruptions.
IBM Technical University, Hollywood 2018 IntelliMagic Sessions
A recap of IBM TechU Hollywood, 2018 with links to all of the presentations and sessions we hosted.