Top ‘IntelliMagic zAcademy’ Webinars of 2023
As year-end approaches I wanted to continue the tradition of looking back on the IntelliMagic zAcademy webinars that resonated most with our mainframe audience.
Since 2020, IntelliMagic zAcademy has offered free, educational z/OS webinars to the mainframe community. In 2023, we reached our 50th zAcademy webinar and covered topics from Mainframe Cost Savings to extracting insights from MQ data, mainframe security through zERT, application performance, and much more. But a top-5 countdown list can only have five.
The Top 5 IntelliMagic zAcademy webinars of 2023, as determined by total registration and attendance numbers, are:
(Honorable Mention) Metro Global Mirror (MGM) Monitoring in GDPS Sites
Okay, I’m going to cheat just a little here. Just narrowly escaping the top 5, was this session where IBM legend Joe Hyde offered practical advice on monitoring the Recovery Point Objective (RPO) and the factors influencing it.
While GDPS automation provides local storage resiliency and remote failover capabilities, it is the customer’s responsibility to supply adequate hardware infrastructure aligned with their business needs, especially concerning data recency at the remote site in case of a failure.
By monitoring performance data, participants can gain assurance that business requirements are met and can proactively make changes if issues arise, ensuring a resilient and optimized IT infrastructure.
#5) How Mainframe Performance Teams are Solving Their Skills Gap Challenges
Kicking off the actual top 5 list is a webinar that covered a topic on everyone’s mind: the mainframe skills gap. In this webinar, Brent Phillips and Todd Havekost discussed how several mainframe sites are addressing the imminent retirement of experienced mainframe personnel.
This recording is highly recommended for any and all sites experiencing the skills gap or trying to proactively avoid it.
#4) Where Are All The Performance Analysts? – A Mainframe Roundtable
Launching our foray into the ’roundtable’ business, our 4th most attended session of the year was truly a meeting of the minds. With the likes of Martin Packer, Frank Kyne, Dave Hutton, and Jim Horne joining IntelliMagic’s own Todd Havekost and John Baker, these guys touched on everything performance – from costs, to labor, how to speak to management, AI, and much more.
Getting a group of experts like this on a single call is truly and honor and spectacle so if you haven’t already, I highly recommend viewing or listening to the recording.
#3) From Taped Walls to Your PC: z/OS Configuration Made Simple
z/OS Performance and Configuration data is very useful for understanding complicated issues and solving problems, but sometimes it is hard to fully grasp what the data is showing us. Traditional methods of viewing and understanding our z/OS configuration often involved physically taping the mapped-out-topology to office walls – something not feasible or desirable in today’s massively complex environments.
In our third most viewed webinar of the year, John Ticic and Todd Havekost discussed and demonstrated a breakthrough new method of interpreting and interacting with the LPAR, FICON, and Sysplex topologies.
#2) Oh Where Performance Will Take You: A Mainframe Roundtable
Our second ever zAcademy Roundtable hosted the likes of Cheryl Watson and Craig Walters amongst a rockstar group of performance and capacity analysts. So there’s no surprise that this comes in at #2 on our list.
Moderated again by John Baker, this roundtable event explored the journey of several renown mainframe performance analysts and offered insights and perspectives on timeless topics.
Panelists included:
- Cheryl Watson, Watson & Walker
- Craig Walters, IBM
- Dave Barry, UPS
- Jon Ulrich, HCSC
- Todd Havekost, IntelliMagic
#1) Unraveling the z16: Understanding the Virtual Cache Architecture and Real-World Performance
By and far the #1 most viewed zAcademy session of 2023 (and all time) was this session presented by John Baker and Todd Havekost.
During this ground-breaking discussion, John and Todd discussed the revolutionary changes brought about by the z16 processor architecture, and walked through the results of numerous recent upgrade analysis’ with surprising results.
With the introduction of virtual cache at levels 2, 3, and 4, the z16 marks the most substantial transformation in the z processor architecture since the z13. For any site who is considering migrating to the z16 (or already has), this is truly a can’t miss session on what results you can likely expect and how you can verify you received (and are receiving) the expected results.
Looking Towards 2024
If you haven’t yet watched any of the live sessions or recordings of this year’s zAcademy sessions, or if you’re a super-learner that signs up for every session, remember that all zAcademy webinars and recordings – past and future – can be accessed at www.intellimagic.com/zacademy/
IntelliMagic zAcademy will continue into 2024 with exciting insights and deep dives into several new areas of the z/OS mainframe. If you have a favorite session you want to tell us about, have questions about any of the material you saw, or if you have a recommendation on a topic you want us to cover in the future, send us a note at info@intellimagic.com, and we’ll get back to you!
Thanks for watching – tune in next year!
This article's author
Share this blog
Related Resources
News
What's New with IntelliMagic Vision for z/OS? 2024.2
February 26, 2024 | This month we've introduced changes to the presentation of Db2, CICS, and MQ variables from rates to counts, updates to Key Processor Configuration, and the inclusion of new report sets for CICS Transaction Event Counts.
Webinar
New to z/OS Performance? 10 Ways to Help Maintain Performance and Cost | IntelliMagic zAcademy
This webinar will delve into strategies for managing z/OS performance and costs. You'll gain insights into key metrics, learn how to identify bottlenecks, and discover tips for reducing costs.
Webinar
Integrating Mainframe Resource Consumption into the Business | IntelliMagic zAcademy
Discover how to apply FinOps in the mainframe ecosystem to better align IT resources with business objectives, optimizing costs and decisions.
Book a Demo or Connect With an Expert
Discuss your technical or sales-related questions with our mainframe experts today
Banco do Brasil Ensures Availability for Billions of Daily Transactions with IntelliMagic Vision
Company Overview
Banco do Brasil, with over 87,000 employees, 5,000 branches, and 81 million customers, is one of the largest banks in the world, processing over 15 billion transactions per day. Headquartered in Brasilia, Brazil, Banco do Brasil provides commercial and government services as well as a large variety of consumer services, including bill payment services, ATM loans, and checking, savings, and investment accounts.
> 87,000 employees | > 5,000 branches | > 81 millions of customers | > 15 billion transaction per day |
The Challenge
As the second largest financial services company in Latin America, Banco do Brasil has one of the largest and most complex IT infrastructure environments in the world. As a publicly owned bank with billions of daily transactions and millions of customers, there is no margin for system downtime or application disruptions.
Previous solutions to manage performance and conduct capacity planning were cumbersome, slow, required manual coding, and were not interactive or easy to train new hires on. They needed a solution that would allow them to keep up with modern demands, rising transactions, and expanding data volumes.
The Solution
For more than a decade, Banco do Brasil has used IntelliMagic Vision to monitor and manage the performance and availability of their entire end-to-end z/OS and SAN infrastructure environments.
“We use IntelliMagic Vision for z/OS on a daily basis to investigate bottlenecks and analyze performance problems. We also use IntelliMagic Vision for z/OS to improve our system and storage designs and better understand our environment. IntelliMagic Vision has been extremely helpful in post-mortem analysis.”
– Fabio Pereira, Banco do Brasil, Storage Manager
Banco do Brasil uses IntelliMagic Vision for z/OS Systems, CICS, Db2, Disk & Replication, Virtual Tape, as well as SAN Storage, Fabric, and VMware.
IntelliMagic Vision also met the organization’s core requirements with its:
- Built-in health insights to proactively avoid disruptions
- Extensive drill down capabilities
- Code free report builder
- Capacity Planning
- Intuitive graphical user interface
Business Results
IntelliMagic Vision enabled Banco do Brasil to streamline its performance management and capacity planning and enhance its overall business operations.
With IntelliMagic Vision, Banco do Brasil was able to:
- Proactively highlight and prevent potential availability issues
- Eliminate redundant tooling and use a single interface across infrastructure areas
- Reduce mean-time-to-resolution for problems
- Enhance communication and cooperation amongst different teams
To learn more about how Banco do Brasil uses IntelliMagic Vision, view the full review on TrustRadius.
Book a Demo or Connect With an Expert
Discuss your technical or sales-related questions with our mainframe experts today
Interactive FICON Topology Viewer Spotlights Configuration Errors
Having an accurate picture of FICON topology is essential for identifying configuration errors or sub-optimal configuration within the z/OS infrastructure. With the release of 12.5.0, IntelliMagic Vision introduced an interactive FICON Topology Viewer. Using the FICON Topology Viewer, performance analysts can now visualize, and interact with, their entire FICON infrastructure.
Until now, mainframe analysts hoping to achieve this visualization have relied upon manually printed, static visualizations of their topology – often taped to a wall – in order to evaluate their FICON configuration and spot errors. IntelliMagic Vision users are now able to save countless hours of manual examination in their analysis.
Use Cases
The FICON Topology Viewer helps analysts identify configuration errors, ensure that the infrastructure is configured correctly, and reveal undesirable infrastructure changes.
Example use cases include:
- FICON Channel Speed is Auto-Negotiated To A Lower Speed: This issue often occurs when a component in the FICON infrastructure cannot run at the faster speed. Using the FICON Topology view, analysts can look at the individual ports/connections for the CEC, FICON Switch and Disk, and determine where the problem is. This may be a microcode issue, a hardware component problem or simply a configuration issue.
- LPARs Running at a Different FICON Speed to the Same Disk/Tape Units: The FICON Topology view allows you so see if all of the LPARs and connections are running at the same and desired speed.
- Verify All LPARs Have Desired Number Of FICON Connections to the Specific Device: Typically, FICON connections to Disks and Tapes are defined for not only performance and throughput, but also availability. The FICON Topology Viewer allows analysts to verify that all LPARs have the desired number of FICON connections to the specific device.
- Verify the Infrastructure is Correctly Defined For Emergency Site Fail-Over: The FICON Topology view allows analysts to verify that the primary and secondary disk systems have the same infrastructure configuration on both sites.
- Verify the Configuration and Optimize Component Usage: Drilldowns are available in the FICON Topology Viewer to show specific charts for that specific component. For example: drilling down on a Disk can show the front-end adapter utilization in a very intuitive min/max/average chart for all of the adapters. This allows analysts to verify that all the components are being used and have a similar utilization.
- Identify System Outages or Offline FICON Channels: The time-selection and compare feature within the FICON Topology Viewer allows analysts to identify issues such as FICON channels being put offline (possibly due to error conditions) or system outages (LPAR IPLs).
The video below demonstrates the FICON Topology Viewer and how access to interactive data with the FICON topology enables analysts to easily spot changes and assess their configuration.
You Might Also Be Interested In:
News
What's New with IntelliMagic Vision for z/OS? 2024.2
February 26, 2024 | This month we've introduced changes to the presentation of Db2, CICS, and MQ variables from rates to counts, updates to Key Processor Configuration, and the inclusion of new report sets for CICS Transaction Event Counts.
Webinar
New to z/OS Performance? 10 Ways to Help Maintain Performance and Cost | IntelliMagic zAcademy
This webinar will delve into strategies for managing z/OS performance and costs. You'll gain insights into key metrics, learn how to identify bottlenecks, and discover tips for reducing costs.
Webinar
Integrating Mainframe Resource Consumption into the Business | IntelliMagic zAcademy
Discover how to apply FinOps in the mainframe ecosystem to better align IT resources with business objectives, optimizing costs and decisions.
Book a Demo or Connect With an Expert
Discuss your technical or sales-related questions with our mainframe experts today
Benefits of Analysis Across SMF Data Types
Escaping Data Silos Within SMF Data Types
Mainframe performance analysts rely heavily on the great insights provided by SMF measurement data into each component of the z/OS ecosystem. While there is extensive interaction and interdependence across many of the z/OS components, analysis of various SMF data types often relies on tooling that is unique to each data type.
Unfortunately, this has formed a barrier to collaborating on performance analysis across disciplines.
This article will show examples of how performance analysts can become more effective through having visibility into multiple types of SMF data.
Examples cited in this article are based on SMF data from:
- WLM and CICS
- Address Space and Db2 Accounting
- CICS and Db2
- MQ and CICS
Hopefully these scenarios will stimulate your thinking to identify many other situations where analysis performed by your teams can benefit from collaboration and using SMF data across disciplines.
No matter which subsystem you are primarily responsible for, we hope this article will help you blur the boundaries between the SMF ‘silos’ for each product.
IntelliMagic highly recommends subscribing to the quarterly Watson Walker Tuning Letter for the best z/OS technical articles in the industry.
Translating Application Performance Data into Business Outcomes on z/OS | IntelliMagic zAcademy
Get Notified of Upcoming Webinars
In a world where applications are at the forefront, it can be difficult to know which applications are critical to your business and how to protect them. The z/OS system manages a large number of applications, some of which are vital to the success of your business.
Performance analysts need to understand both the business and operational aspects of all applications to ensure z/OS manages them optimally. A key to a performance analyst’s effectiveness is to be able to translate what the business requires of the z/OS system and what the performance data is telling those in charge of the business.
In this webinar, we explore practical steps for identifying and prioritizing your business-critical applications, and how to optimize and report on their performance. We cover key considerations, such as CPU and Disk performance, defining service classes, and spotting relevant changes in the midst of overwhelming data.
You learn:
- How to identify and prioritize your “most loved” business-critical applications
- How to examine all aspects affecting the applications, including CICS, Db2, and Systems components
- Techniques for ensuring service classes are defined correctly
- Ways to spot and identify relevant changes through data analysis
- Strategies for protecting and reporting on performance
- Reporting the health of applications to those not close to the operation of z/OS
Watch this informative and engaging webinar to learn how to cut through the noise and ensure your mission-critical applications are functioning optimally on z/OS.
Sign Up for our Newsletter
Subscribe to our monthly newsletter and receive great quality content in your inbox on:
- performance tips and best practices from industry experts
- tutorials and walkthroughs
- latest industry news
- valuable resources
- upcoming events
- and more
Complete the Form to Sign Up
Related Resources
News
What's New with IntelliMagic Vision for z/OS? 2024.2
February 26, 2024 | This month we've introduced changes to the presentation of Db2, CICS, and MQ variables from rates to counts, updates to Key Processor Configuration, and the inclusion of new report sets for CICS Transaction Event Counts.
Webinar
New to z/OS Performance? 10 Ways to Help Maintain Performance and Cost | IntelliMagic zAcademy
This webinar will delve into strategies for managing z/OS performance and costs. You'll gain insights into key metrics, learn how to identify bottlenecks, and discover tips for reducing costs.
Webinar
Integrating Mainframe Resource Consumption into the Business | IntelliMagic zAcademy
Discover how to apply FinOps in the mainframe ecosystem to better align IT resources with business objectives, optimizing costs and decisions.
Book a Demo or Connect With an Expert
Discuss your technical or sales-related questions with our mainframe experts today
How to Find Sick But Not Dead (SBND) TS7700 Tape Clusters
Head lights on new cars are really bright and look great for the driver. Because of this, it is nearly impossible to know while driving the car that one of them has stopped working. Everybody else on the road can tell though, and the police will pull you over regardless of the type of car.
Just as headlights can provide excellent visibility, TS7700 grids look great too. Host jobs love the new TS7770s because they are bigger, better, and faster. It’s easy to tell if a host attached VTS has a problem since the host jobs fail or alerts are sent. If a VTS in a remote location completely fails, then an alert will also be sent. But it is much more difficult to detect if it is simply not functioning as intended. Like car headlights, a high availability grid is capable of hiding a sick but not dead cluster.
In this blog we will look at various ways of monitoring a grid to determine if remote clusters are not receiving replication data and why not. IBM provides a whole set of detection monitoring for host attached clusters which are Sick but not Dead (SBND), but these are not discussed here.
What are ‘Sick But Not Dead’ Clusters?
The TS7770 combines the performance of disk-based operations with the capacity and scalability of physical tape to deliver high availability storage. Using virtualization, VTS clusters have replaced the older technology of physical tape drives and media.
Clusters are connected to form a grid which can automate movement of data from local to remote data centers. Data is replicated to multiple clusters to provide high availability and offset storage of business-critical data. Because of the complex nature of a grid, it can be time consuming to verify the availability of each cluster within the grid. Due to the redundancy in the grid, it may not be apparent for many days or weeks that the cluster is not functioning.
Individual VTS clusters only rarely fail, but they can stop performing their intended function. Remote clusters are there to receive data from the clusters in the production data center. If a remote VTS cluster is not able to receive data it has stopped performing its function.
This is what IBM calls Sick But Not Dead (SBND) Clusters.
Increasing Deferred Queue Age and Replication Backlog
Increasing Deferred Queue Age and Replication Backlog is the primary way to determine if remote clusters are not receiving data. Viewing Replication data from Receiving Clusters is a great way of detecting non-functioning clusters. IntelliMagic Vision groups these charts together into a multi-chart as seen below.
- Replication Backlog will increase as data is written to a local VTS and has not been replicated to this cluster.
- Average Deferred Queue Age will show how much time the delay in replication is.
- Inbound Total Copy Data Rate will show how fast data in coming to this cluster.
The amount of data and how long it takes to replicate will vary by site.
IntelliMagic thresholds can be set to allow normal processing to operate in the green. When thresholds are exceeded, data is taking too long to replicate. Depending upon how a site customizes their thresholds, it may take days to reach these thresholds as they are usually set well above normal operating values.
If there is Replication Backlog and the data rate is zero or lower than normal, then network links may need to be reviewed. If data rate is 0, the cluster may be unable to receive data, either because of cache utilization or VTS hardware problem.
Average CPU/Disk Utilization
Another indication of a Sick But Not Dead cluster is when either Average CPU or Disk Utilization is near 0 and stays there while Replication Backlog is increasing.
It is normal for CPU and Disk activity to be low during a period of no tape activity, but if the cluster utilization stays near 0 when data is awaiting replication to this cluster, then it should be investigated.
This chart shows normal activity, but for SNBD clusters, this could drop to near zero activity during periods of heavy workload because the cluster is unable to receive data.
VTS Cache Utilization
VTS Cache Utilization can be an indication as to why the cluster in a remote location is unable to receive data. Obviously if the utilization reaches 100%, there is no space for data. If the cluster has back-end tape, the library, drives, and media should be investigated to ensure they are operational. If there is no back-end tape, then the investigation moves on to why too much data has been retained in the cluster.
If the above chart reaches 100% utilization no data can be written to the VTS.
Possible reasons for cache full conditions
- Growth has used up excess capacity
- I.E. Not enough space in cache or data is not migrating to physical tape
- Erroneous growth caused by host datasets or GDGs which should have expired
- Accidental or non-calculated host data sets kept too long
- Flash Copy Cache is in use
- DR testing is still actively using disk cache
- Back-end tape is not pre-migrating volumes
- Volumes on disk cache cannot be removed if they have not pre-migrated.
- TS7700 management classes updated to keep volume copies on more clusters
- Accidental or non-calculated data copies
Compressed Data In Use
One way to determine if growth is the culprit for high cache utilization is to review the host Tape Management Catalog analyzing active virtual volumes.
The IntelliMagic Vision Tape Volumes report set has many reports summarizing tape volume activity by volume group. If a volume group has increasing used GB, then more data is retained within the tape grid. This view of a year of data can help spot trends and isolate the problem.
Drilldowns to storage groups, system, job and program level or a combination can be done quickly and easily.
All Data Flows In and Out of the VTS’s Cache
This report is a complete view of data movement within a cluster. If a tape attached cluster is not writing to the tape pool, then the back-end tape system can be investigated.
For host attached clusters, Virtual Device Write Throughput shows data actively being written to the cluster. Outbound copy data rate shows data being replicated to other clusters in the grid. Inbound copy data rate shows data being replicated to this cluster. Write Rate to Pool shows data that is being pre-migrated. Various configuration parameters can influence data movement within the grid, but for a healthy grid, data should be flowing as your configuration allows.
Review These Essential Remote Clusters Performance Reports to Find SBND Clusters
It is always a good idea to check your headlights by doing a walk around your vehicle before stepping into the driver’s seat where that broken headlight isn’t even noticeable. Just as it is a good idea to review remote clusters replication performance reports and not just rely on the VTS heartbeat indicating basic communications.
Putting this data into IntelliMagic Vision allows for potential problem indicators to become as clear as the road is when using two headlights. Putting these reports on a dashboard is a simple and easy way to monitor the health of all your VTSs. The below video covers how to review these key metrics and demonstrates how they may appear in a custom dashboard.
Using IntelliMagic Vision for TS7700 Performance Analysis
IntelliMagic Vision for TS7700 automatically compares the hardware views (via BVIR data) with the workload metrics, providing you with insight into how the standalone or gridded hardware is handling the work and replication between boxes.
This article's author
Share this blog
You May Also Be Interested In
News
What's New with IntelliMagic Vision for z/OS? 2024.2
February 26, 2024 | This month we've introduced changes to the presentation of Db2, CICS, and MQ variables from rates to counts, updates to Key Processor Configuration, and the inclusion of new report sets for CICS Transaction Event Counts.
Webinar
New to z/OS Performance? 10 Ways to Help Maintain Performance and Cost | IntelliMagic zAcademy
This webinar will delve into strategies for managing z/OS performance and costs. You'll gain insights into key metrics, learn how to identify bottlenecks, and discover tips for reducing costs.
Webinar
Integrating Mainframe Resource Consumption into the Business | IntelliMagic zAcademy
Discover how to apply FinOps in the mainframe ecosystem to better align IT resources with business objectives, optimizing costs and decisions.
Book a Demo or Connect With an Expert
Discuss your technical or sales-related questions with our mainframe experts today
Are My Remote Clusters Receiving Replication?
When troubleshooting remote cluster issues or trying to determine if your remote clusters are receiving replication, a few key reports will provide most of the insights necessary. These reports include:
- Replication Backlog
- Logical Volumes for Copy
- Average Deferred Queue Age
- Average Immediate Queue Age
- Inbound Total Copy Data Rate
- Average CPU / Disk Utilization
- VTS Cache Utilization
- Compressed Data on Logical or Physical Volumes
- Data Flows in and out of Cache
In this video, we walk through these reports and provide some insight into what to look for when trying to determine if your remote clusters are receiving replication.
Video Transcript
When trying to determine if your remote clusters are receiving replication, there are really just a handful of key reports you should check.
You likely aren’t going to need to check these reports all that often, but it is still important to make sure you’re keeping an eye on them to ensure remote VTS(s) are operating properly and receiving replication data in a timely manner. Deferred copy throttle will delay replication to remote clusters but should only be active during periods of high host activity.
For this video I used IntelliMagic Vision to create a custom dashboard that has these key reports.
The first key report is actually a group of minicharts covering Replication data for Receiving Clusters. I can drill into any of these reports to explore them further, but having them all in this view makes it easy to assess the replication health at a glance.
If deferred copy throttle is not the cause, then if there is Replication Backlog and the data rate is zero or lower than normal, then network links may need to be reviewed. If data rate is 0, the cluster may be unable to receive data, either because of a full cache utilization situation or a possible hardware issue.
IntelliMagic Vision makes it easy to spot warnings or exceptions with it’s built-in ratings that you can see by the colors around the charts.
Another indication of a replication problem is when either Average CPU or Disk Utilization is near 0 and stays there while Replication Backlog is increasing. It is normal for CPU and Disk activity to be low during a period of no tape activity or if deferred copy throttle is slowing replication, but if the cluster utilization stays near 0 when data is awaiting replication to this cluster, then it should be investigated.
VTS Cache Utilization can be an indication as to why the cluster in a remote location is unable to receive data. Obviously if the utilization reaches 100%, there is no space for data. If the cluster has back-end tape, the library, drives, and media should be investigated to ensure they are operational. If there is no back-end tape, then the investigation moves on to why too much data has been retained in the cluster.
This report is a complete view of data movement within a cluster. If a tape attached cluster is not writing to the tape pool, then the back-end tape system can be investigated.
For host attached clusters, Virtual Device Write Throughput shows data actively being written to the cluster. Outbound copy data rate shows data being replicated to other clusters in the grid. Inbound copy data rate shows data being replicated to this cluster. Write Rate to Pool shows data that is being pre-migrated. Various configuration parameters can influence data movement within the grid, but for a healthy grid, data should be flowing as your configuration allows.
The dashboard helps keep all of these reports in a single location, and any user can share the dashboard with their coworkers. If there were any issues, we could simply click on any of the reports and drill down into the root cause.
And there you go. Keep an eye on these reports to determine if your remote clusters are receiving replication or not. I hope this video helped.
Check us out at intellimagic.com to learn more about z/OS performance and IntelliMagic Vision.
You May Also Be Interested In
News
What's New with IntelliMagic Vision for z/OS? 2024.2
February 26, 2024 | This month we've introduced changes to the presentation of Db2, CICS, and MQ variables from rates to counts, updates to Key Processor Configuration, and the inclusion of new report sets for CICS Transaction Event Counts.
Webinar
New to z/OS Performance? 10 Ways to Help Maintain Performance and Cost | IntelliMagic zAcademy
This webinar will delve into strategies for managing z/OS performance and costs. You'll gain insights into key metrics, learn how to identify bottlenecks, and discover tips for reducing costs.
Webinar
Integrating Mainframe Resource Consumption into the Business | IntelliMagic zAcademy
Discover how to apply FinOps in the mainframe ecosystem to better align IT resources with business objectives, optimizing costs and decisions.
Book a Demo or Connect With an Expert
Discuss your technical or sales-related questions with our mainframe experts today
IntelliMagic and Watson & Walker: Partners in Mainframe Optimization
Company Overview
Watson & Walker is a vendor-independent mainframe consultancy firm that provides practical mainframe performance and measurement advice. Founded by Cheryl Watson and Tom Walker in 1991, the Watson & Walker team is one of the best-known and respected z/OS performance and measurement advisory groups in the world. Cheryl Watson’s Tuning Letter, named after the co-founder and mainframe legend, is the most comprehensive body of practical and impartial z/OS tuning and measurement advice ever published.
The Challenge
As the very nature of their business, Watson & Walker provides consulting services for clients spanning the globe, but their existing method of SMF performance analysis required manual coding, wasn’t dynamic or interactive, and couldn’t keep up with the pace of the demands from their work.
Watson & Walker needed a way to quickly navigate and analyze the mainframe environment in order find performance optimization and cost reducing opportunities for their clients.
Watson & Walker embarked on a research project to find the perfect SMF performance analysis tool available on the market. To meet Watson & Walker’s strict criteria, their chosen solution needed to:
- Be powerful, robust, and interactive
- Save time conducting analysis
- Offer great customer support
- Be up to date on the latest mainframe technology and metrics
The Solution
After analyzing the solutions on the market and consulting with customers from each of the solutions, Watson & Walker decided upon IntelliMagic Vision for z/OS. Since 2016, Watson & Walker has used IntelliMagic Vision for z/OS with their consulting efforts when helping customers and generating reports.
“At the conclusion of our research, we found that IntelliMagic Vision, without a doubt, is heads and shoulders above anything else. We wouldn’t go anywhere without IntelliMagic Vision; it is powerful, easy to use, and is now a vital part of our toolkit when creating presentations, classes, Tuning Letters, or customer reports.”
– Cheryl Watson, CEO and Founder, Watson & Walker
Business Results
The partnership between Watson & Walker and IntelliMagic allows mainframe customers to benefit from IntelliMagic Vision’s built-in intelligence and Watson & Walker’s unrivaled services.
Using IntelliMagic Vision, Watson and Walker was able to:
- save significant time by applying their experience to the analysis, not in trying to extract and organize the information out of SMF
- more quickly and accurately spot tuning opportunities for cost reduction and performance optimization
- take advantage of the built-in health insights to identify warnings and issues prior to their deep-dive analysis
“What I liked most about IntelliMagic Vision was its automatic identifying of the key performance indicators, the KPIs, and the ability to drill down. The whole idea of looking at this boundless information from SMF, is that you don’t want to spend time looking at all that information. All you want to know is, is everything okay or is something wrong? So the ability to drill down, look at only one of the items instead of multiple is pretty powerful and takes no programming.”
Related Resources
News
What's New with IntelliMagic Vision for z/OS? 2024.2
February 26, 2024 | This month we've introduced changes to the presentation of Db2, CICS, and MQ variables from rates to counts, updates to Key Processor Configuration, and the inclusion of new report sets for CICS Transaction Event Counts.
Webinar
New to z/OS Performance? 10 Ways to Help Maintain Performance and Cost | IntelliMagic zAcademy
This webinar will delve into strategies for managing z/OS performance and costs. You'll gain insights into key metrics, learn how to identify bottlenecks, and discover tips for reducing costs.
Webinar
Integrating Mainframe Resource Consumption into the Business | IntelliMagic zAcademy
Discover how to apply FinOps in the mainframe ecosystem to better align IT resources with business objectives, optimizing costs and decisions.
Book a Demo or Connect With an Expert
Discuss your technical or sales-related questions with our mainframe experts today
Closing the Gap on Mainframe Application Profiling | IntelliMagic zAcademy
In today’s age of ‘app’ proliferation and data overload in most areas of z/OS infrastructure performance, one would think we would have access to endless statistics on application behavior. While this is largely true in the distributed space, it remains an elusive goal in our beloved mainframe.
The simple fact is distributed application programmers and mainframe system folks speak very different languages (when they speak to each other at all). DDF transactions don’t ‘look’ like CICS transactions – but they are both important to the business.
In this session, John Baker and Gabe Tully try to break down the barriers between z/OS and distributed systems and communicate specific methods both sides can use to classify workloads properly.
z/OS Health Insights, Ratings, and Thresholds with IntelliMagic Vision
Learn how to automatically highlight any developing problems in your z/OS infrastructure
This white paper demonstrates how IntelliMagic Vision can be used to obtain intelligence about threats in your z/OS infrastructure that are likely to lead to service disruptions for your users and applications or replication health.
The Health Insights will automatically highlight any developing problems. You can configure the thresholds to match the needs of your workload, such that IntelliMagic Vision will become more and more intelligent about your environment as you continue to use and configure the product.
IntelliMagic Vision provides Health Insights for:
- Disk Storage Systems
- Storage Groups
- System and Workload Manager
- Log Streams
- Coupling Facility
- Cross-system Coupling Facility (XCF)
- Db2
- CICS
- MQ
- TCP/IP
- FICON and Channels
- TS7700
Gain insight into how these Health Insights and over 2000 charts can help you identify and solve performance issues.
To download this white paper, please complete the form to the right, and we will send the file to your email.
Overall z/OS Performance & Health Status
The goal of IntelliMagic Vision is to identify potential issues before they impact applications. This is achieved by an intelligent rating system where workload measurements are combined and compared with the capabilities of the systems. The resulting ratings are shown in health insights charts that represent the health of your entire end-to-end z/OS infrastructure.
The health insight reports indicates infrastructure health by showing the key ratings for each part of the environment. Ratings are based on the analysis of thousands of underlying data points, making the health insight charts a very dense summary of hundreds of charts. The result is that issues and risks are flagged proactively before application performance degrades.
The dashboards show the summary; drill-downs allow you to explore details. The color and size of these bubbles show the ratings for the underlying metrics:
- a green icon indicates a healthy situation
- a yellow icon indicates that a problem is developing
- red icons flag more severe risks or issues
With these visual cues, it is extremely easy to see the health status of the entire environment at a glance.
Investigate Performance Details
The health insights chart shown above provides the most compact view, but the rating is based on a very detailed level of analysis. To get more information about an issue, you can click on any icon to get to to the next level of detail. The icons are replaced by several related mini charts that show the metrics over time, as well as the rating and threshold values.
Root Cause Analysis
When the chart shows a problem, you can click on one of the mini charts to get a full version of the chart, which also contains an explanation of the metric and thresholds, as well as recommendations on what could be done to address the issue. The border of the charts is colored in the same way as the health insights chart:
- a green border means a healthy metric
- yellow is for early warning
- red indicates a larger issue
Each individual chart contains multiple drill-down options to go to the deepest level of detail in any direction, for example to find the individual RAID array that was so busy that it caused a large red circle for drive response time in the highest level dashboard.
Customizability
There are thousands of pre-defined charts and reports available in IntelliMagic Vision, grouped into logical sets. If you are interested in showing a combination of metrics or filters that is not available out of the box, you can customize the charts, or define your own from scratch and add it to your favorite chart set. The thresholds that are used in the rating system are also customizable to fit your situation.
All charts and reports can be exported to CSV, HTML, PDF, PowerPoint and Splunk.
Balance Charts
Many metrics are best shown as line or area charts, but some values are better looked at in a different fashion. The variance chart, for instance, is a great way to show (im)balance. In the example below, you see the FICON throughput per port.
- a green dot indicates average throughput for the port
- a green rectangle that shows the standard deviation
- a yellow area that shows the minimum and maximum value over the entire period