Executive Summary
We are in the midst of a rapid growth of IT management systems capability, responding to changing business needs. IT management systems are beginning to address the needs of additional layers of management accountability.
At the same time, they are expanding the range of functionality offered at each layer. Harnessing the full potential of these IT management systems will require significant organizational change.
More fundamentally, the real economic value of these IT management systems will not be realized until we can more effectively create explicit linkages between the performance of elements of the IT infrastructure and the operational and financial performance of the business.
Rather than full transparency of performance data, we need much more selective and highly targeted visibility into the performance of our IT systems, shaped by a deep understanding of the economic leverage points of our business.
Business needs to force major architectural shifts
Around the world, competitive pressures are intensifying. The current recessionary environment in key regions of the world certainly doesn't help matters. But this is isn't about recessions. As painful as they may be, they are only temporary.
Something much more fundamental is going on. Forces that have been at work for decades are reshaping the business landscape. In particular, continued technology innovation and broad-based public policy shifts favoring freer markets are making it easier for companies to challenge each other around the world. The bottom line? A lot more uncertainty, the need for continued cost reduction and the imperative to find new sources of growth to continue to create economic value.
To respond to these needs, senior executives are finding that they must design and manage very different businesses. They seek much more flexibility within their own operations to cope with uncertainty. They also find that they must increasingly rely on other companies to supplement their own capabilities and add more value to their customers. The scope of coordination no longer stops at the edge of the enterprise. In fact, it often extends through several levels of business partners.
Flexibility and collaboration -- that's challenging enough. Layer on top of that the need for real-time management capabilities. This in part is a requirement for flexibility -- companies need to reduce the lead-time required to respond to unanticipated changes in the environment. But, more fundamentally, it is about reducing cycle times across all operations to deliver operating savings and greater customer satisfaction.
These needs are forcing a fundamental shift in IT architectures. No sooner have we started to become comfortable with an "n-tier" architecture than we find that a new architecture is emerging to respond to these new business needs. This new architecture -- a distributed service-oriented architecture -- is only in its earliest stages of development.
Companies are finding that flexibility and efficiency both require that IT resources be made available as services -- accessible anywhere whether these resources are local, in distant reaches of the enterprise, in business partners or located in specialized third party service providers.
We are no longer dealing with IT architectures that are defined by the boundaries of the firm -- to be effective, they need to extend across a broad range of enterprises. This new architecture enables virtualization of resources -- making resources available as needed. In the process, it promotes more efficient utilization of resources since these resources can be allocated dynamically to the areas of highest demand.
IT management systems will need to adapt
This shift in IT architectures poses significant challenges for IT management systems. These management systems will need to evolve rapidly to keep pace with the demands of the new architecture.
Services built upon virtualized and distributed IT resources significantly increase the complexity of problem identification and resolution. At the same time complexity is going up, the time available to address and resolve problems is shrinking. Real-time enterprise operations are highly sensitive to any downtime.
IT management systems will need to adapt to the demands along two very different vectors:
- They will need to steadily expand the scope of management focus -- IT management systems will need to address the needs of multiple levels of decision-makers within the organization, extending from the managers of specific IT elements (e.g., servers, networks, databases, applications) to the COO and CFO
- They will need to expand functionality at each level of management focus -- IT management systems will need to deepen capabilities around reactive problem identification and resolution while at the same time enhancing the ability to anticipate potential issues before they occur
Expanding management focus
We are all familiar with the paradox: we have too much data to effectively use and, when we need it, the right data rarely seems to be available. One way to resolve this paradox is to focus on understanding the specific needs of various types of decision-makers. Rather than trying to provide overall transparency, a much more effective approach is to provide focused visibility into the data that matters the most at the appropriate time. Of course, focused visibility assumes that one clearly understands the needs of various categories of data users. This will be a central challenge for IT management system providers over the next several years.
NSM platforms: IT management systems have evolved significantly already. Emerging from discrete tools to manage performance of specific elements of the IT infrastructure, these systems now provide integrated views of performance across broad segments of the IT infrastructure. These Network and Systems Management (NSM) platforms still focus on the performance of individual elements in the IT infrastructure, but help to aggregate the performance view for managers of IT operations. They are very effective in providing quick alerts regarding performance issues and helping managers of specific IT elements to trouble-shoot the problems.
TSM platforms: Technology Service Management (TSM) platforms expand the scope of management focus. Rather than focusing on the performance of individual elements of the IT infrastructure, these TSM platforms provide views of performance of specific IT services from an end-user perspective.
Take the example of an "available to promise" service provided to the sales force. Many companies run into two issues as the focus shifts to service-level performance. First, what is the expected level of performance for specific services? Service-level agreements (SLAs) help to make these end-user expectations explicit and provide benchmarks against which to measure performance. Second, this service might cross multiple networks, draw on several databases, rely upon many servers, and invoke more than one application along the way. If the service does not offer performance consistent with SLAs, few companies today have the capability to tie the service to all the elements of the IT infrastructure involved in delivering the service. The result is long delays in diagnosing the problem, much less making the service functional again.
The primary user of these platforms is the service "owner" in the IT group who is accountable for delivering performance as specified by an SLA. Many IT management system providers are seeking to address this need today. Two of the most significant opportunities here are IT service view tools and SLA compliance tools. IT service view tools help IT managers to automate the process of identifying the various elements in the IT infrastructure that support delivery of a specific IT service. SLA compliance tools help to automatically monitor SLA-defined performance metrics and send alerts when performance begins to vary outside the prescribed range.
BPPM platforms: But the scope of management focus needs to expand even further. Companies need Business Process Performance Management (BPPM) platforms.These are not platforms to manage the business process itself -- there's a host of Business Process Management platforms available to do that.
Instead, these BPPM platforms are designed to rapidly identify performance issues among the various IT services coming together to support specific business processes.
The "available to promise" service is only one of many services that support a Lead to Order business process. As indicated earlier, real-time enterprise performance demands faster problem diagnosis and resolution -- operational cycle times are compressed, information access demands rise, and operations need to respond more quickly to unanticipated changes. If any of the technology services supporting a business process go down, the economic impact on the business is far greater than in the past.
The user of these platforms would be the operations managers "owning" key business processes and accountable for their smooth functioning. Unlike the two lower layers of management platforms, non-IT managers will use this management platform, and it must be designed to address their needs.
BOPM platforms. There's even a broader scope of management focus. Business Operations Performance Management (BOPM) platforms provide an integrated view of all the resources required to support the operations of a business. Once again, these platforms are not designed to manage the operations -- they provide the capability to monitor financial and operational key performance indicators of the business.
For example, they might help senior management to determine that a growing turnover rate in certain customer segments is due to manufacturing production slippages in specific assembly plants and that these are, in turn, due to the introduction of a new product that has not been appropriately calibrated in the automated assembly equipment. The platform would provide a rapid assessment of the financial impact of these operating issues and help to troubleshoot the source of the problem.
We are no longer talking about narrowly defined IT management systems -- these BOPM platforms would cover a much broader scope than simply the IT services required to run the business. These BOPM platforms will draw heavily upon the functionality offered by business intelligence systems, but they will provide additional value by tying key operating and financial performance indicators to underlying IT services and other operating resources.
This will help the users of these BOPM platforms to not only identify performance issues, but to pinpoint the actions required to address these performance issues. The users of this platform will be chief operating officers and chief financial officers seeking to spot broad performance issues that cut across the core processes of the business.
These layers of IT management systems act like geological sediments, with the lower layers supporting the upper layers. For example, as Technology Service Management platforms emerge, they will not replace NSM platforms. Instead, they will provide an additional layer of management that actively draws upon the resources and information of the NSM platforms. In fact, the upper layers of IT management systems could not operate without the lower layers already in place. Put another way, these are nested platforms that build upon the resources and capabilities of the lower layers.
Expanding functionality in each layer
We have just discussed IT management systems in terms of the need to provide a steadily expanding scope of management focus addressing the needs of more senior executives within the company. At the same time, each layer of IT management systems will need to steadily expand the range of functionality offered to help identify and address performance problems.
Automating problem resolution: Today, NSM platforms provide a robust set of tools to identify performance issues among individual elements of the IT infrastructure and to discover the root causes of the performance issues. These systems are much more limited in terms of their capability to either offer transitional workarounds or to fix the problems causing the performance issues. Often, these steps require expensive and time-consuming manual labor. While we will never take human intervention completely out of the equation, there are significant opportunities to automate the problem-resolution phase. At the first level, NSM platforms can provide enhanced tools to recommend specific problem-resolution actions based on problem-resolution histories and knowledge bases. Automating the resolution itself will likely take longer, especially in cases when additional elements of IT infrastructure need to be provisioned and configured. By automating this phase, however, NSM platforms could substantially reduce the cost of problem resolution and to shorten the time necessary to restore performance to acceptable levels.
Moving from reactive to proactive problem identification: NSM platforms were largely designed to help IT managers address problems as they surfaced. As these systems accumulate more performance histories of elements in the IT infrastructure in specific kinds of operating environments, management platforms can offer more proactive approaches to problem identification. By providing appropriate analytical tools, these performance histories can be assessed to identify conditions that are likely to create certain kinds of performance issues. Management can then take the appropriate steps to either modify the conditions or to adapt the IT infrastructure to the conditions so that performance issues can be averted.
Expanding beyond enterprise boundaries: NSM platforms do a great job of monitoring performance of IT elements within the enterprise. As business processes extend across enterprises, managers have a growing need to assess performance of IT elements in infrastructures of other enterprises. Currently, NSM platforms have a difficult time providing this level of visibility.
Prioritizing problem resolution: We all have scarce resources, especially these days. One of the key business challenges in this environment is to prioritize our efforts so that we maximize financial impact given the resources available. This is certainly true in the domain of IT system and service management. But, there is a problem. How do we know which are the most critical elements in the IT infrastructure to focus on when multiple elements malfunction at the same time? IT managers may have a rough intuitive sense of priority in more static IT architectures, but what tools do they have to prioritize in more dynamic architectures? Today, this server may be used to support a mission-critical, supply-chain operation, while tomorrow the same server may be deployed to support an employee survey regarding cafeteria offerings.
In many respects, this functionality is the most challenging to provide. Here is where the two dimensions of functionality improvement for IT management systems come together. The NSM layer of management systems cannot provide robust prioritization tools until the other layers in the management systems stack have been deployed. To prioritize problem resolution at the level of individual elements of the IT infrastructure, managers must first understand how the various elements knit together to support specific IT services; they must then understand the role that these services play in supporting specific business processes, and, finally, they must understand the impact of these business processes on the financial performance of the corporation.
To deliver this prioritization capability, IT management systems will need to simultaneously pursue a bottom-up, and top-down management approach. The top-down approach is essential to define the broad business context and performance impact of IT systems. The bottom-up approach is necessary to provide the granularity required at the level of individual IT elements to address and resolve performance issues. Neither approach in isolation can deliver the functionality required by management. This is especially true as IT architectures become much more dynamic and the relationships across elements change rapidly to deliver specific IT services.
Most of the preceding discussion focused on the expansion of functionality at the NSM layer of IT management systems. Of course, this same expansion of functionality will be required at each additional layer of IT management systems.
New organizational approaches will be required to facilitate enhanced IT management
The expanded functionality of IT management systems discussed above will not deliver real business impact unless it is accompanied by significant organizational change. New management roles, governance structures, and skills will be required to harness the economic value offered by these more robust management systems.
Management roles: Most IT organizations today focus on defining clear management accountability for individual elements of the IT infrastructure. Far fewer organizations have systematically assigned management accountability for IT services defined from an end-user perspective -- for example, the "available to promise" IT service mentioned earlier. These IT service managers would have responsibility for negotiating SLAs with appropriate business users of IT services and then managing the performance of the IT services to ensure they remain within the prescribed ranges.
Increasingly, these IT service managers will need to be adept at negotiating SLAs, not only within their enterprise, but also with managers in other enterprises. Users in business partners will often access IT services offered by a company. For example, the "available to promise" service discussed earlier might be used not only by the sales force within the enterprise, but also by sales people within various third party distribution channels. Many of the elements required to deliver the IT service may be outsourced to third parties so the IT service manager will also need to negotiate SLAs with these service providers as well.
At an even higher level, organizations will need to assign explicit IT management accountability and business management accountability for core operating processes within the business. The IT manager and the business manager of a core operating process like customer relationship management need to work closely together to ensure that business performance objectives are met. As these core processes span a growing number of business partners, these core process managers will also need to become adept at coordinating activities and IT services in a much more loosely coupled manner.
Without these management roles firmly in place, it will be hard to find the right decision-makers for each of the layers of IT management systems described earlier. Without the right decision-makers operating the platforms, the data provided will not result in the management actions required to deliver maximum business impact.
Governance structures: Centralized, top-down management structures have prevailed within IT organizations. Effective use of the enhanced functionality of IT management systems will require a very different governance structure that builds much more on federation principles. In any federation, decision-making authority is lodged in the lowest layers possible. Clear distinctions are made between what is local and what must be escalated to higher levels for resolution. Adhering to the principle of the least escalation possible, appropriate escalation and conflict resolution procedures are explicitly defined in advance.
Centralized, top-down IT management systems will find it impossible to cope with the growing complexity as IT architectures become more distributed and dynamic. Single, centralized correlation engines cannot perform all the analytics required. Even if this were technically possible, this single centralized correlation function would not be feasible in distributed IT architectures spanning multiple enterprises.
Federation arrangements -- both in the IT management systems and in the broader organization -- are the only way to accommodate the reality of multiple decision-making nodes. The distinct layers of IT management systems accountability outlined earlier can help to support a federated management model, provided that these layers are designed so that decision-making and action are kept as decentralized as possible. The role of the upper layers of IT management systems is to provide guidance regarding prioritization and insight regarding interdependencies. It is not primarily to perform problem identification and problem resolution.
Federated IT management systems will, of course, have a hard time functioning if they are inserted into highly centralized organizations. Organizations will need to define and implement corresponding federation arrangements, both within individual companies and across companies that are becoming more linked in terms of IT infrastructure.
Skill building: Harnessing the economic value of these new IT management system capabilities will require new organizational skills as well. As new layers of IT management systems accountability are defined, it will be harder and harder to draw the line between IT management skills and business management skills. Even at the layer of technical services management, the ability to define services and SLAs in end-user terms will require both deep IT skills and a deep understanding of the business context.
IT skills will also need to evolve. The ability to map elements of the IT infrastructure to specific services is difficult enough in a static, n-tier IT architecture. It becomes far more challenging in a dynamic, service-oriented architecture when the various elements required to support a service may change with each instance of service delivery.
Web services technology will play a significant role in accelerating the move to a dynamic, service-oriented architecture. This same technology can help to define standardized interfaces between applications and IT management systems, and thereby simplify the task of connecting into various elements of the IT infrastructure. Unfortunately, few organizations today have deep Web services technology skills. Even fewer have deep service-oriented-architecture skills. Both of these skill sets will need to be developed to realize the potential of new IT management system capabilities.
Need to move in rapid incremental stages
Few organizations today have the systems or capabilities outlined above. There is, in fact, a significant gap between the systems in place now and the capabilities and potential that will be achieved in the years ahead. To effectively exploit this potential, companies need to have a clear vision of the systems and capabilities required. Very few companies today even have an explicit vision of the potential, so this is a necessary ingredient of any program to develop these capabilities.
But vision alone is useless -- in fact, it can become a distraction and excuse for lack of action. Many IT managers fall victim to the "architecture trap": they insist that, if detailed architectural blueprints are not fully defined, then any near-term initiatives should be put on hold until this is done. They also begin to believe that the entire architecture must be in place before any business value can be delivered. If they're not careful, these architectural initiatives can morph into ten-year mega-projects that rarely deliver the expected returns. What companies need is a broad architectural vision regarding IT management systems capability that can help to focus their near-term initiatives. If the blueprints get too detailed, they run the risk of becoming obsolete before they are complete.
Real impact is achieved when a high-level architectural vision regarding IT management systems capability is coupled with a strategy of rapid incrementalism. This approach focuses on defining short-term initiatives (at most, with a six- to twelve-month horizon) with clear business impact that can accelerate movement towards the longer-term architectural vision. These short-term initiatives speed up learning and, in turn, help to refine the longer-term architectural vision based on the new learning. The key is to stage the architectural migration in a way that delivers tangible business value at each step. This helps to build conviction and support for later stage initiatives. In an ideal world, the architectural migration becomes self-funding, although in practice this is difficult to achieve fully.
Of course, doing this effectively requires a deep understanding of the business context and the various ways in which IT systems management capability can deliver tangible business value. For example, as companies move to develop more technical service management capability, the real challenge is to understand which services have the most significant impact on the operational and financial performance of the firm.
Don't try to move to technical service management across the board -- start narrowly with a core business process and figure out where technical services become key leverage points in the performance of the process.
At the early stages, it may even be necessary to use back of the envelope calculations since detailed operational and financial data may not be available in a form that is helpful. Remember the old Pareto rule? Find the 20% of the services that deliver the 80% of the economic value to the process. At this point, figure out which of these 20% of the services are easiest to manage -- both in terms of the ability to map to the underlying elements of the IT infrastructure and the ability to affect SLA performance through enhanced problem identification and resolution tools. Start here. Depending on how evolved the IT management capabilities are today, it may even be necessary to define appropriate SLAs as a first step.
The key is to figure out what can be realistically done in a six to twelve month time frame and what specific business benefits can be delivered at the end of this period. In this economic environment, the more the business benefits can be framed in terms of operating savings -- operating cost reduction or operating asset savings -- the more likely the short-term initiatives will be funded. Quantify the business benefits and then measure them. If these benefits materialized, document it and publicize it. Build support for the next wave of initiatives. If these benefits did not materialize, then figure out why not and incorporate this learning into the next wave of initiatives. Be clear about what has been learned and why the next wave of initiatives is more likely to produce the expected results.
The high level architectural vision will be very helpful in focusing the near-term efforts so that they produce both near-term business benefits and accelerate the movement towards a much more powerful IT systems management architecture. It helps to prevent scatter-shot initiatives, none of which receive sufficient resources or attention to have real impact. On the other hand, the discipline of seeking six to twelve month tangible business benefits helps to reduce the risk of feature creep or mega-project black holes. The key is to get started, but to have some idea of where you are headed.
IT management performance diagnostic
- How systematically can we tie individual elements of the IT infrastructure to specific IT services delivered at the end-user level?
- How systematically can we tie specific IT services to the core operating processes of the business?
- Can we quantify the financial and operating impact of the malfunction of specific IT elements?
- How much of the data generated from our current IT management systems is actually used for problem identification and resolution?
- Is the IT systems management data that is actually used easily accessible and presented in the most useful way to reduce the expense and lead-time required for problem identification and resolution?
- How much staff time is consumed in IT systems problem identification and resolution? Is this more or less than it was three years ago? Why?
- Are we primarily operating in reactive mode when dealing with IT systems performance issues?
- How well can we identify and resolve problems with elements of the IT infrastructure that are critical to our business performance but reside outside the boundaries of our enterprise?
Action Plan
First month
Assemble a team focused on upgrading your company's IT management system capabilities
Conduct a diagnostic of current IT management system capabilities, focusing on
- Where the major expenses are incurred in current IT performance management
- Where the most severe IT system performance shortfalls occur in terms of impact on the operating and financial performance of the business
- Where major new business initiatives are likely to create the most stress in existing IT management systems
Develop a high level IT management system architectural vision along the two dimensions outlined above -- layers of management accountability and categories of functionality available at each layer
Second month
Meet with senior IT and business management to build alignment around the high level IT management system architectural vision -- refine the architectural vision as required
Identify a series of initiatives designed to accelerate movement toward this architectural vision while at the same time delivering tangible business benefits within a six to twelve month time frame -- be sure to include organizational initiatives designed to address major capability gaps.
Third month
Create a portfolio of near-term initiatives prioritized based on magnitude of business impact and amount of effort and resources required to support the initiatives
Develop a business case for each of the near-term initiatives, quantifying anticipated business impact
Define business impact milestones for each of the near-term initiatives and quantify the base line performance for later measurement of impact
Launch first wave of initiatives
--
More information is available at www.johnhagel.com; contact John Hagel at john@johnhagel.com.