“Electric Power Resilience: The Challenges for Utilities and Regulators,” Yale Journal on Regulation Bulletin, August 2019.

Original Post

Electric Power Resilience: The Challenges for Utilities and Regulators

(https://www.yalejreg.com/bulletin/electric-power-resilience-the-challenges-for-utilities-and-regulators/)

Few people doubt that the United States will continue to experience long-lasting electric power outages affecting a large number of people and businesses (e.g., outages from Superstorm Sandy, Hurricane Maria in Puerto Rico, and severe hurricanes in Florida).¹ Some industry observers believe that the resilience of the U.S. electric-power network is deficient, and if industry spends additional funds on improving its resilience, the benefits would outweigh the costs.²

Customers, the media, and the public have taken more interest in scrutinizing utilities’ responses to weather-related disasters and cyberattack threats.³ There is a full-court press at the federal, state, and local levels to bolster the resilience of electric power.⁴

The common perception is that the benefits are too large to ignore. This belief seems plausible on its surface. After all, the damages from a long power outage can be devastating to a locale, affecting almost everyone.⁵ Possible damages include economic and noneconomic components. An example of the latter is the inconvenience and discomfort caused by an outage. Imputing a dollar value on the noneconomic effects is difficult—not much more than a guess. Any event that causes prolonged power outages over a large area is extremely costly—both for restoration and to the economy, and in terms of citizens’ health, safety, and general welfare.

Studies have shown that electricity outages, on a kilowatt hour (kWh) basis, have far higher costs than both the price of electricity and the cost of producing and delivering electricity.⁶ Outage costs include spoiled food, lost productivity, lost business revenues, and inconveniences. On net, then, electric customers are worse off when they experience an outage, even when their electricity bills decline. This is no surprise for anyone that has experienced an outage, especially for a long duration.⁷

Both the utility provider and electricity consumers can mitigate the damages from extended power outages. Resilience should be an integral goal of utility planning to determine long-term investments and other measures. Through pricing, a utility can determine how much its customers are willing to pay for different levels of resilience. Customers can also self-insure and take other protective actions to mitigate the damage from long-term outages.

The widely recognized Coase Theorem favors the imposition of liability on those parties who are able to address a problem at the lowest cost.⁸

It may be true that customers can more cost effectively make investments and take other adaptive actions to mitigate the effects of long-term power interruptions. But solely maximizing efficiency avoids the question of who should bear the risk from an “equity” perspective.⁹ At the other extreme lies strict liability. Under strict liability, the utility alone is responsible for all long-term service interruptions, irrespective of the cause and cost. Imposing all liability on a utility might both be inefficient and unfair to the utility and its shareholders. It presumes that either (1) the utility could prevent and mitigate the damage at a lower cost than customers could (an efficiency argument), or (2) fairness demands that the utility absorb all costs, since the occurrence and the actual consequences of service interruptions are largely under the utility’s control. Each condition, or both, may fail to hold.

This Essay argues that, for an electric power system, achieving a socially optimal level of resilience poses more challenges to utilities and their regulators than achieving optimal reliability and thus requires a special form of regulatory analysis. A host of factors—flawed decision-making (e.g., probability neglect), ambiguity over the definition of resilience and the scope of activities to which it is relevant, the high uncertainty of the benefits from improving resilience, the difficulty of measuring resilience and establishing a benchmark, and the optimal institutional arrangement involving both utility planning and a consumer-driven approach—complicate the tasks of both utilities and regulators. This Essay discusses each factor in turn.

I. Differentiating Resilience from Reliability

A fundamental problem in developing policies that guide utility-resilience investments is the lack of a consensus on the definition of resilience and deficiencies in measuring and assessing resilience on various scales. Experts and other observers disagree over the scope of resilience. Should it include only actions to restore power service? Or does it encompass avoiding service interruptions as well? What most differentiates resilience from reliability is the attention paid to utility activities once a service interruption begins, a presumed longer interruption period, and a wide selection of mitigating actions.

A. Ambiguity Over the Definition of Resilience

Various definitions of resilience abound, differentiated by the nature and scope of actions directed at making a power system more resilient.¹⁰ Often the term resilience is used loosely and inconsistently.

One generally accepted definition of resilience is that it measures the performance of a system under threat or stress, for example, power grid performance under severe weather conditions or a cyberattack. ¹¹ According to this definition, a resilient power system possesses the capability to absorb a disruptive event and still continue to operate. If service is interrupted, resilience turns to the ability to mitigate the damage or social cost. Sources of system disruption from a major event are both natural (e.g., severe storms) and human (e.g., cyber and terrorist) threats. A resilient power system has the ability to withstand and recover from both malicious and inadvertent cyber and physical attacks.

The above definition of resilience contains two components. The first is static, which is the ability of a power system to remain functional when shocked. The second is dynamic, which involves hastening the speed of recovery from a disruption.¹² The first component relates to keeping the lights on. The second involves restoring service quickly when the lights go out and, considering the economics, at the minimum cost to consumers and society. Static resilience and dynamic resilience are substitutable. By spending more money to prevent service interruptions, the operator can reduce the money spent on managing an outage and restoring service.

Utilities commonly engage in four broad activities related to resilience: (1) sustaining operation of the power system; (2) restoring service; (3) planning and preparing for future extended interruptions; and (4) adapting to future events based on past experiences.¹³ The last activity, often overlooked, refers to the process in which, after a disruptive event happens, a utility learns from that experience. This can result in the utility modifying its future actions in response to a disaster, which will improve resilience and thereby reduce the duration of a service interruption.

The 2017 U.S. Department of Energy report on Electricity Markets and Reliability recapitulates the unique challenges associated with resilience:

Recent severe weather events have demonstrated the need to improve system resilience. The range of potential disruptive events is broad, and the system needs to be designed to handle high-impact, low probability events. This makes it very challenging to develop cost-effective programs to improve resilience at the regional, state, or utility levels. Planning, practice, and coordination on an all-hazards basis and having a mix of resources and fuels available when a major disturbance occurs are both essential to fast response.¹⁴

The Edison Electric Institute has compiled a listing of recent studies, programs, and policies on grid hardening and resilience for distribution systems in response to large storms. It notes that no single solution exists to make all systems more resilient; rather, “utilities and their regulators must look at the full menu of options and decide the most cost-effective measures to strengthening the grid and responding to storm damages and outages.”¹⁵

B. A Two-Part Definition of Reliability

The North American Electric Reliability Corporation (NERC) disaggregates reliability into two parts. The first, operating reliability, is “the ability of the electric system to withstand sudden disturbances,” such as electric short circuits or unanticipated loss of system components.¹⁶ Operating reliability identifies short-term operational aspects of the system, which overlap with maintenance of system functionality when confronted with an extreme event like a hurricane. It relates to that element of resilience that prevents the system from going down and interrupting service to customers.

The second part, adequacy, is “the ability of the electric system to supply the aggregate electric power and energy requirements of the electricity consumers at all times, taking into account scheduled and reasonably expected unscheduled outages of system components.”¹⁷ Reliability is generally measured by interruption metrics.¹⁸ It is a binary concept of system performance: the lights are either on or they are not. Reliability, therefore, concentrates on a system’s general capability to provide power with as few service interruptions as possible.

C. The Uniqueness of Resilience

Resilience, as discussed supra, focuses more on one-time extreme events with the potential for widespread and long-lasting damage. A more resilient system can better withstand an extreme event. With a disruptive event, the damage is smaller and recovery is faster. Moreover, resilience extends beyond the duality that characterizes reliability by allowing for intermediary positions between service and no service.

First, consumers experience benefits of resilience and reliability investments differently. Resilience-improving actions attempt to mitigate the damage done by extreme (i.e., not routine) circumstances. Customers enjoy most of the benefits from investing in resilience only after an extreme event. Indeed, with luck, in a given year, they may realize no benefits other than a perceived reduction in risk due to these investments—for example, comfort in knowing that they would suffer less damage if a severe storm happens. In contrast, because investments in reliability enhance routine operations and conditions, customers should see their benefits within a few years.

Second, improving resilience requires a greater diversity of activities because of resilience’s broader scope. Resilience-improving conduct occurs over a broad time range. Some, like preparation and planning, occur before an event is on the horizon. Others, like the retention of service and service’s quick recovery, only occur after an event happens.¹⁹ Activities also vary in priority because of their greater benefits, lower costs, or both. For example, many observers consider two-way communication with customers during a long-term outage as critical and relatively low cost.²⁰ Customers can better adapt if they know how long a service interruption will continue. Their behavior will likely differ if, say, the estimated time to restoration is eight hours versus thirty minutes. Other commonly cited resilience improvements include more redundancy (e.g., spare parts), hardening of distribution lines (e.g., upgrading poles and structures with stronger materials), distributed energy resources (DER) and microgrids, remote-controlled switches, system design accommodating recovery, mutual assistance programs, security measures, a diversified and integrated grid, training and workforce development, smarter operation of distribution component, and better communications with customers. ²¹

Finally, resilience improvements require more effort on the part of utility companies. To combat a major event, the utility may have to repair damaged overhead lines, transformers, and substations.²² It also may have to grapple with service interruptions throughout its system and for a large number of customers. These possibilities make resilience-based outages more challenging to confront and analyze than reliability-based (e.g., less than twenty-four hour) outages.²³

Resilience is, however, not the sole purview of utility operators. A holistic perspective of resilience views how power operators, electric consumers, and the general economy respond to a disruption—even perhaps through utility-community joint efforts that require outside help from the government.²⁴ Holism also involves how electric consumers react to an extended service outage and what precautions they took prior to the outage to lessen the actual harm they otherwise would suffer.²⁵ These actions represent adaptive responses to a major threat.²⁶ Any regulatory policy should recognize that a utility and its customers, along with the community assisted by the government, can jointly contribute to mitigating the damages from an extended service interruption. These entities possess complementary expertise and skills that can efficiently and equitably mitigate the damage done by extended outages to utility customers and the community.

II. Flawed Decision-Making

A. Probability Neglect

Evidence from different contexts has shown that “probability neglect,” namely the sole focus on outcome and disregard for its probability, helps to explain excessive reactions to low-probability, catastrophic events.²⁷ That is, the responses to tragedies and other highly damaging events often occur right after outrageous incidents with high public exposure. The result is a failure to apply rigorous, analytical approaches to managing risks. Resilience of electric power is susceptible to this conundrum.

Because policymakers and system operators rightly fear extended service interruptions and blackouts—they would face the brunt of criticism—they may not think twice about burdening electric customers with the cost of avoiding them. They will tend to err on the side of excessive caution that translates into higher electricity prices for their customers.²⁸ The problem is that they may view improved resilience to be beneficial from their perspective when it is not from society’s.

The mindset of many decision-makers and industry observers<em> seems to be</em>: we can’t let this happen again, no matter how slim the chances are and the economics.²⁹ Moreover, their risk perceptions are exaggerated, derived from high-profile publicity given to such events. The policy discourse over electric power resilience exemplifies this flawed thinking.

Prudent decisions on resilience require consideration of the probability of events, whether calculated objectively with historical data or determined subjectively. Assessing the economics of improved resilience should account for the likelihood of the future frequency and duration of extended outages covering a large area. Otherwise, how can decision-makers conduct a valid cost-benefit review of costly investments and other actions?

B. The Precautionary Principle

Analysts often refer to the precautionary principle in setting environmental, safety, and other public policies across different industries and contexts.³⁰ The precautionary principle warns that society is gambling when it acts to prevent a potential harmful event only under stringent conditions (e.g., a highly certain future). It assigns a benefit to prevention even with inconclusive risk. The application of the precautionary principle to electric power resilience seems appropriate. The reason is that, even though it becomes difficult to assign a probability to a catastrophic event and measure, with reasonable accuracy, the benefits from actions to enhance resilience, the event could cause severe damage to electric consumers and the regional area.

1. What Is the Precautionary Principle?

According to the precautionary principle, the optimal decision in a world of less-than-perfect certainty and large risk from the status quo calls for new action. Uncertainty exists where decisionmakers lack reasonably accurate estimates or forecasts for the benefits from enhanced resilience. The precautionary principle strategy mirrors a “min-max” approach—i.e., minimizing the maximum harm that can result from an adverse event³¹—which is most appropriate for situations where the outcomes can afflict substantial damage to property and human life, in addition to being highly uncertain.

Under the precautionary principle, even in the face of uncertainty, society should expend resources today to mitigate the chances of severe problems in the future.³² The implication for resilience is that society should spend some amount of money today to enhance resilience and avoid a worst-case scenario, notwithstanding the high uncertainty over the benefits.

To wit, the precautionary principle says that society takes an inordinate risk when it attempts to prevent a potential harmful event only under certainty. It supports society erring on the side of caution in protecting the general public from risk, adopting a “better safe than sorry” stance that insures against catastrophic events. For resilience, the precautionary principle would recognize both the possible extensive damage to society from threats to the electric power grid and the inconclusive nature of the costs from service disruptions.

A conservative interpretation of the precautionary approach aligns with the “real options theory.” According to this theory, decision-makers would “hedge” by deferring costly actions until they acquire more definitive information necessary to reduce the chances of making the wrong decision (e.g., overspending on resilience).³³ This wait-and-see posture can help avoid uneconomic actions. Decision-makers would delay undertaking a major initiative until they know more about the risk level of threats. This cautious approach would result in some spending today on enhancing resilience as an insurance against the possibility of a catastrophic outcome.

2. Critiques of the Precautionary Principle

The precautionary principle is not uncontroversial.³⁴ Critics note its shortcomings compared to a cost-benefit analysis that accounts for uncertainty and the risk aversion of the decision-maker. How much money should society spend today to mitigate the consequences of major threats to the electric power system? Should a utility spend a hundred million dollars or two billion dollars for mitigation? Unlike a cost-benefit analysis, the precautionary approach provides little guidance.

When people purchase insurance, they at least implicitly compare the premiums with the expected cost of an adverse event.³⁵ Should society not have an idea of the expected benefits from spending money today to reduce the damages from long-extended power outages? But the precautionary principle places the burden on those would devote little or no resources toward mitigating a hazard where reasonably accurate information about its consequences and the probability of its occurrence is nonexistent.³⁶ The default option is that society should act to avoid a risk with potentially catastrophic consequences, irrespective of the likelihood of the event. That is why a prominent scholar harshly said that the precautionary principle “offers no guidance—not that it is wrong, but that it forbids all courses of action, including inaction.”³⁷

Indeed, the precautionary principle may not represent an economically rational way to guide socially desirable policy, especially when it involves society spending large sums of money today. Assume that the utility spends substantial sums of money to improve its resilience. Although the investment would reduce the expected damages from a severe storm, that money might have been better spent on alternative utility activities designed to improve safety or reduce other risks. The benefits of the paths not taken constitute the opportunity cost of spending the money on resilience.³⁸

So even if a course of action is tenable, rational behavior would limit spending for preventing the possibility of future harm from catastrophic events. In the face of uncertainty, the best policy may be to avoid the worst outcome irrespective of the probabilities for different scenarios. This rationale assumes a risk-averse society and severe damages from an event.

III. Uncertainty as a Complicating Factor

Uncertainty differs from risk in that the probability of occurrence is not quantifiable, thereby requiring subjective judgment by decision-makers.³⁹ Under uncertainty, a common approach is to describe various hazard scenarios or assign them a probability based on the decision-maker’s personal assessment.

Uncertainty can warrant action, but decision-makers should exercise caution when doing so. When deciding to perform a costly action with uncertain benefits, economic or noneconomic, people often hesitate, and they hesitate rationally. Decision-makers need to continuously acquire better information, whether from credible modeling or more informal sources, to increase the chance that they take socially desirable actions in the face of uncertainty. This seems to be true for electric power resilience.⁴⁰

To make socially optimal—or even effective—decisions on resilience requires reasonably accurate information about the true cost of service interruptions for both utility customers and the local area. As of today, methods that estimate willingness to pay (WTP) are inadequate to inform the benefit-cost analysis of “resilience investments.”⁴¹

A. “Black Swans”

Analysts label power interruptions of long duration as a high impact, low probability (HILP) event, also known as a “Black Swan” event. A Black Swan event poses special challenges for decision-makers because of its (1) far-reaching impact; (2) poorly understood risk (“uncertainty”); (3) costly mitigation; and (4) the unclear role for industry, power customers, and government in sharing the responsibilities for mitigating the effects of a major event.⁴²

HILP events can have macroeconomic and other societal impacts (e.g., nonmonetary inconvenience). The likelihood and magnitude of societal impact increases as an interruption endures for an extended period over a large area.⁴³ As the extent of an outage and the dispersal of its effects increases, the benefits of any actions to improve resilience become harder to measure.

B. Addressing Uncertainty

Decision-makers⁴⁴ face a high degree of uncertainty about the effects of major disruptions on an electric power system. Because of scant data to draw upon, predicting their frequency and the damage they might cause is pure speculation.⁴⁵ Compared with natural threats to electric grid reliability, such as extreme weather, cyber threats are more difficult to anticipate and address.

One source of greater uncertainty that has proliferated recently is more extreme weather, which some experts predict to worsen in the future.⁴⁶ Another relatively new source of uncertainty is cyberattacks; utilities have little clue when they will occur and with what likelihood.⁴⁷

Uncertainty inevitably forces decision-makers to rely heavily on value judgment. These value judgments frequently are part of the evaluation of the benefit side of “resilience” investments.⁴⁸ It also makes strict cost-benefit tests less feasible. ⁴⁹ Analysis of resilience thus differs from evaluating reliability concerns, which are more accurately described as risks. The likelihood of inadequate capacity to meet demand is derivable from historical data for both system peak load⁵⁰ and power-plant outage rates. Reliability is also more precisely measurable and is less ambiguous in its definition.⁵¹ An Electric Power Research Institute (EPRI) report summarizes the current state of cost-benefit analysis for electric power resilience, and demonstrates how much these analyses differ:

In conventional cost-benefit analysis, prospective investments can be evaluated by comparing the costs and benefits expressed in present-value currency terms, which make comparisons straightforward. Resiliency investments are considered to avert the consequences of events characterized by low probability, uncertain timing, and high severity (while the costs are certain and large). If costs or benefits are not known with certainty, then the analysis must account for this from an expected risk perspective. Risk is traditionally defined as a function of the hazard (i.e., probability) and the consequence. Consequence can be further described as a function of exposure and vulnerability . . . . [T]here is no unifying perspective or framework for cost-benefit analysis of resiliency efforts, though there is much interest in advancing the state of the art. ⁵²

One way to handle uncertainty is to know the customer’s implicit WTP for enhanced resilience in order to justify investments or other actions. While decision-makers cannot expect to eliminate uncertainty, they can incorporate it (e.g., by using subjective probabilities) into WTP calculations and cost-benefit analyses. The decision-maker can then better understand how uncertainty affects the cost-effectiveness of investments and other resilience-improving measures under different scenarios.

One unavoidable question is: how much is society willing to pay for increased resilience? The answer depends on the avoidance of lost welfare that would otherwise occur. Lost welfare measures what analysts refer to as the “Value of Lost Load” (VOLL). VOLL estimates can help to set resilience targets and to allocate monies toward different measures to enhance resilience. A risk-averse society would be willing to spend more than the expected loss in welfare. The tough chore for decision-makers is to calculate the risk tolerance of different customers. For the utility-planning approach, discussed infra Section V.A, analysis requires aggregating the disparate risk preferences across utility customers into a single standard for society.

Knowing with reasonable accuracy how much customers and society would be willing to pay for avoiding long service interruptions, therefore, is critical for prudent decision-making. But presently, willingness-to-pay information is too imprecise to render much value.⁵³ For example, VOLL estimates are highly uncertain and specific to local conditions, outage duration, and scope. Besides, almost all studies focus on outages of twenty-four hours or less.⁵⁴ The benefits of investments in resilience to counter interruptions of long durations (e.g., multi-day service outages) are consequently dubious and much more uncertain than the benefits for investments in electric power reliability.

Uncertainty often causes suboptimal behavior. People sometimes act overconfidently by overstating the sureness of their decision, saying something like: “We just know that spending more money on resilience is cost-beneficial. We observe how damaging long-term power outages can be, so we can never throw too much money at trying to mitigate their effects.” Others may rationalize inaction because of the high degree of uncertainty, stating, “Since we have highly imprecise estimates of what the benefits will be, we shouldn’t throw any money at improving resilience until we get better estimates.” Both responses can lead to irrational behavior and a socially undesirable outcome.⁵⁵ The arguments are akin to the policy issues facing climate change, with divergent positions taken.⁵⁶

IV. Metrics for Resilience

A major if not central purpose of regulation is to induce high-quality performance from public utilities.⁵⁷ To achieve that objective, regulators should measure and evaluate utility actions. Performance depends on how well utility management uses available resources. Yet factors outside utility management’s control also affect performance.⁵⁸

In the context of this Essay, regulators lack the ability to determine the minimum costs compatible with a certain level of resilience (if measurable). Utilities inherently have better information that motivates them to overestimate the cost of resilience. This incentive is more acute when utilities receive zero or minimal benefits from better managing their costs or improving their resilience. That is, the higher the allowed costs, the less risk there would be that unanticipated additional expenditures (e.g., cost overruns on underground distribution lines) would result in the utility earning a return on capital below its allowed return.

The challenge for regulators is to determine what constitutes a well-performing utility.

What do they consider acceptable performance? Regulators must address this question if they are to exploit fully the information contained in performance metrics to take appropriate action, including those metrics relating to resilience. Measuring performance trends in the absence of a standard, for example, greatly constrains what actions regulators should take. How can they rightly penalize a utility without a benchmark against which to evaluate a utility’s performance?⁵⁹

A. Challenges to Developing Resilience Metrics

Developing metrics for resilience is inherently difficult but is critical for decision-making.⁶⁰ It involves measuring how well system operators prepare for and deal with rare events without any history (therefore, with significant uncertainty).⁶¹ As amplified in one study:

Reliability metrics measure grid operations during expected outages that could occur under relatively normal conditions. However, reliability metrics typically do not include outage information when low-probability, high-consequence events such as storms, earthquakes, and cyber-attacks occur. As the hazard landscape continues to change, historical data used for reliability calculations may not be suitable for characterizing future potential outages because emerging threats can differ significantly from historical precedents.⁶²

According to the National Academy of Sciences, “[w]hile reliability metrics are relatively well established and widely used in electricity system planning and operation, the development of agreed-upon metrics for resilience lags significantly behind.”⁶³ Decision-makers have found it difficult to assign standards to each of the activities advancing resilience, let alone measure their effectiveness. From the perspective of utility customers, a number of metrics are relevant. They relate to the frequency of long interruptions; the duration of long interruptions; the affected population; survivability (the provision of essential service); and lost welfare, like inconvenience and economic losses.

B. Limitations of Resilience Metrics

1. A Cautionary Note

Appropriate use of performance metrics, even where they accurately measure the outcome of utility actions, depends on the regulator’s ability to separate the effects of external and internal factors on performance. For resilience, several factors are relevant, some internal to a utility’s control and others outside utility management control. The challenge for regulators is to distinguish between these internal and external factors when deciding on whether a utility’s actions are prudent. Without this separation, applying performance metrics for regulatory decision-making becomes more difficult, even counterproductive. One example is comparing two utilities’ time to restore service after an outage. It may well be that the utility taking the longer time faces more challenging physical and environmental conditions. It would be unfair to penalize or reprimand that utility without considering those conditions. Regulators should therefore exercise caution in using performance metrics mechanically or as the sole source of information for evaluating a utility’s performance.

Regulators should pay special attention, however, to those utilities exhibiting abnormal or “outlier” performance, which might lead to more detailed inquiry. A metric, therefore, can act as a guide to future regulatory scrutiny and remedial actions. Metrics function best as a gross indicator signaling a potential problem warranting further inquiry.

2. An Example of Metrics’ Limits: The Utility of Input Metrics

For example, a serious limitation of input metrics (e.g., money spent on vegetation management) is that they provide no indication of how well a power system will operate in the event of a disruption or of the effectiveness of specific actions to enhance resilience.⁶⁴ The real indicators of a power system’s resilience are the consequences that befall the system during and after a disruptive event. The utility can either estimate the consequences,⁶⁵ account for the inherent uncertainties, or measure consequences after an event.

Input metrics also fail to consider the tradeoffs between different processes or standards. Assume that a utility meets a mandated regulatory standard for hardening its infrastructure to mitigate damage from a severe storm. Perhaps the money invested in infrastructure could have been better spent on emergency and planning activities or on better communications with customers during an event. That is, the actual dollars spent by the utility could have been used for greater improvements of resilience; the opportunity cost of the infrastructure investments exceeded their benefits.

V. Alternative Institutional Arrangements

A “resilience” strategy should focus on the consumer. Later, this Part argues that focusing on the individual consumer is a more economically efficient and equitable approach than lumping all the customers together, such as under the utility planning approach, to make decisions on resilience.

The decision to spend money on improving resilience should hinge on the risk-adjusted expected value to consumers. In a market-based environment, the resiliency of electric service will depend more on the value that consumers place on different levels of resilience. Under this “bottom up” approach, pricing and market incentives become major factors. Overall, customer-oriented strategies incorporate flexible, market-based rules accommodating the demands of individual customers.

But with long-term and massive power service interruptions having spillover effects on the economy, there is a macroeconomic effect (and noneconomic effects like safety, health and inconvenience) with a sum cost greater than the aggregate costs suffered by utility customers. When an industrial firm has no electric service, the relevant cost to the firm is the loss of profits. In contemplating whether to purchase backup service, the firm compares the cost of that service to the benefits (which would be avoided profit losses). But this private benefit would fall short of the social benefit, which would include employees not losing work and wages and other economic and noneconomic benefits external to the firm. The total benefit to the local economy would exceed the benefits to the firm. Thus, the profit-maximizing firm would be underinvesting, since it would not internalize some of the benefits that would result from having backup service. In other words, society should be willing to pay more for avoiding a massive outage than what the sum of individual customers would be willing to pay. This gap would then call for some action to close it, action that should be taken by the government or by utilities.

A. Utility Planning Approach

1. Causes of Suboptimal Outcomes

From a public-interest perspective, the ultimate question is whether a “resilience problem” exists; namely, either utilities are spending too much on the resilience that they presently have (however it is measured), spending too little for enhancing their resilience, or are overly resilient today.

On one hand, deficient resilience can exist because of the net-positive externalities—for example, the social benefits, which may include public health and safety—exceed the benefits to electricity customers and the utility itself. On the other hand, utilities may be overspending on resilience because of excessive caution and probability neglect.

Utilities can also spend too much on the resilience desired because of the absence of a sequence of actions based on cost-effectiveness. Some analysts have questioned the cost-effectiveness of underground lines; for example, these lines can cost three to four times more than overhead lines of equal distances. Although underground lines can reduce the frequency of service interruption, outage duration is typically longer than with overhead lines because of the greater difficulty in repairing underground lines. Other measures (e.g., investment in an outage management system) taken by one utility can be cost-effective while the same measures may not be for another utility. One example is underground distribution lines in areas like Florida, where long-term, weather-related outages are more frequent than in other areas, would tend to be more economical.

Under conventional ratemaking, utilities may have an incentive to inflate their rate base to improve resilience (assuming that the authorized rate of return exceeds a utility’s cost of capital⁶⁶)—in regulatory jargon, gold-plating⁶⁷—by spending excessively to boost their profits. They may approach their regulators with key actions and investments to improve resilience without explicitly considering their costs or effectiveness. The likely outcome is utility customers paying for resilience at an amount more than either the benefits they receive or the amount they should pay, because of utilities’ cost inefficiencies. For example, communicating better with customers during an outage may be less expensive than investing in new distribution hardware.

2. Alternatives to Cost-Benefit Analysis

If a problem exists, what might utilities do? How do we know if improved resilience is net beneficial to society? These questions are tough to answer, especially when relying on conventional cost-benefit analysis. Other analytic approaches may provide a fuller picture.

One of these approaches, break-even analysis, asks the following question: if we know the costs of increasing resilience and the costs of electric outages to customers and society, how large does the probability of an event, combined with the effect of resilience investment on the costs of the event’s consequences, have to be to balance benefits and costs? Break-even analysis becomes more valid when there is a high degree of uncertainty over the probability distributions of relevant outcomes. With high uncertainty, even when examining different futures, policymakers have little clue of the probability for each future.⁶⁸

Another strategy would be to expend resources today to mitigate the small chance of a future disaster (i.e., the “fat tail” part of a probability distribution), even when the benefits are highly uncertain.⁶⁹ This resembles the earlier-described “min-max” strategy that aims to minimize the maximum harm that can result from an adverse event. Even a “fat tail” approach, however, might lead to underinvestment in resilience. Long-term service interruptions could result in damages far greater than what is presently considered likely. One policy implication is that instead of viewing “resilience” actions as a cost-benefit question, decision-makers should consider them as a form of insurance against a catastrophe that might happen, but with an unknown likelihood.⁷⁰ Yet this begs the question of how much utilities should spend in total and for each resilience-improving measure. These are the basic challenges facing both utilities and their regulators.

B. Customer-Driven Approach

A customer-driven approach involves utilities offering customers service-differentiated pricing.⁷¹ Such pricing has the potential to optimize the response to service interruptions by allowing utilities to charge a premium to customers who value uninterruptible service at the highest level. These customers, to the extent technically feasible, will have their service cut off only after the utility interrupted service to other customers.⁷² Service-differentiated pricing considers explicitly a customer’s WTP.⁷³ Evidence shows that customers suffer widely varying costs from power interruptions.⁷⁴

For example, some retail customers are very tolerant of variations in power quality and power interruptions, while other customers are less accepting of these conditions. Customers would therefore be willing to pay different amounts for protection against interruptions.⁷⁵ One prime example is interruptible rates for customers who are willing to accept less reliable service in return for a lower rate.⁷⁶ Critical peak pricing and peak-time rebates also illustrate where customers are willing to tolerate lower reliability for savings on their electricity bills.⁷⁷

In comparison, the central planning, or top-down alternative previously mentioned—whereby a utility makes network-wide investments to increase resilience for all customers—can be extremely expensive. It would also raise a “fairness” issue: those customers who impute a relatively low value on increased resilience would subsidize other customers.

One widely acceptable practice is to minimize service interruptions of critical services. This may require the use of distributed generation and microgrids for essential load like police and gas stations, hospitals, and cell towers, and taking nonessential loads offline. Another action may be to ensure that essential services have backup systems. There are different ways to maintain essential services when power is out, and the most economical ones ought to be part of a “resilience” plan.⁷⁸

Noncritical customers can mitigate the damage from extended outages, in other ways than purchasing insurance from a third party. Many industrial customers who find service interruptions extremely costly have a direct connection to the bulk power system and backup generation. Other customers can purchase a backup generator, solar photovoltaic systems with smart islanding inverters, or install Powerwall batteries.⁷⁹ Residential customers can prepare for an outage by buying extra batteries, flashlights, and blankets, and mitigate losses by purchasing surge protectors. Enhancing resilience is therefore a split responsibility between utilities and their customers.⁸⁰

VI. Policy Implications

The overall goal of resilience should be to minimize the lost value to electric customers and society from service interruptions caused by external natural and human threats, net of costs. In other words, actions to improve resilience should minimize the social costs from a disruptive event, which requires accounting for both the benefits and costs.

To quantify the social costs with reasonable accuracy, utilities will need to develop new data and models. These additional analytical capabilities can identify areas of greatest risks and system vulnerabilities, allocate resources more efficiently, and help prioritize investments. Research and development for promising technologies and mechanisms, like improvements to control systems and distribution automation, holds the key to improving resilience in the long term.

Achieving socially optimal resilience poses more of a challenge for utilities and their regulators than reliability does. Resilience covers a wider array of diverse activities, and its benefits are more uncertain. Undertaking a cost-benefit analysis to evaluate different actions is more specious for resilience than for reliability. For example, reliability concerns are more accurately described as risks. The likelihood of inadequate capacity to meet demand is derivable from historical data. Reliability is also more precisely measurable and is less ambiguous in its definition.

To wit, a fundamental problem in developing policies that guide utility resilience investments is the lack of a consensus on the definition of resilience and deficiencies in measuring and assessing resilience on various scales. Experts and other observers disagree over the scope of resilience: should it include only actions to restore power service or does it encompass avoiding service interruptions as well? This Essay supports the latter definition.

A customer-driven approach with service-differentiated pricing is the most promising path to pursue if one wants to know how much customers are willing to pay for resilience. But because of the external benefits like macroeconomic effects, public health and safety from higher resilience, and the technical and political difficulty for a utility to differentiate services across individual customers, it would be ill-advised to rely on market forces alone to achieve a socially optimal outcome.

One important function of public utility regulators is to identify any undue barriers to enhancing resilience and those actions that can most cost-effectively eliminate or mitigate them. Such barriers exist when (1) they cause economically inefficient and socially harmful outcomes, and (2) their mitigation passes a cost-benefit test, making them amenable to public-policy intervention. One possible barrier derives from the perception that utilities could expend substantial money on enhancing resilience with the benefits only realized after a major event. Trying to quantify those benefits prior to investments is extremely difficult: what is the probability that customers will realize benefits, when will they realize them, what are the chances of a major event occurring, and how much do customers benefit when a major event occurs? It would seem hard for a utility to justify expenditures on resilience to a regulator, especially when trying to compare them with the expected benefits to its customers.

With utilities strongly motivated to enhance resilience—perhaps excessively—regulators face two tasks attuned with the public interest. The first is to make sure that utilities implement the most cost-effective actions. The second is to prevent utilities from “gold-plating” their rate base.

In closing, in an ideal world, utilities could justify proposed “resilience” actions by making demonstrated and verifiable benefits at the least cost. But since this is next to impossible, decisions affecting resilience require subjective judgment by utilities and their regulators—more so than for decisions relating to reliability. This implies that any action relies heavily on the decision-maker’s judgment devoid of precise or even reasonably accurate quantification. Utilities (and regulators) need to prioritize “resilience” actions despite the absence of useful estimates of benefits or cost effectiveness.