Read "Assessing Policies for Retirement Income: Needs for Data, Research, and Models" at NAP.edu

« Previous: 4 Data Needs

Page 132 Cite

Suggested Citation:"5 Development of Projection Models." National Research Council. 1997. Assessing Policies for Retirement Income: Needs for Data, Research, and Models. Washington, DC: The National Academies Press. doi: 10.17226/5420.

Page 133 Cite

Page 134 Cite

Page 135 Cite

Page 136 Cite

Page 137 Cite

Page 138 Cite

Page 139 Cite

Page 140 Cite

Page 141 Cite

Page 142 Cite

Page 143 Cite

Page 144 Cite

Page 145 Cite

Page 146 Cite

Page 147 Cite

Page 148 Cite

Page 149 Cite

Page 150 Cite

Page 151 Cite

Page 152 Cite

Page 153 Cite

Page 154 Cite

Page 155 Cite

Page 156 Cite

Page 157 Cite

Page 158 Cite

Page 159 Cite

Page 160 Cite

Page 161 Cite

Page 162 Cite

Page 163 Cite

Page 164 Cite

Below is the uncorrected machine-read text of this chapter, intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text of each book. Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.

Development of Projection Models As the concern about retirement income security and the costs to provide it increases over the next few years, decision makers will look to projection models to estimate the likely costs and other effects of alternative policy proposals. Such models will require good data and a solid base of research knowledge with which to project demographic and economic trends and behavioral responses to policy changes and other factors. Outputs from analytical models of the behavior of individuals and employers that receive general consensus should be inputs to projection models. Otherwise, when policy proposals are expected to elicit a behavioral response, projection models will fall short in estimating the likely effects. In the preceding two chapters, we identified critical deficiencies in both data and research. As we discuss below, available projection models are also deficient in many respects. The question is how to allocate limited resources for developing better capabilities for retirement-income-related policy analysis. As we stress throughout this report, the U.S. Department of Labor (DOL) and other relevant agencies should give priority to effecting needed improve- ments in data and analytical research before making significant investments in broad-based projection models. Such investments will be misplaced until there is better understanding of key behaviors such as employer decisions about hiring and compensation policies and individual decisions about savings and consump- tion and better data with which to project trends in those behaviors and other factors. Until efforts to improve data and research bear fruit, we recommend that agencies make limited investments in special-purpose models that use the best available data to answer policy makers' specific questions. At the same time, agencies should consider the kinds of broad-based projec 132

DEVELOPMENT OF PROJECTION MODELS 133 lion modeling capabilities that it will be important to develop in the future. In this chapter, we address: types of projection models that could be useful; the dimensions of models that should be considered in planning for improved projec- tion capabilities; the capabilities and limitations of existing models; long-term issues in developing models of employers and employer behavior; long-term issues in developing improved individual-level microsimulation models; issues in developing validation capabilities in new and current models; and strategies for projection modeling in the near term with existing models or one-time spe- cial-purpose models (including improvements to the model maintained by the Social Security Office of the Actuary). CURRENT MODELS AND THEIR USES Types of Models Projection models vary widely in scope, complexity, and modeling strategy. Some of the major types are time-series models, cell-based models, microsimu- lation models, macroeconomic models, and computable general equilibrium mod- els. Some models incorporate multiple approaches: for example, in the Macro- economic-Demographic Model (MDM), a macroeconomic growth model interacts with cell-based models of population growth, family formation, the labor market, and other model components. Other models reflect primarily one approach but make use of the outputs of other kinds of techniques; see Appendix C for descriptions of some models. Times-Series Models Time-series models project aggregate historical data into the future on the basis of a function, which can be simple or complex, of the behavior of the data in previous periods. For example, future labor force partici- pation rates for older men can be projected to continue to decline on the basis of trend data from 1950 to 1995. However, the longer the projection period, the more hazardous it is to assume that past trends will continue unchanged into the future. Indeed, for the example just noted, there is currently debate about whether the trend toward early retirement among men has peaked. Cell-Based Models Cell-based models divide the population into groups, such as 5-year age categories by sex, and develop separate projections for each group for example, separate projections of life expectancies, retirement age probabilities, and so forth. The separate projections may be based on time-series analysis, modified by assumptions about the likely behavioral effects of policy changes (e.g., the effects on retirement age of reductions in initial Social Security benefits). Because a cell-based model reflects differences among groups, the aggregate outcome it projects may differ greatly from the outcome of a simple time-series projection for the population as a whole.

34 ASSESSING POLICIES FOR RETIREMENT INCOME The Office of the Actuary in the Social Security Administration (SSA) uses cell-based techniques to develop 75-year projections of the balance in the trust funds. Such techniques are also used in actuarial models of employer pension fund balances and in models used by the Census Bureau, SSA, and others to develop population projections. Microsimulation Models Microsimulation models use individual records for large samples of the population, estimating outcomes for each individual on the basis of that person's characteristics and aggregating the results. Such mod- els usually employ stochastic or probabilistic methods to generate distributions for such outcomes as retirement decisions. DYNASIM2 and PRISM are two dynamic microsimulation models that have been used to evaluate alternative Social Security and employer pension policies. They use dynamic aging techniques to project their samples forward in time. In dynamic aging, characteristics of persons in the sample are altered year by year through the application of transition probabilities to simulate who in the sample will live, die, marry, divorce, stay on the same job, take another job, retire, and so on. Policy simulations are then performed on the samples for 1 or more years. The transition probabilities may come from a variety of sources, including time- series analysis (e.g., for life expectancy) and behavioral analysis (using simple or complex analytical models). Thus, for example, a behavioral equation in DYNASIM2 relates labor force participation to age, race, sex, education, disabil- ity, marital status, presence of children, and spouse's earnings and includes a complex error term by which labor force participation changes for an individual are linked across time and with changes in hours and wages. Another microsimulation projection technique is called static aging, in which the characteristics of the sample are altered by adjusting the sample weights to reflect the expected distributions in the projection year before conducting policy simulations. This method is typically employed for short-range projections (e.g., of tax liabilities or welfare program participants in the next 5 years) and not for the longer range projections that are often required for retirement-income-related policy modeling. Microsimulation models can estimate the effects of more detailed policy changes for a larger number of groups than is possible with cell-based models. In addition, Burtless (1996) argues that microsimulation provides an organizing framework. It requires model developers and users to think through all of the behavioral links and interactions. It also helps them identify gaps in research knowledge and data and set priorities for filling in key gaps. However, micro- simulation models have costs as well as benefits, which must be evaluated vis-a- vis other modeling approaches in the context of users' information needs. In particular, they tend to be complex and difficult to use. Both microsimulation and cell-based models can be developed for organiza- tions (e.g., employers) as well as for individuals. However, with the exception of

DEVELOPMENT OF PROJECTION MODELS 135 the Pension Insurance Management System (PIMS) model recently developed for the Pension Benefit Guaranty Corporation (PBGC) (see Appendix C), we know of no policy projection models of retirement-income-related behavior of employers. Macroeconomic Forecasting Models Macroeconomic forecasting models are used to make projections for the U.S. economy. They use systems of simul- taneous equations, estimated with historical time series, to forecast the effects of aggregate factors, such as rising inflation or changes in the federal budget deficit, on aggregate outcomes, such as gross national product or unemployment. Computable General Equilibrium (CGE) Models CGE models estimate the longer run effects of proposed policy changes on prices and quantities in markets, taking into account behavioral responses and feedback effects between supply and demand. For example, an increase in marginal tax rates may decrease work effort, thereby leading to reduced labor supply, thereby leading to higher wage rates and increased labor supply until a new equilibrium is reached. As generally implemented, CGE models are macro rather than micro in nature, in that they are not usually disaggregated on many dimensions. However, unlike macroeco- nomic forecasting models, CGE models are based on explicit theories of the behavior of households and employers rather than on relationships among aggre- gates. Dimensions of Models There are three key dimensions on which to evaluate projection models: · the degree to which the model provides accurate estimates of policy out- comes, in that the estimates approximate what would occur if a proposed policy change were implemented; · the amount of information provided in the model output on the uncer- tainty of the model estimates and their sensitivity to key assumptions; and · the degree to which the model incorporates best current professional judg- ment about the underlying behavior for example, the best judgment about the appropriate functional form and parameter values with which to estimate changes in personal savings in response to changes in employer pensions or Social Secu- rity. Obviously, it is highly desirable that models be accurate, provide estimates of uncertainty and sensitivity to key assumptions, and incorporate best profes- sional judgment (or be able to use alternative specifications to reflect important disagreements). But determining the accuracy of model outputs is a difficult task at best. Also, models have to date rarely included the capabilities with which to

136 ASSESSING POLICIES FOR RETIREMENT INCOME estimate uncertainty, conduct sensitivity analyses, or incorporate alternative model specifications; consequently, little model validation work has been done. With recent advances in computing technology and statistical methods, it is now possible to effect significant improvements in model capabilities in these areas (see "Validation" below). Other dimensions of projection models include: · scope, in terms of the types of policies that a model can simulate and the kinds of outcome measures for which it can provide information; · detail, in terms of how well a model can represent all of the provisions of current law and estimate the effects of changes to one or more provisions; · extent of disaggregation of model outputs, that is, the extent to which the model can estimate the distributions of outcomes for groups of interest, variously defined, in addition to average tendencies; · time period, whether the model can develop projections for 5 years or 75 years; · extent of feedback loops, which permit estimating not only the first-order effects of a policy change, but also the second-order effects, taking into account behavioral responses to the new policy; · ability to link to other models, either by supplying outputs for other mod- els to use as inputs or by accepting other models' outputs as inputs; · amount of elapsed time that is required to obtain outputs from the model on one or more policy options, including options that are close variants of current policies and options that differ markedly in one or more respects; · extent of openness and transparency, that is, how usable and understand- able the model is to people other than the developers; · ease of use and modification as needed; and · extent of portability, from one computing hardware and software environ- ment to another. The first five dimensions listed above scope, detail, disaggregation, time period of projections, and feedback loops have to do with the breadth and depth of model capabilities. Other things being equal, it is desirable to have a model that can simulate a broad range of policy options and, for each option or combi- nation of options, estimate a broad range of outcomes. Other things being equal, it is also desirable to have a model that can evaluate proposals for fine tuning one or more aspects of complicated sets of program provisions, that can develop projections over long time horizons, and that can provide distributions of outputs for particular groups of interest in addition to population means. As an example, in considering a policy change that increases or decreases employer pension benefits, decision makers are likely to be interested in the effects not only for all workers, but also for workers in different industries, age groups, and earnings categories and by size-of-employer categories. Finally, a projection model is

DEVELOPMENT OF PROJECTION MODELS 137 more useful to the extent that it can estimate the second-round effects of policy changes, which, because of behavioral responses, are likely to differ from the initially estimated effects. For example, changes designed to increase employer pension coverage, if they also increase employer costs, could lead employers to opt out of pension plans and hence reduce coverage over time. Desirable though these dimensions are, there are costs to increasing the breadth and depth of model capabilities. For example, models that simulate a wide range of policy outcomes and options are likely to be time and resource intensive to develop and apply. Also, modeling of feedback effects is quite difficult, both operationally and because of the uncertainty of behavioral re- sponses to policy changes. Hence, it may be cost-effective to have different models for answering questions about particular types of policy changes, as well as different models for estimating means versus distributions, effects over shorter versus longer periods, and second-order as distinct from first-order effects. In- deed, the various types of modeling strategies listed earlier, such as cell-based and microsimulation approaches, reflect choices about how much and what kinds of detail to include in models. With a range of models, it becomes critical to consider ways to facilitate links between them (the sixth dimension listed). The last four dimensions listed response time, openness and transparency, ease of use, and portability have to do with operational features that determine how readily a model can be applied during the course of a policy debate, modi- fied as needed to incorporate additional capabilities, and validated and improved. These dimensions, which cut across model types, also have to do with how large a user community develops for a model, which, in turn, is a major factor in how widely a model is used and how rigorously it is evaluated. Features of models that facilitate a higher level of performance on these dimensions include adequate documentation, public availability of the model to analysts, a user-friendly interface that allows people who are not programmers to operate the model, and other design elements that permit ready modification of the model to simulate newly proposed policy options in a consistent manner and to facilitate multiple runs and specifications for model validation purposes. Many of these features (e.g., real-time response mode or a graphical, point-and-click user-friendly interface) appear more obtainable when a model is designed for microcomputer or workstation computing environments than when it is designed for mainframe or supercomputing environments. It clearly seems important for models to provide timely response and to be open, easy to use, and portable, whatever their type or breadth and depth of capabilities. However, the costs of achieving high levels of performance on these dimensions must also be considered. As an example, it is typically time and labor intensive to produce good documentation, although, with forethought, it is pos- sible to build features into models to facilitate the documentation process. Also, the nature of an agency's information requirements may dictate trade-offs be- tween, say, the ease of use of a model and its capabilities. For example, a

138 ASSESSING POLICIES FOR RETIREMENT INCOME particular set of applications may require computing power that is not available from microcomputers or workstations (e.g., CGE models may require super- computers to attain the level of desired detail of inputs and outputs). In this case, the only modeling option is to invest in a model that has limited access. An alternative is to implement a simpler version of the model on a microcomputer and a more elaborate version or a version that will process larger numbers of sample cases on a supercomputer. Assessment There are many projection models available today that relate in some way to retirement income security. Some of these models were developed for very narrowly targeted applications, such as actuarial cell-based models that are de- signed to project the balance of assets and liabilities for a specific company's pension plan. Although these kinds of models serve important purposes, we have not reviewed them because of their lack of generality. Also, our review does not extend to health care financing projection models, although such models are very important to a full assessment of retirement income security, nor to tax revenue models, such as that used by the U.S. Treasury Department.1 Rather, we have looked primarily at models that can address somewhat broader retirement-in- come-related policy questions, for example, the effects of tax policy changes on accumulations in Individual Retirement Accounts (IRAs) and pension accounts over people's working lives. Our overall conclusions about existing models are mixed. Some models appear to have useful features for some kinds of applications but, on balance, we conclude that existing models are not adequate to fully inform the retirement income policy debate. For some important topics, there are currently no available relevant models. Specifically, there are no models of employer behavior with regard to benefit offerings or demand for older workers that could provide input for individual- based models of labor supply and retirement asset accumulation. Rather, existing employer-based models address such issues as pension funding effects on share prices and employer pension policy within a corporate financial setting. The new PIMS microsimulation employer-based model has been developed for the PBGC, but it has a limited focus. PIMS estimates corporate bankruptcies among em- ployers with defined benefit pension plans on the basis of simulating key eco- nomic variables and employer characteristics. It also simulates plan participants and funding status. Finally, it simulates the implications of projected bankrupt 1See Office of Technology Assessment (1994) for an evaluation of models that were used in the recent health care reform policy debate.

DEVELOPMENT OF PROJECTION MODELS 139 cies of companies with underfunded plans for the PBGC fund that insures such plans. For individuals, there are microsimulation models that are broad in scope and provide the capability for detailed, disaggregated analysis. They could be very useful for retirement-income-related policy analysis, but they generally suf- fer from several deficiencies: they have not been adequately validated and are not designed to facilitate validation; they are based on old or limited data sets; they do not reflect best current professional judgment about underlying behavior; they have little facility for implementing alternative behavioral specifications when there are important disagreements; their provision for feedback loops be- tween policy changes and behavioral responses is weak; they generally have little provision for ready exchange of inputs and outputs with other models; they are not well documented; and they are not open and transparent or readily portable, although they now are generally implemented with personal computing technol- ogy. CGE models that have been developed to answer such questions as the intergenerational benefits and costs of alternative Social Security financing sys- tems are impressive in their treatment of the interactions of the household, gov- ernment, and private sectors of the economy and in their use of up-to-date infor- mation and research knowledge. However, these models deal primarily with "big picture" issues in a highly stylized manner and primarily for the Social Security system, not the much more heterogeneous world of employer pensions. Also, to date, these models have not addressed the limitations of the life-cycle model of savings and consumption (see Chapter 3), and they are usually structured as equilibrium models without unemployment, particularly long-term unemploy- ment, among older workers. These models do not provide distributional detail, nor can they address the effects of very specific policy changes or combinations of changes. Finally, there are as yet no links between these models and cell-based or microsimulation projection models that would provide additional specificity. A positive trend is that newer models generally follow design principles that encourage use and validation. For example, PIMS, the employer-based model, operates on personal computers and has a design that facilitates model validation. However, newer models, such as PIMS, SSASIM, the Solvency and Individual Retirement model (SIR), and the model under development by Wolf et al. (1995), tend to be limited in scope. SSASIM is a cell-based model similar in structure to the SSA actuarial cost model that uses Monte Carlo methods to characterize uncertainty about forecasts of Social Security trust fund balances. SIR is a cell- based model similar to the SSA actuarial cost model that provides limited distri- butional information for different types of workers. Wolf is developing a micro- simulation model primarily for simulating kinship networks and disability status of the elderly. Because individual-level microsimulation models potentially offer very im- portant capabilities for retirement-income-related policy analysis, we review as

140 ASSESSING POLICIES FOR RETIREMENT INCOME pects of two of the existing dynamic models DYNASIM2 and PRISM in Appendix D. The discussion focuses on some weaknesses of these models, not to single them out for criticism, but to support our case for important areas of future development for this type of model.2 LOOKING TO THE FUTURE We recommend priority attention to data collection and research and analytical modeling over the next few years rather than to significant investments in broad- based projection models. However, we believe it is important to establish goals for projection model development and, in some instances, to move forward on design work, so that implementation can progress expeditiously once invest- ments in data and research bear fruit. Specifically, the federal government should establish goals for the development of employer models (using cell-based or microsimulation techniques) and for a new general-purpose individual-level microsimulation model. Other retirement income modeling tools are needed as well, but it is clearly important to have the projection capabilities that employer models and a new individual-level microsimulation model can provide. Employer Models Employer hiring and compensation policies are critical to retirement income security, most obviously affecting the extent of pension coverage and benefits, but also affecting Social Security benefit levels and the opportunities to contrib- ute to such earnings-linked savings vehicles as IRAs. Employers also often provide other important benefits, such as retiree health care and disability insur- ance, as well as opportunities for part-time, post-retirement work. A full assess- ment of the likely effects of many retirement-income-related policy proposals will not be possible without considering what is happening to and with employ- ers. Shifts in employment across sectors that have historically differed in wage and benefit levels can affect retirement income security, as can employer re- sponses to changes in policies and other factors that alter incentives (e.g., pension simplification may increase the probability that small businesses will set up pension plans). Employer-based projection models are clearly needed for much retirement income policy analysis. In some instances, decision makers will want informa 2For other assessments of microsimulation models, see Citro and Hanushek (1991); the papers cited in Cohen (1991a); Employee Benefit Research Institute (1980a, 1980b, 1981); Reno (1993:29- 31); Ross (1991); and U.S. General Accounting Office (1986). Burtless (1989) and Grossman and Watts (1982) focus on behavioral elements of microsimulation models. Anderson (1996) and Hollenbeck (1995) review microsimulation and other projection models that are relevant for retire- ment income policy modeling.

DEVELOPMENT OF PROJECTION MODELS 141 lion about how many employers will take a certain action within a specified time period (e.g., set up a pension plan) and how many workers will be affected. Such information could most likely come directly from an employer model that in- cludes such characteristics as industry and size. In other instances, decision makers will want to know the longer term implications for retirement income flows (e.g., from pension accumulations) for different groups of workers; for this information it will most likely be necessary to produce outputs from employer models that can feed into individual-level models for workers and their families. At present, there are no models that can project the distribution of employer pension and related benefit offerings or the distribution of employer demand for workers. Moreover, there is little basis on which to develop such models in the near term: the needed data and research knowledge are simply lacking (see Chapters 3 and 4~. Resources spent on model development cannot be cost- effective until more complete data are available (e.g., about the distribution of employer benefits) and until there is much better understanding of the factors that influence hiring and compensation decisions for different kinds of employers. Gaining such understanding will require considerable elaboration and testing of alternative theories about employer behavior. The development of employer- based projection models is an important goal for the Department of Labor and other agencies, but one that can only be realized over the long term. It is important to keep this long-term model development goal in mind as decisions are made about investments in employer-related data and research. In this regard, it could be useful to consider alternative projection model designs and strategies and their requirements. One question is whether to start with a cell-based design that classifies employers on a limited number of important dimensions or to opt for a microsimulation model. A cell-based model would be easier to implement and have fewer data requirements than a microsimulation model, but it would have more limited capability for analysis and for simulation of behavioral response. It could also be useful to bring together researchers and people with experi- ence in developing projection models to consider possible model designs on the basis of anticipating what research results may show. For example, what would be the design and data requirements for a projection model if employer costs are taken to be the key factor in hiring and compensation policies? What would be the requirements if employee preferences are also deemed to be an important factor? As new data about employers become available and are analyzed, DOL could sponsor workshops to discuss research findings and their implications for the development of employer-based projection models. Clearly, to make progress toward the goal of model development, it will be important for DOL to give priority to the work of the interagency task force on employer data (see Recom- mendations 5-9, Chapter 4) and to putting an employer data collection plan on as fast a track as possible.

42 ASSESSING POLICIES FOR RETIREMENT INCOME Recommendation 14. The U.S. Department of Labor should consider the development of employer models as an important long-term goal. Retirement-income- related policy analysis requires models of employer behavior that can project the likely future distribution of benefit plan offerings and the demand for older workers. However, there is little basis on which to design, let alone implement, such models until investments have been made in needed data and behavioral research. Microsimulation Models of Individuals and Families Policy makers often want to know the effects of proposed policy changes on population groups, categorized by age, sex, race or ethnicity, income level, and the like. Cell-based models can provide distributional detail, but the cells must be defined in advance and are difficult to change in response to user needs. In principle, large-scale microsimulation is a much more flexible tool with which to provide detailed information about distributional effects, limited only by sample size constraints on the reliability of small cell estimates. Broad-based micro- simulation can also address important interactions among policies, such as how a change in Social Security benefits will likely affect benefits from employer pen- sion plans that are linked to Social Security. Dynamic microsimulation can in principle forecast the long-term effects of policy changes, including behavioral responses: for example, how a change in Social Security benefits will likely change retirement and savings behavior and, ultimately, affect retirement income levels for different cohorts of workers. We believe that a general-purpose, dynamic microsimulation model is needed to project the distributional effects of policy alternatives on the retirement income security of individuals and families over the medium term and for modeling interactions among policies. Other, more limited modeling tools are often appro- priate and cost-effective for specific policy questions. However, as Burtless (1996) argues, it is important to have the capability to address all three legs of the retirement income stool Social Security, employer pensions, and personal sav- ings in an integrated modeling framework that can provide needed distribu- tional detail. Such a modeling framework is also useful because it can help structure related analytical work. The effort to develop and apply a large-scale micro- simulation model will invariably identify behavioral interactions and processes that need to be better understood. It will also help determine which parameters are crucial for analysis and which are less important, and it can suggest how concepts and variables should be consistently defined and measured to be useful for modeling purposes. Unlike the situation for employer models, there are existing individual-level

DEVELOPMENT OF PROJECTION MODELS 143 microsimulation models notably DYNASIM2 and PRISM that have proven useful for analysis of previous pension and Social Security reform proposals (see Appendices C and D). However, these models use obsolete databases March Current Population Survey (CPS) files from the 1970s that were exactly matched with Social Security earnings records and obsolete behavioral parameters, such as retirement probabilities estimated with data from the old Retirement History Survey (RHS). More important, because they were originally implemented with costly and cumbersome mainframe computer technology, these models are not well structured to simulate the effects of policy changes on key behaviors, such as labor force participation. They are also not well developed in other respects (e.g., they do not handle personal savings in any detail).3 Rich new data sets, notably the Health and Retirement Survey and the Asset and Health Dynamics Among the Oldest Old survey (HAS/AHEAD), are begin- ning to provide data with which to reestimate behavioral parameters and address such critical issues as the differences in savings rates across and within cohorts. These new data and associated research can provide the basis for a new indi- vidual-level microsimulation model that can address a wide range of retirement- income-related policy issues. Relevant agencies should consider the develop- ment of such a model as an important long-term goal. We believe that it will prove more cost-effective in the long run to develop a new simulation model than to try to update existing models. It is not simply a question of updating existing models with new data and parameter values, but instead of significantly modifying and augmenting their capabilities in a number of ways. Starting fresh is likely to be a more fruitful strategy. However, elements of existing models could contribute to a new model. Specifically, the demo- graphic components (births, deaths, marriages, etc.) of existing models are gener- ally well developed and could form the basis of the demographic components of a new model with updated parameters. Also, the program accounting compo- nents of existing models for example, the calculation of Social Security ben- efits given an earnings history and the determination of eligibility for Supplemen- tal Security Income (SSI) are generally well specified and could likely be imported into a new model. We caution that it would be premature to begin now to construct a new individual-level microsimulation model. The priority need is for investments in data and research, and it would be difficult to invest in a large, complex model at the same time. Moreover, the results of new data collection and research may 3Another individual-level microsimulation model is CORSIM, which began as an adaptation of DYNASIM2 and has since been redesigned. Operating on both personal computers and super- computers, it provides detailed simulations of demographic histories. However, it has limited capa- bility for simulation of Social security or employer pension policies, and its asset accumulation modules lack behavioral components (see Appendix c).

44 ASSESSING POLICIES FOR RETIREMENT INCOME suggest a different design from the design that would be adopted for a model on the basis of current knowledge. Another reason for delaying construction of a new microsimulation model, while data and behavioral modeling issues are addressed, is that it will then likely be possible to build a new class of projection models that are significantly im- proved over simple extensions of existing models. This possibility exists because of advances in microcomputing technology (including personal computers and even more powerful workstations) that are providing much greater power at cheaper cost than in the past and that are bringing computing applications closer to the end user. These advances, in turn, have spurred the development of new methodologies that promise to improve important model functions that have often been neglected in the past, in particular: validation of model outputs (which includes sensitivity analysis and estimation of uncertainty); documentation of model components, parameters, and data; provision for multiple outputs, includ- ing outputs that alert the user to aspects of the model that are more or less robust; and other features that make a model more open and transparent, easy to use, and accessible to a wide range of users. Examples of relevant new methodologies include powerful computer-based statistical techniques for estimating the uncer- tainty in model outputs; graphical user interfaces that make it possible for ana- lysts to experiment more readily with policy options and alternative model speci- fications (e.g., varying estimates of parameter values); and hypertext language that can support the development of user-friendly, computer-searchable docu- mentation, which makes a model more accessible to users. These and other innovative approaches to important model functions are still evolving. Instead of trying to work out a model structure and design on the basis of today's technology and methods, which would lock in the use of technologies that may soon be superseded, it seems worthwhile to await further developments. When new data and research results become available with which to build new models, then choices can be made about how best to take advantage of the latest technological developments for improved model functionality and design.4 Of course, there will always be new knowledge and methods that make any current state of the art of projection modeling obsolete in one or more respects. We are not thereby arguing that new microsimulation model development should be deferred indefinitely. To the contrary, improved models are needed and should be developed. However, we do argue that, at the present time, the defi- ciencies in needed data and research are so great that resources must be concen- trated first and foremost on filling data and knowledge gaps before investing in model development. 4citro and Hanushek (l991:Chap. 6) discuss principles of microsimulation model design and development; cotton and Sadowsky (1991), Cohen (ladle), and Hollenbeck (1991) discuss, respec- tively, relevant advances in computing technology, use of new techniques for estimation of uncer- tainty, and requirements for model documentation.

DEVELOPMENT OF PROJECTION MODELS Model Requirements 145 We stress that construction of a new microsimulation model would be premature; however, consideration of the requirements for such a model could help structure relevant data collection and research. We assume that a new individual-level microsimulation model would be dynamic: that is, it would use dynamic aging techniques to project a broad range of behaviors and processes (see "Types of Models", above). Important characteristics for a new dynamic model include the capability to develop detailed projections in key areas, to simulate a wide range of relevant policies and behavioral interactions, to estimate the effects of health care coverage and costs on the distribution of retirement income, and to interact with employer-based models and with models that simulate longer run effects. Detailed Projection Capabilities A new individual-level microsimulation model requires detailed demographic, labor supply and retirement, pension cov- erage and benefits, and savings projection capabilities as integral components. Existing dynamic microsimulation models have well-developed demographic modules that could likely be adapted for a new model. For example, the mortality module in DYNASIM2 is richly specified: the mortality equation includes mari- tal status and education as determinant variables as well as age, race, and sex. It would be relatively straightforward to evaluate and modify, if necessary, the specific parameter values in existing demographic modules to reflect current best professional judgment. In contrast, the labor supply and retirement modules in existing models require major revamping for use in a new model. For example, the equations in DYNASIM2 that determine labor force participation, hours, and wages were derived from 1970s data in the Panel Study of Income Dynamics (PSID) and lack policy parameters. The equations for the retirement decision were estimated from the RHS completed in 1979; they include expected Social Security and pension benefits but not other personal savings, and they assume that disability status is exogenous. The hours of work equations in PRISM were estimated from matched CPS files that are now many years out of date; they primarily take the form of simple year-to-year transitions among hours categories. This type of equation is easy to implement, but, over time, its use will negate the very purpose of microsimulation, which is to preserve distributional detail. Such a formulation captures limited information about previous work patterns, so that it will not properly project the work choices of individuals at the extremes of the distribu- tion, such as people who rarely or always worked. Neither DYNASIM2 nor PRISM has a capability for labor supply and retirement decisions to interact with labor demand. The choice of an appropriate functional form and parameter values for equa- tions to include in new labor supply and retirement modules should look to the results of labor supply research with HAS/AHEAD. Such research will undoubt

146 ASSESSING POLICIES FOR RETIREMENT INCOME edly include further development of analytic dynamic programming models (see Chapter 3), which might serve as the labor supply and retirement decision com- ponent of a new microsimulation projection model. Because dynamic program- ming models require reestimating labor supply decisions at each time period in light of all available information (e.g., recalculated expected Social Security benefits), they are currently likely to be computationally intractable for projec- tion modeling purposes. However, if they became more tractable and could be extended in some important ways (e.g., to take account of pension benefits), they could well support a major advance in labor supply and retirement projection capabilities. The pension coverage and benefits components of existing models are also far from satisfactory for a new model. PRISM simulates more features of pen- sion plans and captures more distributional variation than DYNASIM2, by draw- ing on a sample of Summary Plan Descriptions filed with Form 5500s. However, this sample has not been updated since 1983, and neither PRISM nor DYNASIM2 reflects the important changes that have occurred in the distribution of types of pension plans since then. Ideally, given the frequency of changes in pension and other benefit offer- ings, the pension component of an individual-level microsimulation model would obtain inputs from an employer-based model of benefit distributions. Since it may take longer to develop sophisticated new employer-based projection models than new individual-level models, the pension component of a new individual- level model would necessarily be somewhat undeveloped, without full capability for dynamic projections. However, some improvements could be effected (both in a new model and in existing models). For example, data could be pieced together from HAS/AHEAD and other surveys to obtain an up-to-date baseline distribution of pension plan types for different age cohorts and other characteris- tics of workers. Also, a crude capability could be implemented to update the distribution of employer types by size and industry, on the basis of aggregate trends, to use to project job characteristics and pension plan features historically associated with different employers. In addition, a crude capability could be developed to project shifts in pension plan types within and among employer classes and to use the estimated distributions to assign pension coverage to work ers. Finally, the personal savings components of existing microsimulation mod- els are very undeveloped. DYNASIM2 and PRISM project IRA contributions but not other kinds of personal savings.5 Research with HAS/AHEAD and other sources will be required to provide clearer answers about the appropriate analyti 5The long-term care submodel of PRISM assigns asset portfolios to people when they reach age 65, and then in each year of the projection it makes a simple adjustment of housing asset values by an assumed inflation rate and of nonhousing asset values by an assumed constant rate of dissaving.

DEVELOPMENT OF PROJECTION MODELS 147 cat model of savings and consumption that can form the basis of a sophisticated projection module for personal savings. Policy Simulations and Behavioral Interactions A new individual-level microsimulation model should be able to simulate a wide range of policies that relate to the major sources of retirement income: Social Security, employer pensions, IRAs and other forms of personal savings, SSI and other relevant transfer programs, and post-retirement earnings. Moreover, a new model should be able to simulate behavioral responses that can alter the initial or first-round effects of policy changes. Although it is probably not feasible to simulate the equilibrium effects of a policy change, it should be possible to simulate second- round effects that are consequent on important behavioral responses to a pro- posed policy. Existing dynamic models, in order to reduce computational burdens, have adopted a strategy in which demographic and employment life histories are dy- namically generated and only then used as the basis for retirement-income-re- lated policy simulations. (Alternatively, demographic and labor force transition equations do not include such policy variables as expected Social Security or pension benefits.) This strategy greatly constrains the capability to simulate behavioral responses to policy changes or other factors not included at the stage of generating the life histories (see Burtless, 1989, on this point). We believe that a new model needs to be more dynamic in this respect: that is, it should include the capability for policies to interact with demographic and employment behavior (e.g., simulating the effect on labor force participation of younger workers of significant changes in Social Security or employer pension provisions). Simi- larly, a new model should have a capability for policies to interact with savings and consumption behavior. Health Care Coverage and Costs Health care expenditures not covered by insurance (including premiums) are a potential major drain on retirement income, but simulating the effects of health care policy initiatives on medical care needs and expenditures is a daunting task. It would simply be infeasible to try to incorporate a health care policy simulation model into a new individual-level retirement income model. However, a new retirement model could usefully include a component perhaps off-line from the main model that estimates the portion of retirement income that remains after expected health care expenditures for different groups of the elderly population. This component would distribute out-of-pocket medical care expenditures among the elderly to agree with out-of- model projections of overall health care costs and the extent of insurance cover- age under different health care policy proposals. Interactions with Other Models A new individual-level microsimulation model should be able to exchange inputs and outputs with other types of models.

148 ASSESSING POLICIES FOR RETIREMENT INCOME Specifically, a new model should accept distributions of jobs and benefit offer- ings from employer-based models of labor demand and compensation policies to use in simulating the labor supply and pension benefits of workers. In this way, the time path of labor force participation, pension accumulation, and other per- sonal saving can reflect changed behavior by employers. An individual-level model cannot make use of inputs of jobs and benefit offerings from employer-based models until such models are available, but the design of the individual-level model should facilitate easy future implementation of links with employer models. Whether the design should allow for repeated iterations between worker and employer behavior is a difficult question. Feed- back loops between labor supply and demand, and the consequent effects for pension benefits and other savings, are desirable analytically. However, there may be problems of implementation and difficulty in interpreting results. A new individual-level model should also have a capability to interact with longer run effects models, such as macroeconomic models and CGE models. Burtless (1996) outlines a relatively straightforward macroeconomic model that could accept outputs from a dynamic microsimulation model to use to estimate the effects on gross output and national savings of policy changes that affect Social Security taxes and benefits, employer pensions, and other personal sav- ings. He notes that current projections of the Social Security and Medicare trust fund (from cell-based models) ignore the likelihood that trust fund shortfalls may adversely affect national saving and investment due to the crowding-out effect of increasing the federal budget deficit. In turn, such effects on saving could de- press real wage growth and thereby tax collections for Social Security and Medi- care, as well as pension accumulations and other savings. Burtless (1996) also notes the importance for long-term retirement income security projections of allowing important variables such as inflation; unem- ployment; market rates of return on housing, stocks, and other assets; and birth and death rates to exhibit cyclical variation. Existing dynamic microsimulation models usually calibrate their year-to-year simulations to match aggregate pro- jections of births and deaths, employment and unemployment, wage growth, interest rates, and inflation. Most often, they use aggregate values from the intermediate trust fund projections of the SSA Office of the Actuary. However, the latter projections assume that critical economic and demographic variables eventually attain stable values, making no allowance for sudden and unexpected swings (e.g., the intermediate projections assume no economic recessions over a 75-year period). While an assumption of convergence to stable values may be acceptable for aggregate projections, it is much less so for projections in which distributional outcomes within and across cohorts of workers and retirees are of interest. Hence, a new individual-level microsimulation model should have a capability to incor- porate cyclical (or random) variations in key aggregates. Such a capability is particularly important in light of policy interest in plac

DEVELOPMENT OF PROJECTION MODELS 149 ing more responsibility for retirement and health care planning on individuals. Policy changes in this mode include proposals to increase incentives for defined contribution employer pension plans and other personal savings, to allow tax- advantaged medical savings accounts, and to privatize all or part of Social Secu- rity collections. Such policies increase the potential for unexpected gains and losses in retirement income security for particular age cohorts and other groups due to such factors as variation in price inflation, ups and downs in the economy, and stock market booms and busts. Database To Be Used The choice of a primary database on which simulations are performed is critical to the development of a new individual-level microsimulation model with a full range of capabilities. Existing dynamic models use such databases as a 1973 exact-match of the March CPS and SSA earnings histories (DYNASIM2~; a 1978 exact-match of the March CPS and SSA earnings histories, which in turn is exactly matched to the March 1979 CPS and the May 1979 CPS pension supple- ment (PRISM); and decennial census public-use microdata sample files (CORSIM). A model under development by Wolf and his colleagues (Wolf et al., 1995), which focuses on kinship networks and disability status rather than retirement income security per se, uses the 1987-1988 National Survey of Fami- lies and Households as its primary database (see Appendix C). Existing models also draw on a variety of other data for such inputs as transition probabilities (to age or project the database records forward in time) and control totals (to cali- brate the projections on key aggregates). The choice of an exact-match March CPS-SSA file as the database for DYNASIM2 and PRISM was to obtain employment and income information for a large population sample linked with earnings histories to use in calculating Social Security and employer pension benefits. However, the lack of a more recent public-use exact-match file is clearly a problem for these models, which must simulate about 20 years of historical data before they can begin projecting into the future. Aggregate information, such as employment and wage rates, can be used to keep the simulations in line with actual historical experience for a handful of key variables, but there is no way to ensure that the models accurately reflect the interrelationships among all of the important variables for that 20-year period. Even if a new public-use exact-match March CPS-SSA file is developed, however, we question the desirability of continuing to use such a file as the primary database for a new retirement-income-related dynamic microsimulation model. The advantages of the March CPS as a modeling database are that it is a very large sample of the entire population (60,000 households, recently cut back to 50,000), which permits long-term projections, and that it has extensive infor- mation on family characteristics, employment, and income. At the same time, it

150 ASSESSING POLICIES FOR RETIREMENT INCOME has several disadvantages in terms of data content and quality. It has no informa- tion on consumption or on asset holdings (except for a home and what can be inferred from reports of interest and other asset income), limited information on disability status and health insurance coverage, very little information on sample members' employers, and no information on such variables as retirement plans and expectations that research may determine are important to include in labor supply and savings equations. Information on pension coverage is collected in a different supplement that is fielded every 5 years and for which the sample only partially overlaps with the March sample. In addition, there are problems with the quality of the income information, including high nonresponse rates and significant underreporting of income amounts for sources other than earnings. As we argue with regard to such model functions as validation and documen- tation, it makes sense to wait to consider the choice of a primary database until new data and research results can support the development of a new generation of models. Some of the possible alternatives that are now on the horizon and may then be worth investigating include the PSID, the Survey of Income and Program Participation (SIPP), and extracts of HAS/AHEAD and other panel surveys. Panel Study of Income Dynamics The PSID has a rich set of variables and covers the entire age range of the population. However, it has a limited sample size (9,000 households), and it would need to be matched with Social Security earnings records. Survey of Income and Program Participation Beginning in 1996, SIPP will have an expanded sample size of about 37,000 households; a new panel will be introduced every 4 years. SIPP obtains extensive information on a 4-month, annual, or biannual basis for income and assets, health and disability status, use of health care and health insurance coverage, pension plan coverage and retire- ment expectations, and family and work histories. Also, evidence indicates that SIPP obtains more complete reporting for most income sources than the March CPS (see, e.g., Coder and Scoon-Rogers, 1994~. In the past, nonresponse rates to questions on asset values have been high in SIPP, but, beginning in 1996, SIPP will use the HAS/AHEAD bracketing technique for assets and interest income, which should increase response. SIPP has limited information on sample mem- bers' employers; a supplement is being tested that will ask employers to provide information about the employee's pension and health insurance plans and other fringe benefits, but funds have not been provided for the supplement. Further improvements would need to be made to the SIPP questionnaire to make it suitable for use as a database for a new individual-level dynamic micro- simulation model. For example, assuming that the HAS/AHEAD questions on subjective probabilities (e.g., of one's own life expectancy) prove to have analyti- cal power, similar questions would need to be added to SIPP. The modular design of SIPP, with a core income and employment module and a series of

DEVELOPMENT OF PROJECTION MODELS 151 rotating topical modules, lends itself to such additions. In contrast, it is very difficult to add questions to the March CPS income supplement because of the overriding need to keep the supplement from impairing response to the main labor force survey (which is the source of the official monthly unemployment statistics). In practice, however, SIPP has not proven to be as flexible to modify as its design suggests. A major constraint on questionnaire changes and experimenta- tion has been the concern of the Census Bureau to standardize data collection and processing procedures for a large survey. This situation could change in the future as agencies seek to use SIPP for more kinds of policy analysis: for example, DOL is considering a transfer of funds from the CPS to SIPP for an improved supplement on pension coverage, and the SIPP questionnaire is being altered to make it more suitable for poverty measurement. Finally, SIPP would need to be matched with Social Security earnings records. SSA is currently using exact matches of the 1984 and 1990 SIPP panels with SSA data for limited kinds of retirement income modeling (see "Near-Term Modeling Strategies," below); however, these files are not available to research- ers outside SSA. Plans are under way to append year of death and type of Social Security benefit from SSA records to SIPP files for public release, but there are no plans as yet to make available SIPP files with linked SSA earnings data. Extract of HAS/AHEAD and Other Panel Surveys Data from HRS/ AHEAD and other panel surveys (for age groups that HAS/AHEAD does not cover), linked with SSA earnings variables, could be used for microsimulation modeling. The major advantage of HAS/AHEAD as a primary database is that it will be used for much of the analytical modeling that will presumably underlie the behavioral components of a new projection model. Researchers will mine the rich data in HAS/AHEAD, including previously unavailable information (e.g., expectations, detailed descriptions of pension plans), to develop improved ana- lytical models of retirement-income-related behavior. Ideally, a projection model should have available the same rich data for performing simulations as were used analytically. The projection model should not have to fall back on behavioral parameters that, of necessity (due to the limitations of the primary database), have been reestimated with fewer input variables than were incorporated in state- of-the-art research. The principal drawback to the use of HAS/AHEAD for simulation purposes is that it does not contain information for people younger than age 51 (except for some spouses of sample members). Because people may retire as young as ages 55-60, this limitation restricts the time horizon of projections to 5-10 years (al- though time horizons can be longer for projections that pertain to older people, say, those aged 65 and older or 70 and older). A possible solution to this problem is to merge other data sets with HAS/AHEAD. The new and filler HAS/AHEAD cohorts will be introduced in 1998-1999, and first-wave data should be available

152 ASSESSING POLICIES FOR RETIREMENT INCOME for them by 1999-2000. By merging data from these cohorts with data from the original HAS/AHEAD cohorts, the National Longitudinal Survey of Youth (NLSY), and NLS-Young Women, it would be possible to obtain a sizable data- base for women aged 33-40 and 44 and older and for men aged 33-40 and 51 and older in 1998 (see Figure 4-1 in Chapter 4~. Assuming that the missing years could be filled in by some procedure (which would be relatively easy for women and more difficult for men, given a larger missing age range), the resulting database would permit projections of retirement behavior for 20 years or more into the future. (A 20-year projection could estimate retirement status and in- come security for people aged 53 and older in 2018; a 30-year projection could provide the same estimates for people aged 63 and older in 2028.) If HRS/ AHEAD is to be merged with other panel surveys for projection modeling pur- poses, it will be critically important to review the survey questionnaires to iden- tify ways to make them as compatible as possible. Design Considerations Each of the requirements and features discussed above stretches the capabilities of existing dynamic microsimulation models. Considerable thought will be needed to determine potentially useful approaches for implementation: for ex- ample, how best to capture the inherent heterogeneity of employer and worker behavior in linkages between individual and employer-based models. While agencies should remain focused on data collection and basic research in the near term and not work on the development of new large-scale projection models, some continuing discussions about design could help pave the way for the devel- opment of the next generation of models. Such discussions could help rationalize individual data collection and analytical modeling efforts by indicating priorities for projection modeling and raising issues that relate to the feasibility of imple- menting data and research results in new projection models. Discussions about model design issues should involve multiple agencies, including DOL and SSA, and people from both the academic and projection modeling communities. Analysts and modelers have had little communication in the past, and links must be forged. A useful, low-cost activity could be to involve microsimulation modelers in the workshops on early results that are being held to present findings from HAS/AHEAD. They could be asked to comment on re- search papers from the perspective of the feasibility of using analytical models and research results in a microsimulation context. Microsimulation modelers could also contribute to discussions among agency staff and the principal in- vestigators of the major individual-level panel surveys on changes and improve- ments in their content and design.

DEVELOPMENT OF PROJECTION MODELS 153 Recommendation Large-scale, general-purpose microsimulation in principle is a useful technique for projecting the distributional effects of policy alternatives on the retirement income security of individuals and families and for modeling interactions among policies (e.g., between Social Security or Medicare and employer benefits). How- ever, needed data and estimates of key behavioral parameters are lacking for a useful model, and existing models do not provide an adequate structure on which to build an improved model. 15. The relevant federal agencies should consider the development of a new integrated, individual-level microsimulation model for retirement-in- come-related policy analysis as an important long-term goal, but construc- tion of such a model would be premature until advances are made in data, research knowledge, and computational methods. Some continuing design discussions could help structure relevant data collection and research. Some elements of existing projection models might contribute to a new model. VALIDATION Validation is important for the data sources that are used for retirement-income- related analytical and projection modeling and for the models themselves. This section discusses validation issues for projection models. In view of the long time periods for which projections are often required and the range and complex- ity of policies and behaviors that are typically simulated, it is critically important to devote a significant portion of available projection modeling resources to validation activities. Validation Goals and Methods As we indicate above, projection models of whatever type and range of capa- bilities should: (1) provide accurate estimates of policy outcomes; (2) provide as outputs information on the uncertainty of model estimates and their sensitivity to key assumptions; and (3) incorporate best current professional judgment about the underlying behaviors. These are very stringent criteria. Indeed, for the first criterion, one might ask how one can know whether a model's estimates are accurate when the policy change has not yet happened and may never happen (see Citro and Hanushek, 1991:72-78; Chap. 9~. Clearly, one cannot know at the time a model is estimating the effects of a proposed policy change that the estimates are, in fact, accurate. However, one can conduct a series of ex post external validation studies to establish a degree of confidence in a model's track record. Sometimes it is possible to compare a model' s estimates with what actually occurred. More commonly, it is necessary

154 ASSESSING POLICIES FOR RETIREMENT INCOME to conduct "ex post forecasting" studies, in which one puts oneself in the place of an analyst who, say, 10 or 20 years ago, was asked to simulate a policy change to take effect in some future year. One chooses the "future" year to be in the recent past so that measures of what happened are available from administrative records or other sources. Correspondingly, one chooses the policy change to be the actual policy in effect during the comparison year. (An alternative approach is to use a current model to develop "backcasted" estimates of policies for earlier years.) Because differences between a model' s estimates and what actually occurred may be due to economic or social changes that the model could not have been expected to forecast (e.g., an unanticipated recession or an unexpected increase or decrease in fertility rates), it will be helpful to conduct ex post forecasting (or backcasting) studies that use actual values for the factors that could not have been anticipated and focus on the accuracy of the model with regard to other elements. Also, in many instances, the important criterion will be whether the model accu- rately projected the differences in the outcomes under two or more policy alterna- tives and not the more stringent criterion of whether it accurately projected the levels of outcomes for a specific alternative. In addition to studies that help establish confidence in a model's track record, it is important to develop estimates of the range of uncertainty of a model's estimates. On average, a model may produce unbiased estimates, but the results of any particular model run may have a wide band of uncertainty around them. The sources of error in model simulations can be summarized in four broad categories: (1) sampling variability in the input database, which is only one of a family of possible data sets that could have been obtained; (2) sampling variabil- ity in other inputs, such as imputations, regression coefficients, and control totals; (3) errors in the database and other inputs; and (4) errors due to model . . ^. . m~sspecl~lcatlon. Advances in statistical methodology that take advantage of modern comput- ing power have made it possible to estimate the uncertainty in model estimates due to such sources as sampling variations in the input data. One such technique is the bootstrap (see Cohen, l991c), which, simply put, measures variability by using the observed sample distribution in place of the unobserved population distribution. The strength of this approach is that variance estimation becomes a function of computing power rather than an exercise in solving multidimensional integrations for complex estimators and distributions. For complex models, like dynamic microsimulation models, sample reuse techniques can be used to measure the uncertainty in a model's estimates due to the two kinds of sampling variability. However, the uncertainty due to the two kinds of errors may be difficult to put into a probabilistic context. Sensitivity analysis, which is carried out by developing and running one or more alternative versions of one or more model components, can help assess the uncertainty in model estimates due to the choice of a particular model structure or specification.

DEVELOPMENT OF PROJECTION MODELS 155 For example, one could use sensitivity analysis to investigate the effects of a particular equation for simulating the labor supply response to a change in the retirement age for Social Security or employer pensions. In its simplest terms, sensitivity analysis is a diagnostic tool for ascertaining which parts of an overall model could have the largest impact on results and therefore are the most impor- tant to scrutinize for potential errors that could be reduced or eliminated (see Citro and Hanushek, 1991:89-96; Chap. 9~. Sensitivity analysis is particularly important for retirement income modeling in view of the long time horizon for most projections, which will compound any errors in model specification or assumptions. For various reasons, external validation studies, estimation of uncertainty, and sensitivity analysis have rarely been carried out for projection models.6 His- torically, one reason was that such studies were very costly and time-consuming. However, if models are implemented in computing environments that provide ready, cost-effective access to users for multiple runs and model respecifications, then validation becomes very feasible. Another reason there are few estimates of uncertainty is that decision makers usually want only point estimates, without measures of uncertainty. It is impor- tant, therefore, to develop innovative ways to convey estimates of uncertainty that decision makers will take into account in debates about policy alternatives (see Citro and Hanushek, 1991:86-88~. This task may be easier for retirement- income-related policy projections because decision makers have somewhat more experience with estimates of uncertainty for such projections than for estimates on other topics. Thus, because of the long time horizon, the SSA Office of the Actuary routinely presents high-cost, intermediate, and low-cost scenarios for the balance in the Social Security trust funds under current law. However, this approach to bounding uncertainty is problematic (see "SSA Model Validation," below). Moreover, there is no comparable record of regularly projecting the long-term funding status of employer pensions, even though this source of retire- ment income may be more uncertain than Social Security (see Burtless, 1993; also, Schieber and Shoven, 1993, develop such a projection for defined benefit and defined contribution plans). Finally, an essential feature of an acceptable projection model is that it incorporate best professional judgment about relevant factors, particularly for aspects of the model to which estimates are sensitive. "Best" professional judg- ment is not always the newest theory or latest academic model, which may be too new to have been properly evaluated. Also, because there is a range of profes- sional opinion in many areas, it will be important to conduct sensitivity analyses of alternative specifications before selecting a particular approach to use for a 6See Cohen (1991a) for a review of validation studies that have been done of social welfare policy analysis microsimulation models, including DYNASIM2 and PRISM.

156 ASSESSING POLICIES FOR RETIREMENT INCOME central or base case. Yet it is clear that when important changes have occurred- for example, in worker productivity or savings rates or mortality rates a model that uses data or estimates of behavioral parameters that do not reflect those changes will not be helpful in informing the policy debate. Also, given that relatively little work has been done to date to conduct external validation studies or sensitivity analyses of models, the extent to which a model incorporates best professional judgment is often the only indicator of the likely level of model performance. SSA Model Validation The Social Security actuarial cost model, which uses cell-based methods to carry forward the results of time-series analysis, is an exception to our statement that model validation and assessment of uncertainty are rarely performed. The Office of the Actuary's 75-year projections of trust fund balances under current law, which are revised every year, are often evaluated by comparing them to what actually occurred in the short run and by looking at the patterns of yearly revision (see, e.g., Schieber and Shoven, 1993~. Also, the Office of the Actuary routinely prepares trust fund balance projections for low-cost, intermediate, and high-cost scenarios, thereby providing a type of sensitivity analysis (similar to that pro- vided for Census Bureau population projections). SSA also provides sensitivity analyses of the separate effects of key assumptions. In these analyses, the high- cost, intermediate, and low-cost values are used for one assumption, such as fertility, while the intermediate values are used for all other assumptions. However, as Burtless (1993) points out, the assumptions underlying the low- cost and high-cost scenarios are not chosen so as to bound the likely range of potential experience in any manner that approximates a statistical confidence interval. Hence, they are less useful to policy makers, analysts, and the public because they do not indicate the probability of a best-case or worst-case outcome relative to the intermediate projection. Also, the assumptions may be internally inconsistent. For example, the growth rate of real wages (for which a higher value is a favorable outcome for the Social Security system) may be negatively correlated with real interest rates (for which a higher value is also a favorable outcome for the system). Finally, key assumptions of the central or intermediate scenario may not always reflect current best professional judgment. For example, the mortality forecasts developed by SSA are at one extreme of the range of available fore- casts: they represent a more pronounced slowing of the rates of mortality decline observed in this century than is forecast by others (see Lee and Skinner, 1996~. The implication of the SSA forecasts is that people will not live as long as forecast by others and, hence, that the growth in the elderly population, particu- larly of the oldest old (those aged 85 and older), will be less pronounced. Conse- quently, the use of the SSA forecasts, even from the high-cost scenario, could

DEVELOPMENT OF PROJECTION MODELS 157 result in overly optimistic assessments of the ability of employer pensions or personal savings to provide adequate retirement income support over retirees' life spans. The use of the SSA forecasts is also problematic for assessments of the Social Security system, even though the relevant population estimate is not the number of elderly per se but the old-age dependency ratio (people aged 65 and older as a ratio of people aged 20-64~. While a lower mortality rate than that forecast by SSA will increase income to the system (more younger people will be alive to contribute payroll taxes), it will also increase payouts from the system, and the relative increase in outgo will exceed the relative increase in taxable payrolls (see Board of Trustees [OASDIi, 1994: 132-133~. Lee and Carter (1992) provide a credible alternative to the SSA mortality forecasts. Their method essentially projects into the future the rate of mortality decline observed over the twentieth century, which has been fairly steady despite periods of faster and slower progress. SSA's method extrapolates cause-specific death rates over the preceding 20 years, which necessarily results in a marked slowing in the rate of mortality decline, as the more rapidly declining causes of death come to claim a smaller and smaller share of total mortality. Of course, it is impossible to know which of the two methods will ultimately prove more accurate. A major advantage of the Lee and Carter approach, noted by Lee and Skinner (1996), is that it provides probability intervals describing the uncertainty in their extrapolative method. However, the Lee and Carter estimates of uncertainty do not allow for model specification error or the possibility of "structural breaks" in time series, such as the sharp decline in age-specific mortality rates that occurred between the nineteenth and twentieth centuries. Hence, Lee and Carter probably underestimate the extent of uncertainty in their forecasts of mortality rates. Recently, Lee and Tuljapurkar (1994) extended the methods of Lee and Carter to develop stochastic population forecasts with consistent and meaningful probability intervals for all items forecast (e.g., age categories of the population, taking account of all demographic processes). Lee and Tuljapurkar formulated models of age-specific fertility (from Lee, 1993) and age-specific mortality (from Lee and Carter, 1992), in which they estimated a single time-varying parameter for each process by using standard time-series analysis techniques. (Net immi- gration could also be handled in the same manner, but it was treated as determin- istic at the level assumed in the Census Bureau projections.) They then used these fitted models of stochastically varying rates as inputs to the population renewal process and carefully tracked the propagation and sources of error. They used analytic methods to find both the expected value of the forecast and the probability intervals. Similar results could be obtained with Monte Carlo methods, that is, by repeated stochastic simulations using the stochastic models of vital rates. The SSASIM model, recently developed with support from SSA, uses Monte Carlo techniques to characterize uncertainty in the Social Security trust fund balance

158 ASSESSING POLICIES FOR RETIREMENT INCOME forecasts (Holmer, 1995a, 1995b; see also Sze, 1995, and Appendix C). SSASIM implements stochastic processes for all of the key demographic and economic variables that are used in the SSA actuarial cost model in a general framework that can accommodate different expectations (specified by the user) about the dynamics of each variable over time and in association with other variables. For example, a policy analyst can assume that such variables as rates of mortality decline follow a consistent trajectory over the span of the projection period (as does the SSA actuarial cost model) or that the time trend will exhibit cyclical fluctuations. The analyst's assumptions are used to specify the means, standard deviations, and correlation coefficients of a multivariate normal distribution: multiple runs with different random draws from the distribution for a particular set of assumptions generate the expected values of variables and the probability bounds. The use of scenarios by SSA and the Census Bureau to convey uncertainty about forecasts stands in contrast to the use of probabilistic methods. To build scenarios, an analyst develops one forecast on the basis of the "expected" trajec- tories for each demographic or economic process. The analyst then also develops alternative trajectories for each process above and below the expected trajectories and combines them in various ways to produce high and low forecasts with which to bound the intermediate or expected forecast. For example, because trust fund balances are influenced by the old-age dependency ratio, SSA combines low fertility with low mortality for its high-cost scenario. The Census Bureau, on the other hand, combines high fertility with low mortality into a high scenario for total population growth. As a consequence, the Census Bureau's high and low estimates of the elderly population cover a wider band than the SSA estimates, while the Census Bureau's high and low estimates of the old-age dependency ratio cover a narrower band. The SSA high- and low-cost scenarios for the population aged 65 and older and the old-age dependency ratio cover a somewhat narrower band than the 95 percent probability intervals generated by Lee and Tuljapurkar (1994~. One could use the Lee and Tuljapurkar upper and lower bounds to produce new population scenarios. However, as Lee and Skinner (1996) point out, the prob- lem with any method of constructing scenarios is that there is no means by which to attach probability intervals to the resulting forecasts. The scenario approach assumes unrealistically that the high, medium, or low trajectory for each de- mographic process is followed consistently over the entire time span. In contrast, the stochastic forecasts of Lee and Tuljapurkar treat fertility and mortality as random variables at every iteration, producing true probability bounds for the projections at each period (e.g., each 5-year interval). Although stochastic methods may underestimate the uncertainty in demo- graphic and economic forecasts, depending on how the parameters of the stochas- tic process are estimated, they represent a major step forward over the scenario approach for conveying estimates of uncertainty to policy makers. The use of

DEVELOPMENT OF PROJECTION MODELS 159 stochastic methods to estimate the uncertainty of projections of demographic and economic variables seems imperative for the kinds of long-range forecasts that are often used in retirement-income-related policy debates. In this regard, it would be very useful to further develop the capabilities of the SSASIM model for estimating the uncertainty of the projections in the SSA model, including work on alternative methods with which to estimate the parameters of the stochastic pro cess. Recommendation 16. Models that project policy outcomes over long periods should rou- tinely assess the accuracy and uncertainty of the projections and report prediction errors in previous forecasts. To be able to evaluate a model's projections in a cost-effective manner, the model design must explicitly in- clude a capability for sensitivity analysis (e.g., how much the outputs vary with alternative assumptions and versions of model components) and a capa- bility to estimate uncertainty from such sources as sampling variability in the database and other model inputs. NEAR-TERM MODELING STRATEGIES The kinds of investments we recommend in data and research will take time to return benefits for retirement-income-related policy analysis. In addition, the development of needed employer models and a new large-scale individual-level microsimulation model, which must await advances in data and research knowl- edge, cannot be completed in the near term. In the meantime, the Department of Labor and other agencies must be able to answer decision makers' questions about the likely effects of alternative policies. They cannot respond that they are unable to provide requested information because data and knowledge are lacking with which to develop improved projection models. In order to be responsive to decision makers' legitimate information de- mands without incurring significant model development costs that could well be misdirected, agencies should try to frame policy questions as specifically as possible and use limited special-purpose models together with the best available data and research knowledge to answer those questions. Depending on the issue, a special-purpose model may already exist, or, more likely, it will be necessary to adapt one from another model or to construct one on the spot. Admittedly, limited ad hoc models may not provide very reliable answers to policy questions, and, in particular, they are not likely to be able to handle important policy interactions. Also, special-purpose models may be able to model behavioral responses to policy changes only very crudely, although they should incorporate such responses when reasonable parameters are available and feasible to implement. However, the alternative of trying at this time to develop

160 ASSESSING POLICIES FOR RETIREMENT INCOME more complex models, when needed data and research knowledge are lacking, will not likely give a better result and will very likely waste scarce resources. We acknowledge that a phased approach to retirement income modeling, as we recommend, may not provide useful answers before there is actually a retire- ment income crisis that requires extensive evaluation of alternative policy pro- posals in a comprehensive framework. However, there will be continuing need for retirement-income-related modeling capabilities (as is true for health care policy), and without investments in data and behavioral research, government agencies will never be in a position to develop markedly improved projection models. (The health care agencies have recognized this point by redesigning and expanding their surveys.) Again, we are concerned about the use of scarce resources in the near term. We believe they are best used for data and research instead of extensive development (or redevelopment) of large-scale models. To the extent that limited investments can lead to better special-purpose models for specific questions, we support those investments. The CBO Approach The modus operandi that we have in mind is that of the Congressional Budget Office (CBO), whose analysts regularly prepare ad hoc, special-purpose models, often using spreadsheet tools, to develop estimates of the likely costs and other effects of proposed policy changes. CBO staff combine data and parameter estimates in their ad hoc models from a wide range of sources, sometimes includ- ing as inputs the outputs of larger, more formal projection models with the results adjusted as seems appropriate. Sometimes they make additional "out-of-model" adjustments, for example, to provide crude estimates of the likely effects of behavioral response to policy changes. (See Citro and Hanushek, 1991:41-51, for a case study of the modeling tools used by CBO and the U.S. Department of Health and Human Services in the late 1980s for estimating the effects of welfare reform.) In general, CBO staff develop their models only to the point to which they can answer the question at hand. For example, projections for a few years might be developed with distributional detail, while longer term projections are devel- oped on a very aggregate basis. Or a model might be developed to handle only the particular policy change that is under consideration. However, in instances in which the same kind of question arises frequently, CBO will develop more for- mal models that are intended for repeated use and that can handle a wider range of variations in a particular policy area. As just one example, CBO developed a hospital-based model that is designed to estimate the cost and distributional effects of changing various provisions of the Medicare prospective payment scheme used to reimburse hospital costs. The problem with an ad hoc approach to model building is that there is virtually no incentive to provide for such desirable model features as openness or

DEVELOPMENT OF PROJECTION MODELS 161 portability or to document a model for use by others. More important, there is no incentive to perform more than the crudest type of model validation. However, the strategy of special-purpose model development, when carried out by knowl- edgeable analysts, can provide answers to specific policy questions with mini- mum staff resources and time. The answers will very likely be limited and of highly variable quality, but they can assist the policy debate, and the process of developing them can identify issues to consider in the design of new, improved large-scale models for use in future policy projections. The problem for retirement-income-related projection modeling for the ex- ecutive branch in the near term (and in the long term, too) is that there is no equivalent of CBO in a cabinet department. CBO will itself prepare estimates for retirement-income related policy proposals, but the executive branch needs its own modeling capability, particularly when proposals are being drafted and have not yet been sent to the Congress. There is no staff group in the executive branch that is in a position to take a broad view of policy analysis in this area, much less to make effective use of available data, research, and modeling tools to develop the best possible estimates in the most direct manner in response to decision makers' questions. There are clusters of analysts in such agencies as DOL and SSA, but it would be helpful to have a larger critical mass working together on a regular basis. For example, SSA analysts are currently working on models of the distribu- tion of Social Security benefits by using matched SSA earnings records and SIPP data. The SSA analysts initially developed models with which to project earnings histories for 1984-1993 for members of the 1984 SIPP panel on the basis of prior earnings histories, and they obtained reasonable results. They then used these models to project future earnings histories for members of the 1990 SIPP panel (lams and Sandell, 1996; Sandell and lams, 1996~. The SSA analysts propose to extend their models to project pension income, as well as Social Security ben- efits, on the basis of earnings histories and other characteristics of SIPP sample members. If successful, and if a savings income projection capability is also added, this approach could offer some useful policy modeling capability without requiring a full-fledged dynamic microsimulation model (assuming that policy changes can be factored into the projections in some manner). SSA analysts are also working on cross-sectional microsimulation models of SSI and the disability insurance (DI) component of the Social Security system with matched SIPP and disability program data from SSA records. As part of this effort, they are developing ways to simulate the multistage disability application and determination process (see Lahiri, Vaughan, and Wixon, 1995~. Initially they expect to be able to make estimates of the number of people in the civilian noninstitutional population who are likely to meet SSA disability criteria. Esti- mates of those categorically eligible on the basis of disability will then be com- bined with estimates of financial eligibility for SSI and DI insured status. A subsequent model of the application decision is also planned and will be used to

162 ASSESSING POLICIES FOR RETIREMENT INCOME introduce selectivity adjustments to the estimates of the pool of those eligible for disability benefits. This SSA modeling work should be well known and available to other ana- lysts, such as those in the Pension and Welfare Benefits Administration in DOL. Similarly, work that PWBA analysts may do with employer pension issues should be well known and available to analysts in other relevant agencies. It is probably not feasible to establish a cross-agency retirement income modeling group with authority to prepare estimates on behalf of the executive branch, but perhaps it is possible to establish coordinating mechanisms. For example, it would be useful to establish an interagency users group that meets regularly to review retirement- income-related projection model developments at the member agencies. Such meetings could contribute to the design of new, improved models in the future. Improvements to Existing Models In the near term, while large investments in projection model development are in abeyance, it is important to seek opportunities for cost-effective improvements to existing retirement-income-related projection models and to make them known and available across agencies. For example, it may be possible to marginally improve the pension components of such models as DYNASIM2 and PRISM, even though major improvements in pension simulation capability will depend on the development of new models that use greatly improved data. An important model for which it appears feasible to effect useful improve- ments in the near term is the SSA actuarial cost model. The SSA model is the linchpin of retirement-income-related projection modeling. It produces rela- tively few outputs (principally, the trust fund balances over a 75-year time hori- zon), but its projections are critical in setting the tone for policy debates. Also, many other models calibrate their outputs to components of the SSA projections, such as the forecasted population size by age and sex and wage growth and other economic factors. We urge attention to extending the SSA model in several ways.7 First, it is critically important to develop probabilistic estimates of uncertainty for each of the variables projected in the model and then to develop estimates of uncertainty of the model's outputs as a whole, taking account of covariances among the various time series (mortality, labor force participation, wage growth, etc.~. Such estimates should replace the current scenario approach, which has no probabilis- tic structure. Work is under way at SSA along these lines (see the previous discussion in "SSA Model Validation"), and we urge that it be continued. Second, it would be highly useful to add a capability to the model for limited 7The 1994-95 Advisory Council on Social security Technical Panel on Assumptions and Methods (1995:60-66) has made similar recommendations.

DEVELOPMENT OF PROJECTION MODELS 163 distributional analysis of the effects of current and proposed Social Security provisions. For example, it would be useful to be able to project costs and benefits of policy alternatives for low, middle, and high earners among the popu- lation. Such projections would be helpful in evaluating outputs of more disaggre- gated models, and vice versa. A distributional component of the SSA model should include a capability to simulate the effects of cyclical or random varia- tions in key aggregate variables, such as interest rates and growth in wages. Third, because of the importance of mortality projections for estimates of the retirement-age population, we urge SSA to evaluate the sensitivity of the model's outputs to mortality projections that are disaggregated by such characteristics as marital status and income levels. If the analysis suggests a strong degree of sensitivity, it would be important to undertake research on the best form of such projections for use in the model. SSA administrative data could provide the basis for a comprehensive study of the relationship of mortality to earnings for people of retirement age. (Studying the relationship of mortality to marital status with these data would require SSA to track the marital status of all beneficiaries, which could also be useful for projections of beneficiaries.) SSA also has several SIPP panel files to which it is appending year-by-year mortality records. These files could permit a rich, multivariate analysis of mortality correlates for a limited sampled Finally, it would be highly useful to develop means by which to document and provide research access to the model. Outputs of the SSA actuarial cost model are so widely used for inputs to other models that it would be very helpful if outside analysts could more readily learn about and have access to the model. Such access would also facilitate outside reviews of the model's properties to identify other improvements. Recommendations 17. To respond to immediate policy needs, agencies should use limited, special-purpose models with the best available data and research findings to answer specific policy questions. Although such models may not provide very accurate estimates, the alternative of developing complex new indi- vidual-level microsimulation or employer models in advance of needed im- provements in data and research knowledge has little prospect of producing better results and will likely represent, in the immediate future, a misuse of scarce resources. 8Panis and Lillard (1996) recently developed mortality models with data on age, race, sex, marital status, and household income of surviving sample members and decedents in the PSID. They used these models to simulate the effects of proposed Social Security changes, such as increasing the contribution rate and decreasing benefits, on the net present value of Social Security benefits for prototypical types of beneficiaries.

164 ASSESSING POLICIES FOR RETIREMENT INCOME 18. The Social Security Administration should consider enhancements to the SSA actuarial cost model in the near term. This model provides useful outputs for other models and helps frame the overall retirement income security policy debate. Priority improvements include the following: · replacing the scenario approach with probabilistic estimates of un- certainty for the model's projections that take into account covariances among the various time series (mortality, labor force participation, wage growth, etc.~; · adding a capability for limited distributional analysis of the effects of current and proposed Social Security provisions (e.g., of costs and benefits for low, middle, and high earners); · evaluating the sensitivity of the model's outputs to mortality projec- tions that are disaggregated by such characteristics as marital status and income levels and planning to develop such projections, if warranted; and · developing means to document and provide research access to the model.

Next: 6 Furthering Coordination for Data Collection, Research, and Modeling »

Assessing Policies for Retirement Income: Needs for Data, Research, and Models (1997)

Chapter: 5 Development of Projection Models

Welcome to OpenBook!

Get Email Updates