Read "Science and Judgment in Risk Assessment" at NAP.edu

Page 144 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 144

8
Data Needs

This chapter discusses the quantity, quality, and availability of data needed for conducting an adequate risk assessment in the context of the Clean Air Act Amendments of 1990 (CAAA-90). It begins by discussing the need for a priority-setting process, and the need for an iterative data-collection process. It then indicates the proper prioritization for data collection and the availability of data in each of the key risk-assessment steps. It concludes with a discussion of how data should be managed.

Context Of Data Needs

Most would agree that, given the best available model, additional relevant data will lead to a more accurate and precise risk assessment. The quality of the data is critical, no matter how excellent the model chosen, to avoid the classic ''garbage in, garbage out" problem. In the gathering of data, tradeoffs must often be made among data that are necessary, data that are desirable, and data that are affordable. Desirability must be defined in the context of the risk-management goals to be achieved, which might be the development of regulations, the setting of standards, or the screening of chemicals to set priorities.

The more precisely the risk manager frames the questions to be addressed by the risk assessment at the outset, the less ambiguity there will be as to what data are required to answer the questions, the less need for judgment in datagathering, and the lower the likelihood that inappropriate or insufficient data will be gathered. As a corollary, public input into the framing of goals and questions can help to avoid public criticism and distrust of the process of risk assessment,

Page 145 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 145

including the gathering of exposure and toxicity data. Public confidence that risk managers are addressing real concerns, as opposed to going through a process perfunctorily, is critical to the future of risk assessment as an activity capable of improving the quality of life. Risk managers need to articulate clearly from the beginning who is to be protected from what, when and where, and at what cost (including how much effort and funds are to be expended to collect appropriate data), so that risk assessors can provide relevant information.

Implications For Priority-Setting

It is not necessary, nor would it be cost-effective, to collect all the data needed for a complete health-hazard assessment on all the 189 chemicals (or mixtures) listed in CAAA-90. It is important, however, that the entire list be examined to identify chemicals that are potentially hazardous and that the later full-scale evaluation of each chemical selected for further scrutiny proceed as effectively as possible. An overall strategy is essential for setting priorities among the steps in the information-gathering process and for determining the extent of assessment needed.

Because risk is a function of exposure, as well as toxicity, determining both that a chemical is of low toxicity to all humans and that all humans have only small exposures to it would lead to an overall low priority for a full-scale risk assessment. Obviously, assigning a high priority to both would lead to an overall high priority for such assessment and argue for collection of a complete data set in all categories of exposure and toxicity. There will be various intermediate levels between low and high overall priority.

In the absence of pertinent human data, toxicological evaluation should begin with the simplest, most rapid, and most economical tests and proceed to more complex, time-consuming, and more expensive tests only as warranted by the initial steps. Similarly, emission, transport, and exposure data might be used to rank chemicals for testing, from those with relatively large exposure potential down to those with a very low likelihood of significant exposure, either for the population at large or for any substantial subset of the population. What is "substantial" in this context will of course depend on concurrent assessments of toxicity. Ordering can then be based on an evaluation of a relatively modest or limited data set.

To assess whether there is a potential for exposure, and to gauge the magnitude and duration of exposure, one needs to know:

1.	Is the chemical emitted into the air?
2.	Is the chemical stable enough to be transported from its source to a population?

If the chemical is not emitted or is so unstable that it breaks down into innocuous products before reaching a population, no further data need be col-

Page 146 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 146

lected and further risk assessment is not warranted. But if it is emitted and can be transported to a population, one needs to ask:

3.	Who is exposed, to how much, and for how long?
4.	What is the relationship between exposure (dose) and response (effect) for humans and for animals?

In an iterative data-collection process, one works through data related to questions 1-4, first collecting the most critical data within each category, then judging needs for more data within that category before moving to the next category. The process is iterative until sufficient information is gathered to draw a conclusion—e.g., on a potential threat to public health.

Section 112 of the Clean Air Act mandates that EPA consider the hazards and possible regulation of 189 specified chemicals. Considering both the effort required to carry out complete risk assessments and the resources of the agency, it is unlikely that that can be accomplished within the time constraints of the act. Consequently, in the spirit of the act and in the interest of the public welfare, it is critical that EPA assign priorities to the chemicals listed. These priorities should be based first on their potential impact on human health and welfare.

Some of the 189 chemicals appear to present major problems because of their variety of sources, large exposures, or high potency. Other chemicals present simpler problems—e.g., some have relatively few sources, some have lower potential for human exposures, and some have very low potency. It is an inefficient use of resources to invest huge amounts of money and time in research and analysis to determine factors already known to be inconsequential for final risk assessment or to confirm credible estimates on which consensus can easily be obtained. Therefore, EPA should do preliminary analyses (screenings) on all listed compounds to ascertain which chemicals merit detailed risk-assessment efforts and which do not merit such work. These preliminary analyses should be reviewed by an independent board to ensure the validity of the resulting priorities for full-scale assessments. Priorities should be continually reevaluated and changed as appropriate in response to new data. The task of setting priorities and keeping them up to date is not trivial and should be specifically included, with adequate resources, in EPA's evolving program plan to implement CAAA-90. The iterative data-collection process can then help in setting priorities for ranking needed studies to avoid the accumulation of a surfeit of data, which would result in misuse of funds and waste of time.

Data Needed For Risk Assessment

The following sections discuss the priority-setting and availability of data for each of the key data-processing steps in risk assessment: emissions, environmental fate and transport, exposure, and toxicity. The final section summarizes the data priorities in each of these areas, and indicates how this data can be used for overall priority-setting for data collection.

Page 147 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 147

Emissions

Knowledge of emissions of a chemical into the air—specifically, the quantity emitted per unit of time (flux) from each place where it is made, stored, used, or disposed of plus its physical and chemical form—is fundamental to characterizing the magnitude of expected exposure to the chemical.

Priorities for Collecting Data

The specific methods for characterizing emissions are described and evaluated in Chapter 7. On the basis of this analysis, an iterative data-collecting process for emission characterization might proceed roughly as follows:

1.	Plant-specific material balance
2.	Industry-wide emission factors
3.	Plant-specific emission factors
4.	Facility measurements, including flux determinations.

Data quality is critical, because of the wide variety of emission-estimation techniques and the many types of facilities emitting hazardous air pollutants. EPA often uses whatever data are available at the time of decision-making and has not published guidelines or standards for the quality of emission data to be used in its risk assessments.

Because the emission-characterization database is extremely important for priority-setting, EPA should review the emission estimates submitted to ensure that they meet reasonable quality standards and that emission estimates from all sources within a site are submitted.

Data Availability

EPA plans to use emission information that is available in the Toxic Release Inventory (TRI) database as required by Title III of the Superfund Amendments and Recovery Act (SARA). The information available in this database is shown in the table provided by EPA to the committee in Appendix A. The TRI database includes information on annual emissions, facility location, and categorization of emissions as fugitive, point source, or both.

These data have two serious limitations for any use in risk assessment. First, the database does not include emissions from all operations at a facility; for example, transfer operations are not reported. Second, the database does not include emissions of less than 10 tons/year, nor does it have the locations of emission points or the frequency of emissions. Some information is available in emission inventory databases that are required by state implementation plans (SIPs) that states are required to submit to EPA to indicate how they plan to control emissions relative to CAAA-90, but that information is not necessarily

Page 148 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 148

well characterized. For example, emissions of volatile organic chemicals (VOCs) might be listed as a total, instead of as emissions of separate chemicals; but risk assessments should generally be done for separate chemicals, rather than for classes of chemicals.

A study by Amoco and EPA (1992) gives an example of the differences between estimated or calculated emissions (such as those listed in the TRI database) and emissions determined via direct measurement. This study found that the "existing estimates of environmental releases were not adequate for making a chemical-specific, multi-media, facility wide assessment." The report identified several specific problems in using the TRI database to conduct an in-depth evaluation of a facility:

•	Lack of chemical characterization data.
•	Difficulty of measuring and characterizing small sources.
•	Use of estimated, rather than actual, data.
•	Lack of identification of new sources leading to underestimation.
•	Overestimation of some sources because of use of standardized industry-wide emission factors.
•	No requirement that all chemicals be reported in the TRI database (e.g., only 9% of total hydrocarbons were required to be reported).
•	Exclusion of some activities and emissions from record-keeping requirements (e.g., barge loading, which accounted for about 20% of benzene emissions).
•	Lack of data in TRI on location of nearby populations and ecosystems.

EPA should develop a mechanism to gather the information just listed in a consistent fashion. This mechanism could include changes in Title III of SARA, which requires the TRI reporting requirement or development of information for Title I or V of CAAA-90. Although development of emission characterization databases for all of the 189 chemicals might initially seem to be a major task, CAAA-90 requires states to develop more detailed emission inventories by November 1992 and to update them. Most facilities are then required to estimate their emissions on a point basis to satisfy state requirements for emission inventories. Much of this information is also required for permit purposes.

Even simple changes, such as modifying the SARA Title III requirements to include all 189 hazardous air pollutants on the list, would help. Sixteen of the 189 compounds in CAAA-90 Title III are not on the TRI list (see Table 8-1). In addition, the TRI database includes only sources that have 10 or more full-time employees and that manufacture, process, or use specified chemicals above a certain production rate. That restriction excludes smaller sources within the manufacturing sector for which risk assessments must be conducted under the Title III requirements. Instituting an emission threshold relative to the Title III requirements (e.g., 10 tpy for single compound; 25 tpy for multiple compounds) might be more appropriate for gathering information for risk-assessment purposes.

Page 149 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 149

TABLE 8-1 List of Section 112 Pollutants Not in Toxic Release Inventory Data Base

2,2,4-Trimethyl pentane

Acetophenone

Caprolactan

Dichlorodiphenyldichloroethylene (DDE)

Dimethyl formanide

Fine mineral fibers

1 texamethylene-t,t,-diisocyanate

Hexane

Isophorone

Phosphine

Polycylic organic matter

Sulfur dioxide, anhydrous

TCDD

Triethylamine

For evaluation of VOCs, many of which are on the list of 189 compounds under Title III, emission estimates developed for other regulatory purposes (such as the ozone provisions of CAAA-90) can be used. However, these data are frequently not speciated in terms of the chemical composition of the VOCs. In addition, the reporting of VOC emission information is required only in nonattainment areas, so this information may not always be available.

Environmental Fate and Transport

Emitted pollutants can move within and between environmental media and be converted to different forms. A thorough understanding of what happens to a chemical in the environment forms part of the basis for estimating human exposure and hence determining risk.

Priorities for Collecting Data

In the proposed iterative data-collection process described at the beginning of this chapter, data on environmental fate and transport would be acquired in roughly the following order:

1.	Physical properties.
2.	Physicochemical properties of environment.
3.	Chemical properties or reactivity.
4.	Rates of potential removal processes.

Once that information is available, a model calculation of expected concentra-

Page 150 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 150

tions in nearby air is relatively straightforward. If the information is not available, it must be obtained or assumed.

Data Availability

Data on emissions and physical properties are generally available or can be estimated (Lyman et al., 1982). For chemical properties and reactivity, they are available for some environmental reactions, but not all. In the case of physicochemical properties, the environment data are generally available at most locations in the United States. Information on the rates of potential removal processes are more difficult and costly to obtain.

Careful evaluation of data is necessary. For example, published vapor pressures of organic chemicals of moderate to low volatility determined under laboratory conditions can be seriously inaccurate and misleading. For all chemicals, vapor-phase reaction rate constants, when extrapolated from the laboratory to outdoor ambient air, can be seriously in error. The literature is not always for purposes of risk assessment.

Exposure

Accurate exposure data are crucial to valid risk assessment. For example, exposure data must match up temporally with the health end points of concern. Key issues in the evaluation of exposure are

•	The end points of interest (e.g., acute vs. chronic toxicity).
•	The populations at risk (i.e., the general population and defined subpopulations with potentially increased risks).
•	The routes of exposure (e.g., air, diet, or skin).
•	The duration (e.g., lifetime, annual, or instantaneous).
•	The nature and degree of simultaneous toxicant exposures.

Rarely are all those issues resolved by the exposure data available for a risk assessment. Efforts to collect the data should focus on the minimum needed to meet the goals of the assessment in its risk-management context.

Priorities for Collecting Data

In the proposed iterative data-collection process, the order of data collection might be as follows:

Ambient-air monitoring. Most commonly, ambient-air monitoring produces interval concentrations in samples averaged over a fixed time, such as 8 hr or 24 hr at fixed sampling stations. The number of stations, their times of operation, and their locations relative to known emission sources and popula-

Page 151 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 151

	tions at risk must be known, as well as concentration averages, variances or ranges (to estimate uncertainty), and a description of the methods used, including potential error. The time interval of ambient-air monitoring should be commensurate with the time needed to elicit the physiological effects of concern.
2.	Targeted fixed-point monitoring data. These data are often generated from samples placed near sources of high-volume emissions (i.e., "hot spots") or in response to some real or perceived public-health need. They should be accompanied by the same information as for ambient-air monitoring. Targeted monitoring is often more useful than monitoring at pre-existing sampling stations if it can focus on higher concentrations of a pollutant, a population at greater risk, or both.
3.	Peak-concentration data. Either ambient-air or targeted monitoring can miss peak concentrations, because the sampling interval is so long as to "average out" all peaks and valleys in the sampled air mass. Sampling with instantaneous analyzers (e.g., spectrophotometers) or interval analyzers that can accept a sample of short duration is needed to define peaks. That might be of special importance for a toxicant released intermittently.
4.	Personal monitoring. Concentration data from personal monitors are often more useful for risk assessment, because they show the exposure of individual subjects and can be used to relate activity patterns to exposure. If enough subjects are selected for monitoring, a population exposure can be constructed. Such information is not yet generally available, except for a few toxicants, because of the time and expense of a comprehensive study. This in turn is primarily due to a lack of low-cost, portable sampling devices for most chemicals. Active samplers may provide more information directly for risk assessment than passive samplers for personal monitoring, because pollutant concentrations (and thus the dose) can be estimated more directly with active sampling. Passive samplers do not provide specific concentrations; however, they are far less costly and bulky than active samplers. They are useful in screening (i.e., to determine whether exposure has occurred). Research to correlate the concentrations detected by passive samplers with exposure and dose would further enhance their potential.
5.	Biological markers. If a toxicant produces a metabolite, enzyme alteration, or other signal that exposure has occurred and so leads to a high correlation between that marker and degree of exposure, such information can reduce the uncertainty in a predicted risk and could be useful for risk assessment. In one respect, this would be the best exposure information, because it would show that the toxicant has been absorbed and has already had some biological effect (NRC, 1987); but it makes single-source exposure assessment difficult, because it reveals total uptake across all routes of exposure. Unless biologic-marker data are checked against external exposure data, they cannot be used to determine dose. Validation of the correlation between an external concentration and the magnitude of a biological marker in experimental animals can be helpful, but

Page 152 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 152

one is left with the difficulty of extrapolating to humans, who may not respond in the same quantitative way as experimental animals. In some cases, markers in humans can be established in occupational settings.

Data Availability

Some of the 189 chemicals on the Clean Air Act Amendments list have relatively abundant data on concentrations; some have virtually none. When concentration data are available, they are more likely to be from ambient-air monitoring or, at best, targeted fixed-point monitoring. For only some of the compounds are sufficient exposure data available for preliminary evaluation of relative priority for more detailed risk assessment (see Appendix A). That is a major problem that can be solved only by a much more extensive state or federal monitoring program. Some states, such as California, are moving rapidly in developing a hazardous air-pollutant monitoring program. Coordination between states and with federal agencies is necessary to keep scarce resources from being wasted in duplicative efforts.

Collection of new exposure data on humans is limited by current methods of monitoring individual exposures (which are often expensive, often of low accuracy or precision, and often nonquantitative or lacking in the ability to determine the source of exposure) and by methods of obtaining information on human behavior that might affect uptake or exposures. In addition, no reference database is available for comparing new data, that is, for determining whether new data represent exposure outside the general norm or are within the realm of acceptability defined by prior studies. Furthermore, when exposure data are gathered, they should be probability-based to allow inferences to the population and estimation of the tails of the distribution of exposures.

Toxicity

A full assessment of the inherent toxicity of an agent requires some combination of structure-activity analyses, in vitro or whole-animal short-term tests, chronic or long-term animal bioassays, human biomonitoring, clinical studies, and epidemiological investigations (NRC, 1984, 1991c,d). A complete hazard identification might entail review of information in all those categories before a determination that a quantitative risk assessment of the agent is warranted (Bailar et al., 1993).

Estimation of dose-effect relationships requires data on the effects of a wide range of doses, on factors that influence the dose delivered to critical target cells by given magnitudes and patterns of exposure (e.g., uptake, anatomic distribution, metabolism, and excretion) (NRC, 1987), on the shapes and slopes of pertinent dose-effect curves, on the relevant mechanisms of effects (NRC, 1991c),

Page 153 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 153

and on the extent to which the response to an agent can vary with species, sex, age, previous exposure, health status, exposure to extraneous agents, and other variables (NRC, 1988a).

Priorities for Collecting Data

Strategies to fill data gaps in toxicity assessment are best developed case by case, but the following priority-setting of the major types of toxicological data that may be used are listed below. In the suggested iterative data-collection process, the toxicity data listed in the first three categories below (i.e., generic and acute toxicity, acute mammalian lethality) should be collected on every chemical as a starting point, and other, more expensive, data should be collected only on chemicals that give cause for concern based on the data in those categories.

1.	Generic toxicity data (structure-activity relationships and results of other correlational analyses).
2.	Data on acute toxicity (on lethality in microorganisms or effects on mammalian cells in vitro).
3.	Acute mammalian lethality data (usually rodent).
4.	Toxicokinetics data, phase 1 (on uptake, distribution, retention, and excretion in rodents).
5.	Genotoxicity data (results of short-term in vitro tests in microorganisms, Drosophila, and mammalian cells).
6.	Data on subchronic toxicity (on 14-day or 28-day inhalation toxicity in rodents).
7.	Toxicokinetic data, phase 2 (on metabolic pathways and metabolic fate in rodents and other mammalian species, with special attention given to exposure by inhalation).
8.	Data on chronic toxicity (on carcinogenicity, neurobehavioral toxicity, reproductive and developmental toxicity, and immunotoxicity in two rodent species of both sexes, with special attention given to the exposure by inhalation).
9.	Human toxicity data (clinical, biomonitoring, and epidemiological data).
10.	Data on toxic mechanisms, dose-effect relationships, influence of modifying factors (age, sex, and other variables) on susceptibility, and interactive effects of mixtures of chemical and physical agents.

This prioritization is based on the cost and complexity of gathering such data (NRC, 1984). It is generally not possible to plan the collection of clinical and epidemiological data. Toxicological studies conducted clinically in humans are usually planned and implemented under experimental control, but very few are done, because of the attendant hazards. Epidemiological studies are relative-

Page 154 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 154

ly expensive and often produce data that are difficult to interpret as to effects of specific toxic agents. If one were to set data-collection priorities without concern for cost, ethical, or other considerations, the sequence of collection might be

1.	Toxicological human data.
2.	Clinical data.
3.	Epidemiological data.

Data Availability

Availability of requisite data varies widely among the 189 chemicals. On the one hand, some preliminary toxicity data are available on some of the chemicals, or at least can be estimated from structure-activity correlations. On the other hand, the toxicity data are incomplete on almost all 189 chemicals.

The amount of data available is highly variable and depends largely on the existence of uncontrollable chance events. Generally, better data sets exist on individual chemicals that have been used over long periods (vinyl chloride, some solvents, etc.) and on chemicals of wide use (such as pesticides) than on chemicals rarely used or chemicals that are byproducts of other chemicals (e.g., chemicals in automobile exhaust and cigarette smoke). Additional information and analysis on the Integrated Risk Information System (IRIS) used by EPA is provided in Chapter 12. Some of the partial data needed to test models are discussed in Chapter 6.

Overall Priority Setting

The data needed for each step of risk assessment are summarized in rough order of increasing complexity (see Table 8-2). In an iterative data-collection process, if information in the top one or two items of each of the four columns in Table 8-2 does not indicate increased risk potential the priority for full risk assessment should be low. Various combinations of negative information in the first few items of any two of the first three lists (e.g., emissions, environmental fate and transport, exposure) with positive information in the third list might lead to a medium priority. Positive information in the early items of two, or perhaps three, of the lists would argue for a high priority. Data for the more complex items of each list would be developed when evidence of potential hazard exceeded an agreed-on ''bright line" of concern, i.e., a decision point set either by regulation or programmatic procedures.

Although a full priority scheme probably should be on a continuous scale, several important points to develop a more detailed scheme might appear as follows:

Page 155 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 155

TABLE 8-2 Types of Data Available for Risk Assessment
Emissions	Environmental Fate and Transport	Exposure	Toxicity
1. Material balance	1. Physical properties	1. Ambient fixed-point monitoring	1. Generic toxicity
2. Industry-wide emission factors	2. Physicochemical properties of environment	2. Targeted fixed-point monitoring	2. Acute toxicity (lethality for microorganisms or mammalian cells in vitro)
3. Plant-specific emission factors (EPA protocol)	3. Chemical properties or reactivity	3. Duration and frequency of peak concentrations for populations at risk	3. Acute mammalian lethality (rodent)
4. Facility measurements, including flux determinations	4. Rates of potential removal processes	4. Personnel monitoring for average and maximally exposed people	4. Toxicokinetics, phase 1
		5. Biologic markers	5. Genotoxicity (short-term in vitro tests in micro-organisms, Drosophila, or mammalian cells)
			6. Subchronic (13-day or 28-day) inhalation toxicity (rodent)
			7. Toxicokinetics, phase 2
			8. Chronic toxicity: carcinogenicity, neurobehavioral toxicity, reproductive and developmental toxicity, or immunotoxicity
			9. Human toxicity (clinical, biomonitoring, epidemiologic)
			10. Toxic mechanisms and dose-effect relationships

Page 156 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 156

Screening risk assessment

Emissions—Items 1 and 2

Environmental fate and transport—Items 1-3

Exposure—Items 1-3

Toxicity—Items 1-3

•	If the information for all the above items (or items lower on the list, if available) indicates no potential health concerns, assign "low priority."
•	If any information on exposure (emissions, environmental fate and transport, exposure) is positive, assign the chemical "medium priority."
•	If any information on exposure is positive (i.e., emission, environmental fate and transport, or exposure measurement), and toxicity data are positive, then assign the chemical "high priority" and proceed to the full-scale risk assessment.

Full risk assessment

Emissions—Items 1-4

Environmental fate and transport—Items 1-5

Exposure—Items 1-5

Toxicity—Items 1-10

•	If the information is not positive for the higher-order items in all four lists, assign the chemical to Action Level 2 (more extended time response).
•	If the information is positive for the high-order items in all four lists, assign the chemical to Action Level 1 (short time-frame response).

Reliable positive human evidence will always result in a high priority and the full risk evaluation. Any positive clinical, toxicologic, or epidemiological human data would override a priority based on exposure and animal toxicity data alone and move a given chemical to the stage of full risk assessment.

The detailed nature of the process used to set priorities for full risk assessment needs to be addressed in a coordinated way by federal and state agencies, to ensure the best use of limited resources for this programmatic step. There might be, for example, a numerical weighting or scoring approach based on data in the four categories of emissions, environmental fate and transport, exposure, and toxicological data. EPA should consider convening a panel of experts to develop a priority-setting process and the requisite accompanying iterative approach to data collection.

Data Management

More attention needs to be paid to data management to ensure that vital data gaps are filled, that data used in risk assessments are of the best possible quality, and that relevant information (such as negative epidemiological information) is

Page 157 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 157

not overlooked. The lack of a consistent data-collection scheme makes data analysis, and thus effective risk assessment, inconsistent and unreliable for risk-management purposes.

For example, risk assessment often requires that the assessor decide whether to set aside information from old studies when newer, supposedly better information is available. The ultimate desire is for credibility; therefore, it is important to use information that is widely acknowledged as the best representation of reality. If the results of a new study contradict information from an old study and if there is only a small difference in the "bottom-line" estimate of human health risk, then both should be used, and the error bounds of the current risk assessment should be revised. However, if the studies lead to quite different conclusions, use of both might be feasible. For example, some animal evidence might show a major health hazard while there may also be weak, negative, or equivocal animal studies. Such conflicting data should be carefully reviewed in the risk-assessment document, with detailed study of possible reasons for the discrepancy. When no reconciliation of results seems feasible, the committee recommends that the voice of prudence be heard and that the risk assessment be either based on the higher ultimate risk estimate or delayed (as was done in part on formaldehyde) until additional studies can be completed.

Findings And Recommendations

The committee's findings and recommendations follow.

Insufficient Data for Risk Assessment

EPA does not have sufficient data to assess fully the health risks of the 189 chemicals in Title III within the time permitted by the Clean Air Act Amendments of 1990.

•

EPA should screen the 189 chemicals for priorities for the assessment of health risks, identify the data gaps, and develop incentives to expedite generation of the needed data by other public agencies (such as the National Toxicology Program, the Agency for Toxic Substances and Disease Registry, and state agencies) and by other organizations (industry, academia, etc.).

Need for Data-Gathering Guidelines

EPA has not defined the guidelines or process to be used for determining the types, quantities, and quality of data that are needed for conducting risk assessments for facilities emitting one or more of the 189 chemicals.

•	EPA should develop an iterative approach to gathering and evaluating data in the categories of emission, transport and fate, exposure, and toxicology

Page 158 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 158

for use in both screening and full risk assessment. The data-gathering and data-evaluation process should be set forth by EPA in guidelines for use by those who conduct data-gathering activities. To develop these guidelines, EPA should convene a panel of experts to develop a priority-setting scheme that uses a numerical weighting or scoring approach.

Inadequacy of Emission and Exposure Data

EPA has often relied on non-site-specific emission and exposure data. These data are often not sufficient to assess the risk to individuals and the affected population at large.

•	EPA should expand its efforts to gather emission and exposure data to personal monitoring and site-specific monitoring.

Inadequacy of TRI Database as a Source of Emission Data for Risk-Assessment Purposes

The SARA 313 Toxic Release Inventory data and other readily available data used by EPA for emission characterization may be adequate for screening purposes but are not adequate for developing detailed risk assessments for specific facilities. Present processes of gathering emission data do not yield information appropriate for all risk-assessment purposes under the Clean Air Act Amendments.

•	EPA should modify its data-gathering activities related to emissions to ensure that it has or will acquire the data needed to conduct screening and full risk assessments, especially of the 189 chemicals listed in CAAA-90.

Lack of Adequate Natural Background-Exposure Database

EPA does not have an adequate database on natural background exposures to the 189 air pollutants against which to evaluate total human exposure data from facilities producing or using these substances.

•	EPA should develop an ambient-outdoor-exposure database on the 189 listed hazardous air pollutants.

Inadequate Explanation of Analytical Techniques

EPA does not always explain adequately the analytical and measurement methods it uses for estimating ambient outdoor exposures.

•	EPA should collate and explain the analytical and measurement methods it uses for ambient outdoor exposures, including the errors, precision, accuracy, detection limits, etc., of all methods that it uses for risk-assessment purposes.

Page 159 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 159

Need for System of Data Management for Risk Assessment

EPA needs more adequate mechanisms to compile and maintain databases for use in health-risk screening and assessment.

•

EPA should review its data-management systems and improve them as needed to ensure that the quality and quantity of the data are routinely updated and that the data are sufficiently accessible for risk screening and risk assessment. Its responsibilities under CAAA-90 should be prominent in this review and revision.

Page 144 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 145 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 146 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 147 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 148 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 149 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 150 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 151 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 152 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 153 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 154 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 155 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 156 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 157 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 158 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Page 159 Cite

Suggested Citation:"8 Data Needs." National Research Council. 1994. Science and Judgment in Risk Assessment. Washington, DC: The National Academies Press. doi: 10.17226/2125.

Next: 9 Uncertainty »

Science and Judgment in Risk Assessment (1994)

Chapter: 8 Data Needs

8
Data Needs

Context Of Data Needs

Implications For Priority-Setting

Data Needed For Risk Assessment

Emissions

Priorities for Collecting Data

Data Availability

Environmental Fate and Transport

Priorities for Collecting Data

Data Availability

Exposure

Priorities for Collecting Data

Data Availability

Toxicity

Priorities for Collecting Data

Data Availability

Overall Priority Setting

Screening risk assessment

Full risk assessment

Data Management

Findings And Recommendations

Insufficient Data for Risk Assessment

Need for Data-Gathering Guidelines

Inadequacy of Emission and Exposure Data

Inadequacy of TRI Database as a Source of Emission Data for Risk-Assessment Purposes

Lack of Adequate Natural Background-Exposure Database

Inadequate Explanation of Analytical Techniques

Need for System of Data Management for Risk Assessment

Welcome to OpenBook!

Get Email Updates