National Academy of Sciences | 150 Year Anniversary

Questions? Call 800-624-6242

| Items in cart [0]

The National Academies Press

Rights & Permissions

topleft topright

NCHRP Report 571: Standardized Procedures for Personal Travel Surveys (2008)
National Cooperative Highway Research Program (NCHRP)

Citation Manager

Pisarski, Alan E, Alsnih, Rahaf, Zmud, Johanna P, Wilmot, Chester G, Brog, Werner, Freedman, Mark, Stopher, Peter R, Pratt, Joanne, Mix, Wende, Lee-Gosselin, Martin, Stecher, Cheryl, Axhausen, Kay W, Transportation Research Board. "2.6.3 A-3: Missing Data Imputation." NCHRP Report 571: Standardized Procedures for Personal Travel Surveys. Washington, DC: The National Academies Press, 2008.

Please select a format:

BibTeX EndNote RefMan


Page
36
bottomleft bottomright
Page
36
Front Matter (R1-R10)
Summary (1-4)
1.1 Background (5-5)
1.2 Study Objectives (6-6)
1.4 Research Approach (7-8)
1.5 Report Organization (9-9)
Chapter 2 - Summary of Recommended Standardized Procedures and Guidelines (10-10)
2.1.1 I-1: Minimum Question Specification (11-11)
2.1.2 I-2: Categories for Minimum and Other Questions (12-12)
2.2.1 D-1: Number and Type of Contacts (13-16)
2.2.2 D-3: Proxy Reporting (17-17)
2.2.3 D-4: Complete Household Definition (18-18)
2.2.4 D-6: Sample Replacement (19-19)
2.2.6 D-8: Unit Non-Response (20-20)
2.2.7 D-10: Initial Contacts (21-21)
2.2.9 D-14: Respondent Burden (22-22)
2.3.1 P-2: Requirements for Pretests or Pilot Surveys (23-23)
2.4.1 E-2: Ethics (24-25)
2.4.3 E-4: Respondent Questions (26-26)
2.4.5 E-9: Answering Machines and Repeated Call-Back Requests (27-27)
2.4.8 E-12: Time of Day to Begin and End Reporting (28-28)
2.5.1 C-1: Geocoding Standards (29-29)
2.5.3 C-4: Missing Values, Use of Zero, Etc. (30-30)
2.5.4 C-5: Coding Complex Variables (31-34)
2.6.1 A-1: Assessing Sample Bias (35-35)
2.6.3 A-3: Missing Data Imputation (36-36)
2.6.5 A-6: Documentation (37-38)
2.7.1 Q-1: Computing Response Rates (39-39)
2.7.2 Q-2: Transportation Measures of Quality (40-40)
2.7.3 Q-3: Coverage Error (41-41)
2.7.5 Q-6: Validation Statistics (42-42)
2.7.7 Q-8: Number of Missing Values (43-43)
2.7.8 Q-9: Adherence to Quality Standards and Guidelines (44-44)
3.3 National Highway Institute Course (45-45)
3.6 Potential Funding for the Promulgation and Maintenance of the Standardized Procedures and Guidelines (46-47)
4.1.1 D-11: GPS Surveys (48-48)
4.1.3 I-8: SP Data (49-49)
4.2.1 D-2: Who Should Be Surveyed? (50-50)
4.2.2 D-9: Times of Day for Contacts (51-51)
4.2.4 E-7: Cross-Checks in Data Collection and Data Review (52-52)
4.2.5 E-8: Days and Periods to Avoid Data Collection (53-53)
4.2.7 I-4: Ordering of Questions (54-54)
4.2.8 I-6: Instrument Design (55-55)
4.2.9 I-7: Multitasking of Activities (56-56)
4.2.10 S-1: Sample Sizes (57-57)
4.2.12 S-3: Collecting Augment Samples (58-58)
4.2.13 S-4: Stratification Options for Samples (59-59)
4.2.15 S-6: Development of Default Variances (60-60)
4.2.16 P-1: Focus Groups (61-61)
4.2.18 Q-4: Sampling Error (62-62)
4.3.1 Cell Phones (63-63)
4.3.3 Personalized Interview Techniques (64-64)
4.3.5 Impacts of the National Do Not Call Registry (65-65)
4.3.7 Refusal and Non-Contact Conversions (66-66)
4.3.10 Data Archiving in Transportation (67-67)
5.2.1 Scope of Work (68-90)
5.2.2 Schedule of Work (91-92)
Glossary (93-100)
References (101-103)
Abbreviations used without definitions in TRB publications (104-104)

Below are the first 10 and last 10 pages of uncorrected machine-read text (when available) of this chapter, followed by the top 30 algorithmically extracted key phrases from the chapter as a whole.
Intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text on the opening pages of each chapter. Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.

Do not use for reproduction, copying, pasting, or reading; exclusively for search engines.

OCR for page 36
36 Standardized Procedures for Personal Travel Surveys ­ Household income: categories corresponding to those in Table 7 in Section 2.5.4; ­ Race: categories of white, black/African American, American Indian/Alaska native, Asian, Native Hawaiian/Pacific Islander, other single race, and two or more races; ­ Age: categories of 0­5, 6­10, 11­14, 15­17, 18­64, 65­74, 75 and over; ­ Gender: male and female. 4. Total error should be measured using the Percentage RMSE statistic defined in Equation 1. 2 1 ni 1 rij - sij n ji Percent RMSE = n rij × 100 (1) ni i ji j where ni = number of variables i; nji = number of categories j in variable i; rij = reference value of variable i in category j; and sij = sample value of variable i in category j. 2.6.2 A-2: Weighting and Expansion of Data Weighting is the process of assigning weights to observations in a sample so that the weighted sample accurately represents the population. Expansion is the multiplication applied to each observation in a sample so that the expanded sample is an estimate of the population. Weight- ing is determined by comparing values of variables within the sample with values of correspond- ing variables from a reliable external source such as the census. Expansion factors are the inverse of the sampling rate. Weighting and expansion are often combined into a single factor or weight, which reflects both the relative representativeness of each observation in the sample and the number of simi- lar cases each observation in the sample represents in the population. Separate weights are usu- ally assigned to households, persons, and trips. These weights sum to the number of households, persons, and trips in the population, respectively. The reader is referred to Section 9.2 of the Technical Appendix for further elaboration. It is recommended that the following standardized procedures be adopted: 1. Each travel survey should conduct a weighting and expansion exercise to include the weights in the data set and to include a description of the weighting process in the metadata; 2. The weights should include expansion factors so that the sum of the weights match popu- lation estimates; and 3. The two-stage procedure, described in the technical appendix, Section 9.2.2, should be adopted as the standard method of calculating weights. 2.6.3 A-3: Missing Data Imputation As discussed in Section 9.3 of the Technical Appendix, imputation is the substitution of val- ues for missing data items or for values of data items that are known to be faulty. Data values are known to be faulty if they are infeasible (e.g., a 5-year old with a driver's license) or are incon- sistent with other information known of an individual or their household. There are two mech- anisms for substituting values for missing or faulty data items--deductive imputation (or infer- ence) and regular imputation. Inference involves deriving the value of a missing or faulty data item from the information known of a respondent or their household, when such a derivation can be made with relative certainty. For example, the gender of a person can often be inferred from their first name, and a person 16 years of age or older who reports making multiple trips alone by car probably has a driver's license. Imputation, on the other hand, is the generation of