References

ACCUPLACER. (2002). Available: <http://www.collegeboard.com/accuplacer/html/accupla1.html>. [May 14, 2002].

American Educational Research Association, American Psychological Association, and National Council on Measurement in Education. (1999). Standards for educational and psychological testing. Washington, DC: Author.

Ananda, S. (2000). Equipped for the Future assessment report: How instructors can support adult learners through performance-based assessment (EX 0110P). Washington, DC: U.S. Department of Education.

Anastasi, A. (1988). Psychological testing. New York: Macmillan.


Bachman, L.F. (1990). Fundamental considerations in language testing. Oxford: Oxford University Press.

Bachman, L.F. and Palmer, A.S. (1996). Language testing in practice. Oxford: Oxford University Press.

Bond, L. (1995). Unintended consequences of performance assessment: Issues of bias and fairness. Educational Measurement: Issues and Practices, 14(4), 21-24.

Brennan, R.L. (1983). Elements of generalizability theory. Iowa City, IA: ACT Publications.

Brennan, R.L. (2001). An essay on the history and future of reliability from the perspective of replications. Journal of Educational Measurement, 38(4), 295-317.

Brennan, R.L., and Johnson, E.G. (1995). Generalizability of performance assessments. Educational Measurement: Issues and Practices, 14(4), 9-12.


Camilli, G., and Shepard, L.A. (1994). Methods for identifying biased test items. Thousand Oaks, CA: Sage.

Cole, N.S., and Moss, P.M. (1993). Bias in test use. In R.L. Linn (Ed.), Educational Measurement (3rd ed.). Phoenix, AZ: Oryx.

Comings, J., Sum, A., and Uvin, J. (2000). New skills for a new economy: Adult education’s key role in sustaining economic growth and expanding opportunity. Boston: Massachusetts Institute for a New Commonwealth.



The National Academies | 500 Fifth St. N.W. | Washington, D.C. 20001
Copyright © National Academy of Sciences. All rights reserved.
Terms of Use and Privacy Statement



Below are the first 10 and last 10 pages of uncorrected machine-read text (when available) of this chapter, followed by the top 30 algorithmically extracted key phrases from the chapter as a whole.
Intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text on the opening pages of each chapter. Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.

Do not use for reproduction, copying, pasting, or reading; exclusively for search engines.

OCR for page 103
References ACCUPLACER. (2002). Available: <http://www.collegeboard.com/accuplacer/html/accupla1.html>. [May 14, 2002]. American Educational Research Association, American Psychological Association, and National Council on Measurement in Education. (1999). Standards for educational and psychological testing. Washington, DC: Author. Ananda, S. (2000). Equipped for the Future assessment report: How instructors can support adult learners through performance-based assessment (EX 0110P). Washington, DC: U.S. Department of Education. Anastasi, A. (1988). Psychological testing. New York: Macmillan. Bachman, L.F. (1990). Fundamental considerations in language testing. Oxford: Oxford University Press. Bachman, L.F. and Palmer, A.S. (1996). Language testing in practice. Oxford: Oxford University Press. Bond, L. (1995). Unintended consequences of performance assessment: Issues of bias and fairness. Educational Measurement: Issues and Practices, 14(4), 21-24. Brennan, R.L. (1983). Elements of generalizability theory. Iowa City, IA: ACT Publications. Brennan, R.L. (2001). An essay on the history and future of reliability from the perspective of replications. Journal of Educational Measurement, 38(4), 295-317. Brennan, R.L., and Johnson, E.G. (1995). Generalizability of performance assessments. Educational Measurement: Issues and Practices, 14(4), 9-12. Camilli, G., and Shepard, L.A. (1994). Methods for identifying biased test items. Thousand Oaks, CA: Sage. Cole, N.S., and Moss, P.M. (1993). Bias in test use. In R.L. Linn (Ed.), Educational Measurement (3rd ed.). Phoenix, AZ: Oryx. Comings, J., Sum, A., and Uvin, J. (2000). New skills for a new economy: Adult education’s key role in sustaining economic growth and expanding opportunity. Boston: Massachusetts Institute for a New Commonwealth.

OCR for page 103
Comrey, A.L., and Lee, H.B. (1992). A first course in factor analysis (2nd ed.). Hillsdale, NJ: Erlbaum. Crocker, L., and Algina, J. (1986). Introduction to classical and modern test theory. Orlando, FL: Harcourt Brace Jovanovich. Cureton, E.E., and D’Agostino, R.B. (1983). Factor analysis: An applied approach. Hillsdale, NJ: Erlbaum. Dunbar, S., Koretz, D., and Hoover, H.D. (1991). Quality control in the use of performance assessment. Applied Measurement in Education, 4(4), 289-303. Feldt, L.S., and Brennan, R.L. (1993). Reliability. In R.L. Linn (Ed.), Educational Measurement (3rd ed.). Phoenix, AZ: Oryx. Gorsuch, R.L. (1983). Factor analysis. Hillsdale, NJ: Erlbaum. Green, B.F. (1995). Comparability of scores from performance assessments. Educational Measurement: Issues and Practices, 14(4), 13-15. Hambleton, R.K., Swaminathan, H., and Rogers, H.J. (1991). Fundamentals of item response theory. Newbury Park, CA: Sage. Harris, C.W. (Ed.). (1963). Problems in measuring change. Madison, WI: University of Wisconsin Press. Holland, P.W., and Wainer, H. (1993). Differential item functioning. Newbury Park, NJ: Erlbaum. Kaufman, P., Kwon, J.Y., Klein, S., and Chapman, C.D. (2000). Dropout rates in the United States: 1999 (NCES 2001-22). Washington, DC: U.S. Department of Education, National Center for Education Statistics. Kolen, M.J., and Brennan, R.L. (1995). Testing equating methods and practices. New York: Springer. Koretz, D. (1994). The evolution of a portfolio program: The impact and quality of the Vermont Portfolio Program in its second year (1992-93) (ERIC #ED379301). Los Angeles: National Center for Research and Evaluation, Standards, and Student Testing. Koretz, D., Stetcher, B., Klein, S., and McCaffrey, D. (1994). The Vermont Portfolio Assessment Program: Findings and implications. Educational Measurement: Issues and Practice, 13(3), 5-16. Kunnan, A.J. (Ed.) (2000). Fairness and validation in language assessment. Cambridge: Cambridge University Press. LeMahieu, P.G., Gitomer, D.H., and Eresh, J.T. (1995). Portfolios in large-scale assessments: Difficult but not impossible. Educational Measurement: Issues and Practice, 14(3), 11-16, 25-28. Linn, R.L. (1993). Linking results of distinct assessments. Applied Measurement in Education, 6(1), 83-102. Linn, R.L., Gronlund, N.E., and Davis, K.M. (1999). Measurement and assessment in teaching (8th ed.). Upper Saddle River, NJ: Prentice-Hall. Messick, S. (1989). Validity. In R.L. Linn (Ed.), Educational Measurement (3rd ed.). New York: Macmillan. Messick, S. (1995). Standards of validity and the validity of standards in performance assessment. Educational Measurement: Issues and Practices, 14(4), 5-8. Microsoft Certification Program. (2002). Available: <http://www.Microsoft.com/traincert/mcp>. [March 28, 2002].

OCR for page 103
Millman, J., and Greene, J. (1993). The specification and development of tests of achievement and ability. In R.L. Linn (Ed.), Educational Measurement (3rd ed.). Phoenix, AZ: Oryx. Mislevy, R.J. (1992). Linking educational assessments: Concepts, issues, methods and prospects (ERIC #ED353302). Princeton, NJ: Educational Testing Service. Mislevy, R.J. (1995). Linking adult literacy assessments. Princeton, NJ: Educational Testing Service. National Board for Professional Teaching Standards. (2000). A distinction that matters: Why national teacher certification makes sense. Arlington, VA: Author. National Council of Teachers of Mathematics. (2000). Principles and standards for school mathematics. Reston, VA: Author. National Institute for Literacy. (2002). EFF standards for adult literacy and lifelong learning. Available: <http://www.nifl.gov/lincs/collections/eff/eff_standards.html>. [May 1, 2002]. National Reporting System. (2002). 6 levels of ABE or ESL. Available: <http://www.oei-tech.com/nrs/>. [April 29, 2002]. National Research Council. (1997). Educating one and all: Students with disabilities and standards-based reform. Committee on Goals 2000 and the Inclusion of Students with Disabilities, L.M. McDonnell, M.J. McLaughlin, and P. Morison (Eds.). Commission on Behavioral and Social Sciences and Education. Washington, DC: National Academy Press. National Research Council. (1999a). Embedding questions: The pursuit of a common measure in uncommon tests. Committee on Embedding Common Test Items in State and District Assessments, D.M. Koretz, M.W. Bertenthal, and B.F. Green (Eds.). Board on Testing and Assessment, Commission on Behavioral and Social Sciences and Education. Washington, DC: National Academy Press. National Research Council. (1999b). High stakes: Testing for tracking, promotion, and graduation. Committee on Appropriate Test Use, J.P. Heubert and R.M. Hauser (Eds.). Commission on Behavioral and Social Sciences and Education. Washington, DC: National Academy Press. National Research Council. (1999c). Uncommon measures: Equivalency and linkage among educational tests. Committee on Equivalency and Linkage of Educational Tests, M.J. Feuer, P.W. Holland, B.F. Green, M.W. Bertenthal, and F.C. Hemphill (Eds.). Board on Testing and Assessment, Commission on Behavioral and Social Sciences and Education. Washington, DC: National Academy Press. National Research Council. (2001a). Knowing what students know: The science and design of educational assessment. Committee on the Foundation of Assessment, J. Pellegrino, N. Chudowsky, and R. Glaser (Eds.). Division of Behavioral and Social Sciences and Education. Washington, DC: National Academy Press. National Research Council. (2001b). Testing teacher candidates: The role of licensure tests in improving teacher quality. Committee on Assessment and Teacher Quality, K.J. Mitchell, D.Z. Robinson, B.S. Plake, and K.T. Knowles (Eds.). Board on Testing and Assessment, Division of Behavioral and Social Sciences and Education. Washington, DC: National Academy Press. Nitko, A.J. (2001). Educational assessment of students (3rd ed.). Upper Saddle River, NJ: Prentice-Hall.

OCR for page 103
Popham, W.J. (1999). Classroom assessment: What teachers need to know (2nd ed.). Boston: Allyn and Bacon. Popham, W.J. (2000). Modern educational measurement: Practical guidelines for educational leaders (3rd ed.). Boston: Allyn and Bacon. Reckase, M.D. (1995). Portfolio assessment: A theoretical estimate of score reliability. Educational Measurement: Issues and Practice, 14(1), 12-14. Reckase, M.D., and Welch, C. (1999). Advances in portfolio assessment with applications to urban school populations. In M.T. Nettles and A.L. Nettles (Eds.), Measuring up: Challenges minorities face in educational assessment. Boston: Kluwer. Shavelson, R.J., Baxter, G.P., and Gao, X. (1993). Sampling variability of performance assessments. Journal of Educational Measurement, 30(3), 215-232. Shavelson, R., and Webb, N. (1991). Generalizability theory: A primer. Newbury Park, CA: Sage. St. Pierre, R.G., Swartz, J.P., Gamse, S., Murray, S., Deck, D., and Nickel, P. (1995). National evaluation of the Even Start Family Literacy Program: Final report. Cambridge, MA: Abt Associates. Test of English as a Foreign Language. (2002). Available: <http:/www.toefl.org>. [May 14, 2002]. Thissen, D. (2001). Comments on Performance Assessments for Adult Education: Exploring the Measurement Issues. Paper commissioned by the Committee on Alternatives for Assessing Adult Education and Literacy Programs. Center for Eduation. National Research Council. Thorndike, R.L., and Hagen, E.P. (1977). Measurement and evaluation in psychology and education (4th ed.). New York: Wiley. U.S. Department of Education. (2001a). Measures and methods for the National Reporting System for Adult Education: Implementation guidelines. Washington, DC: Author, Division of Adult Education and Literacy, Office of Vocational and Adult Education . U.S. Department of Education. (2001b). State-administered adult education program fiscal year 1998 expenditures (July 1, 1998-June/September 30, 2000). Washington, DC: Author, Division of Adult Education and Literacy. U.S. Department of Education. (2001c). State reported hours of attendance in adult education programs (1997-2000). Washington, DC: Author, Division of Adult Education and Literacy. U.S. Medical Licensing Examination. (2002). Available: <http://www.uslc.org>. [May 14, 2002]. Wainer, H., Dorans, N.J., Eignor, D., Flaugher, R., Green, B.F., Mislevy, R.J., Steinberg, L., and Thissen, D. (2000). Computerized adaptive testing: A primer (2nd ed.). Mahwah, NJ: Erlbaum. Workforce Investment Act of 1998 (H.R. 1385), 105th Cong., 2nd Sess. (1998). Zumbo, B.D. (1999). The simple difference score as an inherently poor measure of change. Some reality, much mythology. In B. Thompson (Ed.), Advances in Social Science Methodology, Volume 5 (pp. 269-304). Greenwich, CT: JAI Press.