The development and evaluation of several programmed testing methods. Educational and Psychological Measurement, 29, 129-146.
. (1969). Sequential testing for dichotomous decisions. College Entrance Examination Board Research and Development Report (RDR 69-70, No 3", and Educational Testing Service RB-70-31). Princeton NJ: Educational Testing Service.
. (1970). Sequential testing for dichotomous decisions. . Educational and Psychological Measurement, 32, 85-95.
. (1972). Impact of item location effects on ability estimation in CAT. In Paper presented at the annual meeting of the National Council on Measurement in Education. Seattle WA.
. (2001). On-the-Fly Constraint-Controlled Assembly Methods for Multistage Adaptive Testing for Cognitive Diagnosis. Journal of Educational Measurement, 55, 595-613. doi:10.1111/jedm.12194
. (2018). Impact of flawed items on ability estimation in CAT. In Paper presented at the annual meeting of the National Council on Measurement in Education. Montreal, Canada.
. (1999). The impact of scoring flawed items on ability estimation in CAT. In Paper presented at the annual meeting of the Psychometric Society. Urbana, IL.
. (1998). Investigation of Response Changes in the GRE Revised General Test. Educational and Psychological Measurement, 75, 1002-1020. doi:10.1177/0013164415573988
. (2015). A theoretical study of the measurement effectiveness of flexilevel tests. Educational and Psychological Measurement, 31, 805-813.
. (1971). Tailored testing, an approximation of stochastic approximation. Journal of the American Statistical Association, 66, 707-711.
. (1971). Discussion. In . C. K. Clark (Ed.), Proceedings of the First Conference on Computerized Adaptive Testing (pp. 113-117). Washington DC: U.S. Government Printing Office.
. (1976). lo75-03.pdf (317.31 KB)Tailored testing: An application of stochastic approximation (RM 71-2). Princeton NJ: Educational Testing Service.
. (1971). Test theory and the public interest. Proceedings of the Educational Testing Service Invitational Conference.
. (1976). . (1971).
A broad range tailored test of verbal ability. In . C. K. Clark (Ed.), Proceedings of the First Conference on Computerized Adaptive Testing (pp. 75-78). Washington DC: U.S. Government Printing Office.
. (1976). lo75-01.pdf (249.89 KB) . (1977). v01n1p095.pdf (402.88 KB)
The self-scoring flexilevel test (RB-7043). Princeton NJ: Educational Testing Service.
. (1970). Some how and which for practical tailored testing. In . L. J. T. van der Kamp, W. F. Langerak and D.N.M. de Gruijter (Eds): Psychometrics for educational debates (pp. 189-206). New York: John Wiley and Sons. Computer-Assisted Instruction, Testing, and Guidance (pp. 139-183). New York: Harper and Row.
. (1980). A broad range test of verbal ability (RB-75-5). Princeton NJ: Educational Testing Service.
. (1975). lo75-01.pdf (249.89 KB) . (1971).
Individualized testing and item characteristic curve theory. In . D. H. Krantz, R. C. Atkinson, R. D. Luce, and P. Suppes (Eds.), Contemporary developments in mathematical psychology (Vol. II). San Francisco: Freeman.
. (1974). Small N justifies Rasch model. In , New horizons in testing: Latent trait test theory and computerized adaptive testing (pp. 51-61). New York, NY. USA: Academic Press.
. (1983). Panel discussion: Future directions for computerized adaptive testing. In . D. J. Weiss (Ed.), Proceedings of the 1977 Item Response Theory and Computerized adaptive conference. Minneapolis: University of Minnesota, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory.
. (1978). Practical methods for redesigning a homogeneous test, also for designing a multilevel test (RB-74-30). Princeton NJ: Educational Testing Service.
. (1974). . (1971).
Some test theory for tailored testing. In . W. H. Holtzman (Ed.), Computer-assisted instruction, testing, and guidance (pp.139-183). New York: Harper and Row.
. (1970). . (1977).
Some likelihood functions found in tailored testing. In . C. K. Clark (Ed.), Proceedings of the First Conference on Computerized Adaptive Testing (pp. 79-81). Washington DC: U.S. Government Printing Office.
. (1976). lo75-02.pdf (165.27 KB)Individualized testing and item characteristic curve theory (RB-72-50). Princeton NJ: Educational Testing Service.
. (1972). Efficiency and precision in two-stage adaptive testing. West Palm Beach Florida: Eastern ERA.
. (1984). Evaluating a new approach to detect aberrant responses in CAT. In Paper presented at the annual meeting of the American Educational Research Association. Chicago IL.
. (2003). Statistics for detecting disclosed items in a CAT environment. Metodologia de Las Ciencias del Comportamiento., 5(2).
. (2004). Methods for item set selection in adaptive testing. In Paper presented at the annual meeting of the National Council on Measurement in Education. Chicago IL.
. (2003). lu03-02.pdf (441.94 KB)Evaluating computerized adaptive testing design for the MCAT with realistic simulated data. In Paper presented at the annual meeting of the National Council on Measurement in Education. Chicago IL.
. (2003). lu03-01.pdf (984.95 KB)Test information targeting strategies for adaptive multistage testlet designs. In Paper presented at the Annual meeting of the National Council on Measurement in Education. Chicago IL.
. (2003). lu03-03.pdf (178.13 KB)Some practical examples of computerized adaptive sequential testing. Journal of Educational Measurement, 35, 229-249.
. (1998). Exposure control using adaptive multi-stage item bundles. In annual meeting of the National Council on Measurement in Education. Chicago, IL. USA.
. (2003). lu03-04.pdf (115.22 KB)Adaptive computer-based tasks under an assessment engineering paradigm. In . D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing.
. (2009). cat09luecht.pdf (288.75 KB)A few more issues to consider in multidimensional computerized adaptive testing. In Paper presented at the annual meeting of the American Educational Research Association. San Francisco.
. (1994). Heuristics based CAT: Balancing item information, content, and exposure. In Paper presented at the annual meeting of the National Council on Measurement in Education. New York NY.
. (1996). A testlet assembly design for the uniform CPA examination. In Paper presented at the Annual Meeting of the National Council on Measurement in Education.. New Orleans.
. (2002). lu02-01.pdf (191.34 KB)Item selection using an average growth approximation of target information functions. Applied Psychological Measurement, 16, 41-51.
. (1992). Multidimensional computerized adaptive testing in a certification or licensure context. Applied Psychological Measurement, 20, 389-404.
. (1996). Computer-adaptive testing. In . B. Everett, and D. Howell (Eds.), Encyclopedia of statistics in behavioral science. New York: Wiley.
. (2004). . (1992).
Heuristic-based CAT: Balancing item information, content and exposure. In Paper presented at the annual meeting of the National Council on Measurement in Education. New York NY.
. (1996). A testlet assembly design for the uniform CPA Examination. Applied Measurement in Education, 19, 189-202. doi:10.1207/s15324818ame1903_2
. (2006). Multidimensional Computerized Adaptive Testing in a Certification or Licensure Context. Applied Psychological Measurement, 20, 389-404.
. (1996). v20n4p389.pdf (1.19 MB)Exposure control using adaptive multi-stage item bundles. In Paper presented at the Annual meeting of the National Council on Measurement in Education. Chicago IL.
. (2003). Some practical examples of computerized adaptive sequential testing (Internal Report). Philadelphia: National Board of Medical Examiners.
. (1996). Computer-assisted test assembly using optimization heuristics. Applied Psychological Measurement, 22 , 224-236.
. (1998). Overview of the USMLE Step 2 computerized field test. In Paper presented at the annual meeting of the National Council on Measurement in Education. Chicago IL.
. (1997). Computer-adaptive sequential testing. In . W. J. van der Linden (Ed.), Computerized Adaptive Testing: Theory and Practice (pp. 289-209). Dordrecht, The Netherlands: Kluwer.
. (2000). Implementing the computer-adaptive sequential testing (CAST) framework to mass produce high quality computer-adaptive and mastery tests. In Symposium paper presented at the Annual Meeting of the National Council on Measurement in Education. New Orleans, LA.
. (2000). CASTISEL [Computer software]. Philadelphia, PA: National Board of Medical Examiners.
. (1998). Heuristic-based CAT: Balancing item information, content, and exposure. In Paper presented at the annual meeting of the National Council on Measurement in Education. New York NY.
. (1996). Maintaining content validity in computerized adaptive testing. Advances in Health Sciences Education, 3, 29-41.
. (1998). Test models for complex computer-based testing. In . C. N. Mille,. M. T. Potenza, J. J. Fremer, and W. C. Ward (Eds.). Computer-based testing: Building the foundation for future assessments (pp. 67-88). Hillsdale NJ: Erlbaum.
. (2002). A framework for exploring and controlling risks associated with test item exposure over time. In Paper presented at the Annual Meeting of the National Council for Measurement in Education. San Diego, CA.
. (1998). Some alternative CAT item selection heuristics (Internal report). Philadelphia PA: National Board of Medical Examiners.
. (1995).