The development and evaluation of several programmed testing methods (Research Bulletin 68-5). Princeton NJ: Educational Testing Service.
. (1968). The development and evaluation of several programmed testing methods. Educational and Psychological Measurement, 29, 129-146.
. (1969). Discussion. In . D. J. Weiss (Ed.), Computerized adaptive trait measurement: Problems and Prospects (Research Report 75-5), pp. 44-46. Minneapolis: University of Minnesota, Department of Psychology, Psychometric Methods Program.
. (1975). li75-01.pdf (413.98 KB)Impact of flawed items on ability estimation in CAT. In Paper presented at the annual meeting of the National Council on Measurement in Education. Montreal, Canada.
. (1999). On-the-Fly Constraint-Controlled Assembly Methods for Multistage Adaptive Testing for Cognitive Diagnosis. Journal of Educational Measurement, 55, 595-613. doi:10.1111/jedm.12194
. (2018). Investigation of Response Changes in the GRE Revised General Test. Educational and Psychological Measurement, 75, 1002-1020. doi:10.1177/0013164415573988
. (2015). The impact of scoring flawed items on ability estimation in CAT. In Paper presented at the annual meeting of the Psychometric Society. Urbana, IL.
. (1998). Impact of item location effects on ability estimation in CAT. In Paper presented at the annual meeting of the National Council on Measurement in Education. Seattle WA.
. (2001). Some how and which for practical tailored testing. In . L. J. T. van der Kamp, W. F. Langerak and D.N.M. de Gruijter (Eds): Psychometrics for educational debates (pp. 189-206). New York: John Wiley and Sons. Computer-Assisted Instruction, Testing, and Guidance (pp. 139-183). New York: Harper and Row.
. (1980). . (1971).
Individualized testing and item characteristic curve theory. In . D. H. Krantz, R. C. Atkinson, R. D. Luce, and P. Suppes (Eds.), Contemporary developments in mathematical psychology (Vol. II). San Francisco: Freeman.
. (1974). Small N justifies Rasch model. In , New horizons in testing: Latent trait test theory and computerized adaptive testing (pp. 51-61). New York, NY. USA: Academic Press.
. (1983). Practical methods for redesigning a homogeneous test, also for designing a multilevel test (RB-74-30). Princeton NJ: Educational Testing Service.
. (1974). . (1977).
Panel discussion: Future directions for computerized adaptive testing. In . D. J. Weiss (Ed.), Proceedings of the 1977 Item Response Theory and Computerized adaptive conference. Minneapolis: University of Minnesota, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory.
. (1978). A theoretical study of the measurement effectiveness of flexilevel tests. Educational and Psychological Measurement, 31, 805-813.
. (1971). Some test theory for tailored testing. In . W. H. Holtzman (Ed.), Computer-assisted instruction, testing, and guidance (pp.139-183). New York: Harper and Row.
. (1970). Individualized testing and item characteristic curve theory (RB-72-50). Princeton NJ: Educational Testing Service.
. (1972). Tailored testing, an approximation of stochastic approximation. Journal of the American Statistical Association, 66, 707-711.
. (1971). Some likelihood functions found in tailored testing. In . C. K. Clark (Ed.), Proceedings of the First Conference on Computerized Adaptive Testing (pp. 79-81). Washington DC: U.S. Government Printing Office.
. (1976). lo75-02.pdf (165.27 KB)Tailored testing: An application of stochastic approximation (RM 71-2). Princeton NJ: Educational Testing Service.
. (1971). . (1971).
Discussion. In . C. K. Clark (Ed.), Proceedings of the First Conference on Computerized Adaptive Testing (pp. 113-117). Washington DC: U.S. Government Printing Office.
. (1976). lo75-03.pdf (317.31 KB)Test theory and the public interest. Proceedings of the Educational Testing Service Invitational Conference.
. (1976). The self-scoring flexilevel test (RB-7043). Princeton NJ: Educational Testing Service.
. (1970). . (1971).
A broad range tailored test of verbal ability. In . C. K. Clark (Ed.), Proceedings of the First Conference on Computerized Adaptive Testing (pp. 75-78). Washington DC: U.S. Government Printing Office.
. (1976). lo75-01.pdf (249.89 KB) . (1977). v01n1p095.pdf (402.88 KB)
A broad range test of verbal ability (RB-75-5). Princeton NJ: Educational Testing Service.
. (1975). lo75-01.pdf (249.89 KB)Efficiency and precision in two-stage adaptive testing. West Palm Beach Florida: Eastern ERA.
. (1984). Statistics for detecting disclosed items in a CAT environment. Metodologia de Las Ciencias del Comportamiento., 5(2).
. (2004). Evaluating a new approach to detect aberrant responses in CAT. In Paper presented at the annual meeting of the American Educational Research Association. Chicago IL.
. (2003). Methods for item set selection in adaptive testing. In Paper presented at the annual meeting of the National Council on Measurement in Education. Chicago IL.
. (2003). lu03-02.pdf (441.94 KB)Evaluating computerized adaptive testing design for the MCAT with realistic simulated data. In Paper presented at the annual meeting of the National Council on Measurement in Education. Chicago IL.
. (2003). lu03-01.pdf (984.95 KB)Exposure control using adaptive multi-stage item bundles. In Paper presented at the Annual meeting of the National Council on Measurement in Education. Chicago IL.
. (2003). CASTISEL [Computer software]. Philadelphia, PA: National Board of Medical Examiners.
. (1998). Computer-adaptive sequential testing. In . W. J. van der Linden (Ed.), Computerized Adaptive Testing: Theory and Practice (pp. 289-209). Dordrecht, The Netherlands: Kluwer.
. (2000). Overview of the USMLE Step 2 computerized field test. In Paper presented at the annual meeting of the National Council on Measurement in Education. Chicago IL.
. (1997). Maintaining content validity in computerized adaptive testing. Advances in Health Sciences Education, 3, 29-41.
. (1998). Implementing the computer-adaptive sequential testing (CAST) framework to mass produce high quality computer-adaptive and mastery tests. In Symposium paper presented at the Annual Meeting of the National Council on Measurement in Education. New Orleans, LA.
. (2000). Some alternative CAT item selection heuristics (Internal report). Philadelphia PA: National Board of Medical Examiners.
. (1995). Some practical examples of computerized adaptive sequential testing. Journal of Educational Measurement, 35, 229-249.
. (1998). Test models for complex computer-based testing. In . C. N. Mille,. M. T. Potenza, J. J. Fremer, and W. C. Ward (Eds.). Computer-based testing: Building the foundation for future assessments (pp. 67-88). Hillsdale NJ: Erlbaum.
. (2002). Heuristic-based CAT: Balancing item information, content, and exposure. In Paper presented at the annual meeting of the National Council on Measurement in Education. New York NY.
. (1996). Exposure control using adaptive multi-stage item bundles. In annual meeting of the National Council on Measurement in Education. Chicago, IL. USA.
. (2003). lu03-04.pdf (115.22 KB)A framework for exploring and controlling risks associated with test item exposure over time. In Paper presented at the Annual Meeting of the National Council for Measurement in Education. San Diego, CA.
. (1998). A testlet assembly design for the uniform CPA Examination. Applied Measurement in Education, 19, 189-202. doi:10.1207/s15324818ame1903_2
. (2006). Item selection using an average growth approximation of target information functions. Applied Psychological Measurement, 16, 41-51.
. (1992). Adaptive computer-based tasks under an assessment engineering paradigm. In . D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing.
. (2009). cat09luecht.pdf (288.75 KB)Test information targeting strategies for adaptive multistage testlet designs. In Paper presented at the Annual meeting of the National Council on Measurement in Education. Chicago IL.
. (2003). lu03-03.pdf (178.13 KB)Multidimensional computerized adaptive testing in a certification or licensure context. Applied Psychological Measurement, 20, 389-404.
. (1996). A few more issues to consider in multidimensional computerized adaptive testing. In Paper presented at the annual meeting of the American Educational Research Association. San Francisco.
. (1994). Heuristics based CAT: Balancing item information, content, and exposure. In Paper presented at the annual meeting of the National Council on Measurement in Education. New York NY.
. (1996). Computer-assisted test assembly using optimization heuristics. Applied Psychological Measurement, 22 , 224-236.
. (1998). Computer-adaptive testing. In . B. Everett, and D. Howell (Eds.), Encyclopedia of statistics in behavioral science. New York: Wiley.
. (2004). A testlet assembly design for the uniform CPA examination. In Paper presented at the Annual Meeting of the National Council on Measurement in Education.. New Orleans.
. (2002). lu02-01.pdf (191.34 KB)Some practical examples of computerized adaptive sequential testing (Internal Report). Philadelphia: National Board of Medical Examiners.
. (1996). Multidimensional Computerized Adaptive Testing in a Certification or Licensure Context. Applied Psychological Measurement, 20, 389-404.
. (1996). v20n4p389.pdf (1.19 MB) . (1992).
Heuristic-based CAT: Balancing item information, content and exposure. In Paper presented at the annual meeting of the National Council on Measurement in Education. New York NY.
. (1996).