Some item response theory to provide scale scores based on linear combinations of testlet scores, for computerized adaptive tests. In Paper presented at the annual meeting of the Psychometric Society. Urbana, IL.
. (1998). The MEDPRO project: An SBIR project for a comprehensive IRT and CAT software system: IRT software. In . D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing.
. (2009). cat09thissen.pdf (816.21 KB)Reliability and measurement precision. In . H. Wainer, N. J. Dorans, R. Flaugher, B. F. Green, R. J. Mislevy, L. Steinberg, and D. Thissen (Eds.), Computerized adaptive testing: A primer (pp. 161-186). Hillsdale NJ: Erlbaum.
. (1990). Trace lines for testlets: A use of multiple-categorical-response models. Journal of Educational Measurement, 26, 247-260.
. (1989). Methodological issues for building item banks and computerized adaptive scales. Quality of Life Research, 16, 109-119, .
. (2007). Some Applications of Optimization Algorithms in Test Design and Adaptive Testing. Applied Psychological Measurement, 10(4), 381-389.
. (1986). v10n4p381.pdf (919.2 KB)Computerized adaptive testing to screen children for emotional and behavioral problems by preventive child healthcare. BMC Pediatrics, 20(Article number: 119 ). Retrieved from https://bmcpediatr.biomedcentral.com/articles/10.1186/s12887-020-2018-1
. (2020). Some applications of optimization algorithms in test design and adaptive testing. Applied Psychological Measurement, 10, 381-389.
. (1986). Overview of quantitative measurement methods. Equivalence, invariance, and differential item functioning in health applications. Medical Care, 44, S39-49. presented at the Nov.
. (2006). . (2001).
The relationship between computer familiarity and performance on computer-based TOEFL test tasks (Research Report 98-08). Princeton NJ: Educational Testing Service.
. (1998). Application of adaptive testing to a fraction test (Research Report 84-3-NIE). Urbana IL: Univerity of Illinois, Computer-Based Education Research Laboratory.
. (1984). The danger of relying solely on diagnostic adaptive testing when prior and subsequent instructional methods are different (CERL Report E-5). Urbana IL: Univeristy of Illinois, Computer-Based Education Research Laboratory.
. (1979). Diagnostic adaptive testing: Effects of remedial instruction as empirical validation. Journal of Educational Measurement, 34, 3-20.
. (1997). A cognitive error diagnostic adaptive testing system. In . the 28th ADCIS International Conference Proceedings. Washington DC: ADCIS.
. (1986). A comparison of two methods of controlling item exposure in computerized adaptive testing. In Paper presented at the meeting of the American Educational Research Association. San Diego CA.
. (1998). A comparison of the traditional maximum information method and the global information method in CAT item selection. In annual meeting of the National Council on Measurement in Education. New York, NY USA.
. (1996). A comparison of methods for adaptive estimation of a multidimensional trait. Unpublished doctoral dissertation, Columbia University.
. (1992). A multivariate experimental study of three computerized adaptive testing models for the measurement of attitude toward teaching effectiveness. Unpublished doctoral dissertation, Florida State University.
. (1973). Guess what? Score differences with rapid replies versus omissions on a computerized adaptive test. In . D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing.
. (2009). cat09talento-miller.pdf (214.05 KB)How Do Trait Change Patterns Affect the Performance of Adaptive Measurement of Change?. Journal of Computerized Adaptive Testing, 10(3), 32-58. doi:10.7333/2307-1003032
. (2023). Pre-equating: a simulation study based on a large scale assessment model. Journal of Applied Measurement, 5, 301-18.
. (2004). The Philosophical Aspects of IRT Equating: Modeling Drift to Evaluate Cohort Growth in Large-Scale Assessments. Educational Measurement: Issues and Practice, 32, 2–14. doi:10.1111/emip.12000
. (2013). A model for testing with multidimensional items. Proceedings of the 1977 Computerized Adaptive Testing Conference. presented at the 06/1978, Minneapolis, MN. USA: University of Minnesota, Department of Psychology, Psychometrics Methods Program.
. (1978). Estimating the reliability of adaptive tests from a single test administration. In Paper presented at the annual meeting of the American Educational Research Association. Boston.
. (1980). sy81-01.pdf (7.42 MB)Evaluating the results of computerized adaptive testing. In . D. J. Weiss (Ed.), Computerized adaptive trait measurement: Problems and Prospects (Research Report 75-5), pp. 26-31. Minneapolis: University of Minnesota, Department of Psychology, Psychometric Methods Program.
. (1975). sy75-01.pdf (448.16 KB)Predictive validity of conventional and adaptive tests in an Air Force training environment (Report AFHRL-TR-81-40). Brooks Air Force Base TX: Air Force Human Resources Laboratory, Manpower and Personnel Division.
. (1982). Controlling item-exposure rates in computerized adaptive testing. In . Proceedings of the 27th annual meeting of the Military Testing Association (pp. 973-977). San Diego CA: Navy Personnel Research and Development Center.
. (1985). Controlling item exposure conditional on ability in computerized adaptive testing. Journal of Educational and Behavioral Statistics, 23, 57-75.
. (1985). Criterion-related validity of conventional and adaptive tests in a military environment. In Paper presented at the 1979 Computerized Adaptive Testing Conference. Minneapolis MN.
. (1979). Item Calibrations for Computerized Adaptive Testing (CAT) Experimental Item Pools Adaptive Testing. In . D. J. Weiss (Ed.). Proceedings of the 1982 Computerized Adaptive Testing Conference (pp. 290-294). Minneapolis MN: University of Minnesota, Department of Psychology, Psychometric Methods Program.
. (1982). sy82-01.pdf (104.17 KB)Predictive validity of computerized adaptive testing in a military training environment. In Paper presented at the annual meeting of the American Educational Research Association. New Orleans LA.
. (1984). Estimation of latent trait status in adaptive testing. In . D. J. Weiss (Ed.), Applications of computerized testing (Research Report 77-1). Minneapolis: University of Minnesota, Department of Psychology, Psychometric Methods Program.
. (1977). we77-01.pdf (3.15 MB)Validity of adaptive testing: A summary of research results. In Paper presented at the annual meeting of the American Psychological Association.
. (1985). A model for testing with multidimensional items. In . D. J. Weiss (Ed.), Proceedings of the 1977 Computerized Adaptive Testing Conference. Minneapolis MN: University of Minnesota, Department of Psychology, Psychometric Methods Program.
. (1977). Estimation of item difficulty from restricted CAT calibration samples. In Paper presented at the annual conference of the National Council on Measurement in Education in San Francisco.
. (1995). Computerized adaptive testing in computer science: assessing student programming abilities. In Proceedings of the twenty-fourth SIGCSE Technical Symposium on Computer Science Education. Indianapolis IN.
. (1993). Detecting misbehaving items in a CAT environment. In Paper presented at the annual meeting of the National Council on Measurement in Education. Chicago, IL.
. (1997). An examination of item-level response times from an operational CAT. In Paper presented at the annual meeting of the National Council on Measurement in Education. Urbana IL.
. (1998). A burdened CAT: Incorporating response burden with maximum Fisher's information for item selection. In . In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing.
. (2009). cat09swartz.pdf (373.74 KB)Relationship of response latency to test design, examinee ability, and item difficulty in computer-based test administration. In Paper presented at the annual meeting of the National Council on Measurement in Education. Chicago IL.
. (1997). Small sample estimation in dichotomous item response models: Effect of priors based on judgmental information on the accuracy of item parameter estimates. Applied Psychological Measurement, 27, 27-51.
. (2003). Routing Strategies and Optimizing Design for Multistage Testing in International Large-Scale Assessments. Journal of Educational Measurement, 56, 192-213. doi:10.1111/jedm.12206
. (2019). Development of an adaptive multimedia program to collect patient health data. American Journal of Preventative Medicine, 21, 320-324.
. (2001). Development of a multiple-component CAT for measuring foreign language proficiency (SIMTEST). In . D. J. Weiss (Ed.). Proceedings of the 2007 GMAC Conference on Computerized Adaptive Testing.
. (2007). cat07sumbling.pdf (256.99 KB)A Comparison of Constrained Item Selection Methods in Multidimensional Computerized Adaptive Testing. Applied Psychological Measurement, 40, 346-360. doi:10.1177/0146621616639305
. (2016). Linking the standard and advanced forms of the Ravens Progressive Matrices in both the paper-and-pencil and computer-adaptive-testing formats. Educational and Psychological Measurement, 53, 905-925.
. (1993). The development of a computerized version of Vandenberg's mental rotation test and the effect of visuo-spatial working memory loading. Dissertation Abstracts International Section A: Humanities and Social Sciences, 60, 3938.
. (2000). Test difficulty and stereotype threat on the GRE General Test. Journal of Applied Social Psychology, 34(3), 563-597.
. (2004). . (1998).
Multi-stage Testing for a Multi-disciplined End-of primary-school Test . In IACAT 2017 Conference. presented at the 08/2017, Niigata, Japan: Niigata Seiryo University. Retrieved from https://drive.google.com/open?id=1C5ys178p_Wl9eemQuIsI56IxDTck2z8P
. (2017). Computerized adaptive testing in the Bundeswehr. Unpublished manuscript.
. (1999). st99-01.pdf (426.36 KB)Detection of misfitting item-score patterns in computerized adaptive testing. Enschede, The Netherlands: Febodruk B.
. (2001). The historical developments of fit and its assessment in the computerized adaptive testing environment. In Midwestern Education Research Association annual meeting. presented at the 10/1994, Chicago, IL USA.
. (1994). Item calibration considerations: A comparison of item calibrations on written and computerized adaptive examinations. In Paper presented at the annual meeting of the American Educational Research Association. New Orleans LA.
. (1994). The effect of review on the psychometric characteristics of computerized adaptive tests. Applied Measurement in Education, 7, 211-222.
. (1994). Testing software review: MicroCAT Version 3. . Educational Measurement: Issues and Practice, 8 (3), 33-38.
. (1989). The effect of review on the psychometric characterstics of computerized adaptive tests. Applied Measurement in Education, 7, 211-222.
. (1994). Equivalence of scores from computerized adaptive and paper-and-pencil ASVAB tests (No. CNR 113) (p. 100). Alexandria, VA. USA: Center for Naval Analysis.
. (1985). Equivalent-groups versus single-group equating designs for the Accelerated CAT-ASVAB Project (Research Memorandum 87-6). Alexandria VA: Center for Naval Analyses.
. (1987).