TY - JOUR T1 - Efficiency of Targeted Multistage Calibration Designs Under Practical Constraints: A Simulation Study JF - Journal of Educational Measurement Y1 - 2019 A1 - Berger, Stéphanie A1 - Verschoor, Angela J. A1 - Eggen, Theo J. H. M. A1 - Moser, Urs AB - Abstract Calibration of an item bank for computer adaptive testing requires substantial resources. In this study, we investigated whether the efficiency of calibration under the Rasch model could be enhanced by improving the match between item difficulty and student ability. We introduced targeted multistage calibration designs, a design type that considers ability-related background variables and performance for assigning students to suitable items. Furthermore, we investigated whether uncertainty about item difficulty could impair the assembling of efficient designs. The results indicated that targeted multistage calibration designs were more efficient than ordinary targeted designs under optimal conditions. Limited knowledge about item difficulty reduced the efficiency of one of the two investigated targeted multistage calibration designs, whereas targeted designs were more robust. VL - 56 UR - https://onlinelibrary.wiley.com/doi/abs/10.1111/jedm.12203 ER - TY - JOUR T1 - Latent-Class-Based Item Selection for Computerized Adaptive Progress Tests JF - Journal of Computerized Adaptive Testing Y1 - 2017 A1 - van Buuren, Nikky A1 - Eggen, Theo J. H. M. KW - computerized adaptive progress test KW - item selection method KW - Kullback-Leibler information KW - Latent class analysis KW - log-odds scoring VL - 5 UR - http://iacat.org/jcat/index.php/jcat/article/view/62/29 IS - 2 ER - TY - JOUR T1 - Multidimensional Computerized Adaptive Testing for Classifying Examinees With Within-Dimensionality JF - Applied Psychological Measurement Y1 - 2016 A1 - van Groen, Maaike M. A1 - Eggen, Theo J. H. M. A1 - Veldkamp, Bernard P. AB - A classification method is presented for adaptive classification testing with a multidimensional item response theory (IRT) model in which items are intended to measure multiple traits, that is, within-dimensionality. The reference composite is used with the sequential probability ratio test (SPRT) to make decisions and decide whether testing can be stopped before reaching the maximum test length. Item-selection methods are provided that maximize the determinant of the information matrix at the cutoff point or at the projected ability estimate. A simulation study illustrates the efficiency and effectiveness of the classification method. Simulations were run with the new item-selection methods, random item selection, and maximization of the determinant of the information matrix at the ability estimate. The study also showed that the SPRT with multidimensional IRT has the same characteristics as the SPRT with unidimensional IRT and results in more accurate classifications than the latter when used for multidimensional data. VL - 40 UR - http://apm.sagepub.com/content/40/6/387.abstract ER - TY - JOUR T1 - Item Selection Methods Based on Multiple Objective Approaches for Classifying Respondents Into Multiple Levels JF - Applied Psychological Measurement Y1 - 2014 A1 - van Groen, Maaike M. A1 - Eggen, Theo J. H. M. A1 - Veldkamp, Bernard P. AB -

Computerized classification tests classify examinees into two or more levels while maximizing accuracy and minimizing test length. The majority of currently available item selection methods maximize information at one point on the ability scale, but in a test with multiple cutting points selection methods could take all these points simultaneously into account. If for each cutting point one objective is specified, the objectives can be combined into one optimization function using multiple objective approaches. Simulation studies were used to compare the efficiency and accuracy of eight selection methods in a test based on the sequential probability ratio test. Small differences were found in accuracy and efficiency between different methods depending on the item pool and settings of the classification method. The size of the indifference region had little influence on accuracy but considerable influence on efficiency. Content and exposure control had little influence on accuracy and efficiency.

VL - 38 UR - http://apm.sagepub.com/content/38/3/187.abstract ER -