Educational and Psychological Measurement

Papers
(The median citation count of Educational and Psychological Measurement is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)
ArticleCitations
Iterative Item Selection of Neighborhood Clusters: A Nonparametric and Non-IRT Method for Generating Miniature Computer Adaptive Questionnaires136
Model Specification Searches in Structural Equation Modeling Using Bee Swarm Optimization29
Summary Intervals for Model-Based Classification Accuracy and Consistency Indices27
Functional Approaches for Modeling Unfolding Data27
Using Deep Reinforcement Learning to Decide Test Length19
Investigating Confidence Intervals of Item Parameters When Some Item Parameters Take Priors in the 2PL and 3PL Models15
Assessing the Properties and Functioning of Model-Based Sum Scores in Multidimensional Measures With Local Item Dependencies: A Comprehensive Proposal15
An Illustration of an IRTree Model for Disengagement11
Detecting Differential Item Functioning Using Response Time10
Optimal Number of Replications for Obtaining Stable Dynamic Fit Index Cutoffs10
Generalized Mantel–Haenszel Estimators for Simultaneous Differential Item Functioning Tests10
Detecting Preknowledge Cheating via Innovative Measures: A Mixture Hierarchical Model for Jointly Modeling Item Responses, Response Times, and Visual Fixation Counts10
An Explanatory Multidimensional Random Item Effects Rating Scale Model9
An Omega-Hierarchical Extension Index for Second-Order Constructs With Hierarchical Measuring Instruments9
Assessing the Speed–Accuracy Tradeoff in Psychological Testing Using Experimental Manipulations9
Improving the Use of Parallel Analysis by Accounting for Sampling Variability of the Observed Correlation Matrix9
On the Benefits of Using Maximal Reliability in Educational and Behavioral Research9
Identifying Ability and Nonability Groups: Incorporating Response Times Using Mixture Modeling9
Item Parameter Recovery: Sensitivity to Prior Distribution9
Examination of ChatGPT’s Performance as a Data Analysis Tool9
Rotation Local Solutions in Multidimensional Item Response Theory Models8
What Affects the Quality of Score Transformations? Potential Issues in True-Score Equating Using the Partial Credit Model8
Exploratory Graph Analysis for Factor Retention: Simulation Results for Continuous and Binary Data8
Extended Multivariate Generalizability Theory With Complex Design Structures7
The Impact and Detection of Uniform Differential Item Functioning for Continuous Item Response Models7
On Bank Assembly and Block Selection in Multidimensional Forced-Choice Adaptive Assessments7
Non-iterative Conditional Pairwise Estimation for the Rating Scale Model7
Polytomous Testlet Response Models for Technology-Enhanced Innovative Items: Implications on Model Fit and Trait Inference7
Examining the Dynamic of Clustering Effects in Multilevel Designs: A Latent Variable Method Application7
A New Stopping Criterion for Rasch Trees Based on the Mantel–Haenszel Effect Size Measure for Differential Item Functioning6
A Monte Carlo Study of Confidence Interval Methods for Generalizability Coefficient6
Detecting Rating Scale Malfunctioning With the Partial Credit Model and Generalized Partial Credit Model5
Linear and Nonlinear Indices of Score Accuracy and Item Effectiveness for Measures That Contain Locally Dependent Items5
Are Speeded Tests Unfair? Modeling the Impact of Time Limits on the Gender Gap in Mathematics5
Obtaining a Bayesian Estimate of Coefficient Alpha Using a Posterior Normal Distribution5
An Evaluation of Fit Indices Used in Model Selection of Dichotomous Mixture IRT Models5
Separation of Traits and Extreme Response Style in IRTree Models: The Role of Mimicry Effects for the Meaningful Interpretation of Estimates5
Detecting Differential Rater Functioning in Severity and Centrality: The Dual DRF Facets Model5
Examining the Instructional Sensitivity of Constructed-Response Achievement Test Item Scores5
Assessing Essential Unidimensionality of Scales and Structural Coefficient Bias5
Using Multiple Imputation to Account for the Uncertainty Due to Missing Data in the Context of Factor Retention5
A Bayesian General Model to Account for Individual Differences in Operation-Specific Learning Within a Test5
Measuring Unipolar Traits With Continuous Response Items: Some Methodological and Substantive Developments5
Evaluating Model Fit of Measurement Models in Confirmatory Factor Analysis5
Differential Item Functioning Effect Size Use for Validity Information5
Historical Measurement Information Can Be Used to Improve Estimation of Structural Parameters in Structural Equation Models With Small Samples4
Awareness Is Bliss: How Acquiescence Affects Exploratory Factor Analysis4
Investigating Heterogeneity in Response Strategies: A Mixture Multidimensional IRTree Approach4
Overestimation of Internal Consistency by Coefficient Omega in Data Giving Rise to a Centroid-Like Factor Solution4
Coefficients of Factor Score Determinacy for Mean Plausible Values of Bayesian Factor Analysis4
Evaluation of Polytomous Item Locations in Multicomponent Measuring Instruments: A Note on a Latent Variable Modeling Procedure4
A Small Sample Correction for Factor Score Regression4
Evaluating the Performance of a Regularized Differential Item Functioning Method for Testlet-Based Polytomous Items4
An Item Response Theory Model for Incorporating Response Times in Forced-Choice Measures4
Identifying Problematic Item Characteristics With Small Samples Using Mokken Scale Analysis4
Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?4
Equidistant Response Options on Likert-Type Instruments: Testing the Interval Scaling Assumption Using Mplus4
Modeling Misspecification as a Parameter in Bayesian Structural Equation Models4
Croon’s Bias-Corrected Estimation for Multilevel Structural Equation Models with Non-Normal Indicators and Model Misspecifications3
The Effect of Latent and Error Non-Normality on Measures of Fit in Structural Equation Modeling3
Discriminant Validity of Interval Response Formats: Investigating the Dimensional Structure of Interval Widths3
The Impact of Sample Size and Various Other Factors on Estimation of Dichotomous Mixture IRT Models3
Detecting Cheating in Large-Scale Assessment: The Transfer of Detectors to New Tests3
Corrigendum to The Optimal Item Pool Design in Multistage Computerized Adaptive Tests with the p-Optimality Method3
Two-Method Measurement Planned Missing Data With Purposefully Selected Samples3
Comparing Accuracy of Parallel Analysis and Fit Statistics for Estimating the Number of Factors With Ordered Categorical Data in Exploratory Factor Analysis3
Is the Area Under Curve Appropriate for Evaluating the Fit of Psychometric Models?3
A Note on Evaluation of Polytomous Item Locations With the Rating Scale Model and Testing Its Fit3
Resolving Dimensionality in a Child Assessment Tool: An Application of the Multilevel Bifactor Model3
Exploration of the Stacking Ensemble Machine Learning Algorithm for Cheating Detection in Large-Scale Assessment3
Field-Testing Multiple-Choice Questions With AI Examinees: English Grammar Items3
Investigating the Ordering Structure of Clustered Items Using Nonparametric Item Response Theory3
Diagnostic Classification Model for Forced-Choice Items and Noncognitive Tests2
Symptom Presence and Symptom Severity as Unique Indicators of Psychopathology: An Application of Multidimensional Zero-Inflated and Hurdle Graded Response Models2
Range Restriction Affects Factor Analysis: Normality, Estimation, Fit, Loadings, and Reliability2
Disentangling Qualitatively Different Faking Strategies in High-Stakes Personality Assessments: A Mixture Extension of the Multidimensional Nominal Response Model2
Dominance Analysis for Latent Variable Models: A Comparison of Methods With Categorical Indicators and Misspecified Models2
The Effect of Modeling Missing Data With IRTree Approach on Parameter Estimates Under Different Simulation Conditions2
On the Utility of Indirect Methods for Detecting Faking2
Added Value of Subscores for Tests With Polytomous Items2
Why Forced-Choice and Likert Items Provide the Same Information on Personality, Including Social Desirability2
Interpretation of the Standardized Mean Difference Effect Size When Distributions Are Not Normal or Homoscedastic2
A Regression Discontinuity Design Framework for Controlling Selection Bias in Evaluations of Differential Item Functioning2
Evaluating Equating Methods for Varying Levels of Form Difference2
Treating Noneffortful Responses as Missing2
Assessing Ability Recovery of the Sequential IRT Model With Unstructured Multiple-Attempt Data2
Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks2
A Comparison of the Next Eigenvalue Sufficiency Test to Other Stopping Rules for the Number of Factors in Factor Analysis2
Testing the Performance of Level-Specific Fit Evaluation in MCFA Models With Different Factor Structures Across Levels2
On the Importance of Coefficient Alpha for Measurement Research: Loading Equality Is Not Necessary for Alpha’s Utility as a Scale Reliability Index2
Evaluating Imputation-Based Fit Statistics in Structural Equation Modeling With Ordinal Data: The MI2S Approach2
Developing Situated Measures of Science Instruction Through an Innovative Electronic Portfolio App for Mobile Devices: Reliability, Validity, and Feasibility2
Comparing RMSEA-Based Indices for Assessing Measurement Invariance in Confirmatory Factor Models2
0.034780025482178